Wading Through Treacle: Carl Shulman

Showing posts with label Carl Shulman. Show all posts

Wednesday, February 10, 2016

"My simulated sister is smarter than me"

Apologies that this is a bit techie - and it won't make sense without reading the previous post.

Yesterday I did some simple stats to show that my sister is most likely 8 IQ points smarter than me (to be fair and by symmetry, the converse could also be true).

Health warnings:

expected value only when averaged over large numbers of copies of my sister and myself;
equally true for my brother and myself - the stats are gender-blind.

How far is one sibling likely to be from the parental midpoint average?

Intuitively, you wouldn't expect every sibling to be exactly the average (they're not clones) but over a large family the pluses and minus would sort of average out to the mid-parental mean. But what about if we're just considering the deviation from average, without caring about the sign?

We seem to have a choice: halve the expected difference between two siblings, or find the average (absolute) deviation from the mean. As we saw yesterday, these give different answers.

I therefore decided to run an Excel simulation using the built-in RAND() function. Here's the four coin-set (taking values from 0 to 4):

IF(RAND()>0.5,1,0) + IF(RAND()>0.5,1,0) + IF(RAND()>0.5,1,0) + IF(RAND()>0.5,1,0)

and here is the last part of the spreadsheet model showing 100 tosses of two 4-coin sets (random variables X and Y) showing the number of heads.

If you like, you can consider this a simple four gene model for intelligence, with each gene presenting as two alleles, each of which code up or down for IQ by 7.5 points.

I ran each 100 toss simulation ten times and noted the results in the table on the right.

The heading "Mean-IQ" refers to ten runs of the IQ (7.5) value in the "abs(X-Y)" column on the left, showing the mean difference in IQ between the two siblings;

the heading "Dec-IQ" refers to ten runs of the IQ (7.5) value in the "abs(X-2)" column on the left, showing the average deviation (+ and -) of a single sibling's IQ from the parental-midpoint mean.

From yesterday's post the computed values are respectively 8.2 and 5.625.

If we go back to selecting embryos for implantation, which is the right statistic to use to measure our likely IQ gain over the biological default of just taking what comes?

The leftmost statistic, 5.625 IQ points above the mean, would be sort of accurate if we were conceptually considering two embryos, one randomly varying and the other always exactly on the parental midpoint mean. But it wouldn't work, not least because the random embryo might well be below the mean but we're counting all variation as positive. So it's not realistic.

The statistic we get by halving the expected inter-sibling gap of 8.2 IQ points is better as we always select the smarter of the two embryos. However, since both X and Y are varying freely on the range {0,1,2,3,4} it's a bit difficult to correlate the abs(X-Y)/2 gap with the range-midpoint (mean) of 2. At this point we handwave and mutter about symmetry.

And what do you do when the presented embryos are all below the expected average?*

---

* Which with two embryos will occur 25% of the time. I feel like spending some more money and genotyping a few more ...

Tuesday, February 09, 2016

"My sister is smarter than me"

Well, maybe.

Your brother or sister could be exactly the same intelligence as yourself, but that's not the way to bet. Yesterday I wrote about the Carl Shulman and Nick Bostrom paper, "Embryo Selection for Cognitive Enhancement: Curiosity or Game-changer?". They said that the average distance in intelligence between two siblings was 8.4 IQ points.

Hmm.

When I saw that figure I was mightily impressed by their mathematical skills. After much thinking I decided that what they had done was this. They had taken the Gaussian distribution for the IQ of offspring with

mean = parental midpoint IQ; standard deviation = 7.5,

and considered two random variables from this distribution. Let's call them, with a bow to genetics, X for daughter and Y for son.

They had then computed the expected value of abs(X-Y). The absolute value is important here: the expected value of the mere difference (X-Y) is zero, as one sibling taken at random will on average be no brighter or dimmer than the other.

I hate these messy integrals incorporating the bivariate Gaussian distribution with weird boundaries! And apparently so do Carl and Nick: they got their answers by simulation.

But there is another way where we can still be analytic. The answer is in the power of four.

Take four coins and toss them. Count the number of heads - there are five possible outcomes: 0. 1. 2, 3, 4. The respective probabilities are 1/16, 4/16, 6/16, 4/16. 1/16.

The mean number of heads is 2 and the standard deviation of the number of heads is 1. How convenient.

[SD = √(npq) = √(4 * 0.5 * 0.5) = 1].

When you draw the histogram you get something like the above, which is not a million miles away from looking rather ... normal.

If we wanted to make this about sibling intelligence, map the number of heads into IQ scores:

2 heads = IQ 100
3 heads = IQ 107.5
4 heads = IQ 115.

Warning, IQ can go down as well as up.

Now imagine if we had one set of four coins to our left "the daugher IQ" and another set to our right "the son IQ". Throw simultaneously and count the number of left heads and right heads.

The difference (abs, remember, absolute value) in the number of heads is the IQ difference - each head counts as 7.5 IQ points. What do we get?

There's a spreadsheet for that.

Each cell in the bottom table is the product of the corresponding cells in the top two

Now these three tables are either self-evident after a little consideration, or too long to explain. I simply draw your attention to the figure in the bottom right in bold. The sum of all the entries in the third table = 1.09375.

This is the expected value of the difference in number of heads between left and right, "daughter" and "son" coin sets.

To change it into an IQ figure we multiply by 7.5 giving 8.2 ("my sister is 8 IQ points smarter than me!" ... or maybe conversely ...).

If we want the gap between each sibling's expected IQ from the mean expected IQ (over all possible siblings) we halve it to get 4.1. Think about it - don't you just love stats *.

This compares to Carl's and Nick's number of 4.2 based on ten million simulations (table below). Not a bad approximation to the normal distribution and you can see what's going on.

---

Admittedly, extending to the case of 'best of ten siblings' seems a few coin sets too far.

---

* and then I worry about this ...

which equates to 0.75 * 7.5 = 5.625 IQ points.

Time for a simulation ...

Monday, February 08, 2016

Choose the best of your virtual children

I wrote critically of Nick Bostrom in my review of his "Superintelligence" book. But when he's not writing in a philosophical straightjacket, his intelligence and creativity produce rather better results, as here with Carl Shulman.

Suppose you have nine brothers/sisters. Your nine siblings will, of course vary in height, weight ... and intelligence. We know how to think about the relationship between the IQ of parents and their offspring (Steve Hsu explains here).

We take the average of the parents' IQ, and that is the mean IQ of their children.

The children do not of course have identical IQs, they're not clones; instead they populate the usual Gaussian distribution with standard deviation in the 7-11 IQ point range (rather than the usual population IQ SD of 15). The authors were conservative and used 7.5 in their simulation below.

So Carl and Nick have this table in their paper, from which you can see that the maximum IQ gap within ten children (dimmest to smartest) might be as much as 23 IQ points. Yes, I find that surprising too.

The table is based on a large scale simulation (10 million couples) and what I take it to be saying is this: if you use ten embryos and decide to implant the smartest, then you'll get an average 11.5 IQ point gain over just selecting a random embryo with no pre-screening for intelligence at all.

Is that important? It's the difference between a clerical and a professional job.

If you come from a large family, consider your siblings and consider whether any of this makes any sense.

We already do embryo selection for single-mutation diseases. A fertilised egg (in vitro) is allowed to divide until you have, say, an eight-cell clump - at this stage there is no functional differentiation. One cell is then extracted and its genome sequenced looking for the faulty gene. If the genome is fine, the seven remaining cells are implanted and the embryo grows to term with no ill effects; otherwise, discard and repeat. (It is more complex than this).

For IQ-based embryo selection to catch on we need a predictive model which can take a sequenced genome and predict with tight accuracy the resulting IQ (assuming decent nutrition, no abuse etc). We can't do this now as the relevant genes haven't yet been identified. But that should change within five to ten years. And we need to get the cost right down: doing 10 whole-genome sequences could be pricey.

Apart from the hassle of IVF and any legalistic hurdles, the way would then be open. What might be the consequences? Carl and Nick have a table - click on it to make it bigger.

---

Just a note about IES on the right-hand side.

"The effectiveness of embryo selection would be vastly increased if multiple generations of selection could be compressed into less than a human maturation period. This could be enabled by advances in an important complementary technology: the derivation of viable sperm and eggs from human embryonic stem cells. Such stem cell derived gametes would enable iterated embryo selection (henceforth, IES):

1. Genotype and select a number of embryos that are higher in desired genetic characteristics;

2. Extract stem cells from those embryos and convert them to sperm and ova, maturing within 6 months or less;

3. Cross the new sperm and ova to produce embryos;

4. Repeat until large genetic changes have been accumulated."
...
"Using IES could deliver much more extreme results, and the fixed costs of using IES to produce enhanced embryos could be spread across large numbers of enhanced children. On the other hand, IES would compromise the typical genetic relationship between parents and children. To avoid negative effects of inbreeding, IES would require either a large starting supply of donors, or the expenditure of substantial selective power to reduce harmful recessive alleles. These factors would tend to push towards IES offspring being less genetically related to their parents (though more related to one another), and could reduce the appeal of IES."

All this and we haven't even mentioned genetic engineering, or CRISPR-Cas9.

I blogged that Toby Young had a piece about this back in September last year.