Administrator/Researcher: Angela J. Cone
Costa Tsirigakis - Founder J2 Y DNA project & admin/researcher from 2006 - mid 2008
See bottom of page for Project Admin. information

Analysis Phase 2
  Marker distributions in J2 sub-clades
   First 12 markers
   • Second results panel
   • Third results panel
Analysis Phase 3
  J2 Cluster analysis
   Marker panel comparison
   • Cluster analysis
   • J2b-Delta hypotheses
Please note:
• This analysis is derived from the SNP tested haplotypes that were in the J2 Y-DNA project on 19 July 2006. The reliability, and applicability of these analyses is dependant on how globally representative these haplotypes are for their sub-clades.
It is possible, due to the demographics of people who DNA test, and the relatively low sample size, that these marker values are not globally representative. In future - after more results are added - allele frequency distributions for some sub-clades may change slightly (which, accordingly, may change some of the preliminary observations and/or conclusions). The reliability and representativeness of this data set should increase, as more haplotypes are added.
• J2*( in YCC nomenclature) is a paraphyletic clade (ie. probably contains many different "sub-clades"). The J2 project is aware that there are very distinct clusters in the J2* results, and that these clusters being lumped together may confound the statistical analysis.
However, - if we were to divide J2* using subjective criteria, this would invalidate the scientifically objective nature of the analysis.
When there are adequately scientifically verified UEP's ("unique event polymorphisms", ie. SNP markers, or equivalent) to define these clusters, and there are sufficient numbers of project members who have tested for these UEP's, then these additional scientifically/objectively defined clades will be included in the analyses. For instance, the project is currently beginning to receive results for DYS 413, so we will be able to separate J2a* from J2a1* (Both of these clades are paraphyletic (J2a1* nested within J2a*), however the increased resolution will still be informative).

Last analysed: July 2006

Marker distributions in J2 sub-clades*
(The potential significance of each marker in distinguishing sub-clades is Statistically quantified)

Second marker panel
(in FTDNA order)

There are (weak) statistically significant differences in the allele values for DYS 458 between the different sub-clades.

T-Test comparing J2a with J2b
T= 0.29 P= 0.7751 *NS*

ANOVA comparing J2*/J2f/J2e
F= 4.02 P= 0.0240 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2* A
J2e AB
J2f  B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 2.28 P= 0.0738 *NS*
Tukey HSD All-Pairwise Comparisons Test
J2f  A
J2e2 AB
J2f1 AB
J2e1 AB
J2*   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another

 
According to current data, DYS 458 is of weak (statistically significant) value in distinguishing J2* and J2f*, but is of no value in distinguishing between J2a and J2b. There is a strong nearly bimodal separation of values within J2f*. There is one grouping of haplotypes (comprised of one main cluster plus some additional haplotypes), that have a value of 18, and there are a couple of other smaller clusters (plus singleton haplotypes) that have a value of 15 or 16. In most instances the latter are the J2f's that have DYS 390=22 (both J2f1 and J2f*).
In practical terms, a value of 18 most likely means J2f or J2f1, however J2f's that have DYS 390=22 often have DYS 458 values of 15 or 16 (so a value other than 18 does not exclude J2f).
There are statistically significant differences in the allele values for DYS 459a between the different sub-clades.

T-Test comparing J2a with J2b
Cannot be calculated, as the variation for one variable is constant.

ANOVA comparing J2*/J2f/J2e
F= 27.4 P= 0.0000 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e A
J2*   B
J2f     C
All clades are significantly different

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 13.4 P= 0.0000 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e1 is significantly different to J2*, J2f and J2f1; J2e2 is significantly different to J2f and J2f1; J2* is also significantly different to J2f.

 
According to current data, DYS 459a is of great significant value in distinguishing between J2e and J2f. There are statistically significant differences between each of the main sub-clade divisions. Within J2e values are currently always 8, and within J2f most values are 9. Each value occurs more or less equally within J2*, and many (but not all) of those within J2* that have a value of 8, are within the two main putative J2* clusters.
In practical terms (according to current data) a value of 9 means that the haplotype is not within J2e. A value of 8 may be suggestive of the haplotype not being J2f (unless the haplotype also has DYS 390=22, DYS 439=10, which may represent a putative cluster within J2f* - additional haplotypes need to be tested to confirm this).
There are no statistically significant differences in the allele values for DYS 459b between the different sub-clades.

T-Test comparing J2a with J2b
T= 0.71 P= 0.4786 *NS*

ANOVA comparing J2*/J2f/J2e
F= 1.06 P= 0.3537 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 1.10 P= 0.3678 *NS*

 

According to current data, DYS 459b is of no significant value in distinguishing J2 sub-clades.

There are no statistically significant differences in the allele values for DYS 455 between the different sub-clades.

T-Test comparing J2a with J2b
T= 0.62 P= 0.5381 *NS*

ANOVA comparing J2*/J2f/J2e
F= 0.25 P= 0.7784 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 0.15 P= 0.9611 *NS*

 
According to current data, DYS 455 is of no significant value in distinguishing J2 sub-clades.
There are statistically significant differences in the allele values for DYS 454 between the different sub-clades.

T-Test comparing J2a with J2b
T= -1.68 P= 0.0996 *NS*

ANOVA comparing J2*/J2f/J2e
F= 1.48 P= 0.2379 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 11.4 P= 0.0000 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e2 A
J2f    B
J2e1   B
J2*    B
J2f1   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another

 

According to current data, DYS 454 is of no significant value in distinguishing between J2a and J2b. Neither is it of any significance differentiating between J2*/J2f/J2e. It is however, of significant value in distinguishing between J2e1 and J2e2 (but owing to the small sample size of J2e2, - we should be cautious in making this conclusion until more J2e2 haplotypes are seen).
In practical terms, in haplotypes with other marker values that are consistent with J2e, DYS 454 values of 12 or 13 might be suggestive for J2e2 (but not conclusive, - as there is a J2e1 haplotype with a value of 12). The results for DYS 437 should also be taken into consideration when assessing whether a J2e haplotype might be J2e2.

There are (weak) statistically significant differences in the allele values for DYS 447 between the different sub-clades.

T-Test comparing J2a with J2b
T= -3.76 P= 0.0005 *Significant*

ANOVA comparing J2*/J2f/J2e
F= 7.44 P= 0.0015 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e  A
J2f  AB
J2*   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 5.77 P= 0.0007 *significant*
Tukey HSD All-Pairwise Comparisons Test
J2e1 is significantly different to J2*.

 
According to current data, DYS 447 is of (statistically significant) value in distinguishing between J2a and J2b. There are significant statistical differences between J2e, and J2*. J2f is not significantly different to either J2* or J2e.
In practical terms, a value of 28 and above is unlikely to be J2*, and values of 27 and above are generally more indicative of J2e, and values of 26 and below are generally more indicative of J2a (ie J2f and J2*)..
There are statistically significant differences in the allele values for DYS 437 between the different sub-clades.

T-Test comparing J2a with J2b
T= -5.50 P= 0.0000 *Significant*

ANOVA comparing J2*/J2f/J2e
F= 19.5 P= 0.0000 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e  A
J2f   B
J2*   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another.

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 22.7 P= 0.0000 *Significant*
J2e1 is significantly different to J2*, J2e2, J2f, and J2f1; J2f is also significantly different to J2*.

 

According to current data, DYS 437 is of significant value in distinguishing between J2a and J2b. It is of greatest potential value in distinguishing between J2e1 and J2e2 (but owing to the small sample size of J2e2, - we should be cautious in making this conclusion until more J2e2 haplotypes are seen). A significant difference was also seen between J2f and J2* (but only in the third analysis).
In practical terms, values of 16 or 17 are indicative of J2e1 in most cases (but not all) ; Values of 15 are most likely to occur in J2f (but can occur in J2* as well); values of 14 are most likely to be indicative of J2*, - but if the other marker values are consistent with J2e, then this may indicate the haplotype is J2e2 (especially if the DYS 454 values are 12 or 13).

There are statistically significant differences in the allele values for DYS 448 between the different sub-clades.

T-Test comparing J2a with J2b
T= 4.82 P= 0.0000 *Significant*

ANOVA comparing J2*/J2f/J2e
F= 13.3 P= 0.0000 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e  A
J2f   B
J2*   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another.

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 8.09 P= 0.0000 *Significant*
J2e1 is significantly different to J2* and J2f1.

 
According to current data, DYS 448 is of significant value in distinguishing between J2a and J2b. J2e is significantly different to J2f and J2*. There is no significant difference between J2* and J2f.
In practical terms, a value of 19 is in most cases indicative of J2e, and haplotypes with a value of 20 or above (or 18 and below) are unlikely to be J2e (however there is one J2e haplotype with a value of 20). Values of 21 appear to predominant in one of the putative clusters within J2*.
There are (weak) statistically significant differences in the allele values for DYS 449 between the different sub-clades.

T-Test comparing J2a with J2b
T= 2.26 P= 0.0280 *Significant*

ANOVA comparing J2*/J2f/J2e
F= 3.03 P= 0.0573 *NS*
Tukey HSD All-Pairwise Comparisons Test
J2e  A
J2f  AB
J2*   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 2.00 P= 0.1092 *NS*

 

According to current data, DYS 449 is of weak (statistically significant) value in distinguishing between J2a and J2b.
However, this statistically significant difference cannot at this time be easily translated into practical terms. There is a tighter range of values for J2e, and a wider range for both J2* and J2f. It is possible that the statistically significant effect was spurious, as a result of a slight peak for repeat values of 32 in J2*. Whether this small peak within J2* is correlated with a biologically distinct clade lumped within J2* remains to be seen.
In practical terms, all we can say is that within J2e values of 29 are somewhat more likely to occur, and values above 31 and below 28 are (according to current data) unlikely to occur in J2e.

DYS 464 was omitted from the statistical analysis, due to the fact that is a multicopy marker - that is ordered by numerical value, rather than by specific marker copies - and therefore would not lead to a meaningful analysis.

 

T-Test comparing
J2a with J2b

J2a   N=35
J2b   N=18

ANOVA comparing
J2*/J2f/J2e

J2*   N=18
J2f   N=17
J2e   N=18

ANOVA comparing
J2*/J2f*/J2f1/J2e1/J2e2

J2*    N=18
J2f*   N=14
J2f1   N= 3
J2e1   N=16
J2e2   N= 2

 

Creative Commons License
This work is licensed under a
Creative Commons Attribution-No Derivative Works 3.0 License.
This work can be freely cited, if it is attributed to:
The J2 Y-DNA project
http://www.j2-ydnaproject.net

Angela Cone - Co-administrator from mid 2006 - mid 2008
Administrator from mid 2008 - Present
Click here to read about Angela.

Costa Tsirigakis - Founder J2 Y DNA project & admin from 2006 - mid 2008