Administrator/Researcher: Angela J. Cone
Costa Tsirigakis - Founder J2 Y DNA project & admin/researcher from 2006 - mid 2008
See bottom of page for Project Admin. information

Analysis Phase 2
  Marker distributions in J2 sub-clades
   First 12 markers
   • Second results panel
   • Third results panel
Analysis Phase 3
  J2 Cluster analysis
   Marker panel comparison
   • Cluster analysis
   • J2b-Delta hypotheses
Please note:
• This analysis is derived from the SNP tested haplotypes that were in the J2 Y-DNA project on 19 July 2006. The reliability, and applicability of these analyses is dependant on how globally representative these haplotypes are for their sub-clades.
It is possible, due to the demographics of people who DNA test, and the relatively low sample size, that these marker values are not globally representative. In future - after more results are added - allele frequency distributions for some sub-clades may change slightly (which, accordingly, may change some of the preliminary observations and/or conclusions). The reliability and representativeness of this data set should increase, as more haplotypes are added.
• J2*( in YCC nomenclature) is a paraphyletic clade (ie. probably contains many different "sub-clades"). The J2 project is aware that there are very distinct clusters in the J2* results, and that these clusters being lumped together may confound the statistical analysis.
However, - if we were to divide J2* using subjective criteria, this would invalidate the scientifically objective nature of the analysis.
When there are adequately scientifically verified UEP's ("unique event polymorphisms", ie. SNP markers, or equivalent) to define these clusters, and there are sufficient numbers of project members who have tested for these UEP's, then these additional scientifically/objectively defined clades will be included in the analyses. For instance, the project is currently beginning to receive results for DYS 413, so we will be able to separate J2a* from J2a1* (Both of these clades are paraphyletic (J2a1* nested within J2a*), however the increased resolution will still be informative).

Last analysed: July 2006

Marker distributions in J2 sub-clades*
(The potential significance of each marker in distinguishing sub-clades is Statistically quantified)

Third marker panel
(markers tested by all labs first, in FTDNA order, and then markers tested by FTDNA only)

There are statistically significant differences in the allele values for DYS 460 between the different sub-clades

T-Test comparing J2a with J2b
T= -4.30 P= 0.0002 *Significant*

ANOVA comparing J2*/J2f/J2e
F= 11.7 P= 0.0002 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e  A
J2*  A
J2f   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another.

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 5.56 P= 0.0018 *Significant*
J2e1 is significantly different to J2f.

 

According to current data, DYS 460 is of significant value in distinguishing J2 sub-clades. J2f is significantly different to J2e and J2*. It is most useful in distinguishing between J2f and J2e, - J2e almost always has values of 11, and J2f almost always has values of 10. Some J2* haplotypes have a value of 11, but others have a value of 10 - and the latter are usually in one of the two main putative clusters lumped within J2* (with a few exceptions).
In practical terms, if a haplotype has a value of 10 for DYS 460, in almost all cases it will not be J2e (with just one exception recorded so far). If a haplotype has a value of 11 for DYS 460, in almost all cases it will not be J2f (with just one exception recorded so far).

There are no statistically significant differences in the allele values for GATA H4 between the different sub-clades

T-Test comparing J2a with J2b
T= -0.22 P= 0.82754 *NS*

ANOVA comparing J2*/J2f/J2e
F= 0.03 P= 0.9676 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 0.25 P= 0.9089 *NS*

 
According to current data, GATA H4 is of no significant value in distinguishing J2 sub-clades.
There are no statistically significant differences in the allele values for YCA II a between the different sub-clades.

T-Test comparing J2a with J2b
T= 0.61 P= 0.5479 *NS*

ANOVA comparing J2*/J2f/J2e
F= 0.68 P= 0.5136 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 0.61 P= 0.6560 *NS*

 
According to current data, YCA IIa is of no significant value in distinguishing J2 sub-clades.
There are highly statistically significant differences in the allele values for YCA II b between the different sub-clades.

T-Test comparing J2a with J2b
T= 16.79 P= 0.0000 *Significant*

ANOVA comparing J2*/J2f/J2e
F= 127 P= 0.0000 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e  A
J2*   B
J2f   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another.

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 61.00 P= 0.0000 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e1 A
J2e2 A
J2f    B
J2f1   B
J2*    B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another

 

According to current data, YCA II b is the marker which is of most statistically significant value in distinguishing between J2a and J2b. J2e1 and J2e2 are both significantly different to J2* and J2f. J2e1 and J2e2 are not significantly different to each other, and J2f and J2 are not significantly different to each other . According to the SNP tested data currently within the project, values of 19 and 20 are found exclusively within J2b (ie. J2e), and values of 21 and above are found exclusively within J2a. This currently appears to be a 100% conclusive way of distinguishing between J2a and J2b, - however this pattern may not be quite as definitive as the current data suggest, as there is a haplotype within the project with YCA II values of 20, 20 which otherwise has marker values that are not typical of J2e.
In practical terms, (according to the SNP tested haplotypes currently in the project), a value of 19 or 20 is conclusive for J2e, and a value of 21-23 is conclusive for J2*/J2f.

There are statistically significant differences in the allele values for DYS 456 between the different sub-clades

T-Test comparing J2a with J2b
T= 5.67 P= 0.0000 *Significant*

ANOVA comparing J2*/J2f/J2e
F= 17.6 P= 0.0000 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e  A
J2*   B
J2f   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another.

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 8.73 P= 0.0001 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e1 A
J2e2 AB
J2f   B
J2*   B
J2f1  B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another

 

According to current data, DYS 456 is of significant value in distinguishing between J2a and J2b. J2e is significantly different to both J2f and J2*, however there are no statistically significant differences between J2* and J2f.
In practical terms, values of 12 and 13 are highly suggestive of J2e, and values of 12-14 are highly suggestive of the haplotype not being J2f. Also, a value of 15 is most often found in J2f, and values of 16 are most often found in J2* (these differences are however not significant in statistical terms).

There are (weak) statistically significant differences in the allele values for DYS 442 between the different sub-clades

T-Test comparing J2a with J2b
T= 2.49 P= 0.0179 *Significant*

ANOVA comparing J2*/J2f/J2e
F= 3.99 P= 0.0284 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2e  A
J2f  AB
J2*   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another.

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 1.88 P= 0.1404 *NS*

 

According to current data, DYS 442 is of weak (statistically significant) significant value in distinguishing between J2e and J2*. J2f is not significantly different to either J2e nor J2*.
In practical terms (according to current data) values below 11, and above 12 are more likely to be J2*. Also, if the DYS 442 repeat value is not 11, then it is unlikely to be J2e (but a value other than 11 does not exclude the haplotype from being J2e). In one of the putative clusters within J2*, a value of 12 is most common.

There are no statistically significant differences in the allele values for DYS 438 between the different sub-clades

T-Test comparing J2a with J2b
T= -1.01 P= 0.3211 *NS*

ANOVA comparing J2*/J2f/J2e
F= 1.55 P= 0.2267 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 0.87 P= 0.4954 *NS*

 
According to current data, DYS 438 is of no significant value in distinguishing J2 sub-clades.
There are (weak) statistically significant differences in the allele values for DYS 607 between the different sub-clades

T-Test comparing J2a with J2b
T= -0.60 P= 0.5572 *NS*

ANOVA comparing J2*/J2f/J2e
F= 7.83 P= 0.0019 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2*  A
J2e  A
J2f   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another.

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 4.25 P= 0.0086 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2*  A
J2e2 AB
J2e1 AB
J2f   B
J2f1  B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another

 

According to current data, DYS 607 is of weak (statistically significant) value in distinguishing J2f from J2* and J2e. Most J2f haplotypes have DYS 607 values of 13, and most J2* and J2e haplotypes have DYS 607 values of 14.
In practical terms, a value of 13 is suggestive of J2f (but not conclusive), and values of 14 or greater are suggestive of J2e or J2*. According to current data, values of 15 and 16 are unlikely to be J2f.

There are no statistically significant differences in the allele values for DYS 576 between the different sub-clades

T-Test comparing J2a with J2b
T= 0.40 P= 0.6959 *NS*

ANOVA comparing J2*/J2f/J2e
F= 3.09 P= 0.0608 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 2.49 P= 0.0669 *NS*

 
According to current data, DYS 576 is of no significant value in distinguishing J2 sub-clades.
There are (weak) statistically significant differences in the allele values for DYS 570 between the different sub-clades

T-Test comparing J2a with J2b
T= -0.28 P= 0.7809 *NS*

ANOVA comparing J2*/J2f/J2e
F= 4.50 P= 0.0199 *Significant*
Tukey HSD All-Pairwise Comparisons Test
J2*  A
J2e  AB
J2f   B
There are 2 groups (A, B, etc.) in which the clades are not significantly different from one another.

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 3.60 P= 0.0177 *Significant*
J2* is significantly different to J2f

 

According to current data, DYS 570 is of weak (statistically significant) value in distinguishing J2 sub-clades. There is a significant difference between J2f and J2*. J2e is not significantly different to either J2* or J2f.
In practical terms, according to current data, haplotypes with DYS 570 values above 18 are unlikely to be J2f. Haplotypes with values of 16 or lower are more likely to be J2f (but this is not strongly definitive).

There are no statistically significant differences in the allele values for CDYa between the different sub-clades

T-Test comparing J2a with J2b
T= 0.32 P= 0.7493 *NS*

ANOVA comparing J2*/J2f/J2e
F= 0.54 P= 0.5862 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 0.51 P= 0.7288 *NS*

 
According to current data, CDYa is of no significant value in distinguishing J2 sub-clades.
It is noted that J2f has a very restricted range of values.
There are no statistically significant differences in the allele values for CDYb between the different sub-clades

T-Test comparing J2a with J2b
T= -1.34 P= 0.2011 *NS*

ANOVA comparing J2*/J2f/J2e
F= 0.88 P= 0.4244 *NS*

ANOVA comparing J2*/J2f*/J2f1/J2e1/J2e2
F= 1.49 P= 0.2339 *NS*

 
According to current data, CDYb is of no significant value in distinguishing J2 sub-clades.

T-Test comparing
J2a with J2b


J2a   N=24
J2b   N=11





J2a   N=23
J2b   N= 9

ANOVA comparing
J2*/J2f/J2e

DYS 460, GATA H4,YCAIIa,b DYS 456, DYS 442, DYS438

J2*   N=13
J2f   N=11
J2e   N=11

DYS 607, DYS 576, DYS 570, CDYa, CDYb

J2*   N=13
J2f   N=10
J2e   N= 9

ANOVA comparing
J2*/J2f*/J2f1/J2e1/J2e2


J2*    N=13
J2f*   N= 9
J2f1   N= 2
J2e1   N=10
J2e2   N= 1



J2*    N=13
J2f*   N= 8
J2f1   N= 2
J2e1   N= 8
J2e2   N= 1

 

Creative Commons License
This work is licensed under a
Creative Commons Attribution-No Derivative Works 3.0 License.
This work can be freely cited, if it is attributed to:
The J2 Y-DNA project
http://www.j2-ydnaproject.net

Angela Cone - Co-administrator from mid 2006 - mid 2008
Administrator from mid 2008 - Present
Click here to read about Angela.

Costa Tsirigakis - Founder J2 Y DNA project & admin from 2006 - mid 2008