1 Introduction

At present, short tandem repeat (STR) loci are applied broadly in paternity testing and individual identification of forensic cases in forensic DNA laboratories all over the world (Rosenberg et al., 2002; Deng et al., 2013; Wang et al., 2013; Zhu et al., 2013; 2014). X-chromosome STRs (X-STRs), which contain the characteristics of both autosomal and uniparental genetic markers, are verified to be of high-efficiency in cases such as the tests of mother-son kinship, half-sisters having a common biological father without the father’s DNA, or grandmother-granddaughter relationships (Liu et al., 2008; Nadeem et al., 2009; Luo et al., 2011). Although a few panels of X-STRs have been used, these X-STR kits were not enough for forensic applications, because of insufficient X-STR loci. Thus, 19 X-STRs (DXS8378, DXS7423, DXS10148, DXS10159, DXS10134, DXS7424, DXS10164, DXS10162, DXS7132, DXS10079, DXS6789, DXS101, DXS10103, DXS10101, HPRTB, DXS6809, DXS10075, DXS10074, and DXS10135) were selected to build up a novel X-STR panel.

Uygur is one of the important ethnic minorities in the People’s Republic of China and a large proportion of Uygurs live in the Xinjiang Uygur Autonomous Region, China (Xu, 2003; Jin and Chu, 2006). It is significant to obtain the information of various genetic markers in Uygur for forensic identification and population genetics. As in the previous reports, we investigated the genetic polymorphisms of 24 Y-chromosomal STR haplotypes (Zhu et al., 2014), killer cell immunoglobulin-like receptor genes (Wang et al., 2012), 21 autosomal STR loci (Deng et al., 2013), and HLA-A, -B, and -DRB1 loci with sequence-based typing (Shen et al., 2010) in the Chinese Uygur ethnic group, but the genetic polymorphism analyses of the novel panel of 19 X-STRs mentioned above have not yet been reported until now. In the present study, we used the panel to estimate the allelic frequencies of the 19 X-STR loci, calculate the important forensically statistical parameters for each locus in a sample of 233 unrelated individuals, and evaluate the allelic frequency differentiations of these loci between the Uygur group and other groups, in order to gain a better understanding of the Uygur overall genetic background.

2 Materials and methods

2.1 Sample preparation and DNA extraction

Informed consent was obtained from all the eligible individuals, and bloodstain samples were collected from 233 unrelated healthy Uygur individuals (139 females and 94 males) living in Ili of the Xinjiang Uygur Autonomous Region, China. The criteria of sample selection were: the ancestors should be unrelated within at least three generations, come from the Uygur ethnic group, and have no family migration. The study was conducted according to the human and ethical research principles of the Stomatological Hospital, Xi’an Jiaotong University, China. After sample collection, the Chelex-100 method was used to extract genomic DNA from the bloodstain samples as described by Walsh et al. (2013).

2.2 PCR amplification and X-STR genotyping

Multiplex polymerase chain reaction (PCR) was performed by the AGCU X19 STR fluorescence amplification reagents (AGCU ScienTech Inc., Wuxi, Jiangsu, China) in a single PCR system in accordance with the manufacturer’s instructions. PCR was performed on a GeneAmp PCR System 9700 Thermal Cycler (Applied Biosystems, Foster City, CA, USA) in a 25-µl reaction volume containing reaction mix, X19 primers, C-Taq, and sterile distilled H2O (sdH2O). Capillary electrophoresis was carried out on the ABI Genetic Analyzer 3500 (Applied Biosystems, Foster City, CA, USA) following the manufacturer’s instructions. Alleles of 19 X-STR loci were genotyped using the GeneMapper ID software V3.2 (Applied Biosystems, Foster City, CA, USA), based on peak heights reaching a set threshold value of 50 relative fluorescence units.

2.3 Statistical analyses

The allelic frequencies of 19 X-STR loci and Hardy-Weinberg equilibrium (HWE) were analyzed by the modified Powerstate V1.2 spreadsheet (Promega, Madison, WI, USA) (Tereba, 1999). The P-value of HWE tests was adjusted using the Bonferroni correction (P=0.05/19=0.0026). HWE was only calculated in the female samples. Forensic statistical parameters including polymorphism information content (PIC), heterozygosity (HET), the power of discrimination in females (PDF) and males (PDM), and mean exclusion chances (MEC) were performed with ChrX-STR.org 2.0 (http://www.chrx-str.org) based on allelic frequencies. Pairwise linkage disequilibrium (LD) analysis was estimated by the Genepop Version 4.0.10 (http://genepop.curtin.edu.au). In order to measure the differences of allele frequencies among different populations, the locus-by-locus Fst and P-values were calculated based on 19 X-STR allele frequencies using the analysis of molecular variance (AMOVA) method by ARLEQUIN Version 3.0 software (Excoffier et al., 2007).

3 Results and discussion

3.1 Forensic parameter analysis

HWE was tested in female samples and no significant deviations were observed after Bonferroni correction (P>0.0026). There were no significant differences for the allelic frequencies of 19 X-STRs between male and female subgroups according to exact tests, except DXS10134, DXS7424, DXS101 and DXS10074 loci. Therefore, allelic frequencies of male and female samples in these 4 loci are shown separately, while the remaining loci are shown together. The allele frequency distributions are shown in Tables 1 and 2. Forensic statistical parameters of the 19 X-STR loci including HET, PDF, PDM, probability of exclusion (PE), paternity index (PI), PIC, and MEC are shown in Table 3. The allelic frequencies of 19 X-STR loci ranged from 0.0021 to 0.5644. The HET ranged from 0.5493 at the DXS 10164 locus to 0.9258 at the DXS10135 locus. The highest and the lowest values of PE were observed at the DXS10135 locus (PE=0.8572) and the DXS10164 locus (PE=0.2928), respectively. The PDF ranged from 0.7889 at the DXS10164 locus to 0.9908 at the DXS10135 locus, and the PDM ranged from 0.6016 at the DXS10164 locus to 0.9301 at the DXS10135 locus. The cumulative PDF, PDM, and PE of the 19 X-STR loci were 0.999 999 999 999 999 999 998 091, 0.999 999 999 999 966, and 0.999 999 986 35, respectively. In the 19 X-STRs, the highest and the lowest values of PIC were surveyed at the DXS10164 locus (PIC=0.54923) and the DXS10135 locus (PIC=0.9258), respectively. The highest value of MEC in deficiency cases, normal trios and duo cases were observed at the DXS10135 locus (MEC=0.8585, 0.9256, and 0.8663, respectively). The lowest values were found at the DXS10164 locus (MEC=0.3595, 0.5493, and 0.4021, respectively). The cumulative MEC in deficiency cases, normal trios and duo cases were 0.999 999 992 849, 0.999 999 999 999 628 and 0.999 999 998 722, respectively. The parameters mentioned above revealed that the panel of 19 loci had great potential for forensic personal identification and forensic paternity testing.

Table 1 Allele frequency distributions in Chinese Uygur group at 15 X-STR loci with no difference between male and female samples
Table 2 Allele frequency distribution in Chinese Uygur group at 4 X-STR loci with difference between male and female samples
Table 3 Forensic efficiency parameters of 19 X-STR loci in Chinese Uygur group

3.2 Linkage disequilibrium analyses

It is necessary to estimate the value of LD before these STR loci are used for forensic application and population genetics. In this study, among the 171 pairwise loci of 19 X-STRs, P-values of 8 pairwise loci were found to be <0.05 in the Chinese Uygur ethnic minority. However, only one (between DXS10075 and DXS10074, P<0.0001) remained significant after Bonferroni correction (P=0.05/171=0.0003). The value of LD is affected by various factors, for example, genetic linkage, natural selection, and demographic structure. The distance between the two markers (DXS10075 and DXS10074) is 0.021 Mbp (http://chrx-str.org), and therefore genetic linkage is likely to be a major cause of the observed LD.

3.3 Haplotype diversities

According to previously reports, there were seven linkage groups observed in the 19 X-STRs. These clusters were listed as follows: DXS10135-DXS8378-DXS10148, DXS7132-DXS10074-DXS 10075-DXS10079, DXS10103-DXS10101-HPRTB, DXS10134-DXS7423, DXS10159-DXS10162-DXS 10164, DXS6809-DXS6789, and DXS7424-DXS101. The first four clusters were obtained from a database (http://xdb.qualitype.de/xdb/linkageTable.jsf), and the remaining clusters from previous studies (Edelmann et al., 2002; 2010; Szibor et al., 2005). Haplotype frequencies for the seven clusters in 94 male samples are displayed in the Table S1. The most common haplotypes in the Uygur ethnic minority were observed at DXS10134-DXS7423 (H36-15) and DXS7424-DXS101 (H16-24) with a frequency of 0.1064, followed by DXS10134-DXS7423 (H36-14, H37-15) and DXS6809-DXS6789 (H32-21) at 0.0957. Haplotype diversities for the seven cluster groups were 0.9954, 0.9977, 0.9920, 0.9556, 0.9792, 0.9669 and 0.9634, respectively.

3.4 Interpopulation differentiation

Population differentiation between the Uygur and 11 other previously published groups was analyzed by the means of AMOVA based on the allelic frequencies of 9 overlapping STR loci. The Fst and P-values of these loci are shown in the Table S2. The results showed that statistically significant differences (P<0.05) were observed between the Uygur and Greenlander (Tomas et al., 2012) and Somali (Tomas et al., 2012) at 8 loci; East Timor (Moreira et al., 2015), Japanese (Uchigasaki et al., 2013), Shenyang Han population (Uchigasaki et al., 2013) and Dane (Tomas et al., 2012) at 6 loci; Shanghai Han population (Zhang et al., 2012) at 5 loci; Bhil tribal population (Shrivastava et al., 2015) and Taiwanese (Chen et al., 2014) at 4 loci; Malay (Samejima et al., 2012) and Guangdong Han population (Zeng et al., 2011) at 3 loci. The results of AMOVA demonstrated that the allelic frequency in most of these loci distributed differently among different ethnic groups. Therefore, more studies on STR loci in different ethnic groups should improve understanding of the genetic relationships between the different ethnic groups and their genetic backgrounds.

4 Conclusions

In summary, the 19 X-STR loci were used to investigate the genetic polymorphisms of the Uygur ethnic minority and the population differentiations between the Uygur group and other populations. The value of cumulative PDF, PDM, and PE showed that these 19 X-STR loci could be used as complement for the application of autosomal STR. Moreover, these results provided basic data for population genetics and forensic science research.

Compliance with ethics guidelines

Yu-xin GUO, Jian-gang CHEN, Yan WANG, Jiang-wei YAN, Jing CHEN, Tian-hua YAO, Li-ping ZHANG, Guang YANG, Hao-tian MENG, Yu-dang ZHANG, Ting MEI, Yao-shun LIU, Qian DONG, and Bo-feng ZHU declare that they have no conflict of interest.

All procedures followed were in accordance with the ethical standards of the responsible committee on human experimentation (institutional and national) and with the Helsinki Declaration of 1975, as revised in 2008 (5). Informed consent was obtained from all volunteers for being included in the study.

List of electronic supplementary materials

Table S1 Haplotype diversities of seven clusters in Uygur male individuals (n=94)

Table S2 Fst and P-values for allele frequency distribution between Uygur ethnic group and the other groups previously published