Introduction

Porcine epidemic diarrhea (PED), caused by porcine epidemic diarrhea virus (PEDV), is an acute and highly contagious enteric disease in swine. The disease is characterized by severe enteritis and watery diarrhea, leading to death with a mortality rate of up to 80%. PED was first reported in England in 1971 [1], since then it has been reported in many other swine-producing countries [2]. In China, PED was first reported in 1976 [3]. So far, it has been prevailing in many provinces and become one of the most important viral diarrhea diseases in China, resulting in heavy economic losses in swine industry.

PEDV was first reported in Belgium and the United Kingdom in 1978 [4, 5]. It is a member of Group 1a, genus Coronavirus, family Coronaviridae within the Order Nidovirales. PEDV is an enveloped, single-stranded, positive-sense RNA virus. The genome of it is approximately 28 kb in length and encodes four mainly structural proteins, which are the spike (S) protein, the membrane (M) protein, the envelope (E) protein, and the nucleocapsid (N) protein [68].

PEDV occurs frequently in swine-raising farms in spite of using the vaccine strategy at present, damage caused by PEDV infection is continuous and serious in many provinces in China. According to an accurate analysis on genome of prevalent PEDVs, we can prevent outbreaks of the PEDV-induced diarrhea more effectively and choose the correct PEDV vaccine strain. By the phylogenetic analysis of the partial spike glycoprotein genes, the Korean PEDV isolates were divided into three groups (G1, G2, G3), which had three subgroups (G1-1, G1-2, G1-3) [9]. Like S glycoprotein gene, M protein gene is also an important component in the phylogenetic analysis of the epidemiological situation of PEDV in the field. On the basis of sequence analysis of M protein gene, Korean PEDV isolates were shown to be different from foreign PEDV strains [10]. Therefore, the purpose of the present study was to investigate the diversity among the Chinese PEDV isolates and was to find out more prevalent PEDVs in China by the sequence analysis and the phylogenetic relationship of the M protein genes with Chinese PEDV isolates and reference PEDV strains.

Materials and methods

Isolates

Porcine fecal samples had been taken from piglets showing watery diarrhea and dehydration on six different swine-raising farms of five provinces in China. These fecal samples had been confirmed positive for PEDV by Anigen Rapid PED Ag Test Kit (Animal Genetics Inc., Korea). PEDV positive fecal samples were diluted with phosphate-buffered saline (PBS; 0.1 M, pH 7.2) to be 10% (V/V) suspensions. The suspensions were vortexed and clarified by centrifugation for 10 min at 5,000 rpm. The supernatants were collected for the amplification of M protein gene by RT-PCR. The isolates, used for nucleotide sequence analysis, amino acid sequence analysis and phylogenetic analysis in this study, were described in Table 1.

Table 1 Six Chinese PEDV isolates used for sequence alignment, sequence analysis, and phylogenetic analysis

Extraction of viral RNA

The supernatant (200 μl) containing PEDV was mixed with 400 μl TRIzol Reagent (Invitrogen Corp., Carlsbad, CA) and RNA was isolated according to the manufacturer’s instructions. The RNA was air-dried for 10 min and redissolved in 15 μl diethyl-pyrocarbonate (DEPC)-treated deionized water.

Primers

Published primers [11] designed based on the published sequence of M gene were used for generating the full M genes of PEDV isolates. Briefly, P1 (5′-CCCCAGTACTGTTATTGACGTATAAAC-3′) and P2 (5′-GTTTAGACTAAATGAAGCACTTTC-3′) were used for PCR to amplify the M gene. The size of amplified products was 715 bp. P3 (5′-GCCATAAAGTTTCTGTTTAGACTAA-3′) was used as the primer for the synthesis of complementary DNA.

Synthesis of cDNA

For reverse transcription, 15 μl of extracted viral RNA was mixed with 1 μl P3 and incubated at 70°C for 10 min followed by 2 min on ice. After adding a reaction mixture consisting of 5 μl 5× First Strand Buffer (Fermentas International Inc., Ltd., USA), 2 μl 2.5 mM dNTPs (TaKaRa Biotechnology Co., Ltd., Dalian, China), 100 units of M-MLV reverse transcriptase (Fermentas International Inc., Ltd., USA), 20 units RNAsin (TaKaRa Biotechnology Co., Ltd., Dalian, China), the mixture was incubated at 37°C for 2 h. The reaction was terminated by heating at 98°C for 6 min and chilling on ice. The cDNA was either stored at −70°C or amplified immediately.

PCR

In PCR, P1 and P2 were used to amplify the M genes of six isolates. Exactly, 1 μl cDNA was mixed with a reaction mixture containing 2.5 μl 10× Ex Taq buffer (TaKaRa Biotechnology Co., Ltd., Dalian, China), 2 μl 2.5 mM dNTPs (TaKaRa Biotechnology Co., Ltd., Dalian, China), 0.5 μl each specific primer (10 pmol), 0.25 μl Ex Taq DNA polymerase (TaKaRa Biotechnology Co., Ltd., Dalian, China), and 18.25 μl sterile deionized water. The amplification was performed with 30 cycles of denaturation at 95°C for 1 min, annealing at 50°C for 1 min, and extension at 72°C for 1 min, followed by a final extension step of 72°C for 10 min. PCR products were analyzed on 1% agarose gels and were purified using AxyPrep™ Gel Extraction Kit (Axygen Scientific, Inc., USA) according to the manufacturer’s instructions.

Cloning and sequencing

Purified PCR products were cloned into a T-tailed vector, pMD18-T (TaKaRa Biotechnology Co., Ltd., Dalian, China) and transformed using JM109 competent cells (TaKaRa Biotechnology Co., Ltd., Dalian, China) according to the manufacturer’s instructions. Exactly, for cloning of cDNA, 9 μl of purified PCR products, 1 μl of pMD18-T (50 ng/μl), and 10 μl of 2× Solution I were mixed gently and incubated for 6 h at 16°C. The ligation-reaction mixture was then subjected to the transformation protocol, which renders cells competent through heat-shock. For transformation, a tube of 100 μl JM109 competent cells was thawed on ice following which 20 μl of ligation-reaction mixture was added to the tube, mixed gently and incubated on ice for 30 min. The tube was heated in a 42°C water bath for 60 s and incubated on ice for 2 min immediately, then 800 μl room temperature SOC medium was added to the tube and mixed gently, incubated at 37°C for 1 h with vigorous shaking. 200 μl of each transformation mixture was immediately plated onto LB agar plate containing ampicillin. The plate was incubated at room temperature until the transformation mixture had been absorbed into the agar, following which it was inverted and then incubated at 37°C overnight. Colonies grown in LB agar plate were cultured in LB medium with vigorous shaking at 37°C overnight, and DNA was extracted using the AxyPrep™ Plasmid Miniprep Kit (Axygen Scientific, Inc., USA). PCR and restriction enzyme digestion, with enzymes such as EcoRI, followed by electrophoresis through a 1.0% agarose gels was employed for identification of recombinant DNA clones.

Three independent DNA clones of each isolate were sequenced by using M13 sequencing primers by Shanghai Sangon Biological Engineering Technology & Services CO., Ltd. (China). All sequencing reactions were performed in duplicate and all sequences were confirmed by sequencing both strands.

Sequence analysis

Nucleotide and deduced amino acid sequences of six PEDV isolates were aligned, edited and analyzed with DNAMAN software (Version 5.2.2, Lynnon BioSoft. Inc., USA), and MegAlign software of Lasergene (version 7.1.0, DNASTAR Inc., USA). A phylogenetic tree was then generated using an alignment of M glycoprotein gene nucleotide sequences with reference PEDV strains by Jotun Hein Method in MegAlign software. The reference strains used for sequence alignment, sequence analysis, and phylogenetic analysis with Chinese PEDV isolates were described in Table 2.

Table 2 Ten PEDV reference strains used for sequence alignment, sequence analysis, and phylogenetic analysis

Results

Sequence analysis

The M protein genes of six Chinese PEDV isolates consisted of 692 nucleotides containing a single open reading frame (ORF) of 681 nucleotides, which encoded a 226aa-long peptide. There were no nucleotides deletion or insertion in the ORFs of six isolates. Six isolates had a conserved intergenic motif (ATAAAC) of 5 nucleotides upstream of the initiator ATG, as previously recognized in Br1/87 [11]. The hexamer motif was also found in CV777, JMe2, LZC, and QH. There was a five-nucleotide conserved sequence (GAAAT) between the conserved intergenic motif and the initiator ATG of M protein genes of six isolates. The five-nucleotide conserved sequence was found in the corresponding regions in CV777, JMe2, LZC, and QH, too. Compared to PEDV CV777, CH/JSX/06 had 4 nucleotide mutations, CH/IMT/06 had 6 nucleotide mutations, CH/HLJH/06 had 7 nucleotide mutations, CH/IMB/06, CH/HNCH/06, and CH/SHH/06 had 8 nucleotide mutations (Fig. 1).

Fig. 1
figure 1figure 1figure 1

Comparison of the nucleotide sequences of the membrane glycoprotein genes of Chinese PEDV isolates and PEDV reference strains by DNAMAN software. The dashes (–) indicate regions where the sequences are identical to those of CV777. The dots (·) indicate unavailable sequence. The asterisk (*) indicates the numbers are calculated from the start codon. The conserved intergenic motif is in shade, the start codon and the stop codon is underlined, respectively

The nucleotide changes would lead to changes in the predict amino acid sequences of PEDV isolates. Compared to PEDV CV777, CH/IMB/06, CH/HNCH/06, and CH/SHH/06 had 1 amino acid change (from E to Q at 13), CH/IMT/06 had 2 amino acid changes (from S to F at 5, from N to D at 167), CH/HLJH/06 had 2 amino acid changes (from V to A at 42, from A to S at 214). Although CH/JSX/06 had nucleotide changes, it had no amino acid change (Fig. 2). The M proteins of six Chinese PEDV isolates were 226 amino acid residues in length. The M proteins of five isolates had two potential N-linked glycosylation sites in theirs exposed amino-terminal domain, excepting for that of CH/IMT/06, which had only one N-linked glycosylation site because 1 nucleotide change (from C to T at 14) (Fig. 1) destroying N-linked glycosylation site through amino acid change (from S to F at 5) (Fig. 2) (predicted by http://www.cbs.tdu.dk/CBS Prediction Servers/NetNGlyc 1.0 Server). The glycosylation sequence Asn-Phe-Thr (NFT) was highly conserved in the M proteins of six isolates. The M proteins of six isolates were highly hydrophobic, and had four hydrophobic domains alternating with short hydrophilic regions in the amino-terminal half of the protein. The locations of three main transmembrane regions were aa20–38, aa43–65 and aa75–97, respectively, but the location of the fourth hydrophobic domain was not shown (predicted by http://www.cbs.tdu.dk/CBS Prediction Servers/TMHMM Server v. 2.0). In the center of the M protein, directly adjacent to the third hydrophobic domain, was a stretch of eight amino acids (SWWSFNPE) (Fig. 2), which was well conserved in the coronavirus M proteins [12].

Fig. 2
figure 2

Comparison of the deduced amino acid sequences of the membrane glycoprotein genes of Chinese PEDV isolates and reference PEDV strains by DNAMAN software. The dashes (–) indicate regions where the sequences are identical to those of CV777. The potential glycosylation sites predicted by NetNGlyc 1.0 Server are shaded. Three transmembrane domains (aa20–38, aa43–65, and aa75–97) predicted by TMHMM Server v. 2.0 are indicated by single underline, respectively. The conserved stretch of eight amino acids (SWWSFNPE) is indicated by double underline

Sequence homology analysis

Nucleotide and deduced amino acid sequence homology results are described in Table 3. The six Chinese PEDV isolates have 98.8–100% DNA sequence identities with each other. Among the isolates, CH/IMB/06, CH/HNCH/06, and CH/SHH/06 have 100% sequence homologies with each other, while CH/IMT/06 has 98.8% homology with them. Six isolates have 97.2–99.4% DNA sequence identities with ten PEDV reference strains. CH/HLJH/06 has the lowest sequence identities (97.2%) with strain LJB/03, and CH/JSX/06 has the highest sequence identities (99.4%) with strain CV777. The nucleotide identities of six isolates and four Chinese PEDV strains, JS-2004-2, LJB/03, LZC, and QH, are less than 90.0%. The isolates have 98.2–100% homologies with deduced amino acid sequences of each other and have 96.9–100% homologies with deduced amino acid sequences of ten reference strains.

Table 3 Comparison of the nucleotide and deduced amino acid sequences of the membrane protein genes of Chinese PEDV isolates and PEDV reference strains

Phylogenetic relationship

The Chinese PEDV isolates and ten PEDV reference strains were divided into three clusters based on the phylogenetic relationship of the nucleotide sequences of M protein genes (Fig. 3). One cluster (I) comprised three Korean strains (KPEDV-9, KPEDV-9F, Chinju99), two Chinese strains (LJB/03, QH), and one Japanese strain JMe2. The second cluster (II) consisted of CV777, Br1/87 and one Chinese strain LZC. The third cluster (III) was composed of six Chinese isolates and one Chinese strain, JS-2004-02.

Fig. 3
figure 3

Phylogenetic relationship based on the membrane glycoprotein genes of Chinese PEDV isolates and PEDV reference strains. Phylogenetic tree generated by Jotun Hein Method in MegAlign software is a rooted tree. The Chinese PEDV isolates are in bold

Discussion

The M protein is a multispanning membrane protein with a small, amino-terminal domain located on the exterior of the virion, or, intracellularly, in the lumen of the endoplasmic reticulum (ER). The ectodomain is followed by three transmembrane segments and then a large carboxy terminus comprising the major part of the molecule. This latter domain is situated in the interior of the virion or on the cytoplasmic face of intracellular membranes. The M protein is the most abundant viral constituent and is almost invariably glycosylated. Intriguingly, whereas the group 1 and group 3 coronaviruses—with transmissible gastroenteritis virus (TGEV) and infectious bronchitis virus (IBV) as important representatives, respectively—all contain M proteins with only N-linked sugars, the M proteins of group 2 coronaviruses such as mouse hepatitis virus (MHV) are O-glycosylated [13], An exception is MHV-2, the M protein of which carries both O- and N-linked sugars [14].

The M protein plays a central role in coronavirus assembly. Together with the minor E protein it is responsible for the assembly of the coronavirus envelope [1517]. In addition, the M protein directs the incorporation of the S protein [18, 19] and the nucleocapsid [20] into the budding particle.

In the present study, the complete nucleotide and deduced amino acid sequences of M protein genes of six Chinese PEDV isolates were determined and compared with ten PEDV reference strains in order to find genetic diversity among prevailing PEDVs in China. Although PEDV is known to have only one serotype at present, the genome of PEDV is widely to show genetic diversity, according to the analysis results using molecular biological methods [2123]. The M protein genes of four Chinese strains (LJB/03, JS-2004-2, LZC, and QH) have been sequenced and reported. Although the phylogenetic analysis based on M protein gene of LJB/03 was reported [24], it could not reflect the phylogenetic relationship among the PEDV field strains in China. Although the bi-combined attenuated vaccine against TGEV and PEDV infection is authorized to use in swine farms, PED still occurs in the immunized swine herds in China. So it is necessary to further investigate the prevalence status of PEDV based on the phylogenetic analysis of M protein genes of PEDV field strains in China.

There are no insertion and deletion in the M protein genes of six Chinese PEDV isolates except point mutations. These point mutations may lead to genetic diversity among these isolates. Our findings show that Chinese PEDV isolates are genetically diverse in their M protein genes, both among themselves and as compared with reference strains. The hexamer motif (ATAAAC), which locates at the 5′ end of M protein gene, are conserved among the six isolates. They are identical to those found in the corresponding sites in strains CV777, Br1/87, JMe2, LZC, and QH. The M proteins of six isolates have four hydrophobic domains alternating with short hydrophilic regions in the amino-terminal half of the protein, and the forth hydrophobic domain may has functions as a cleavable signal peptide. In the center of M proteins of six isolates, directly adjacent to the third hydrophobic domain, are also the stretches of eight amino acids (SWWSFNPE) that are well conserved as mentioned by de Haan [12]. The coronavirus M protein is almost invariably glycosylated in its exposed amino-terminal domain. N-linked glycosylation is initiated in the endoplasmic reticulum by the cotranslational linkage of a large oligosaccharide structure to the polypeptide at asparagine residues within the consensus sequence NXS/T (where X is any amino acid). By prediction, five isolates have two N-linked glycosylation sites except CH/IMT/06 that has one N-link glycosylation site. The M proteins of group1 and 3 coronaviruses and SARS-CoV all contain with only N-linked sugars. The glycosylation status of the MHV M protein was found to influence the ability of the virus to replicate in the liver but not in the brain [25], yet that of PEDV M protein has not been reported.

Sequence homology of M protein genes indicated that six Chinese PEDV isolates were highly homologous to CV777, Br1/87, Chinju99, JMe2, and JS-2004-2 rather than to QH, LZC, LJB/03, KPEDV-9, and KPEDV-9F. Although CH/SHH/06, CH/HNCH/06, CH/IMB/06 were isolated in different provinces, they had identical nucleotide sequences. It showed that the three isolates maybe originated from the same ancestor. CH/HLJH/06 was highly homologous to JS-2004-2, but they were isolated in different provinces at different time. Although CH/IMB/06 and CH/IMT/06 were isolated in the same province, the sequence homology between them was lower. These results indicated that the Chinese PEDV isolates had genetic diversity with respect to the M protein genes. By phylogenetic analysis, the six Chinese PEDV isolates with JS-2004-2 formed a separate group, which excluded European strains, Korean strains, Japanese strain and three Chinese strains (QH, LZC, LJB/03). The phylogenetic relationship indicated that Chinese PEDV isolates were different from foreign PEDV strains and a new genotype PEDV was prevailing in China. We expect to prevent outbreaks of the PEDV-induced diarrhea more effectively, according to choosing the more efficient PEDV vaccine strain by accurate analysis on genome of prevalent PEDVs in China. Moreover, genetic variation existence in M protein genes of Chinese PEDV isolates and more prevalent PEDVs discovery in China will form the basis for further functional exploration of PEDV.