Porcine epidemic diarrhea virus (PEDV), a member of the family Coronaviridae, is an enveloped, single-stranded RNA virus [13]. It causes a devastating enteric disease with acute diarrhea, dehydration and significant mortality in swine, thereby incurring heavy economic losses in Asia [4, 5]. Although serologically unrelated, PEDV and transmissible gastroenteritis virus (TGEV), cause digestive tract infections which are extremely difficult to differentiate clinically [68]. Both viruses belong to the family Coronaviridae.

The spike (S) gene of TGEV is an important site of virus neutralization [911]. In addition, it is known that determinants that confer TGEV enteropathogenicity are associated with the S gene [12]. Similarly for mouse hepatitis virus (MHV), it has been reported that mutations or deletions in its S gene markedly affect its neurovirulence [13, 14].

Genetic changes have been reported in the S gene of cell culture-adapted PEDV [15, 16]. These changes appear to have resulted from passage of the virus through cell cultures. Similarly, in vivo, the pathogenicity of PEDV in piglets was reduced through serial passage in Vero cell cultures [16]. Moreover, the S gene has been suggested as an important determinant for PEDV biological properties.

Reading in the 5′–3′ direction, the PEDV genome contains genes for pol1 (P1) protein, spike (S) protein, an open reading frame (ORF3), envelope (E) protein, membrane (M) protein and nucleocapsid (N) protein [10, 1721]. Among the proteins encoded by these genes, S protein, a glycoprotein peplomer (surface antigen) on the viral surface, plays an important role in binding to specific host cell receptor glycoproteins with subsequent penetration into the cells occurring via membrane fusion. The S protein also stimulates induction of neutralizing antibodies in the host [10].

The PEDV DR13 strain, which is highly adapted to cell culture, exhibited reduced pathogenicity and induced immunogenicity in pigs [16]. These changes may have resulted from adaptation and attenuation through serial passage in Vero cell cultures [16, 22]. Although unexpected, this attenuation of PEDV DR13 through serial passage may be of strategic interest.

In this study, we constructed DNA clones of the PEDV DR13 S gene. In order to elucidate the genetic basis of the markedly different wild type and attenuated PEDV phenotypes, the nucleotide and deduced amino acid sequences of the S gene were determined and were further analyzed and aligned with those of reference PEDVs [16]. Furthermore, phylogenetic trees were constructed and analyzed on the basis of the S gene nucleotide and deduced amino acid sequences. The similarities and differences between reference PEDVs and attenuated PEDV DR13 were elucidated. This analysis helped to elucidate the phylogenetic relationships between attenuated PEDV DR13 and other PEDV strains.

The continuous Vero cell line (ATCC, CCL-81) was regularly maintained in α-minimum essential medium (α-MEM) supplemented with 5% fetal bovine serum, penicillin (100 units/ml), streptomycin (100 μg/ml) and amphotericin B (0.25 μg/ml).

Porcine epidemic diarrhea virus strain named DR13 was isolated from the intestinal tissues of piglets suspected with porcine epidemic diarrhea (PED), which had been submitted to the Department of Veterinary Medicine Virology Laboratory, College of Veterinary Medicine, Seoul National University, Seoul, Korea. Intestinal samples were made into 10% (v/v) suspensions through homogenization with phosphate buffered saline (PBS; 0.1 M, pH 7.2). The suspensions were vortexed and clarified by centrifugation for 10 min at 4,800 x g. Supernatants passed through a 0.2 μm syringe filter (Acrodisk, Gelman) were used for virus isolation in Vero cells. Prior to inoculation, the growth media of confluent cells grown in 25-cm2 flasks (Falcon, USA) were removed and the cells were washed three times with PBS (pH 7.4). Cells were inoculated with 1 ml per flask of the supernatants. After adsorption at 37°C for 1 h, the cells were incubated in α-MEM supplemented with 0.02% yeast extract, 0.3% tryptose phosphate broth, and 2 μg of trypsin as described previously [16]. Serial passages of the DR13 isolate of PEDV were continued in a 25-cm2 flask by level 100 according to the method described above. PEDV was identified by RT-PCR [23].

Infected cell cultures were prepared for the extraction of viral RNA. Infected cells were harvested when the cells reached 70–80% cytopathic effect (CPE). RNA was extracted from infected cells using TRIzol LS (Invitrogen Corp., Carlsbad, CA) according to the manufacturer’s instructions. For PEDV-infected cells, 250 μl suspensions were lysed directly in 1.7 ml microtubes by adding 750 μl TRIzol LS reagent. Then 200 μl of chloroform was added to the mixture, and the suspension was centrifuged for 10 min at 12,000 x g. The RNA-containing aqueous phase was precipitated with isopropanol of the same volume, maintained at −70°C for 2 h, and centrifuged for 10 min at 12,000 x g. The RNA pellet was washed with 1 ml of 75% ethanol, centrifuged for 10 min at 12,000 x g, and dried, following which it was resuspended in 30 μl of diethyl-pyrocarbonate (DEPC)-treated deionized water.

Pairs of sense and antisense primers were designed and aligned based on the nucleotide sequence of the S gene of CV777 and Br1/87 [10, 15] from the GenBank database (National Center for Biotechnology Information, USA). These primers were used to generate cDNA for the S gene of attenuated PEDV DR13. The nucleotide sequences and relative position of the primers are shown in Table 1 and Fig. 1, respectively.

Table 1 Primers used in cDNA synthesis of the PEDV S gene
Fig. 1
figure 1

Construction of cDNA clones for the full-length S gene of attenuated PEDV DR13 by RT-PCR using pairs of sense (SF) and antisense (SR) primers: diagrammatic representation of the S gene of viral RNA (long solid rectangle) and S gene ORF (long open rectangle) show primer-binding sites (small open rectangles). Three DNA fragments amplified by RT-PCR and cloned into the pDrive Cloning Vector are denoted as recombinant DNA clones Sfrag I, Sfrag II and Sfrag III

RT-PCR was performed using a Maxime RT-PCR PreMix Kit (iNtRON BIOTECHNOLOGY, Korea), according to the manufacturer’s instructions.

Briefly, for RT-PCR, 2 μl aliquots of extracted RNA and 2 μl of each specific primer (10 pmol) were added into the Maxime RT-PCR PreMix tubes and brought to 20 μl with autoclaved, filtered (0.2 μm) distilled water. RT-PCR was performed using a commercial amplification system (Perkin–Elmer, Applied Biosystems, Foster City, Calif) and employed a program of 1 cycle of 30 min at 45°C and 5 min at 94°C and 40 cycles of 1 min at 94°C, 1 min at 57°C and 2 min at 72°C, and a final extension at 72°C for 5 min. RT-PCR products were visualized by electrophoresis in a 1.5% agarose gel containing ethidium bromide. Bands of the correct size were excised and purified using a QIAquick Gel Extraction Kit (QIAGEN) according to the manufacturer’s instructions.

Purified RT-PCR products corresponding to the S gene were cloned using a QIAGEN PCR Cloningplus Kit (QIAGEN) according to the manufacturer’s instructions with simple modifications.

For cloning of cDNA, 4 μl of purified RT-PCR product, 1 μl of pDrive Cloning Vector (50 ng/ul) and 5 μl of 2× ligation Master Mix were mixed gently and incubated for 4 h at 16°C. The ligation-reaction mixture was then subjected to the transformation protocol, which renders cells competent through heat-shock. For transformation, a number of tubes of QIAGEN EZ Competent Cells were thawed on ice and SOC medium was warmed to room temperature following which 5 μl of ligation-reaction mixture was added per tube of cells, mixed gently for 3 s and incubated on ice for 30 min. The tubes were heated in a 42°C water bath for 90 s and incubated on ice immediately. Room temperature SOC medium (250 μl) was added to each tube and 100 μl of each transformation mixture was immediately plated onto LB agar plates containing ampicillin. The plates were incubated at room temperature until the transformation mixture had absorbed into the agar, following which they were inverted and then incubated at 37°C overnight. Colonies grown in LB agar plates were cultured in LB broth with shaking at 37°C overnight, and DNA was extracted using the Wizard® Plus Minipreps DNA Purification System (Promega). Restriction enzyme digestion, with enzymes such as EcoRI, followed by electrophoresis through 1.5% agarose gels was employed for identification of recombinant DNA clones.

All S gene recombinant DNA clones were sequenced by Genotech Co. Ltd. (Korea). All sequencing reactions were performed in duplicate and all sequences were confirmed by sequencing both strands.

Nucleotide and deduced amino acid sequences were analyzed with the CLUSTALX v1.83 program and MegAlign software (DNAStar Inc., Madison, WI, USA) for alignment and sequence analysis. S gene nucleotide and deduced amino acid sequences were compared with the PEDV CV777 (GenBank Accession No. AF353511), Br1/87 (EMBL Accession No. Z25483), JS-2004-2 (GenBank Accession No. AY653204), Spk1 (GenBank Accession No. AF500215), Chinju99 (GenBank Accession No. AY167585) and parent DR13 (GenBank Accession No. DQ862099) strains.

Nucleotide and deduced amino acid sequences were edited and aligned with the CLUSTALX v1.83 and Bioedit v7.0.5.2 programs. The resulting subsets were edited manually. A phylogenetic tree was then generated using an alignment of S gene nucleotide and deduced amino acid sequences with the above-mentioned reference PEDVs by applying the neighbor-joining method in the MEGA 3.1 program. To assess the relative support for each clade, bootstrap values were calculated from 1,000 replicate analyses and the cut-off point for bootstrap replication was 70%.

To synthesize ds-cDNA of the attenuated PEDV DR13 S gene, three overlapping DNA fragments were amplified by RT-PCR using a proper pair of sense (SF) and antisense (SR) primers. The DNAs, designated as Sfrag I (1,654 bp), Sfrag II (1,593 bp) and Sfrag III (1,422 bp) were each cloned into the pDrive Cloning Vector DNA (Fig. 1) and subjected to sequencing.

Alignment of nucleotide and deduced amino acid sequences is presented in Fig. 2. This revealed that the nucleotide sequence encoding the entire attenuated PEDV DR13 S gene is 4,159 bases in length and contains a single 4,149-base ORF starting with an initiator, ATG, at position 11 nt and ending with a terminator, TGA, at position 4,157 nt. The coding region of the gene has 142 (146) and 141 nucleotide mismatches compared to CV777 (Br1/87) and parent DR13, and 3 missing nucleotides compared to CV777, Br1/87 and DR13, respectively. It consists of 1,011 adenine (24.31%), 842 cytosine (20.25%), 879 guanine (21.13%) and 1,427 thymine (34.31%) nucleotides, and has a GC content of 41.38%.

Fig. 2
figure 2figure 2figure 2figure 2

Comparison of the (a) nucleotide and (b) deduced amino acid sequence of the S gene of attenuated PEDV DR13 with CV777 (GenBank Accession No. AF353511), Br1/87 (EMBL Accession No. Z25483) and parent DR13 (GenBank Accession No. DQ862099) strains. Asterisks represent (a) nucleotides and (b) amino acids that are identical to those in the attenuated PEDV DR13. Dashed lines represent missing (a) nucleotides and (b) amino acids compared to the PEDV CV777, Br1/87 and parent DR13 strains. Start codon ATG and stop codon TGA are underlined. Only the (a) 142 (146) and 141 nucleotides, and (b) 59 (63) and 59 amino acids of CV777 (Br1/87) and parent DR13 which mismatched those of attenuated PEDV DR13 are included. Three missing nucleotides and one missing amino acids compared to the PEDV CV777, Br1/87 and parent DR13 strains are included. Regions corresponding to the six primers used for cloning are underlined and labeled above the sequence as SF1-3 and SR1-3. Asn-Xaa-Ser/Thr sequons in the sequence are underlined and bold letters indicate asparagine residues that are predicted to be N-glycosylated

The attenuated PEDV DR13 S gene encodes a protein of 1,382 amino acids with predicted M r of 151 kDa. There are 28 Asn-Xaa-Ser/Thr sequons and 21 asparagine residues that are predicted to be N-glycosylated in the protein. The attenuated PEDV DR13 S protein has 59, 63 and 59 amino acid mismatches compared to CV777, Br1/87 and parent DR13, respectively, and 1 missing amino acids compared to CV777, Br1/87 and parent DR13. There is also a stretch of highly hydrophobic residues at positions 1,327–1,347 (>1.6 on the Kyte-Doolittle scale). Maximum value was 3.978 at position 1,333 and minimum was −2.444 at position 914.

Nucleotide and deduced amino acid sequence homology results are described in Table 2. We found that the attenuated PEDV DR13 S gene shares 96.5, 96.4, 96.1, 93.9, 93.5 and 96.6% DNA sequence identities with CV777, Br1/87, JS-2004-2, Spk1, Chinju99 and parent DR13, respectively. Likewise, it shares 95.7, 95.4, 95.6, 92.0, 91.6 and 95.7% homologies with the deduced amino acid sequences of the same genes.

Table 2 Nucleotide and deduced amino acid sequence homology of the S gene of attenuated PEDV DR13 and reference PEDVs

Phylogenetic trees were generated on the basis of nucleotide and deduced amino acid sequences (Fig. 3). The left hand phylogenetic tree (Fig. 3a) was generated based on nucleotide sequences and the right hand tree (Fig. 3b) was based on deduced amino acid sequences. While these phylogenetic trees did differ slightly, overall they showed high similarity. In brief, all seven PEDVs, which were used for comparison, including attenuated PEDV DR13, fell into two groups. One group comprised CV777, Br1/87, JS-2004-2, parent DR13 and attenuated DR13. The second group consisted of Spk1 and Chinju99. The group containing CV777, Br1/87, JS-2004-2, parent DR13 and attenuated DR13 had two subgroups. Attenuated PEDV DR13 formed one subgroup with JS-2004-2 and the others formed another subgroup.

Fig. 3
figure 3

Phylogenetic trees generated on the basis of (a) nucleotide and (b) deduced amino acid sequences of the S gene region of attenuated PEDV DR13 and reference PEDVs. Trees constructed with neighbor-joining method using MEGA 3.1 program. Horizontal branch lengths are proportional to genetic distances between PEDV strains. Bootstrap figures are shown in italics for the major nodes. The GenBank Accession Nos. of reference PEDVs for the S gene are AF353511 (CV777), AY653204 (JS-2004-2), AF500215 (Spk1), AY167585 (Chinju99), DQ862099 (parent DR13) and the EMBL Accession No. for Br1/87 is Z25483

The S gene of the attenuated PEDV DR13 strain was successfully cloned and sequenced as a series of three overlapping cDNA clones. The sequencing results showed a single large ORF of 4,149 nucleotides encoding a protein of 1,382 amino acids with a predicted M r of 151 kDa. A single ORF of 4,149 nucleotides, with the potential to encode the coronavirus S protein, was identified [24]. The PEDV DR13 S gene had a sequence (GUAAAC) of 8 nucleotides upstream of the initiator ATG, as previously recognized in Br1/87 [10]. This sequence is a hexameric motif common to coronaviruses and is similar to the hexameric motifs XUA(A/G)AC found adjacent to other PEDV ORFs. These hexameric motifs have been proposed as a starting site for the transcription of the subgenomic mRNAs [24].

Previous studies showed that wild type and cell culture adapted PEDV exhibit remarkably different phenotypes in terms of pathogenicity in piglets [16, 25]. Moreover, those two PEDV types have 5 nucleotide differences within their S gene coding sequences, and all of those changes are meaningful in that they produce changes in the predicted amino acid sequence [15]. However, these regions may not be crucial for pathogenicity, because these 5 nucleotide changes were not found in the attenuated PEDV DR13 strain.

The coding region of the S gene of attenuated PEDV DR13 has nucleotide and amino acid differences compared to CV777, Br1/87 and parent DR13 as described above. Out of all differences, 50 nucleotide and 20 amino acid changes of the attenuated PEDV DR13 appear to be meaningful because we reveal other changes found in wild type PEDV, including JS-2004-2, Spk1, Chinju99 and parent DR13. Notably, only 20 nucleotide changes, however, are thought to be significant for pathogenicity because they lead to changes in the predicted amino acid sequence of attenuated PEDV DR13. In addition, attenuated PEDV DR13 exhibited reduced pathogenicity in pigs when subjected to a high number of serial passages in Vero cell cultures [16].

The attenuated PEDV DR13 S protein was found to contain 28 Asn-Xaa-Ser/Thr sequons, 21 asparagines predicted to be N-glycosylated and a region of highly hydrophobic residues at positions 1,327–1,347, which is predicted to form an α-helix and to function as a membrane anchor. Similar to attenuated PEDV DR13, the Br1/87 S protein has 29 potential N-linked glycosylation sites and a hydrophobic stretch at positions 1,322–1,337 [10]. Although CV777 was a little different, it did contain 29 potential N-linked glycosylation sites [15]. Prediction of N-glycosylation sites using the ExPASy (Expert Protein Analysis System) Proteomics Server of the Swiss Institute of Bioinformatics (SIB) revealed that CV777 S protein contains 29 Asn-Xaa-Ser/Thr sequons and 22 asparagines that are predicted to be N-glycosylated. The Br1/87 and parent PEDV DR13 S proteins are a little different but still have 29 Asn-Xaa-Ser/Thr sequons and 22 asparagines that are predicted to be N-glycosylated. In the case of attenuated PEDV DR13, two (from N to K at 378, from T to I at 1,260) of the changes in the predicted amino acid sequence destroy N-linked glycosylation sites, while another change (from N to S at 114) creates a new glycosylation site when it compare to CV777 and Br1/87, respectively. There are two amino acid changes (from N to K at 378, from N to T at 1,193) destroying N-linked glycosylation sites and another change (from N to S at 114) creating a new glycosylation site when the attenuated PEDV DR13 is compared with parent PEDV DR13. Taken together, it appears that 2 nucleotide changes of 5 changes are thought to be simply strain differences because we reveal 1 nucleotide change (from A to C at 3,588) destroying N-linked glycosylation site through amino acid change (from N to T at 1,193) found in not other wild type PEDV but only parent PEDV DR13, and another change (from C to T, A in parent DR13 at 3,789) destroying N-linked glycosylation site through amino acid change (from T to I, N in parent DR13 at 1,260) found in both parent and attenuated PEDV DR13. Therefore, the fundamental cause of these differences in N-linked glycosylation sites was 3 nucleotide changes and these are suggested to influence the pathogenicity of attenuated PEDV DR13.

Sequence homology analysis and phylogenetic analysis of S genes indicated that attenuated PEDV DR13 was highly homologous to CV777, Br1/87, JS-2004-2 and parent DR13, rather than to Spk1 and Chinju99 at the nucleotide and deduced amino acid sequence levels. In addition, attenuated PEDV DR13 was found to belong to a group that includes CV777, Br1/87, JS-2004-2 and parent DR13. More precisely, attenuated PEDV DR13 formed one subgroup with JS-2004-2. Taken together, it appears that attenuated PEDV DR13 is closely related to CV777, Br1/87, JS-2004-2 and parent DR13, rather than to Spk1 and Chinju99. It is notable that attenuated PEDV DR13 is especially close to the Chinese PEDV strain JS-2004-2 rather than to the Korean PEDV strains Spk1 and Chinju99, even though it is of Korean origin.

In the present study, the complete nucleotide and deduced amino acid sequences of the attenuated PEDV DR13 S gene were determined and compared to reference PEDVs, to find determinants of PEDV pathogenicity in the S gene. Phylogenetic trees were constructed and analyzed according to S gene nucleotide and deduced amino acid sequences. Similarities and differences among reference PEDVs, including attenuated PEDV DR13, were demonstrated, and these helped to elucidate the phylogenetic relationship of attenuated PEDV DR13 to other PEDVs. Moreover, the complete nucleotide and deduced amino acid sequences of the attenuated PEDV DR13 S gene will now form the basis for further functional exploration of both wild type and attenuated PEDV.