Introduction

Progressive familial intrahepatic cholestasis type 3 (PFIC3) is a very rare autosomal-recessive human disease (Online Mendelian Inherited in Man (OMIM) no. 602347), which causes a chronic and progressive cholestatic liver disease in patients with high γ-glutamyltranspeptidase activity (γ-GT) levels and histologic evidence of bile duct proliferation.1 Age of onset may range from 1 month to 20.5 years, symptoms may include jaundice, pruritus, or there may be only biochemical cholestasis. Most patients progress to cirrhosis, portal hypertension and liver failure and often require liver transplantation2 even during the first years of life. Response to treatment with ursodeoxycholate is variable.

PFIC3 is caused by mutations of ATP-binding cassette subfamily B (MDR/TAP) member 4 gene (ABCB4; OMIM*171060), also known as multidrug resistance 3 gene (MDR3), a member of the superfamily of ABC transporters. This ubiquitous molecular family uses energy derived from ATP hydrolysis to efflux a wide range of substrates across the cell membrane.3 The name MDR3 was originally assigned because of the high homology (78% identical residues) with the ABC transporter P-glycoprotein coded by the multidrug resistance 1 (MDR1) gene, also called ABCB1 gene, which shares with MDR3 the same domain organization.4 Both ABC transporters contain four distinct protein domains and their ancestor appears to have arisen by gene duplication.3 ABCB4 polypeptide chain is organized in two repeats, each about 610-amino-acid long, joined by a 60-residue-long linker region. Each repeat contains two structural modules (Figure 1a), a transmembrane (TM) domain (TMD) comprised of six TM α-helices and a cytoplasmatic nucleotide-binding domain (NBD). On the cytoplasmatic side of the protein four small linker peptides, also referred to as intracellular domains (ICDs), connect the TM helices 2–3, 4–5, 8–9 and 10–11; on the extracellular side six short loops (ECs) connect TM segments5, 6, 7 (Figure 1a). The two TMDs contain specific sites for substrate binding and translocation, whereas the two NBDs, which display a high degree of sequence similarity with the equivalent domain of ABC transporters, couple the energy obtained from ATP hydrolysis to substrate transport.8 The ICDs are deemed to be involved in mediating the coupling between NBD conformational changes and the reorientation of TM helices concomitant with substrate extrusion.9

Figure 1
figure 1

(a) Localization of the 29 mutations identified in this study in the ABCB4 protein, schematically represented in its domains. All P-glycoproteins are composed of 12 TM segments (TM1–TM12) organized in two TMDs, connected by six intracellular segments (ICD1–ICD6) and by six short extracellular loops (EC1–EC6). A 60-amino-acid-long linker peptide connects the N terminal to the C terminal TMD–NBD domain. Of the 29 mutations identified in this study, 28% are located in the one of the two NBDs, 34% in TMDs, and 31% in ICDs. The 25 mutations newly identified in this study are marked in bold. Homozygous mutations are underlined; amino-acid numbering derived from ABCB4 isoform B is used throughout this report. (b) Stereo representation of the three-dimensional model of ABCB4, build by homology with the multidrug ABC transporter Sav1866. It is assumed that the N- and C-terminal domains of ABCB4 maintain relative to each other the same position of the two molecules of the (functional) crystallographic Sav1866 dimer. The N- and C-terminal domains are represented in blue and red, respectively. The mutated residues described in this work, which could be mapped in the homology model, are depicted as green spheres. Previously described mutations are represented as yellow spheres.

The ABCB1 gene, one of the most extensively studied ABC transporters, is responsible for the human multidrug resistance phenotype that is a rapidly growing obstacle to the treatment of numerous infectious diseases, including human immunodeficiency10 and malaria.11 The properties of this transporter are also exploited in cancer pharmacological therapy where ABCB1 translocates the chemotherapeutic drugs and other molecules with a broad but defined specificity.12

A gene duplication of ABCB1 and additional mutations selected as advantageous have created in mammals the paralog ABCB4 gene13 that, instead of drugs, translocates phosphatidylcholine (PC) from the inner to the outer leaflet of the canalicular membrane of the hepatocyte. The mammalian ABCB4-mediated PC translocation, also known as floppase activity, makes the phospholipids available for extraction into the canalicular lumen by bile salts.14 In the mouse, ABCB4 floppase activity has been discovered by inactivation of the orthologous Mdr2 gene, which resulted in progressive liver damage and cirrhosis that highly resemble the pathology of PFIC3 patients.15 The pathophysiology resides in lack of phospholipid protection in the bile against the detergent effect of bile salts, resulting in damage to the biliary epithelium, bile ductular proliferation, and progressive portal fibrosis.

The human ABCB4 gene is located on chromosome 7q21.1 and spans approximately 74 kb (GenBank accession no. CH236949, region 10391396…10464818). It contains 27 coding exons and the B isoform of the transcript encodes a 1286-amino-acid-long polypeptide.

To date, about 20 distinct ABCB4 mutations causing PFIC3 have been reported.1, 2, 16 In this study, we describe 29 distinct mutations that have been identified in a cohort of 68 unrelated patients with PFIC3 phenotype. The elucidation of the three-dimensional structure of ABCB4 bacterial homolog makes it possible to map the position of the mutated residues in the protein structure, under the assumption of conserved three-dimensional fold. Although not conclusive, this study contributes to complete the picture of the molecular variability associated to PFIC3 that will eventually provide a clear insight on the molecular mechanisms leading to the onset of the disease.

Materials and methods

Patients and controls

This study includes 68 proband index cases affected by PFIC3 enrolled in a multicenter study coordinated by one of us (CC) and carried out within the Italian Society for Pediatric Gastroenterology, Hepatology and Nutrition.

Inclusion criteria were increased values of γ-GT at presentation, associated with at least one of the following: (a) symptomatic cholestasis with pruritus; (b) jaundice; (c) abnormal alanine aminotransferase (ALT) at presentation. The exclusion of other known causes of cholestatic liver disease in our patient population reduced to 68 the number of patients enrolled for the molecular analysis. Each of the 13 participating centers collected the same clinical information using a common database and sent the blood samples to our laboratory for ABCB4 analysis. Molecular analysis of ABCB4 was also carried out on a control set of 100 healthy subjects.

DNA amplification

The DNA was extracted from peripheral blood by Puregene DNA Isolation Kit (Gentra Systems, Minneapolis, MN, USA) according to the manufacturer's instructions. Mutation analysis of all coding regions of the human ABCB4 gene was performed using intronic primers (Table 1). Exon numbering was used according to Lincke et al.17 Polymerase chain reaction (PCR) was carried out in a 30 μl reaction volume containing 50–100 ng DNA, 1.5 Unit Taq DNA Polymerase (Roche Applied Science, Mannheim, Germany; URL: http://www.roche.com/home/science.htm), 6–30 pmol of each primer, 3.75 nmol dNTP (Finnzymes Company, Espoo, Finland; URL: http://www.finnzymes.fi) and 1.0–2.0 mM MgCl2 in the buffer supplied by the manufacturer. The amplification was performed using a denaturation step at 94°C for 30 s, an annealing step (temperature primer-specific) for 30 s, extension step at 72°C for 30 s, repeated for 35 cycles. PCR products were cleaned up in Microcon® Centrifugal Filter (Millipore Corporation, Billerica, MA, USA; URL: http://www.millipore.com/offices.nsf/home).

Table 1 Oligonucleotides and specific annealing temperature used for DNA amplification of the ABCB4 gene

DNA analysis

Sequence reactions were performed using 5–15 ng of PCR product, 5 pmol of the same primers used for PCR reactions, 1 μl BigDye® Terminator v1.1. Cycle Sequencing kit (Applied Biosystems, Foster City, CA, USA; URL: http://www.appliedbiosystems.com), and 2.5 μl of a dilution buffer (400 mM Tris–HCl, 10 mM, pH 9.0) in a total volume of 10 μl. Thermal cycling was done at 96°C for 60 s followed by 25 cycles of 96°C for 10 s, 50°C for 5 s, and 60°C for 4 min. The sequencing reactions were purified by Montage SEQ96 Sequencing Reaction Cleanup Kit (Millipore Corporation, Billerica, MA, USA; URL: http://www.millipore.com/offices.nsf/home) and separated on a 16-capillary ABI PRISM® 3100 Genetic Analyzer (Applied Biosystems). Sequence data were analyzed using the software Sequencer 4.2 (Gene Codes, Ann Arbor, MI, USA; URL: http://www.genecodes.com). Each individual mutation was confirmed in two independent PCR reactions and sequencings and all mutations were verified in at least two different members of the family. To exclude the possibility that the identified missense mutations could be polymorphisms, we analyzed 200 chromosomes from the general population by nucleotide. The fragments with abnormal chromatograms were sequenced to identify the exact denaturing high-performance liquid chromatography (DHPLC) change.

Sequence and structural analysis

Amino-acid multiple sequence alignment of human ABCB4 protein (NP_061337) was produced, after the inclusion of the paralogous human ABCB1 enzyme (NP_000918), four orthologous Eutheria mammalians ABCB4 proteins (Pan troglodytes (XP_001160982), Mus musculus (NP_032856), Rattus norvegicus (NP_036822) and Bos taurus (XP_602101)), the ABCB4 orthologous Metatheria mammalian Monodelphis domestica or Opossum (from Ensembl release 44: ENSMODP00000032941), the orthologous MsbA proteins from Escherichia coli (AP_001544), Salmonella typhimurium (NP_459959), Vibrio cholera (Q9KQW9) and the ABC transporter Sav1866 from Staphylococcus aureus (2ONJA). The alignment was generated using the ClustalW web server (http://www.ebi.ac.uk/clustalw/). The three-dimensional model of ABCB4 was generated by homology model building with the SwissModel server18 using the three-dimensional structure of Sav1866 determined crystallographically9 (PDB code, 2HYD) and visually analyzed with the O program suite.19 For this purpose, the sequence alignment Sav1866 with either the N- and C-terminal domain of ABCB4 was determined with BL2SEQ program suite.20 Significant sequence homology of the N- and C-terminal ABCB4 domain with Sav1866 spans amino-acid positions 56–600 (41% identical residues) and positions 807–1280 (32% identity), respectively. On the basis of the obtained sequence alignment, the models of the N- and C-terminal ABCB4 domains were built and energy minimized with SwissModel, using the molecule A and B of the crystallographic asymmetric unit, respectively.

Results

The mutation analysis of the human ABCB4 gene carried out on 68 patients from different families led to the identification of 29 distinct mutations, of which 25 are not yet described. The observed mutations include two short insertions, 23 missense mutations and four nonsense mutations. The new sequences were submitted to Genbank (accession numbers are shown in Table 2). Overall, in this study 31 mutated alleles were identified in 18 families, for a total number of 35 mutations. Clinical inclusion criteria of each patient with identified mutations are described in Table 3. In these patients, in addition to increased values of γ-GT at presentation (18/18), other associated signs and/or symptoms were symptomatic cholestasis with pruritus (11/18), abnormal ALT at presentation (11/18) and jaundice (3/18).

Table 2 Mutations identified in ABCB4
Table 3 Clinical inclusion criteria and ABCB4 genotype in our PFIC3 patients with identified mutations

In 13 patients mutations were found on both alleles, whereas in five cases only one mutated allele was identified. Four patients carried a double mutant allele showing a genotype with three mutations, one on the maternal allele and two on the paternal allele (Table 3). In three index cases (IC2, IC7 and IC10) with an affected brother, mutations were concordantly segregating.

In summary, more than half (51.7%) of the observed mutations were located in exons 6, 9, 15 and 17 (Table 2), with highest mutation frequency observed on exon 17. From a structural viewpoint, the mutations described here were located in one of the NBDs (eight cases, accounting for the 27.6% of the total; Figure 1a), in one of the TMDs (10 cases, 34.5%) and in one of the ICDs (nine cases, 31%). Only one mutation was found in the EC4 and one was found in the linker region.

Insertions

Two families showed two distinct short insertions. In one patient, the homozygous insertion c.2169_2170insG (exon 17) was observed, leading to the formation of a stop codon at position 744 (p.L724AfsX744; Table 3); the resulting truncated protein is therefore missing the C-terminal TMD and NBD. In the second case, the insertion c.1135_1136insAA (exon 11) was identified. This mutation leads to the formation of a stop codon at position 413 (p.S379KfsX413) resulting into a truncated protein comprised of the first TMD, with likely impairment of floppase activity; the patient was compound heterozygous with the missense mutation c.2102T>C (p.L701P) (Table 3).

Missense mutations

This is the largest group of ABCB4 mutations, with 23 distinct nucleotide substitutions identified, 19 of which are novel (Table 2). Two patients were homozygous, one for the mutation p.G954S and the second for p.Y403H (Table 3). Notably, two mutations already described were found in members of different families that are not part of a genetic isolate: p.T175A and p.S320F were identified in unrelated Caucasian patients (three and two cases, respectively). The comparison among six orthologous ABCB4 proteins and paralogous human ABCB1 shows that most (80%) of the mutated amino-acids identified in this study are conserved in eukaryotic P-glycoproteins (red and blue residues in Figure 2). Moreover, 200 alleles from the general population were analyzed by DHPLC, and none of the missense mutations identified in the PFIC3 families were observed.

Figure 2
figure 2

Multiple sequence alignment of ABCB4 protein sequences. The alignment includes the amino-acid sequences of the human ABCB4 protein (Hs_ABCB4), Pan troglodytes (Pt_ABCB4), Mus musculus (Mm_ABCB4), Rattus norvegicus (Rn_ABCB4), Bos taurus (Bt_ABCB4) and Monodelphis domestica or Opossum (Md_ABCB4). The sequence of the human paralogous ABCB1 (Hs_ABCB1) and that of four ABC bacterial proteins, Escherichia coli (Esch-coli_MsbA), Salmonella typhimurium (Salm-typh_MsbA), Vibrio cholera (Vibrio-ch_MsbA) and Staphylococcus aureus (Staph-au_Sav1866) have also been included. The multiple alignment was obtained with ClustalW. Only sequence stretches containing amino-acid positions involved in new changes in PFIC3 are shown. Residues fully conserved are shown in red, amino-acid residues highly conserved are depicted in blue, residues less conserved are represented in green.

Nonsense mutations

The four novel nonsense mutations identified in this study are p.R159X, p.Y279X, p.G762X and p.E888X (Table 2). These mutations result in the synthesis of truncated proteins likely lacking floppase activity. The patient carrying the p.R159X mutation is heterozygous and the second mutated allele is still unknown; the other three cases show one nonsense mutation associated with three distinct missense mutations (Table 3).

Discussion

In this study, we analyzed the genomic DNA from 68 patients with PFIC3 diagnosis enrolled in the context of a pediatric multicenter study.

There are no PFIC3 epidemiologic data available to date; however, knowing that the number of newborns in Italy has been on average 500 000/year in the last 14 years (http://demo.istat.it/), since we observed 18 patients with ABCB4-mutated alleles born within a 14-year period (with an average of 1.28 cases per year), we can assume that the frequency of the disease is not less than 1/400 000 live births in Italy. However, considering that the clinical centers participating in this study encompass no more than half of Italian population, and that the positive subjects for the molecular analysis are about one-fourth of the patients with PFIC3 clinical phenotype, we can estimate a more realistic frequency of 1/50 000.

To evaluate the effect of the mutations described in this study on protein function, the positions of the mutated residues were located on the predicted tertiary structure of ABCB4; the latter was obtained by homology building as previously illustrated starting from the known three-dimensional structure of homologous bacterial ABC transporter Sav1866 determined with crystallographic methods.9

The bacterial ABC transporter Sav1866 from S. aureus displays significant sequence homology with either the N- and C-terminal half of ABCB4 (41 and 32% identical residues spanning 545 and 474 amino-acid positions, respectively). Therefore, they are likely to share in common with Sav1866 the same tertiary structure, consisting of a TMD composed of six membrane-spanning helical segments followed by a NBD. The structure of the multidrug ABC transporter Sav1866, determined with crystallography at 3.0 Å resolution9 (PDB entry code, 2HYD), was used as a template structure to locate the position of mutated amino-acids in the protein tertiary structure, under the hypothesis of conserved overall topology between ABCB4 and the two molecules of the asymmetric unit of Sav1866 crystals (Figure 1b). In fact, the functional Sav1866 dimer is observed in the crystallographic asymmetric unit. It is assumed here that the relative position of the N- and C-terminal domain of ABCB4 is the same as the dimeric arrangement of the molecules A and B in the crystallographic Sav1866 structure, respectively. Although this has to be considered a working hypothesis, the deduced considerations may provide a useful framework to address validating site-directed mutagenesis experiments, aimed also at determining the functional role of the residues involved.

The amino-acid variability as observed in 10 distinct ABC transporters was also considered. For this purpose, the multiple sequence alignment was built with six mammalian proteins and four bacterial proteins (Figure 2). The multiple alignment provides a clear view of the amino-acids conserved among mammalian species, as well as those conserved across the whole molecular family encompassing eukaryota and prokaryota.

The missense mutations described in this study can be classified into three distinct categories, according to the degree of amino-acid conservation observed in the multiple alignment:

  1. a)

    Mutations affecting fully conserved residues (represented in red in Figure 2). This category (named A in Table 3) includes seven cases: p.Y403H, p.V475A, p.A511T, p.E558K, p.R590Q, p.T593A and p.A1193T. Six of them (see Figure 1a) are located in the N-terminal NBD (spanning amino-acid positions 394–630), whereas only one is found in the C-terminal NBD (residues 1034–1272). Notably, among the 11 missense mutations previously described in the literature for PFIC3 phenotype,2 six involve the N-terminal NBD, and none involve the C-terminal NBD. This bias is indicative of a distinct functional role of the two NBDs in the floppase activity. The strict conservation of the residues involved suggests either a functional or a structural role maintained in both mammalian and bacterial ABC transporters. In particular, the two missense mutations p.Y403H and p.E558K are located in protein sites already extensively investigated in other ABC transporters. Amino-acid Y403 forms the A-loop and represents the aromatic residue interacting with the adenine ring of ATP. Biochemical and site-directed mutagenesis studies carried out on human ABCB1 (Y401) and on the HisP subunit of the bacterial ABC transporter histidine permease (Y16) have shown that this residue is critical for ATP binding and hydrolysis.21, 22 The amino-acid E558 is part of the ‘extended’ Walker B motif; this carboxylate residue is completely conserved in the sequence of 266 ABC transporters;23 site-directed mutagenesis studies involving the homologous position (E552) in mouse ABCB1 and the equivalent position (E179) in the HisP subunit of the bacterial histidine permease indicate that this residue is involved in ATP hydrolysis but not in ATP binding.21, 23

  2. b)

    Mutations involving amino-acid positions conserved in mammals ABCB4 and human ABCB1 P-glycoprotein (represented in blue in Figure 2); nine residues, located in functional domains, have been identified in this study (category named B in Table 3). Apart M630, located in the NBD, the remaining mutations involve either the helical loops (A250, A286, F357, A364) or TM segments (G126, P726, A840, G954; Figure 1b); the changes p.G126E, p.P726T, p.A840D and p.G954S that involve substitution of neutral residue with a polar residue, could cause structural perturbations affecting the correct interaction of the respective TM segment with the cell membrane phospholipid bilayer. Previous mutagenesis and biochemical studies carried out on human ABCB1 have assessed that Ala841, part of TM9, structurally equivalent to Ala840 in ABCB4, is involved in drug binding24, 25 and interferes with the ability of the two structural modules (TMDs and NBDs) to form correct interactions.25 Furthermore, the position Gly954 was already described as ABCB1 mutant G955V that shows altered drug-stimulated ATPase activities.26

  3. c)

    The remaining four novel missense mutations described here (green bars, Figure 2) are affected residues conserved only in mammals ABCB4 (category named C in Table 3). Among them, p.L701P involves the linker region (Figure 1a). Although no information is available concerning the structural role of this amino-acid, its substitution with the sterically constrained Pro residue may result in conformational stress. In this regard, transfected mammalian cell lines with ABCB1 mutants showed that insertion of a rigid structure (α-helical peptide) in this flexible connecting segment affects the cell surface localization of ABCB1.27 The mutation p.G723E brings a charged residue within the hydrophobic environment of the membrane, with very unfavorable energetic effects. In fact, in the multiple alignment only ambivalent or hydrophobic residues are observed at this position; it has been previously shown that the Gly–Ala change at the adjacent position Gly722 in human ABCB1 disrupts interactions between the NBDs.25

By cysteine-scanning mutagenesis, it has been recently shown that seventh transmembrane (TM7) of human ABCB1 forms part of the drug-binding pocket: the missense substitutions affecting the TM7 residues F728 and A729 with Cys residues resulted in significant decrease in the drug substrate affinity for Verapamil;28 in ABCB4 it is likely that TM7 is part of the PC-translocation pathway. As a matter of fact, our results show that (i) TM7, encoded by exon 17, is subject to frequent mutational events (Table 2) and (ii) the four less conserved amino-acids are all clustered in exon 17 (green bars, Figure 2). Moreover, this exon is the part of the ABCB4 most involved in PFIC3 mutations (20.7% in our study and 14.9% considering also data from other published mutations1, 2, 16).

In summary, for 18 of the 25 new mutations described in this study, the predicted impairment effect on ABCB4 is quite evident; six mutations resulting in truncated proteins and 12 consisting in missense mutations. Among the latter, six mutations are located in predicted functional domains with involved residues fully conserved in eukaryotic P-glycoproteins and in prokaryotic ABC proteins, and six mutations result in radical amino-acid substitutions in TMDs and linker region. The structural or functional relevance of the remaining seven amino-acid substitutions (p.A250P, p.A286V, p.F357L, p.A364V, p.M630V, p.T715I and p.A737V) is less clear and the induced effect on ABCB4 is less evident. Further characterization is required to understand better their causative role.

The low percentage of identified ABCB4-mutated alleles in our cohort of patients with PFIC3 phenotype (26.5%) suggests that the direct sequencing technique used in this study has a high sensitivity for point mutations and small insertions/deletions, but it misses the large allelic deletions; in addition, genetic heterogeneity underlying the PFIC3 phenotype can also be present. Finally, although all other known causes of liver disease were excluded, some degree of overlap in clinical phenotype could be present.

According to the category ABCB4 mutations (Table 3), our 13 patients with two ABCB4-mutated alleles can be divided into two groups:

  1. 1)

    Six patients with more severe genotype which predicts substantial impairment of ABCB4 floppase activity (IC2, IC7, IC8, IC10, IC13 and IC18); patients in this group present on the first allele one mutation of the category X (truncated proteins) or category A (mutations affecting fully conserved residues) and on the second allele one mutation of the categories X or A or one mutations with predicted structural perturbations in TMDs of the category B.

  2. 2)

    Seven patients with less severe genotype (IC3, IC4, IC5, IC9, IC12, IC14 and IC17) who present on the first allele one mutation of the category X (truncated proteins) or category B (mutations affecting conserved residues in functional domains of mammals ABCB4 and human ABCB1) and on the second allele one mutation of the categories C (mutations affecting conserved residues only in functional domains of mammals ABCB4) or B.

Although detailed clinical data on long-term follow up are needed to define genotype–phenotype correlation, it is of interest that two of the three patients (IC2 and IC8) with the more severe clinical picture at presentation belong to the more severe genotype group.

In conclusion, our data indicate that (a) the large majority of the new mutations of the ABCB4 gene are found in functional domains of the protein, and likely impair ABCB4 floppase activity; (b) ABCB4 deficiency is characterized by wide allelic heterogeneity, with private mutations in each affected family, with the N-terminal NBD as preferred hot spot for causative missense mutations and with more than half mutations located in exons 6, 9, 15 and 17; (c) frequent and advantageous mutational events in exon 17, and in TM7 segment in particular, of the ancestral ABCB1 gene could have very likely contributed to definition of the new PC floppase activity in the mammalian genome.