Introduction

Chronic obstructive pulmonary disease (COPD) is now becoming the most prevalent respiratory disease, and it is estimated that COPD will be the fifth most prevalent disease in the world in 2020.1 As only 10–20% of heavy smokers develop symptomatic COPD,2 susceptibility to COPD is thought to depend on environmental and genetic factors.

Genetic factors other than α-1-antitrypsin (A1AT) might influence the susceptibility of an individual to COPD.3 Some genes related to nicotine dependence, including nicotine receptors and those involved in the dopamine and serotonin pathways,4 are believed to be good candidates for genetic risk factors of COPD. CYP2A6 and α-nicotinic acetylcholine receptor (CHRNA 3/5) were investigated as candidate genes involved in COPD pathogenesis;5, 6 however, the association between many genes and COPD remains to be studied.

A serotonin transporter gene, solute carrier family 6, member 4 (SLC6A4) (or 5-hydroxytryptamine transporter: 5-HTT), which is involved in serotonin reuptake in the serotonin pathway, is thought to be a good candidate for a gene associated with tobacco consumption and COPD and also depression, which is one of the important comorbidities of COPD.7 SLC6A4 is thought to be closely related to nicotine dependence4 and smoking behaviour in the Japanese8and Chinese.9 SLC6A4 variation has also been reported to be related to depression.10 Depression markedly affects the quality of life, exacerbation and hospitalisation, and mortality in COPD.11, 12 As a twin study with a large population suggested that the association between smoking and depression arises largely from genetic factors,13 it could be postulated that smoking, COPD and depression are affected by the same genetic factors, including SLC6A4.

In this study, we investigated whether single-nucleotide polymorphisms (SNPs) of SLC6A4 are correlated with tobacco consumption, susceptibility to COPD and its depressive symptoms.

Materials and methods

Study subjects

The subjects were consecutive patients who initially consulted in the outpatient clinic of the Respiratory Care Clinic, Nippon Medical School, Tokyo, Japan, a secondary care specialist COPD clinic. They received ambulatory treatment without exacerbations from May 2007 to February 2009. Eligible patients fulfilled the following criteria: their clinical course, clinical symptoms and laboratory data satisfied the criteria for the clinical diagnosis of COPD, according to the COPD guidelines published by the Global Initiative for Chronic Obstructive Lung Disease,14 and they were current or ex-smokers with airflow limitation (forced expiratory volume in 1 s (FEV1)/forced vital capacity (FVC), <70%). The control subjects were current or ex-smokers (number of pack-years, >20) who had visited our clinic with some respiratory symptoms but no airflow limitation; these were considered as ‘stage 0’ subjects.15

Blood samples were obtained from 367 current or ex-smokers (67.8±9.8 years old, 328 males and 39 females), comprising COPD patients (n=247) and control subjects (n=119). The basic characteristics of the subjects are shown in Table 1.

Table 1 Basic characteristics of the study population

This study was approved by the ethics committee of the institute, and the subjects were enroled after appropriate informed consent was obtained.

Pulmonary function testing

Pulmonary function parameters, including the post-bronchodilator FEV1, carbon monoxide diffusing capacity (DLCO/VA), vital capacity (VC) and forced vital capacity, were measured according to the American Thoracic Society guidelines.16 using equipment for lung function testing with computer processing (CHESTAC; CHEST, Tokyo, Japan). The reference values of post-bronchodilator FEV1 and VC used were according to the standards of the Japanese Respiratory Society.17

High-resolution computed tomography (HRCT) of the chest

Emphysema severity and airway remodelling were determined on the basis of HRCT scans obtained at full inspiration, as in our previous study.18 Helical computed tomography scans were acquired with a 1.25-mm collimation, a scan time (rotation time) of 0.8 s, 120 kV and 100600 mA, using a Light Speed Pro16 computed tomography scanner (GE, Tokyo, Japan).

Analysis of the low-attenuation area

We calculated the percentage low-attenuation area (LAA%) on HRCT scans, using the method of Nakano et al.19 with a minor modification. Briefly, six slices were obtained of the upper, middle and lower lobes of the bilateral lungs. The upper slice was obtained at the aortic arch, the middle section was obtained at the carina and the lower section was obtained at 3 cm above the top of the diaphragm. The voxels within each field were categorised as emphysema based on a grey scale image that was reconstructed using Hounsfield units. A density histogram, based on the distribution of the number of voxels at each Hounsfield unit within the lung, was plotted; values <–940 Hounsfield units, which correspond to severe emphysema, were obtained. The arithmetical mean value for each case was used as the representative value, LAA%.

Analysis of airways

A cross-section of the apical bronchus of the right upper lobe (rB1) was obtained on HRCT to assess airway thickness.20 Airway dimensions were measured using the method described in previous studies21, 22 with a modification. The cross-section of the right B1 bronchus, which appeared on the HRCT film was scanned, and the images were magnified 50 times for the following measurements. Both the inner and outer perimeters of rB1 were traced. The airway wall area (WA) was calculated as the area obtained with the outer perimeter of rB1 (total airway area) subtracted from both the dark plus intermediate areas (luminal area). The percentage of the airway wall area (WA%) was calculated using the formula 100 × WA/total airway area. Major and minor axes in the inner area of rB1 were determined on the enlarged, traced images, and the ratio of the major and minor axes was calculated; this ratio was the distortion index. The WA% was obtained only for cases with a distortion index >0.75, indicating the best circular shape of the rB1 cross-section. Analyses of all images on HRCT films were performed on a Macintosh computer using the public domain NIH Image programme (developed at the US National Institutes of Health and available on the Internet at http://rsb.info.nih.gov/nih-image/).

Depression score based on the hospital anxiety and depression scale (HADS)

Depression and anxiety were assessed at baseline, using HADS.23 HADS is a validated tool for screening for the severity of symptoms of depression and anxiety in both hospitalised and primary care patients24 with chronic diseases, including COPD,11 and in the general population.24 HADS comprises seven items for depression (HAD-D) and seven items for anxiety (HAD-A). The scores range from 0 to 21 for each subscale, with a score of 0–7 denoting a non-case; 8–10, a possible case; and 11 or higher, a probable case.23 This scale was translated into Japanese and has also been validated as a screening tool in Japan.25

Methods for genotyping

SNP selection

We selected SNPs of SLC6A4 based on information from Hapmap (http://www.hapmap.org) for a Japanese population, as shown in Supplementary Table 1 and Supplementary Figure 1. These SNPs were selected using a tagging algorithm in the Haploview 4.1 software (Whitehead Institute for Biomedical Research, http://www.broad.mit.edu/mpg/haploview). Haplotype blocks were defined according to Gabriel et al.,26 and tag SNPs were selected using an linkage disequilibrium threshold defined by r20.8 and a minor allele frequency 0.10.

Genotyping assays

Genomic DNA was isolated from whole blood using the QIAamp DNA Blood Mini Kit, according to the manufacturer's protocol (Qiagen KK, Tokyo, Japan). The SNPs described above were genotyped using 5′ to 3′ exonuclease TaqMan assays27 with ABI TaqMan SNP Genotyping Assays (Life Technologies Japan, Tokyo, Japan).

Statistical analysis

All values are presented as means±standardized deviation (s.d.). The basic characteristics between two groups were compared using Fisher’s exact test and an unpaired t-test or a non-parametric test (Wilcoxon test). Statistical analyses were performed using JMP Genomics software, version 3.1 (SAS Institute, Cary, NC, USA). The analyses included testing Hardy–Weinberg equilibrium and the other genetic analyses that are described below.

We used multivariate logistic regression analysis to compare the COPD incidence among the genotypes tested (represented in the tables as genotype analyses) or in a trend model (an additive model in which the genotypes were coded as 0, 1 and 2 as per the number of minor alleles). The odds ratio of COPD development was also calculated according to the number of a specific risk allele in a trend model. The effect of each SNP genotype on continuous variables, including pulmonary function testing, quantitative computed tomography scan data, HADS score and pack-years, were analyzed using multivariate linear regression. Data were adjusted for age, gender and smoking status.

We implemented mediation analysis to identify the potential mediation effects of pack-years on the relation between rs2020936 and risk of COPD incidence.28 The significance of the indirect effect (mediating effect) is tested using the Sobel test, and the mediating effect is also calculated as previously described.29, 30, 31 In our study, because of the dichotomous outcome (COPD), the calculation had to be slightly modified as previously described.32, 33 Because the distribution of the pack-years was not normal, we had to apply box-cox transformation before initiating the mediation analysis.34 All regression analyses for the mediation analysis were adjusted for age and gender. The genetic variant (rs2020936) was coded assuming an additive model.

The expectation-maximisation algorithm was used to estimate the unobserved haplotype frequencies, given a multilocus sample of genetic marker genotypes under the assumption of Hardy–Weinberg equilibrium. The haplotypes for each subject were estimated. The effect of each haplotype on the formation of COPD or the continuous variables was determined via a logistic or linear regression analysis with a trend model adjusted for age, gender and smoking status.

P<0.05 was considered significant. Owing to the nature of the analysis and multiple testing of the dependent and independent variables, Bonferroni correction was applied.

Results

Tag SNP selection

We investigated the linkage disequilibrium block structure for SLC6A4 and its 5′ flanking sequence and selected 5 tag SNPs for SLC6A4. The 5 tag SNPs were considered to represent the genetic variations for the region and were used in this study (Supplementary Figure 1).

Relationship between SLC6A4 variation and tobacco consumption and COPD incidence

We found a significant correlation between COPD incidence and rs2020936 in the trend model (Table 2). rs2020936 was also associated with pack-years in the trend model before Bonferroni correction (Table 3). The number of T alleles of rs2020936 positively correlated with COPD and pack-years in a parallel manner (Figures 1 and 2), and the association between COPD and rs2020936 weakened after adjustment for pack-years (P-value adjusted for pack-years=0.006). Therefore, rs2020936 is thought to be associated with COPD, and this association is thought to depend partly on tobacco consumption. We also performed a mediation analysis to assess the mediating effect of pack-years on the relation between rs2020936 and COPD incidence. The Sobel test results showed that this mediating effect was significant (P=0.042), and that the pack-years contributed partly to the effect of the SNP on COPD (11.06%) (Supplementary Table 2). The odds ratio of the T allele for COPD was 2.20 (95% confidence interval, 1.31–3.74).

Table 2 The association between SNPs of SLC6A4 and COPD
Table 3 The association between SNPs of SLC6A4 and phenotypes of COPD, smoking history, and depressive symptoms
Figure 1
figure 1

The difference in the proportion of COPD between SLC6A4 genotypes (rs2020936). Refer to Table 2 for the results of statistical analyses.

Figure 2
figure 2

Comparison between the pack-years and the rs2020936 genotypes. The mean and s.d. values are represented in the bar graphs as error bars. The number of T alleles of rs2020936 showed a positive correlation with the number of pack-years (P=0.026). Refer to Table 3 for the results of statistical analyses.

There were significant differences in FEV1%predicted and %WA values between the rs2020936 genotypes (Table 3). These differences were also observed after the data were adjusted for pack-years (P=0.014 and 0.000, respectively). Further, five C-homozygotes for this SNP have significantly higher FEV1%predicted and lower %WA values than subjects with other genotypes (data not shown).

SLC6A4 variation and depressive symptoms

Another SNP of SLC6A4, rs3794808, significantly correlated with the depression score of HADS in a trend model of a whole population (Table 3) and also COPD subjects (P=0.022 for the trend model after adjustment) (Figure 3). This correlation was significant even after adjusting for FEV1% predicted values (P=0.019). The association between this SNP and the depression score was not significant after applying Bonferroni correction.

Figure 3
figure 3

Comparison between the HADS depression score and the rs3794808 genotypes. The mean and s.d. values are shown in the bar graphs as error bars. All genotypes of rs3794808 significantly correlated with the HADS depression score in the trend model of the whole population (P=0.016) and also with the COPD incidence (P=0.022). Refer to Table 3 for the results of statistical analyses.

Relationship between SLC6A4 haplotypes and tobacco consumption, COPD or depressive symptoms

The haplotype rs3794808 G/rs140701 G/rs140700 G/rs2020939 C/rs2020936 T was correlated with the HADS depression score, and there were no haplotypes that correlated with COPD, pack-years and depression scores simultaneously, as shown in Supplementary Table 3.

Discussion

In this study, the SNP rs2020936 of SLC6A4 significantly correlated with COPD, and this SNP was also associated with pack-years. Another SNP of SLC6A4, rs3794808, significantly correlated with the HADS depression score. These results indicate that SLC6A4 variation affects COPD pathogenesis, and this effect depends partly on tobacco consumption. Further, SLC6A4 variation might also contribute to the depressive symptoms in COPD.

An association between SLC6A4 variation and smoking behaviour has been reported. However, this is the first report on an association between SLC6A4 SNPs and COPD. SLC6A4 is thought to be closely related to nicotine dependence4 and has been reported to be associated with smoking behaviour in the Japanese8 and Chinese.9 CYP2A6 and CHRNA 3/5 are believed to be candidate genes involved in COPD pathogenesis,5, 6 there are no reports on an association between SLC6A4 and COPD. In addition, we also showed the mediation effect of pack-years on the relationship between genetic variation of SLC6A4 and COPD, and to the best of our knowledge, this finding has been reported for the first time in the literature.

Since an SLC6A4 SNP also correlated with %WA after adjusting for pack-years, SLC6A4 is possibly associated with an aspect of airway disease in COPD. Although serotonin does not constrict the human airway smooth muscle in vitro, it increases acetylcholine release from airway nerves via 5HT3 and 5HT4 receptors.35 The activation of the serotoninergic receptors on human airway epithelial cells regulates the release of IL-6 and IL-8.36 SLC6A4 could play a role in the inflammatory responses of human airway epithelial cells in COPD subjects by changing the concentration of serotonin in the peripheral lungs.

We also observed an association between another SNP of SLC6A4 and depressive symptoms, which is consistent with previous reports. There are many reports on the association between depression and SLC6A4 variation,37 especially variation in the polymorphic region (LPR) and the 17-bp variable number of tandem repeats (VNTR) in the second intron. Although reports on the association between 5-HTTLPR and depression are inconsistent, LPR or VNTR could be the only ‘tags’ of the causal SNPs and could be responsible for the inconsistency. Since we found this correlation after adjusting for COPD severity (FEV1% predicted), SLC6A4 variation is believed to be associated with depressive symptoms, regardless of COPD severity.

Since genetic variations of SLC6A4 were associated with depressive symptoms and COPD, there is a possibility that modification of this enzyme could be an option for preventing COPD and treating its depressive symptoms. Selective serotonin reuptake inhibitors (SSRI) and nortriptyline, inhibitors or modifiers of SLC6A4, are now used as standard therapy for depression and the depressive symptoms in COPD.38 These drugs are also candidates for smoking cessation treatments.39 Thus, via their role smoking cessation, some types of SSRIs might be effective for treating both depressive symptoms and COPD.

This study had some limitations. First, the associations between the different variables, particularly the association between the SNPs and pack-years, were relatively weak, as revealed by the P-values obtained. This weak association might possibly be because of the relatively small sample size (247 COPD cases versus 119 controls). In future studies, we aim to determine this association using a larger population of other races. Second, the tag SNPs in our study were located in the intron. Fine mapping of the SNPs of SLC6A4 is needed to investigate the association between all the SNPs and phenotypes. Functional analyses (for example, mRNA expression study) of the causal SNPs should also be performed after they have been identified.

We conclude that SLC6A4 variation affects COPD pathogenesis, and this effect depends partly on tobacco consumption. SLC6A4 variation also affects depressive symptoms in COPD. Therapy for COPD in the future could be optimised to include treatments for smoking cessation and depression by taking SLC6A4 variation into consideration.