Scolaris Content Display Scolaris Content Display

Teledermatology for diagnosing skin cancer in adults

Collapse all Expand all

Background

Early accurate detection of all skin cancer types is essential to guide appropriate management and to improve morbidity and survival. Melanoma and squamous cell carcinoma (SCC) are high‐risk skin cancers which have the potential to metastasise and ultimately lead to death, whereas basal cell carcinoma (BCC) is usually localised with potential to infiltrate and damage surrounding tissue. Anxiety around missing early curable cases needs to be balanced against inappropriate referral and unnecessary excision of benign lesions. Teledermatology provides a way for generalist clinicians to access the opinion of a specialist dermatologist for skin lesions that they consider to be suspicious without referring the patients through the normal referral pathway. Teledermatology consultations can be 'store‐and‐forward' with electronic digital images of a lesion sent to a dermatologist for review at a later time, or can be live and interactive consultations using videoconferencing to connect the patient, referrer and dermatologist in real time.

Objectives

To determine the diagnostic accuracy of teledermatology for the detection of any skin cancer (melanoma, BCC or cutaneous squamous cell carcinoma (cSCC)) in adults, and to compare its accuracy with that of in‐person diagnosis.

Search methods

We undertook a comprehensive search of the following databases from inception up to August 2016: Cochrane Central Register of Controlled Trials, MEDLINE, Embase, CINAHL, CPCI, Zetoc, Science Citation Index, US National Institutes of Health Ongoing Trials Register, NIHR Clinical Research Network Portfolio Database and the World Health Organization International Clinical Trials Registry Platform. We studied reference lists and published systematic review articles.

Selection criteria

Studies evaluating skin cancer diagnosis for teledermatology alone, or in comparison with face‐to‐face diagnosis by a specialist clinician, compared with a reference standard of histological confirmation or clinical follow‐up and expert opinion. We also included studies evaluating the referral accuracy of teledermatology compared with a reference standard of face‐to‐face diagnosis by a specialist clinician.

Data collection and analysis

Two review authors independently extracted all data using a standardised data extraction and quality assessment form (based on QUADAS‐2). We contacted authors of included studies where there were information related to the target condition of any skin cancer missing. Data permitting, we estimated summary sensitivities and specificities using the bivariate hierarchical model. Due to the scarcity of data, we undertook no covariate investigations for this review. For illustrative purposes, we plotted estimates of sensitivity and specificity on coupled forest plots for diagnostic threshold and target condition under consideration.

Main results

The review included 22 studies reporting diagnostic accuracy data for 4057 lesions and 879 malignant cases (16 studies) and referral accuracy data for reported data for 1449 lesions and 270 'positive' cases as determined by the reference standard face‐to‐face decision (six studies). Methodological quality was variable with poor reporting hindering assessment. The overall risk of bias was high or unclear for participant selection, reference standard, and participant flow and timing in at least half of all studies; the majority were at low risk of bias for the index test. The applicability of study findings were of high or unclear concern for most studies in all domains assessed due to the recruitment of participants from secondary care settings or specialist clinics rather than from primary or community‐based settings in which teledermatology is more likely to be used and due to the acquisition of lesion images by dermatologists or in specialist imaging units rather than by primary care clinicians.

Seven studies provided data for the primary target condition of any skin cancer (1588 lesions and 638 malignancies). For the correct diagnosis of lesions as malignant using photographic images, summary sensitivity was 94.9% (95% confidence interval (CI) 90.1% to 97.4%) and summary specificity was 84.3% (95% CI 48.5% to 96.8%) (from four studies). Individual study estimates using dermoscopic images or a combination of photographic and dermoscopic images generally suggested similarly high sensitivities with highly variable specificities. Limited comparative data suggested similar diagnostic accuracy between teledermatology assessment and in‐person diagnosis by a dermatologist; however, data were too scarce to draw firm conclusions. For the detection of invasive melanoma or atypical intraepidermal melanocytic variants both sensitivities and specificities were more variable. Sensitivities ranged from 59% (95% CI 42% to 74%) to 100% (95% CI 48% to 100%) and specificities from 30% (95% CI 22% to 40%) to 100% (95% CI 93% to 100%), with reported diagnostic thresholds including the correct diagnosis of melanoma, classification of lesions as 'atypical' or 'typical, and the decision to refer or to excise a lesion.

Referral accuracy data comparing teledermatology against a face‐to‐face reference standard suggested good agreement for lesions considered to require some positive action by face‐to‐face assessment (sensitivities of over 90%). For lesions considered of less concern when assessed face‐to‐face (e.g. for lesions not recommended for excision or referral), agreement was more variable with teledermatology specificities ranging from 57% (95% CI 39% to 73%) to 100% (95% CI 86% to 100%), suggesting that remote assessment is more likely recommend excision, referral or follow‐up compared to in‐person decisions.

Authors' conclusions

Studies were generally small and heterogeneous and methodological quality was difficult to judge due to poor reporting. Bearing in mind concerns regarding the applicability of study participants and of lesion image acquisition in specialist settings, our results suggest that teledermatology can correctly identify the majority of malignant lesions. Using a more widely defined threshold to identify 'possibly' malignant cases or lesions that should be considered for excision is likely to appropriately triage those lesions requiring face‐to‐face assessment by a specialist. Despite the increasing use of teledermatology on an international level, the evidence base to support its ability to accurately diagnose lesions and to triage lesions from primary to secondary care is lacking and further prospective and pragmatic evaluation is needed.

What is the diagnostic accuracy of teledermatology for the diagnosis of skin cancer in adults?

Why is improving the diagnosis of skin cancer important?

There are different types of skin cancer. Melanoma is one of the most dangerous forms and it is important to identify it early so that it can be removed. If it is not recognised when first brought to the attention of doctors (also known as a false‐negative test result) treatment can be delayed resulting in the melanoma spreading to other organs in the body and possibly causing early death. Cutaneous squamous cell carcinoma (cSCC) and basal cell carcinoma (BCC) are usually localised skin cancers, although cSCC can spread to other parts of the body and BCC can cause disfigurement if not recognised early. Calling something a skin cancer when it is not really a skin cancer (a false‐positive result) may result in unnecessary surgery and other investigations that can cause stress and worry to the patient. Making the correct diagnosis is important. Mistaking one skin cancer for another can lead to the wrong treatment being used or lead to a delay in effective treatment.

What is the aim of the review?

The aim of this Cochrane Review was to find out whether teledermatology is accurate enough to identify which people with skin lesions need to be referred to see a specialist dermatologist (a doctor concerned with disease of the skin) and who can be safely reassured that their lesion (damage or change of the skin) is not malignant. We included 22 studies to answer this question.

What was studied in the review?

Teledermatology means sending pictures of skin lesions or rashes to a specialist for advice on diagnosis or management. It is a way for primary care doctors (general practitioners (GPs)) to get an opinion from a specialist dermatologist without having to refer patients through the normal referral pathway. Teledermatology can involve sending photographs or magnified images of a skin lesion taken with a special camera (dermatoscope) to a skin specialist to look at or it might involve immediate discussion about a skin lesion between a GP and a skin specialist using videoconferencing.

What are the main results of the review?

The review included 22 studies, 16 studies comparing teledermatology diagnoses to the final lesion diagnoses (diagnostic accuracy) for 4057 lesions and 879 malignant cases and five studies comparing teledermatology decisions to the decisions that would be made with the patient present (referral accuracy) for 1449 lesions and 270 'positive' cases.

The studies were very different from each other in terms of the types of people with suspicious skin cancer lesions included and the type of teledermatology used. A single reliable estimate of the accuracy of teledermatology could not be made. For the correct diagnosis of a lesion to be a skin cancer, data suggested that less than 7% of malignant skin lesions were missed by teledermatology. Study results were too variable to tell us how many people would be referred unnecessarily for a specialist dermatology appointment following a teledermatology consultation. Without access to teledermatology services however, most of the lesions included in these studies would likely be referred to a dermatologist.

How reliable are the results of the studies of this review?

In the included studies, the final diagnosis of skin cancer was made by lesion biopsy (taking a small sample of the lesion so it could be examined under a microscope) and the absence of skin cancer was confirmed by biopsy or by follow‐up over time to make sure the skin lesion remained negative for melanoma. This is likely to have been a reliable method for deciding whether people really had skin cancer. In a few studies, a diagnosis of no skin cancer was made by a skin specialist rather than biopsy. This is less likely to have been a reliable method for deciding whether people really had skin cancer*. Poor reporting of what was done in the study made it difficult for us to say how reliable the study results are. Selecting some patients from specialist clinics instead of primary care along with different ways of doing teledermatology were common problems.

Who do the results of this review apply to?

Studies were conducted in: Europe (64%), North America (18%), South America (9%) or Oceania (9%). The average age of people who were studied was 52 years; however, several studies included at least some people under the age of 16 years. The percentage of people with skin cancer ranged between 2% and 88% with an average of 30%, which is much higher than would be observed in a primary care setting in the UK.

What are the implications of this review?

Teledermatology is likely to be a good way of helping GPs to decide which skin lesions need to be seen by a skin specialist. Our review suggests that using magnified images, in addition to photographs of the lesion, improves accuracy. More research is needed to establish the best way of providing teledermatology services.

How up‐to‐date is this review?

The review authors searched for and used studies published up to August 2016.

*In these studies, biopsy, clinical follow‐up or specialist clinician diagnosis were the reference comparisons.

Authors' conclusions

Implications for practice

Studies were generally small and heterogeneous and methodological quality was difficult to judge due to poor reporting. Considering concerns regarding the applicability of study participants and of lesion image acquisition in specialist settings, our results suggest that teledermatology can be relied upon to correctly identify most malignant lesions. Using a more widely defined threshold to identify 'possibly' malignant cases or lesions that should be considered for excision is likely to appropriately triage those lesions requiring face‐to‐face assessment by a specialist.

Implications for research

Despite the increasing use of teledermatology on a national and international level, the evidence base to support its ability to accurately triage lesions from primary to secondary care is lacking and further prospective and pragmatic evaluation is needed. Consecutive series of participants with suspicious skin lesions judged to require a specialist opinion by general practitioners should be recruited (i.e. excluding those clearly judged to be malignant or benign) and referred for store‐and‐forward teledermatology in comparison to routine referral to a dermatologist. The reason for referral (e.g. exclusion of melanoma to avoid an urgent or 'two‐week wait' referral, exclusion of cSCC or basal cell carcinoma, or exclusion of 'any skin cancer') should be clearly recorded. 'State‐of‐the‐art' digital photography should be used (potentially utilising mobile phone cameras) to allow the full benefit of current technology to be exploited, and compared with smart phone applications, and systematic follow‐up of non‐excised lesions implemented to avoid over‐reliance on a histological reference standard. The level of training and experience of both the referring and specialist clinicians should be explicit to allow the generalisability of results to be judged. Any future research study needs to be clear about the diagnostic pathway followed by study participants, and should conform to the updated Standards for Reporting of Diagnostic Accuracy (STARD) guideline (Bossuyt 2015).

Summary of findings

Open in table viewer
Summary of findings Summary of findings table

Question:

What is the diagnostic accuracy of teledermatology for the detection of skin cancer in adults?

Population:

Adults with lesions suspicious for skin cancer

Index test:

TD using photographic or dermoscopic (or both) images

Comparator test:

Face‐to‐face diagnosis using visual inspection or dermoscopy (or both)

Target condition:

Any skin cancer, including invasive melanoma and atypical intraepidermal melanocytic variants, BCC and cSCC

Reference standard:

Histology with or without long‐term follow‐up (diagnostic accuracy); expert face‐to‐face diagnosis (for referral accuracy)

Action:

If accurate, positive results ensure the malignant lesions are not missed but are appropriately referred for specialist assessment or are treated appropriately in a non‐referred setting, and those with negative results can be safely reassured and discharged.

Quantity of evidence

Number of studies

Total lesions

Total cases

Diagnostic accuracy

16

4057

879

Referral accuracy

6

1449

270

Limitations

Risk of bias:

Low risk for participant selection in 7 studies; high risk (5) from case‐control design (2) or inappropriate exclusion criteria (4). Low risk for teledermatology assessments (22). Low risk for comparison with face‐to‐face diagnosis (2/5); unclear (3). Low risk for reference standard (10/22); high risk from use of expert diagnosis alone – referral accuracy (6) or inadequate reference standard (6). High risk for participant flow (17) due to differential verification (5), and exclusions following recruitment (14); timing of tests not mentioned in 14 studies.

Applicability of evidence to question:

High concern (14/22) for applicability of participants due to recruitment from secondary care or specialist clinics (12) or inclusion of multiple lesions per participant (6). High concern for applicability of teledermatology assessments (12/22) due to images acquired by dermatologists secondary care settings or in medical imaging units rather than images acquired in primary care. Low concern for reference standard (6/22); unclear concern due to lack of information concerning the expertise of the histopathologist (13) or expert face‐to‐face diagnosis (3).

Findings:

7 studies reported diagnostic accuracy data for the primary target condition of any skin cancer; 9 studies for the detection of invasive melanoma or atypical intraepidermal melanocytic variants; 2 studies for invasive melanoma alone; and 4 studies for BCC alone. 6 studies reported only referral accuracy data (teledermatology decisions versus face‐to‐face decisions). The findings presented are based on results for the primary target condition of any skin cancer and for the detection of invasive melanoma or atypical intraepidermal melanocytic variants.

Diagnostic accuracy data

Number of datasets

Total lesions

Total malignant

Test: TD using photographic images for any skin cancer

4

717

452

3 studies reported a cross‐tabulation of lesion final diagnoses against the diagnosis on teledermatology such that data could be extracted for the detection of any malignancy, regardless of any misclassification of 1 skin cancer for another, e.g. a BCC diagnosed as a melanoma or vice versa; and 1 study presented data for the detection of 'malignant' versus benign cases with no breakdown of individual lesion diagnoses given. Summary sensitivity was 94.9% (95% CI 90.1% to 97.4%) and summary specificity 84.3% (95% CI 48.5% to 96.8%). 2 studies providing a direct comparison between TD assessment and in‐person diagnosis by a dermatologist the data suggested similar accuracy between approaches; however, data were too scarce to draw firm conclusions.

Test: TD using clinical and dermoscopic images for any skin cancer

3

928

215

Sensitivities were 100% in all 3 studies. Specificities ranged from 25% (95% CI 5% to 57%) to 92% (95% CI 74% to 99%). Studies used varying thresholds to decide test positivity and included highly selected populations. No statistical pooling was undertaken.

Test: TD using photographic images for invasive melanoma or atypical intraepidermal melanocytic variants

4

1834

106

Sensitivities ranged from 59% (95% CI 42% to 74%) to 100% (95% CI 48% to 100%) and specificities from 30% (95% CI 22% to 40%) to 100% (95% CI 93% to 100%). Diagnostic thresholds were correct diagnosis of melanoma (3) or classification as 'atypical' or 'typical.' Populations also varied, some including only atypical or higher‐risk pigmented lesions and excluding equivocal lesions and others including both pigmented and non‐pigmented lesions who were either self‐referred or were deemed to require lesion excision. The number of melanomas missed ranged from 0 to 17.

Test: TD using photographic and dermoscopic images for invasive melanoma or atypical intraepidermal melanocytic variants

4

664

93

Summary estimates were 85.4% (95% CI 68.3% to 94.1%) for sensitivity and 91.6% (95% CI 81.1% to 96.5%) for specificity. Sensitivities were lower for the correct diagnosis of melanoma (71% to 81%) compared to the decision to refer or to excise a lesion (96% to 100%). The number of melanomas missed ranged from 0 to 7.

All data (referral accuracy)

6

1449

270

TD diagnoses were reported based on photographic images alone (4), photographic and dermoscopic images (1) and using live‐link TD (1). Diagnostic decisions on TD varied, including the diagnosis of malignancy, the decision to excise a lesion, the decision to refer versus not refer, or to excise or follow‐up at a later date. For store‐and‐forward TD sensitivities were generally above 90% indicating good agreement between remote image‐based decisions with the face‐to‐face reference standard for lesions considered to require some positive action by face‐to‐face assessment. Specificities were more variable ranging from 57% (95% CI 39% to 73%) to 100% (95% CI 86% to 100%) suggesting that remote assessment is more likely to recommend excision, referral or follow‐up for lesions considered of less concern when assessed face‐to‐face.

BCC: basal cell carcinoma; CI: confidence interval; cSCC: cutaneous squamous cell carcinoma; TD: teledermatology.

Background

This review is one of a series of Cochrane Diagnostic Test Accuracy (DTA) reviews on the diagnosis and staging of melanoma and keratinocyte skin cancers conducted for the National Institute for Health Research (NIHR) Cochrane Systematic Reviews Programme. For the purposes of these reviews, diagnostic accuracy is assessed by the sensitivity and specificity of a test. Appendix 1 shows the content and structure of the programme, Appendix 2 provides a glossary of terms used.

Target condition being diagnosed

There are three main forms of skin cancer. Melanoma has the highest skin cancer mortality (Cancer Research UK 2017); however, the most common skin cancers in Caucasian populations are those arising from keratinocytes: basal cell carcinoma (BCC) and cutaneous squamous cell carcinoma (cSCC) (Gordon 2013; Madan 2010). In 2003, the World Health Organization estimated that between two and three million 'non‐melanoma' skin cancers (of which BCC is estimated to account for around 80% and cSCC around 16% of cases) and 132,000 melanoma skin cancers occur globally each year (WHO 2003).

In this DTA review there are three target conditions of interest melanoma, BCC and cSCC.

Melanoma

Melanoma arises from uncontrolled proliferation of melanocytes – the epidermal cells that produce pigment or melanin. Cutaneous melanoma refers to any skin lesion with malignant melanocytes present in the dermis, primarily including superficial spreading, nodular, acral lentiginous and lentigo maligna melanoma variants (see Figure 1). Melanoma in situ refers to abnormal melanocytes that are contained within the epidermis and have not yet invaded the dermis, but are at risk of progression to melanoma if left untreated. Lentigo maligna, a subtype of melanoma in situ in chronically sun‐damaged skin, denotes another form of proliferation of abnormal melanocytes. Melanoma in situ and lentigo maligna are both atypical intraepidermal melanocytic variants. All forms of melanoma in situ can progress to invasive melanoma if growth breaches the dermo‐epidermal junction during a vertical growth phase; however, malignant transformation is both lower and slower for lentigo maligna than for melanoma in situ (Kasprzak 2015). Melanoma is one of the most dangerous forms of skin cancer, with the potential to metastasise to other parts of the body via the lymphatic system and bloodstream. It accounts for only a small percentage of skin cancer cases but is responsible for up to 75% of skin cancer deaths (Boring 1994; Cancer Research UK 2017).


Sample photograph of superficial spreading melanoma(left), basal cell carcinoma (centre) and squamous cell carcinoma (right). Copyright © 2012 Dr Rubeta Matin: reproduced with permission.

Sample photograph of superficial spreading melanoma(left), basal cell carcinoma (centre) and squamous cell carcinoma (right). Copyright © 2012 Dr Rubeta Matin: reproduced with permission.

The incidence of melanoma rose to over 200,000 newly diagnosed cases worldwide in 2012 (Erdmann 2013; Ferlay 2015), with an estimated 55,000 deaths (Ferlay 2015). The highest incidence was observed in Australia with 13,134 new cases of melanoma of the skin in 2014 (ACIM 2017), and in New Zealand with 2341 registered cases in 2010 (HPA and MelNet NZ 2014). In the USA, the predicted incidence in 2014 was 73,870 per annum and the predicted number of deaths was 9940 (Siegel 2015). The highest rates in Europe are in north‐western Europe and the Scandinavian countries, with the highest incidence reported in Switzerland of 25.8 per 100,000 people in 2012. Rates in the England have tripled from 4.6 and 6.0 per 100,000 in men and women respectively, in 1990, to 18.6 and 19.6 per 100,000 in 2012 (EUCAN 2012). Indeed, in the UK, melanoma has one of the fastest rising incidence rates of any cancer, with the biggest projected increase in incidence between 2007 and 2030 (Mistry 2011). In the decade leading up to 2013, age‐standardised incidence increased by 46%, with 14,500 new cases in 2013 and 2459 deaths in 2014 (Cancer Research UK 2017). Rates are higher in women than in men; however, the rate of incidence in men is increasing faster than in women (Arnold 2014). The rising incidence in melanoma is thought to be primarily related to an increase in recreational sun exposure and tanning bed use and an increasingly ageing population with higher lifetime recreational ultraviolet (UV) exposure, in conjunction with possible earlier detection (Belbasis 2016; Linos 2009). Belbasis 2016 provides a detailed review of putative risk factors, including eye and hair colour, skin type and density of freckles, history of melanoma, sunburn and presence of particular lesion types.

A database in the USA of over 40,000 patients from 1998 onwards, which assisted the development of the 8th American Joint Committee on Cancer (AJCC) Staging System indicated a five‐year survival of 97% to 99% for stage I melanoma, which dropped to 32% to 93% in stage III disease depending on tumour thickness, the presence of ulceration and number of involved nodes (Gershenwald 2017). While these are substantial increases relative to survival in 1975 (Cho 2014), increasing incidence between 1975 and 2010 means that mortality rates have remained static during the same period. This observation, coupled with increasing incidence of localised disease, suggests that improved survival rates may be due to earlier detection and heightened vigilance (Cho 2014). New targeted therapies for advanced (stage IV), melanoma (e.g. BRAF inhibitors), have improved survival, and immunotherapies are evolving such that long‐term survival is being documented (Pasquali 2018; Rozeman 2017). No new data regarding the survival prospects for patients with stage IV disease were analysed for the AJCC 8 staging guidelines due to lack of contemporary data (Gershenwald 2017).

Basal cell carcinoma

BCC can arise from multiple stem cell populations, including from the bulge and interfollicular epidermis (Grachtchouk 2011). BCC growth is usually localised, but it can infiltrate and damage surrounding tissue, sometimes causing considerable destruction and disfigurement, particularly when located on the face (Figure 1). The four main subtypes of BCC are superficial, nodular, morphoeic or infiltrative, and pigmented. They typically present as slow‐growing asymptomatic papules, plaques or nodules which may bleed or form ulcers that do not heal (Firnhaber 2012). People with a BCC often present to healthcare professionals with a non‐healing lesion rather than specific symptoms such as pain. Many lesions are diagnosed incidentally (Gordon 2013).

BCC most commonly occurs on sun‐exposed areas on the head and neck (McCormack 1997), and they are more common in men and in people over the age of 40. Different authors have attributed a rising incidence of BCC in younger people to increased recreational sun exposure (Bath‐Hextall 2007a; Gordon 2013; Musah 2013). Other risk factors include Fitzpatrick skin types I and II (Fitzpatrick 1975; Lear 1997; Maia 1995); previous skin cancer history; immunosuppression; arsenic exposure; and genetic predisposition, such as in basal cell naevus (Gorlin's) syndrome (Gorlin 2004; Zak‐Prelich 2004). Annual incidence is rising worldwide; Europe has experienced a mean increase of 5.5% per year since the late 1970s, the USA 2% per year, while estimates for the UK show incidence appears to be increasing more steeply at a rate of an additional 6 per 100,000 people per year (Lomas 2012). The rising incidence has been explained by an ageing population; changes in the distribution of known risk factors, particularly UV radiation; and improved detection due to the increased awareness among both practitioners and the general population (Verkouteren 2017). Hoorens 2016 points to evidence for a gradual increase in the size of BCCs over time, with delays in diagnosis ranging from 19 to 25 months.

According to National Institute for Health and Care Excellence (NICE) guidance (NICE 2010), low‐risk BCCs are nodular lesions occurring in people older than 24 years who are not immunosuppressed and do not have Gorlin syndrome. Furthermore, they should be located below the clavicle; should be small (diameter of less than 1 cm) with well‐defined margins; not recurrent following incomplete excision; and not in awkward or highly visible locations (NICE 2010). Superficial BCCs are also typically low risk and may be amenable to medical treatments such as photodynamic therapy (PDT) or topical chemotherapy (Kelleners‐Smeets 2017). Assigning BCCs as low or high risk influences the management options (Batra 2002; Randle 1996).

Advanced locally destructive BCC can be found on 'high‐risk' anatomical areas such as the eyebrow, eyelid, nose, ear and temple (these are at higher risk of invisible spread and therefore are more at risk of being incompletely excised (Baxter 2012; Lear 2014)), and they can arise from long‐standing untreated lesions or from a recurrence of aggressive BCC after primary treatment (Lear 2012). Very rarely, BCC metastasises to regional and distant sites resulting in death, especially cases of large neglected lesions in people who are immunosuppressed or those with Gorlin syndrome (McCusker 2014). Rates of metastasis are reported at 0.0028% to 0.55% (Lo 1991), with very poor survival rates. It is recognised that basosquamous carcinoma (more like a high‐risk SCC in behaviour and not considered a true BCC) is likely to have accounted for many cases of apparent metastases of BCC hence the spuriously high reported incidence in some studies of up to 0.55% which is not seen in clinical practice (Garcia 2009).

Squamous cell carcinoma of the skin

Primary cSCC arises from the keratinocytes of the outermost layer (epidermis) of the skin. People with cSCC often present with an ulcer or firm (indurated) papule, plaque or nodule (Firnhaber 2012; Griffin 2016), often with an adherent crust and poorly defined margins (Madan 2010). This type of carcinoma can arise in the absence of a precursor lesion or it can develop from pre‐existing actinic keratosis or Bowen's disease (considered by some clinicians to be squamous cell carcinoma in situ); the estimated annual risk of progression being less than 1% to 20% for newly arising lesions (Alam 2001), and 5% for pre‐existing lesions (Kao 1986). It remains locally invasive for a variable length of time, but it has the potential to spread to the regional lymph nodes or via the bloodstream to distant sites, especially in immunosuppressed individuals (Lansbury 2010). High‐risk lesions are those arising on the lip or ear, recurrent cSCC, lesions arising on non‐exposed sites, scars or chronic ulcers, tumours more than 20 mm in diameter and with depth of invasion more than 4 mm, and poor differentiation on pathological examination (Motley 2009). Perineural invasion of nerves at least 0.1 mm in diameter is a further documented risk factor for high‐risk cSCC (Carter 2013).

Chronic ultraviolet light exposure through recreation or occupation is strongly linked to cSCC occurrence (Alam 2001). It is particularly common in people with fair skin and in less common genetic disorders of pigmentation, such as albinism, xeroderma pigmentosum and recessive dystrophic epidermolysis bullosa (Alam 2001). Other recognised risk factors include immunosuppression; chronic wounds; arsenic or radiation exposure; certain drug treatments, such as voriconazole and BRAF mutation inhibitors; and previous skin cancer history (Baldursson 1993; Chowdri 1996; Dabski 1986; Fasching 1989; Lister 1997; Maloney 1996; O'Gorman 2014). In solid organ transplant recipients, cSCC is the most common form of skin cancer; the risk of developing cSCC has been estimated at 65 to 253 times that of the general population (Hartevelt 1990; Jensen 1999; Lansbury 2010). Overall, local and metastatic recurrence of cSCC at five years is estimated at 8% and 5% respectively (Rowe 1992). The five‐year survival rate of metastatic cSCC of the head and neck is around 60% (Moeckelmann 2018).

Treatment

For primary melanoma, the mainstay of definitive treatment is wide local surgical excision of the lesion, to remove both the tumour and any malignant cells that might have spread into the surrounding skin (Garbe 2016; Marsden 2010; NICE 2015a; SIGN 2017; Sladden 2009). Recommended lateral surgical margins vary according to tumour thickness (Garbe 2016), and to stage of disease at presentation (NICE 2015a).

Treatment options for BCC and cSCC include surgery, other destructive techniques such as cryotherapy or electrodesiccation and topical chemotherapy. A Cochrane Review of 27 randomised controlled trials (RCTs) of interventions for BCC found very little good‐quality evidence for any of the interventions used (Bath‐Hextall 2007b). Complete surgical excision of primary BCC has a reported five‐year recurrence rate of less than 2% (Griffiths 2005; Walker 2006), leading to significantly fewer recurrences than treatment with radiotherapy (Bath‐Hextall 2007b). After apparent clear histopathological margins (serial vertical sections) after standard excision biopsy with 4 mm surgical peripheral margins taken there is a five‐year reported recurrence rate of around 4% (Drucker 2017). Mohs micrographic surgery , whereby surgeons microscopically examine horizontal sections of the tumour peri‐operatively, undertaking re‐excision until the margins are tumour‐free, are options for high‐risk lesions on the face where standard wider excision margins might lead to incomplete excision or considerable functional impairment (Bath‐Hextall 2007b; Lansbury 2010; Motley 2009; Stratigos 2015). Bath‐Hextall 2007b found one trial comparing Mohs micrographic surgery with a 3 mm surgical margin excision in BCC (Smeets 2004); the update of this study showed non‐significantly lower recurrence at 10 years with Mohs micrographic surgery (4.4% with Mohs micrographic surgery compared to 12.2% after surgical excision; P = 0.10) (van Loo 2014).

The main treatments for high‐risk BCC are standard surgical excision, Mohs micrographic surgery or radiotherapy. For low‐risk or superficial subtypes of BCC, or for people with small or multiple (or both) BCCs at low‐risk sites (Marsden 2010), destructive techniques other than excisional surgery may be used (e.g. electrodesiccation and curettage or cryotherapy (Alam 2001; Bath‐Hextall 2007b)). Alternatively, non‐surgical ('non‐destructive') treatments may be considered (Bath‐Hextall 2007b; Drew 2017; Kim 2014), including topical chemotherapy such as imiquimod (Williams 2017), 5‐fluorouracil (Arits 2013), ingenol mebutate (Nart 2015), and photodynamic therapy (PDT) (Roozeboom 2016). Non‐surgical treatments are most frequently used for superficial forms of BCC, with one head‐to‐head trial suggesting topical imiquimod is superior to PDT and 5‐fluorouracil (Jansen 2018). Although non‐surgical approaches are increasingly used, they do not allow histological confirmation of tumour clearance, and their efficacy is dependent on accurate characterisation of the histological subtype and depth of tumour. The 2007 Cochrane review of BCC interventions found limited evidence from very small RCTs for these approaches (Bath‐Hextall 2007b), which have only partially been addressed by subsequent studies (Bath‐Hextall 2014; Kim 2014; Roozeboom 2012). Most BCC trials have compared interventions within the same treatment class, and few have compared medical versus surgical treatments (Kim 2014).

Vismodegib, a first‐in‐class Hedgehog signalling pathway inhibitor is now available for the treatment of metastatic or locally advanced BCC based on the pivotal study ERIVANCE BCC (Sekulic 2017). It is licensed for use in these patients where surgery or radiotherapy is inappropriate, e.g. for treating locally advanced periocular and orbital BCCs with orbital salvage of patients who otherwise would have required exenteration (Wong 2017). However, NICE has recommended against the use of vismodegib based on cost effectiveness and uncertainty of evidence (NICE 2017).

A systematic review of interventions for primary cSCC found only one RCT eligible for inclusion (Lansbury 2010). Current practice therefore relies on evidence from observational studies, as reviewed in Lansbury 2013, for example. Surgical excision with predetermined margins is usually the first‐line treatment (Motley 2009; Stratigos 2015). Estimates of recurrence after Mohs micrographic surgery, surgical excision or radiotherapy, which are likely to have been evaluated in higher‐risk populations, have shown pooled recurrence rates of 3%, 5.4% and 6.4%, respectively, with overlapping confidence intervals (CI); the review authors advised caution when comparing results across treatments (Lansbury 2013).

Index test(s)

Teledermatology is a term used to describe the delivery of dermatological care through information and communication technology (Bashshur 2015). It uses imaging modalities to provide specialist dermatology services either to other healthcare professionals (such as general practitioners (GP)), or to patients directly (Ndegwa 2010). It is considered a valuable tool in the diagnosis and management of skin disease, because of the visual nature of skin lesions and rashes (Warshaw 2011). Teledermatology allows an increased information flow between primary care physicians and dermatologists, which could lead to a reduction in waiting times and limit unnecessary referrals (Ndegwa 2010; Warshaw 2011; Bashshur 2015). In rural areas, where access to speciality services can have significant and potentially off‐putting travel and time implications for the patient, teledermatology has the potential to widen access to specialist opinion.

Teledermatology consultations can be conducted in two main ways, store‐and‐forward or 'asynchronous,' and live interactive or 'synchronous' (Ndegwa 2010). With the store‐and‐forward approach, clinicians and patients are separated by both time and space, as electronic digital images are taken and then transmitted to a dermatologist for review at a later unspecified time (Warshaw 2011). The pictures can be digital photographic (or 'macroscopic') images, or can be magnified dermoscopic images taken using a dermatoscope. Images are often accompanied by a summary of the patient history and demographic information as part of a consultation package (Ndegwa 2010). Furthermore, recent developments in smartphone technology have also introduced a new platform for transferring lesion images from one setting to another (Chuchu 2018). The store‐and‐forward approach is advantageous as it requires less sophisticated technology and lower‐cost equipment (Warshaw 2011); however, it does not allow the specialist to take a direct history, request additional views or communicate in detail the purpose of management to the patient or referrer (Ndegwa 2010).

Live interactive teledermatology uses videoconferencing and image transmission to connect the patient, referrer and dermatologist in real time (Ndegwa 2010). The dermatologist and patient can interact verbally in a similar manner to a traditional clinic‐based encounter, but more extensive telecommunications infrastructure and time are needed (Ndegwa 2010).

Clinical pathway

The diagnosis of melanoma can take place in primary, secondary and tertiary care settings by both generalist and specialist healthcare providers. In the UK, people with concerns about a new or changing skin lesion will usually present first to their GP or, less commonly, directly to a specialist in secondary care, which could include a dermatologist, plastic surgeon, other specialist surgeon (such as an ear, nose and throat specialist or maxillofacial surgeon), or ophthalmologist (Figure 2). Current UK guidelines recommend that all suspicious pigmented lesions presenting in primary care should be assessed by taking a clinical history and visual inspection guided by the revised seven‐point checklist (MacKie 1990). Clinicians should refer those with suspected melanoma or cSCC for appropriate specialist assessment within two weeks (Chao 2014; Marsden 2010; NICE 2015a). Evidence is emerging, however, to suggest that excision of melanoma by GPs is not associated with increased risk compared with outcomes in secondary care (Murchie 2017). In the UK, low‐risk BCC are usually recommended for routine referral, with urgent referral for those in whom a delay could have a significant impact on outcomes, for example due to large lesion size or critical site (NICE 2015b). Appropriately qualified generalist care providers increasingly undertake management of low‐risk BCC in the UK such as by excision of low‐risk lesions (NICE 2010). Similar guidance is in place in Australia (CCAAC Network 2008).


Current clinical pathway for people with skin lesions.

Current clinical pathway for people with skin lesions.

Teledermatology consultations can aid more appropriate triage of lesions, providing reassurance for benign lesions and referral via urgent or non‐urgent routes to secondary care (e.g. for suspected BCC. The distinction between setting and examiner qualifications and experience is important, as specialist clinicians might work in primary care settings (e.g. in the UK, GPs with a special interest in dermatology and skin surgery who have undergone appropriate training), and generalists might practice in secondary care settings (e.g. GPs working alongside dermatologists in secondary care, or plastic surgeons who do not specialise in skin cancer). The level of skill and experience in skin cancer diagnosis varies for both generalist and specialist care providers and impacts on test accuracy.

For referred lesions, the specialist clinician will use history‐taking, visual inspection of the lesion (in comparison with other lesions on the skin) usually in conjunction with dermoscopic examination, and palpation of the lesion and associated regional nodal basins to inform a clinical decision. If melanoma is suspected, then urgent 2 mm excision biopsy is recommended (Lederman 1985; Lees 1991); for cSCC, predetermined surgical margin excision or a diagnostic biopsy may be considered. BCC and premalignant lesions potentially eligible for non‐surgical treatment may undergo a diagnostic biopsy before initiation of therapy. Equivocal melanocytic lesions for which a definitive clinical diagnosis cannot be reached may undergo surveillance to identify any lesion changes that would indicate excision biopsy or reassurance and discharge for lesions that remain stable over a period of time.

Prior test(s)

Although smartphone applications and community‐ or high street pharmacy‐based teledermatology services (e.g. the Boots 'Mole Scanning Service' www.boots.com/health‐pharmacy‐advice/skin‐services/mole‐scanning‐service) can increasingly be accessed directly by people who have concerns about a skin lesion (Kjome 2017), the diagnosis of skin cancer is still based on history taking and clinical examination by a suitably qualified clinician. In the UK, this is typically undertaken at two decision points – first in primary care where the GP makes a decision to refer or not to refer, and then a second time by a dermatologist or other secondary care clinician where a decision is made to biopsy or excise or not.

Visual inspection of the skin is undertaken iteratively, using both implicit pattern recognition (non‐analytical reasoning) and more explicit 'rules' based on conscious analytical reasoning (Norman 2009), the balance of which will vary according to experience and familiarity with the diagnostic question. Various attempts have been made to formalise the "mental rules" involved in analytical pattern recognition for melanoma (Friedman 1985; Grob 1998; MacKie 1985; MacKie 1990; Sober 1979; Thomas 1998); however, visual inspection for keratinocyte skin cancers relies primarily on pattern recognition. Accuracy has been shown to vary according to the expertise of the clinician. Primary care physicians have been reported to miss over 50% of BCCs (Offidani 2002) and to misdiagnose around 33% of BCCs (Gerbert 2000). In contrast, one Australian study found that trained dermatologists were able to detect 98% of BCCs, but with a specificity of only 45% (Green 1988).

A range of technologies have emerged to aid diagnosis to reduce the number of diagnostic biopsies or inappropriate surgical procedures. Dermoscopy using a hand‐held microscope has become the most widely used tool used by clinicians to improve diagnostic accuracy of pigmented lesions, in particular for melanoma (Dinnes 2018a); it is less well established for the diagnosis of BCC or cSCC. Dermoscopy (also referred to as dermatoscopy or epiluminescence microscopy) uses a hand‐held microscope and incident light (with or without oil immersion) to reveal subsurface images of the skin at increased magnification of ×10 to ×100 (Kittler 2001). Used alongside clinical examination, dermoscopy has been shown in some studies to increase the sensitivity of clinical diagnosis of melanoma from around 60% to as much as 90% (Bono 2006; Carli 2002; Kittler 1999; Stanganelli 2000) with much smaller effects in others (Benelli 1999; Bono 2002). The accuracy of dermoscopy depends on the experience of the examiner (Kittler 2001), with accuracy when used by untrained or less‐experienced examiners potentially no better than clinical inspection alone (Binder 1997; Kittler 2002).

The diagnostic accuracy, and comparative accuracy, of visual inspection and dermoscopy have been evaluated in a further three reviews in this series (Dinnes 2018a; Dinnes 2018b; Dinnes 2018c).

Role of index test(s)

The use of teledermatology by primary care or by other generalist clinicians has the potential to ensure that people with suspicious lesions are appropriately referred for examination by a specialist clinician, and people with non‐suspicious lesions are appropriately reassured and managed in primary care. If an accurate triage is made, the proportion of people who are referred unnecessarily will be minimised and lesions requiring urgent referral and treatment correctly identified. By creating an environment where there is facilitated access to more specialist services, selective dermatology referral could ultimately reduce costs while enabling a faster, more reliable and more efficient service (Piccolo 2002). Increased information flow between primary care physicians and dermatologists also has the potential effect of increasing knowledge and reducing isolated decision‐making (Bashshur 2015).

When diagnosing potentially life‐threatening conditions such as melanoma, the consequences of falsely reassuring a person that they do not have skin cancer can be potentially fatal, as the delay to diagnosis means that the window for successful early treatment may be missed. To minimise these false‐negative diagnoses, a good diagnostic test for melanoma demonstrates high sensitivity and high negative predictive value (i.e. very few of those with a negative test result will actually have a melanoma). Giving false‐positive test results (meaning the test has poor specificity and a high false‐positive rate) resulting in the removal of lesions that are benign, is arguably less of an error than missing a potentially fatal melanoma, but does have implications for patient welfare and costs. False‐positive diagnoses cause unnecessary scarring from the biopsy or excision procedure, and increase patient anxiety while they await the definite histology results and increase healthcare costs as the number needed to remove to yield one melanoma diagnosis increases.

Delay in diagnosis of a BCC as a result of a false‐negative result is not as serious as for melanoma because BCCs are usually slow growing and unlikely to metastasise. However, delayed diagnosis can result in a larger and more complex excision with consequent greater morbidity. Very sensitive diagnostic tests for BCC may compromise on lower specificity leading to a higher false‐positive rate, and an enormous burden of skin surgery, such that a balance between sensitivity and specificity is needed. The situation for cSCC is more similar to melanoma in that the consequences of falsely reassuring a person that they do not have skin cancer can be serious and potentially fatal. Thus, a good diagnostic test for cSCC should demonstrate high sensitivity and a corresponding high negative predictive value. In summary, a test that can reduce false‐positive clinical diagnoses without missing true cases of disease has patient and resource benefits. False‐positive clinical diagnoses cause unnecessary morbidity from the biopsy, and could lead to initiation of inappropriate therapies and increase patient anxiety.

Alternative test(s)

Teledermatology provides an alternative means for primary care clinicians (and therefore patients) to access specialist opinion compared to the standard referral process from primary to secondary care. Although the general public can also seek advice on skin lesions, they may be concerned about doing so via smartphone applications or from services provided within community or 'high street' pharmacies. These are not considered direct alternatives to teledermatology services.

Several other tests that may have a role in diagnosis of skin cancer have been reviewed as part of our series of systematic reviews, including visual inspection and dermoscopy (Dinnes 2018a; Dinnes 2018b; Dinnes 2018c), smartphone applications (Chuchu 2018). Reflectance confocal microscopy (Dinnes 2018d; Dinnes 2018e), optical coherence tomography (Ferrante di Ruffano 2018a), and computer‐assisted diagnosis techniques applied to various types of images including those generated by dermoscopy, diffuse reflectance spectrophotometry and electrical impedance spectroscopy (Ferrante di Ruffano 2018b), and high‐frequency ultrasound (Dinnes 2018f). Evidence permitting, the accuracy of available tests will be compared in an overview review, exploiting within‐study comparisons of tests and allowing the analysis and comparison of commonly used diagnostic strategies where tests may be used singly or in combination.

Rationale

Our series of reviews of diagnostic tests used to assist clinical diagnosis of melanoma aimed to identify the most accurate approaches to diagnosis and provide clinical and policy decision‐makers with the highest possible standard of evidence on which to base decisions. With increasing rates of skin cancer and the push towards the use of dermoscopy and other high‐resolution image analysis in primary care, the anxiety around missing early cases needs to be balanced against the risk of over referrals, to avoid sending too many people with benign lesions for a specialist opinion. It is questionable whether all skin cancers detected by sophisticated techniques, even in specialist settings, help to reduce morbidity and mortality, or whether newer technologies run the risk of increasing false‐positive diagnoses. It is also possible that use of some technologies (e.g. widespread use of dermoscopy in primary care with no training), could actually result in harm by missing melanomas if they are used as replacement technologies for traditional history‐taking and clinical examination of the entire skin. Many branches of medicine have noted the danger of such "gizmo idolatry" among doctors (Leff 2008).

Although teledermatology is increasingly used, the accuracy of different approaches to providing teledermatology services (e.g. store‐and‐forward versus live‐link modalities, and use of clinical versus dermoscopic images) has yet to be fully established. A review by Warshaw 2011 suggested that both store‐and‐forward and live‐link teledermatology had acceptable diagnostic accuracy and concordance when compared with clinical face‐to‐face diagnosis; however, clinic‐based dermatology had superior diagnostic accuracy (i.e. in comparison to store‐and‐forward teledermatology consultations). As with any technology requiring significant investment, a full understanding of the benefits including patient acceptability and cost‐effectiveness in comparison to usual practice should be obtained before such an approach can be recommended; establishing the accuracy of diagnosis and referral accuracy is one of the key components. Given the rapidly changing evidence base in skin cancer diagnosis, there is a need for an up‐to‐date analysis of the accuracy of teledermatology for skin cancer diagnosis.

This review followed a generic protocol which covered the full series of Cochrane DTA reviews for the diagnosis of melanoma (Dinnes 2015a); aspects of this review which relate to the diagnosis of BCC and cSCC follow the generic protocol that was written to cover the reviews in the series for the diagnosis of keratinocyte skin cancers (Dinnes 2015b). The 'Background' and 'Methods' sections of this review therefore use some text that was originally published in the protocols (Dinnes 2015a; Dinnes 2015b), and text that overlaps some of our other reviews (Chuchu 2018; Dinnes 2018a; Dinnes 2018b).

Objectives

To determine the diagnostic accuracy of teledermatology for the detection of any skin cancer (melanoma, BCC or cSCC) in adults, and to compare its accuracy with that of in‐person diagnosis.

Accuracy was estimated separately according to the type of teledermatology images used:

  • photographic images;

  • dermoscopic images;

  • photographic and dermoscopic images.

Secondary objectives

  • To determine the diagnostic accuracy of teledermatology for the detection of invasive melanoma or atypical intraepidermal melanocytic variants in adults, and to compare its accuracy with that of in‐person diagnosis.

  • To determine the diagnostic accuracy of teledermatology for the detection of invasive melanoma only, in adults, and to compare its accuracy with that of in‐person diagnosis.

  • To determine the diagnostic accuracy of teledermatology for the detection of BCC in adults, and to compare its accuracy with that of in‐person diagnosis.

  • To determine the diagnostic accuracy of teledermatology for the detection of cSCC in adults, and to compare its accuracy with that of in‐person diagnosis.

  • To determine the referral accuracy of teledermatology (i.e. to compare diagnostic decision making based on teledermatology images with that of an in‐person consultation).

Investigation of sources of heterogeneity

We set out to address a range of potential sources of heterogeneity for investigation across our series of reviews, as outlined in our generic protocols (Dinnes 2015a; Dinnes 2015b), and described in Appendix 4; however, our ability to investigate these was necessarily limited by the available data on each individual test reviewed.

Methods

Criteria for considering studies for this review

Types of studies

We included test accuracy studies that allow comparison of the result of the index test with that of a reference standard, including the following:

  • studies where all participants receive a single index test and a reference standard;

  • studies where all participants receive more than one index test(s) and reference standard;

  • studies where participants are allocated (by any method) to receive different index tests or combinations of index tests and all receive a reference standard (between‐person comparative (BPC) studies);

  • studies that recruit series of participants unselected by true disease status;

  • diagnostic case‐control studies that separately recruit diseased and non‐diseased groups (see Rutjes 2005); however, we did not include studies that compared results for malignant lesions to those for healthy skin (i.e. with no lesion present);

  • both prospective and retrospective studies; and

  • studies where previously acquired clinical or dermoscopic images were retrieved and prospectively interpreted for study purposes.

We excluded studies from which we could not extract 2×2 contingency data or if they included fewer than five cases of melanoma, BCC or cSCC or fewer than five benign lesions. For studies of referral accuracy where a lesion's final diagnosis was not reported, we required at least five 'positive' cases as identified by the expert diagnosis reference standard. The size threshold of five was arbitrary. However, such small studies are unlikely to add precision to estimate of accuracy.

Studies available only as conference abstracts were excluded; however, attempts were made to identify full papers for potentially relevant conference abstracts (Searching other resources).

Participants

We included studies in adults with lesions suspicious for skin cancer or adults at high risk of developing skin cancer. We excluded studies that recruited only participants with malignant or benign final diagnoses.

We excluded studies conducted in children, or which clearly reported inclusion of more than 50% of participants aged 16 and under.

Index tests

We included studies evaluating teledermatology alone, or teledermatology in comparison with face‐to‐face diagnosis.

The following index tests were eligible for inclusion:

  • store‐and‐forward teledermatology;

  • real‐time 'live link' teledermatology.

Data for face‐to‐face clinical diagnosis against a histological reference standard was also included where reported, to allow a direct comparison with teledermatology to be made.

Although primary care clinicians can in practice be specialists in skin cancer, we considered primary care physicians as generalist practitioners and dermatologists as specialists. Within each group, we extracted any reporting of special interest or accreditation in skin cancer.

Target conditions

We defined the primary target condition as the detection of any skin cancer, primarily cutaneous melanoma, BCC or cSCC.

We considered four additional definitions of the target condition in secondary analyses, namely, the detection of:

  • invasive cutaneous melanoma alone;

  • invasive cutaneous melanoma or atypical intraepidermal melanocytic variants (including melanoma in situ, lentigo maligna);

  • BCC;

  • cSCC.

We also considered referral accuracy, comparing decision making from teledermatology compared with the face‐to‐face decisions for the same lesions. These decisions could have included the decision to excise a lesion, follow‐up a lesion or refer a lesion for face‐to‐face assessment.

Reference standards

Teledermatology can be assessed in terms of diagnostic accuracy in comparison to the final lesion diagnosis and referral accuracy in comparison to a face‐to‐face expert management decision.

To establish diagnostic accuracy, the ideal reference standard was histopathological diagnosis in all eligible lesions. A qualified pathologist or dermatopathologist should have performed histopathology. Ideally, reporting should have been standardised detailing a minimum dataset to include the histopathological features of melanoma to determine the AJCC Staging System (e.g. Slater 2014). We did not apply this minimum dataset requirement as a necessary inclusion criterion, but extracted any pertinent information.

Partial verification (applying the reference test only to a subset of those undergoing the index test) was of concern given that lesion excision or biopsy were unlikely to be carried out for all benign‐appearing lesions within a representative population sample. Therefore, to reflect what happens in reality, we accepted clinical follow‐up of benign‐appearing lesions as an eligible reference standard, while recognising the risk of differential verification bias (as misclassification rates of histopathology and follow‐up will differ). Additional eligible reference standards included cancer registry follow‐up and 'expert opinion' with no histology or clinical follow‐up.

All of the above were considered eligible reference standards for establishing lesion final diagnoses (diagnostic accuracy) with the following caveats:

  • all study participants with a final diagnosis of the target disorder must have had a histological diagnosis, either subsequent to the application of the index test or after a period of clinical follow‐up, and

  • at least 50% of all participants with benign lesions must have had either a histological diagnosis or clinical follow‐up to confirm benignity.

To establish referral accuracy of teledermatology (i.e. the ability of the remote observer to approximate an in‐person diagnosis), the action recommended by the remote observer was compared with an in‐person 'expert opinion' reference standard (i.e. the diagnosis or management recommendation of an appropriately qualified clinician made face‐to‐face with the study participant).

Search methods for identification of studies

Electronic searches

The Information Specialist (SB) carried out a comprehensive search for published and unpublished studies. A single large literature search was conducted to cover all topics in the programme grant (see Appendix 1 for a summary of reviews included in the programme grant). This allowed for the screening of search results for potentially relevant papers for all reviews at the same time. A search combining disease related terms with terms related to the test names, using both text words and subject headings was formulated. The search strategy was designed to capture studies evaluating tests for the diagnosis or staging of skin cancer. As the majority of records were related to the searches for tests for staging of disease, a filter using terms related to cancer staging and to accuracy indices was applied to the staging test search, to try to eliminate irrelevant studies, for example, those using imaging tests to assess treatment effectiveness. A sample of 300 records that would be missed by applying this filter was screened and the filter adjusted to include potentially relevant studies. When piloted on MEDLINE, inclusion of the filter for the staging tests reduced the overall numbers by around 6000. The final search strategy, incorporating the filter, was subsequently applied to all bibliographic databases as listed below (Appendix 5). The final search result was cross‐checked against the list of studies included in five systematic reviews; our search identified all but one of the studies, and this study was not indexed on MEDLINE. The Information Specialist devised the search strategy, with input from the Information Specialist from Cochrane Skin. No additional limits were used.

We searched the following bibliographic databases to 29 August 2016 for relevant published studies:

  • MEDLINE via Ovid (from 1946);

  • MEDLINE In‐Process & Other Non‐Indexed Citations via Ovid;

  • Embase via Ovid (from 1980).

We searched the following bibliographic databases to 30 August 2016 for relevant published studies:

  • the Cochrane Central Register of Controlled Trials (CENTRAL) 2016, Issue 7, in the Cochrane Library;

  • the Cochrane Database of Systematic Reviews (CDSR) 2016, Issue 8, in the Cochrane Library;

  • Cochrane Database of Abstracts of Reviews of Effects (DARE) 2015, Issue 2;

  • CRD HTA (Health Technology Assessment) database 2016, Issue 3; and

  • CINAHL (Cumulative Index to Nursing and Allied Health Literature via EBSCO from 1960).

We searched the following databases for relevant unpublished studies using a strategy based on the MEDLINE search:

  • CPCI (Conference Proceedings Citation Index), via Web of Science™ (from 1990; searched 28 August 2016); and

  • SCI Science Citation Index Expanded™ via Web of Science™ (from 1900, using the 'Proceedings and Meetings Abstracts' Limit function; searched 29 August 2016).

We searched the following trials registers using the search terms 'melanoma', 'squamous cell', 'basal cell' and 'skin cancer' combined with 'diagnosis':

We aimed to identify all relevant studies regardless of language or publication status (published, unpublished, in press, or in progress). We applied no date limits.

Searching other resources

We screened relevant systematic reviews identified by the searches for their included primary studies, and included any missed by our searches. We checked the reference lists of all included papers, and subject experts within the author team reviewed the final list of included studies. There was no electronic citation searching.

Data collection and analysis

Selection of studies

At least one review author (JDi or NC) screened titles and abstracts, and discussed and resolved any queries by consensus. A pilot screen of 539 MEDLINE references showed good agreement (89% with a kappa of 0.77) between screeners. Primary test accuracy studies and test accuracy reviews (for scanning of reference lists) of any test used to investigate suspected melanoma, BCC or cSCC were included at initial screening. Both a clinical reviewer (from one of a team of 12 clinician reviewers) and a methodologist reviewer (JDi or NC) independently applied inclusion criteria to all full‐text articles (Appendix 6), and resolved disagreements by consensus or by a third party (JDe, CD, HW and RM). We contacted authors of eligible studies when insufficient data were presented to allow for the construction of 2×2 contingency tables.

Data extraction and management

One clinical (as detailed above) and one methodological reviewer (JDi, NC or LFR) independently extracted data concerning details of the study design, participants, index test(s) or test combinations, and criteria for index test positivity, reference standards and data required to complete a 2×2 diagnostic contingency table for each index test using a piloted data extraction form. Data were extracted at all available index test thresholds. We resolved disagreements by consensus or by a third party (JDe, CD, HW, and RM). We entered data into Review Manager 5 (Review Manager 2014).

We contacted authors of included studies where information related to the target condition (in particular to allow the differentiation of invasive cancers from 'in situ' variants) or there were missing diagnostic threshold. We contacted authors of conference abstracts published from 2013 to 2015 to ask whether full data were available. If there was no full paper, we marked conference abstracts as 'pending' and will revisit them in a future review update.

Dealing with multiple publications and companion papers

Where we identified multiple reports of a primary study, we maximised yield of information by collating all available data. Where there were inconsistencies in reporting or overlapping study populations, we contacted study authors for clarification in the first instance. If this contact with authors was unsuccessful, we used the most complete and up‐to‐date data source where possible.

Assessment of methodological quality

We assessed risk of bias and applicability of included studies using the QUADAS‐2 checklist (Whiting 2011), tailored to the review topic (see Appendix 7). We piloted the modified QUADAS‐2 tool on five included full‐text articles. One clinical reviewer (as detailed above) and one methodological reviewer (JDi, NC or LFR) independently assessed quality for the remaining studies; we resolved disagreements by consensus or by a third party (JDe, CD, HW and RM).

Statistical analysis and data synthesis

Our unit of analysis was the lesion rather than the participant. This is because firstly, in skin cancer initial treatment is directed to the lesion rather than systemically (thus it is important to be able to correctly identify cancerous lesions for each person), and secondly, it is the most common way in which the primary studies reported data. Although there is a theoretical possibility of correlations of test errors when the same people contribute data for multiple lesions, most studies included very few people with multiple lesions and any potential impact on findings was likely to be very small, particularly in comparison with other concerns regarding risk of bias and applicability. For each analysis, we included only one dataset per study to avoid multiple counting of lesions.

For the diagnosis of melanoma, any BCCs or invasive cSCCs that were correctly identified by teledermatology but that were identified as melanomas in the 'disease‐negative' group were considered as true‐negative test results rather than as false positives, on the basis that excision of such lesions would be a positive outcome for the participants concerned. However, for the diagnosis of BCC, we considered any melanomas or cSCCs that were mistaken for BCCs as false‐positive results. This decision was taken on the basis that the clinical management of a lesion considered to be a BCC might be quite different to that for a melanoma or cSCC and could potentially lead to a negative outcome for the participants concerned, for example if a treatment other than excision was initiated.

For preliminary investigations of the data, we plotted estimates of sensitivity and specificity on coupled forest plots and in receiver operating characteristic (ROC) space for each index test, target condition and reference standard combination. When meta‐analysis was possible and there were at least four studies, we used a bivariate model to obtain summary estimates of sensitivity and specificity (Chu 2006; Reitsma 2005). When there were fewer than four studies and little or no heterogeneity in ROC space, we pooled sensitivity and specificity using fixed‐effect logistic regression (Takwoingi 2017). We included data for face‐to‐face diagnosis only if reported in comparison to teledermatology diagnosis. Using these direct (head‐to‐head) comparisons, a comparative meta‐analysis to compare the accuracy of teledermatology with face‐to‐face diagnosis was not possible because there were too few studies. However, we tabulated results from the studies and estimated differences in sensitivity and specificity. Since these comparative studies did not report the cross‐classified results of the two index tests in participants with and without a particular form of skin cancer, we were unable to compute CIs for the differences using methods that accounted for the paired nature of the data. Therefore, we assumed independence between the sensitivities and between the specificities of the two tests, and calculated 95% CIs for the differences using the Newcombe‐Wilson method without continuity correction (Newcombe 1998). We performed analyses using Stata version 15 (Stata 2017).

Investigations of heterogeneity

We examined heterogeneity by visually inspecting forest plots of sensitivity and specificity, and summary ROC (SROC) plots. We were unable to perform meta‐regression to investigate potential sources of heterogeneity due to insufficient numbers of studies.

Sensitivity analyses

There were too few data to perform sensitivity analyses.

Assessment of reporting bias

Because of uncertainty about the determinants of publication bias for diagnostic accuracy studies and the inadequacy of tests for detecting funnel plot asymmetry (Deeks 2005), we did not perform tests to detect publication bias.

Results

Results of the search

The search identified 34,517 unique references and screened them for inclusion after reading the title and abstract. Of these, we reviewed 1051 full‐text papers for eligibility for any one of the suite of reviews of tests to assist in the diagnosis of melanoma or keratinocyte skin cancer; 203 publications were included in at least one review in our series and 848 publications were excluded (see Figure 3; PRISMA flow diagram of search and eligibility results).


PRISMA flow diagram.

PRISMA flow diagram.

Of the 125 studies tagged as potentially eligible for this review of teledermatology, we included 22 publications. Exclusions from the review were primarily due to: lack of test accuracy data to complete a 2×2 contingency table (31 studies), ineligible populations (38 studies) or target conditions (19 studies), not accuracy studies (23 studies) and inadequate sample size (fewer than five cases of skin cancer, fewer than five benign lesions, or for referral accuracy, fewer than five 'positive' cases identified by the expert diagnosis reference standard) (seven studies) or reference standards (11 studies). Of the 31 studies for which a 2×2 table could not be constructed, 11 reported only agreement between observers or between teledermatology and the reference standard and 14 had at least one piece of missing data. A list of the 103 studies with reasons for exclusion is provided in the Characteristics of excluded studies table, with a list of all studies excluded from the full series of reviews available as a separate pdf (please contact skin.cochrane.org for a copy of the pdf).

We contacted the corresponding authors of 12 studies and asked them to supply further information to allow study inclusion or to clarify diagnostic thresholds or target condition definition. We received responses from four authors allowing inclusion of all four studies in this review (Borve 2015;Mahendran 2005; Warshaw 2010b; Wolf 2013). One of the four authors was unable to provide all of the information requested such that the data presented in the paper could only be partially included in this review (Warshaw 2010b).

The 22 included study publications reported on 22 cohorts of lesions and provided 96 datasets (individual 2×2 contingency tables). Sixteen studies (73%) including 4057 lesions and 879 malignant cases reported data for the diagnostic accuracy of teledermatology (Arzberger 2016; Borve 2015; Bowns 2006; Congalton 2015; Coras 2003; Ferrara 2004; Grimaldi 2009; Jolliffe 2001a; Kroemer 2011; Massone 2014; Moreno Ramirez 2005; Piccolo 2000; Piccolo 2004; Silveira 2014; Warshaw 2010b; Wolf 2013), five of which also reported data for the diagnostic accuracy of expert face‐to‐face clinical diagnosis (Coras 2003; Jolliffe 2001a; Kroemer 2011; Piccolo 2000; Warshaw 2010b). Six studies (27%) including 1449 lesions reported data for the referral accuracy of teledermatology (i.e. teledermatology diagnosis or management action as the index test versus expert face‐to‐face diagnosis or management action as the reference standard) (Jolliffe 2001b; Mahendran 2005; Manahan 2015; Oliveira 2002; Phillips 1998; Shapiro 2004); these studies included 270 'positive' cases as determined by the reference standard face‐to‐face decision. A cross‐tabulation of studies by reported comparisons, target conditions and types of image used is provided in Table 1 and summary study details is presented in Appendix 8.

Open in table viewer
Table 1. Cross‐tabulation of included studies against target condition assessed, by type of images used for teledermatology

Studies of diagnostic accuracy –TD vs histology

FTF dxa

vs histology

TD

vs expert FTF

Study

Any skin cancer

Melanomab

BCC

cSCC

Action

Arzberger 2016

Photo/Derm (excise or not)

Borve 2015

Photo/Derm (dx as malignant/possibly malignant)

Bowns 2006

Photo/Derm (dx as MM/MiS)

Photo/Derm (dx as BCC)

Congalton 2015

Photo/Derm (excise or not)

Coras 2003

Photo/Dermc (dx as MM/MiS)

dx as MM

Ferrara 2004

Derm only (dx as MM/MiS)

Grimaldi 2009

Photo/Dermc (excise or not)

(GP dx as suspicious for malignancy; data not included)

Jolliffe 2001a

Photo onlyc (dx as any SC)

< 5 MM

Photo onlyc (dx as BCC)

dx as any SC

dx as BCC

Kroemer 2011

Photo onlyc

Derm onlyc (dx as malignant)

Photo onlyc

Derm onlyc (dx as MM/MiS)

Photo onlyc

Derm onlyc (dx as BCC)

Photo onlyc

Derm onlyc (dx as cSCC)

dx as malignant

dx as MM/MiS

dx as BCC

dx as cSCC

Massone 2014

Photo/Derm (dx as malignant)

Moreno Ramirez 2005

Photol only (dx as malignant)

Photo only (dx as MM/MiS)

Photo only (dx as BCC)

Piccolo 2000

Photo/Dermc (dx as MM)

dx as MM

Piccolo 2004

Derm only (dx as MM)

Silveira 2014

Photo only (dx as malignant)

Warshaw 2010b

Photo onlyc (dx as MM/MiS)

dx of MM/MiS

Wolf 2013

Photo only (dx as atypical)

Studies of referral accuracy – TD vs expert FTF decision

Jolliffe 2001b

Clin only (refer or not)

Mahendran 2005

Clin only (excise or not)

Manahan 2015

Clin/Derm (see FTF)

Oliveira 2002

Clin only (dx as malignant)

Phillips 1998

Live link

(dx as any SC;

definitely/probably malignant/

excise or not)

Shapiro 2004

Clin only (excise or not)

BCC: basal cell carcinoma; clin: clinical; cSCC: cutaneous squamous cell carcinoma; Derm: dermoscopic images; dx: diagnosis; FTF: face‐to‐face; GP: general practitioner; MiS: melanoma in situ (atypical intraepidermal melanocytic variants); MM: invasive melanoma; Photo: photographic images; SC: skin cancer; TD: teledermatology.

aFace‐to‐face diagnosis by an expert/dermatologist unless otherwise stated.

bAll include melanoma in situ as disease positive apart from Piccolo 2000 and Piccolo 2004, which reported detection of invasive melanoma only.

cIncluded a direct comparison with expert face‐to‐face diagnosis versus histology.

Studies were primarily prospective case series (18 studies; 82%), with two retrospective case series (9%) (Moreno Ramirez 2005; Piccolo 2004), and two case‐control studies (9%) (Ferrara 2004; Wolf 2013), three of which retrospectively selected previously acquired images for prospective evaluation in the study (Ferrara 2004; Piccolo 2004; Wolf 2013). Studies were conducted in: Europe (14 (64%) studies), including five studies from Austria (Arzberger 2016; Kroemer 2011; Massone 2014; Piccolo 2000; Piccolo 2004), and four from the UK (Bowns 2006; Jolliffe 2001a; Jolliffe 2001b; Mahendran 2005); North America (four (18%) studies; Phillips 1998; Shapiro 2004; Warshaw 2010b; Wolf 2013); or South America (two studies; 9%; Oliveira 2002; Shapiro 2004); or in Australia (Manahan 2015); or New Zealand (Congalton 2015) (9%). Eight (36.4%) studies included only pigmented (Coras 2003; Grimaldi 2009; Jolliffe 2001a; Moreno Ramirez 2005; Piccolo 2000; Wolf 2013) or melanocytic (Ferrara 2004; Piccolo 2004) lesions (Piccolo 2004 restricting to acral lesions only); the remainder included any suspicious lesion.

Ten studies were based in primary care or community‐based settings. Seven studies acquired lesion images in primary care (Borve 2015; Grimaldi 2009; Mahendran 2005; Massone 2014; Moreno Ramirez 2005; Oliveira 2002; Shapiro 2004), two studies were in a community skin cancer screening outreach programme with image acquisition for 'remote' assessment by specialists in a secondary care setting (Silveira 2014) or using live transmission video‐conferencing (Phillips 1998), and one study recruited participants at high risk of melanoma who took images of their own lesions that they 'did not like the look of' using a smartphone (Manahan 2015). There were no studies conducted in high street pharmacy‐type settings. Five of the 10 studies reported the diagnostic accuracy of the teledermatology image‐based assessment (Borve 2015; Grimaldi 2009; Massone 2014; Moreno Ramirez 2005; Silveira 2014), and five examined referral accuracy, comparing the teledermatology assessment against specialist in‐person assessment (Mahendran 2005; Manahan 2015;Oliveira 2002; Phillips 1998; Shapiro 2004). Of these studies, nine were prospective in design and one was retrospective (Moreno Ramirez 2005).

Twelve studies acquired lesion images at a secondary care dermatology or pigmented lesion clinic (Arzberger 2016; Congalton 2015; Ferrara 2004; Jolliffe 2001a; Jolliffe 2001b; Kroemer 2011; Piccolo 2000; Piccolo 2004; Warshaw 2010b; Wolf 2013), from a private dermatology practice (Coras 2003), or in a medical imaging unit (Bowns 2006), at the time of the patient consultation, and a second dermatologist made remote image‐based diagnoses. Of these studies, nine were prospective in design, two studies retrieved routinely collected lesion images for prospective 'teledermatology' examination (Piccolo 2004;Wolf 2013), and one did not clearly report how lesion images were acquired (Ferrara 2004). Six studies reported data only for the diagnostic accuracy of specialist image‐based assessment (Arzberger 2016; Bowns 2006; Congalton 2015; Ferrara 2004; Piccolo 2004; Wolf 2013), five compared the diagnostic accuracy of image‐based diagnosis to that of a dermatologist's face‐to‐face diagnosis (Coras 2003; Jolliffe 2001a; Kroemer 2011; Piccolo 2000; Warshaw 2010b), and one examined referral accuracy, comparing specialist image‐based assessment against specialist in‐person assessment (Jolliffe 2001b).

Ten evaluations reported data using photographic images for teledermatology consultations (Jolliffe 2001a; Jolliffe 2001b; Kroemer 2011; Mahendran 2005; Moreno Ramirez 2005; Oliveira 2002; Shapiro 2004; Silveira 2014; Warshaw 2010b; Wolf 2013), nine for a combination of clinical and dermoscopic images (Arzberger 2016; Borve 2015; Bowns 2006; Congalton 2015; Coras 2003; Grimaldi 2009; Manahan 2015; Massone 2014; Piccolo 2000), and three reported data for diagnosis using dermoscopic images only (Ferrara 2004; Kroemer 2011; Piccolo 2004). The final study reported referral accuracy data for live‐link teledermatology using three cameras: a full‐body camera, a lens for close‐up views and a magnifying lens to allow magnified examination and examination with polarised light (Phillips 1998). The Warshaw 2010b paper also reported accuracy data for diagnosis using a combination of photographic and dermoscopic images; however, we were unable to obtain underlying 2×2 data to allow the inclusion of this aspect of the study.

Images for store and forward teledermatology were obtained using a mobile phone camera alone (Kroemer 2011) or coupled with a dermatoscope (Borve 2015; Kroemer 2011; Manahan 2015); using still images from a video camera (Jolliffe 2001a; Jolliffe 2001b); using a combination of film and digital images (Ferrara 2004); or using a digital camera (13 studies) to acquire photographs (Mahendran 2005; Moreno Ramirez 2005; Oliveira 2002; Shapiro 2004; Silveira 2014; Warshaw 2010b; Wolf 2013), dermoscopic images (Ferrara 2004; Piccolo 2004), or both (Arzberger 2016; Congalton 2015; Coras 2003; Grimaldi 2009; Massone 2014; Piccolo 2000). Bowns 2006 acquired photographic and dermoscopic images in a medical photography unit (equipment not described). See the Characteristics of included studies table for details of the digital cameras and mobile phones used. Eighteen (82%) studies provided observers with additional clinical information along with the digital image of the lesion, three did not clearly report whether additional clinical information was provided (Arzberger 2016; Grimaldi 2009; Manahan 2015), and one did not provide any further participant‐related information (Wolf 2013). The remote observers were dermatologists in 19 (86%) studies, oncologists in one, and a mixture of dermatologists and other healthcare professionals (e.g. oncologists, plastic surgeons) in two (9%).

The five studies providing a direct comparison of teledermatology with in‐person evaluation by a dermatologist based diagnosis on visual inspection of the lesion with dermoscopy used for some (proportions not reported) (Coras 2003; Jolliffe 2001a; Kroemer 2011; Warshaw 2010b), or for all lesions (Piccolo 2000). One study reported using pattern analysis (Coras 2003), and the remaining three (50%) studies did not specify any algorithm used to aid diagnosis. The direct comparison of teledermatology with in‐person evaluation by a GP provided in Grimaldi 2009 was not included in this review as it is not directly relevant to teledermatology in this context.

For the 16 studies reporting diagnostic accuracy,11 (69%) used histology alone as the reference standard (Arzberger 2016; Congalton 2015; Coras 2003; Ferrara 2004; Jolliffe 2001a; Kroemer 2011; Piccolo 2000; Piccolo 2004; Silveira 2014; Warshaw 2010b; Wolf 2013), three (19%) used both histology and expert opinion for some benign lesions (Borve 2015; Bowns 2006; Massone 2014), and two (12%) used histology and follow‐up of clinically benign‐appearing lesions (Grimaldi 2009; Moreno Ramirez 2005). The median number of study participants was 77 (interquartile range (IQR) 4p to 182) (reported in 10 studies), and median number of lesions was 116 (IQR 45 to 240). The median prevalence of skin cancer was 30% (IQR 21% to 45%) and median percentage of men was 46% (IQR 35% to 47%) (reported in nine studies). Where reported (eight studies), the median age was 52 years (IQR 43 to 65) (reported in eight studies). Four of the 16 diagnostic accuracy studies did not report the range in age of included participants (Bowns 2006; Coras 2003; Grimaldi 2009; Wolf 2013), four restricted inclusion to adults only (Borve 2015; Massone 2014; Silveira 2014; Warshaw 2010b), and eight included participants under the age of 16 years (Arzberger 2016; Congalton 2015; Ferrara 2004; Jolliffe 2001a; Kroemer 2011; Moreno Ramirez 2005; Piccolo 2002; Piccolo 2004); however, data were not presented to allow data to be extracted excluding children.

For the six studies reporting referral accuracy, the same dermatologist undertaking the teledermatology assessment made the face‐to‐face reference standard diagnosis, either within one to two weeks (Mahendran 2005; Oliveira 2002), or with a gap of several months between assessments (Jolliffe 2001b), by a dermatology trainee under the supervision of the teledermatologist (Manahan 2015), or a different dermatologist (Phillips 1998; Shapiro 2004). The median number of study participants was 50 (IQR 49 to 72) (reported in all six studies), and median number of lesions was 107 (IQR 94 to 253). The reported decisions of the face‐to‐face expert included diagnosis of malignancy (Oliveira 2002; Phillips 1998), decision to excise (Mahendran 2005; Phillips 1998; Shapiro 2004), or decision to see face‐to‐face (Jolliffe 2001b; Manahan 2015); the median overall percentage of 'positive' expert diagnoses was 15% (IQR 10% to 41%). One study reported the median age of included participants was 46.7 years (Phillips 1998); two studies reported age ranges of 8 to 94 years (Jolliffe 2001b), and 50 to 64 years (Manahan 2015); and four did not report the range in ages (Mahendran 2005; Oliveira 2002; Phillips 1998; Shapiro 2004). The percentage of men ranged from 15.7% (Phillips 1998) to 49% (Manahan 2015) (reported in three studies).

Methodological quality of included studies

The overall methodological quality of included studies is summarised in Figure 4 and Figure 5. At least half of studies were at high or unclear risk of bias for participant selection, reference standard, and flow and timing domains, while the majority were at low risk for the index test. The applicability of study findings were of high or unclear concern for the majority of studies in all domains assessed.


Risk of bias and applicability concerns graph: review authors' judgements about each domain presented as percentages across included studies.

Risk of bias and applicability concerns graph: review authors' judgements about each domain presented as percentages across included studies.


Risk of bias and applicability concerns summary: review authors' judgements about each domain for each included study.

Risk of bias and applicability concerns summary: review authors' judgements about each domain for each included study.

For participant selection, seven (32%) studies were at low risk of bias (Borve 2015; Bowns 2006; Grimaldi 2009; Jolliffe 2001b; Mahendran 2005; Oliveira 2002; Warshaw 2010b), and five (23%) studies were at high risk (Congalton 2015; Ferrara 2004; Massone 2014; Moreno Ramirez 2005; Wolf 2013). Four studies applied inappropriate participant exclusions (such as excluding poor‐quality images, or difficult to diagnose lesions) (Congalton 2015; Massone 2014; Moreno Ramirez 2005; Wolf 2013); and two studies used a case‐control type study design with separate selection of malignant cases and lesions with benign diagnosis (Ferrara 2004; Wolf 2013). Fourteen studies were at high concern for applicability of participants due to recruitment from secondary care or specialist clinics rather than from the primary setting in which teledermatology is more likely to be used (12/22; Arzberger 2016; Bowns 2006; Congalton 2015; Coras 2003; Jolliffe 2001a; Jolliffe 2001b; Kroemer 2011; Moreno Ramirez 2005; Piccolo 2000; Piccolo 2004; Warshaw 2010b; Wolf 2013), and/or due to inclusion of multiple lesions per participant (6/22; Congalton 2015; Grimaldi 2009; Jolliffe 2001b; Kroemer 2011; Manahan 2015; Warshaw 2010b). Two studies were at low concern for participant applicability (Oliveira 2002; Phillips 1998).

For the index test, all of the teledermatology assessments were at low risk of bias for the index test apart from Phillips 1998, in which it was unclear whether the teledermatology diagnoses were made blinded to the decision to the face‐to‐face clinician (reference standard). One study was judged of low concern for applicability of teledermatology (Massone 2014), and 12 (45%) studies were of high concern (Arzberger 2016; Bowns 2006; Congalton 2015; Coras 2003; Ferrara 2004; Jolliffe 2001a; Jolliffe 2001b; Kroemer 2011; Piccolo 2000; Piccolo 2004; Warshaw 2010b; Wolf 2013), due to images acquired by dermatologists based in secondary care settings or from pigmented lesion clinic databases rather than images acquired in primary care. For the five studies reporting direct comparisons of teledermatology with face‐to‐face expert clinical diagnoses, two (40%) were at low risk of bias (Coras 2003; Warshaw 2010b) and three (60%) were judged unclear, as the thresholds used were not clearly prespecified (Jolliffe 2001a; Kroemer 2011; Piccolo 2000). For the comparison between tests, there was no blinding between store‐and‐forward teledermatology and face‐to‐face clinical diagnosis in Jolliffe 2001a; the remaining four comparative studies did not describe any blinding (Coras 2003; Kroemer 2011; Piccolo 2000; Warshaw 2010b). One (20%) study was of low concern for applicability of the face‐to face diagnosis (Coras 2003), and four of unclear concern of applicability as the thresholds used for diagnosis were not clearly reported to allow replication of methods (Jolliffe 2001a; Kroemer 2011; Piccolo 2000; Warshaw 2010b).

For the reference standard domain, 10 (45%) studies were at low risk of bias (Arzberger 2016; Congalton 2015; Coras 2003; Ferrara 2004; Jolliffe 2001a; Piccolo 2000; Piccolo 2004; Silveira 2014; Warshaw 2010b; Wolf 2013), and 12 (55%) were at high risk either because they were referral accuracy studies using only expert face‐to‐face diagnosis as the reference standard (Jolliffe 2001b; Mahendran 2005; Manahan 2015; Oliveira 2002; Phillips 1998; Shapiro 2004), or because they were diagnostic accuracy studies that did not meet our criteria for an adequate reference standard (i.e. greater than 80% of lesions with histology and up to 20% with clinical follow‐up; see Appendix 7) (Borve 2015; Bowns 2006; Grimaldi 2009; Kroemer 2011; Massone 2014; Moreno Ramirez 2005). Only two studies clearly reported blinding of the reference standard diagnosis to the teledermatology assessment (Phillips 1998; Shapiro 2004), and two did not implement any blinding (Mahendran 2005; Oliveira 2002); these were all referral accuracy studies. Six studies were of low concern for the applicability of the reference standard, including three of the 16 diagnostic accuracy studies that clearly reported the level of experience of the histopathologist (Ferrara 2004; Piccolo 2004; Warshaw 2010b), and three of the six referral accuracy studies that reported the expertise of the face‐to‐face reference standard diagnosis (Jolliffe 2001b; Mahendran 2005; Shapiro 2004).

For flow and timing of participants, two referral accuracy studies were at low risk of bias (Jolliffe 2001b; Phillips 1998), and 17 (77%) were at high risk of bias; either because they did not use the same reference standard for all participants (Borve 2015; Bowns 2006; Grimaldi 2009; Massone 2014; Moreno Ramirez 2005), or they did not include all study participants in the final analysis (Arzberger 2016; Congalton 2015; Coras 2003; Jolliffe 2001a; Kroemer 2011; Mahendran 2005; Manahan 2015; Massone 2014; Moreno Ramirez 2005; Oliveira 2002; Shapiro 2004; Silveira 2014; Warshaw 2010b; Wolf 2013). Fourteen (64%) studies did not clearly report the interval between reference standard and index test (Arzberger 2016; Congalton 2015; Coras 2003; Ferrara 2004; Grimaldi 2009; Jolliffe 2001a; Kroemer 2011; Manahan 2015; Massone 2014; Moreno Ramirez 2005; Piccolo 2000; Piccolo 2004; Warshaw 2010b; Wolf 2013).

Findings

Study results are summarised below according to target condition with results of meta‐analyses in Table 2 with summary details in Appendix 8. Lack of data and between‐study variation in populations, approaches to teledermatology and target conditions considered, limited the pooled analyses that could be undertaken.

Open in table viewer
Table 2. Summary estimates of sensitivity and specificity for teledermatology

Index test,a target condition

Studies

Cases/number of participants

Summary sensitivity (95% CI) %

Summary specificity (95% CI) %

Teledermatology photographic image, any

4

452/717

94.9

(90.1 to 97.4)

84.3

(48.5 to 96.8)

Teledermatology photographic/dermoscopic image, MM + MiS

4

93/664

85.4

(68.3 to 94.1)

91.6

(81.1 to 96.5)

Teledermatology photographic image vs histology, BCC

3

62/301

93.5

(84.0 to 97.6)

95.8

(92.4 to 97.7)

BCC: basal cell carcinoma; CI: confidence interval; MM + MiS: invasive melanoma and atypical intraepidermal melanocytic variants.

aReference standard was histology for all comparisons.

1. Target condition: detection of any skin cancer

Seven studies with 1588 lesions and 638 cases of skin cancer reported the diagnostic accuracy of teledermatology assessment for the detection of the primary target condition of any skin cancer. Forest plots of study data are provided in Figure 6 with results of meta‐analysis in Figure 7.


Forest plot of tests for the detection of any skin cancer (any). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.

Forest plot of tests for the detection of any skin cancer (any). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.


Summary receiver operating characteristic plot of teledermatology using photographic images versus histology for the detection of any skin cancer (any).

Summary receiver operating characteristic plot of teledermatology using photographic images versus histology for the detection of any skin cancer (any).

Four studies compared diagnosis based on photographic images to histology (Figure 6). In three studies, sensitivities for the correct diagnosis of malignancy ranged from 93% (95% CI 77% to 99%) to 100% (95% CI 75% to 100%) and specificities from 88% (95% CI 77% to 95%) to 96% (95% CI 82% to 100%) (Jolliffe 2001a; Kroemer 2011; Moreno Ramirez 2005). All three studies reported a cross‐tabulation of lesion final diagnoses against the diagnosis on teledermatology such that data could be extracted for the detection of any malignancy, regardless of any misclassification of one skin cancer for another (e.g. a BCC diagnosed as a melanoma or vice versa). Similarly there was high sensitivity in Silveira 2014 (96%, 95% CI 94% to 98%) but specificity was outlying at 25% (95% CI 14% to 39%); data here were presented for the detection of 'malignant' versus benign cases with no breakdown of individual lesion diagnoses given. The low specificity in Silveira 2014 result was likely due to the recruitment of participants with lesions deemed to be highly clinically suspicious after visual inspection of the lesion during a community screening programme; the prevalence of malignancy was 87.5% with only 52 benign cases in a sample of 416 lesions.

The pooled result across the four studies indicated a summary sensitivity of 94.9% (95% CI 90.1% to 97.4%) and summary specificity of 84.3% (95% CI 48.5% to 96.8%) based on 717 lesions and 452 cases of skin cancer (Table 2; Figure 7).

One study also reported diagnosis based on dermoscopic images only versus histology (Kroemer 2011). Sensitivity was lower than for assessment using photographic images (85%, 95% CI 71% to 94% with histology versus 93%, 95% CI 82% to 99% with dermoscopic images) with similar specificities (Figure 6).

Three studies with 928 lesions and 215 cases of skin cancer compared image‐based diagnosis based on both clinical and dermoscopic images to histology (Figure 6). Each study used a slightly different threshold to decide test positivity: the correct classification of lesions as malignant versus benign (Massone 2014), lesions considered malignant or possibly malignant (Borve 2015), or lesion recommended for excision (Arzberger 2016). Sensitivities were 100% in all three studies. Specificities ranged from 25% (95% CI 5% to 57%) (Arzberger 2016) to 92% (95% CI 74% to 99%) (Massone 2014). Massone 2014 was a primary care‐based study for which accuracy data could only be estimated for a subgroup of 32 of the original 962 lesions. Arzberger 2016 included participants at particularly high risk for melanoma, while Borve 2015 included large proportions of seborrhoeic keratosis (19%) and actinic keratosis (9%) in the disease‐negative group. These factors may have made differentiating a malignant case from a benign one more challenging.

We undertook no statistical pooling for these studies due to variation in threshold and heterogeneity in specificities.

Two studies compared remote image‐based assessment with in‐person diagnosis by a dermatologist for the detection of any skin cancer using the same diagnostic thresholds as for the teledermatology decision (Jolliffe 2001a; Kroemer 2011). Jolliffe 2001a reported 100% (95% CI 75% to 100%) sensitivity for both assessments, while Kroemer 2011 reported 100% (95% CI 92% to 100%) sensitivity for in‐person diagnosis compared to 93% (95% CI 82% to 99%) for diagnosis using photographic images and 85% (95% CI 71% to 94%) for dermoscopic image‐based assessment (Table 3; Figure 6). Both studies reported only marginal differences in specificity between approaches.

Open in table viewer
Table 3. Direct comparisons of teledermatology with face‐to‐face diagnosis of melanoma and other types of skin cancer

Study

Sensitivity (true positives/cases) %

Difference (95% CI)

Specificity (true negatives/non cases) %

Difference (95% CI)

Any skin cancer

Teledermatology photographic image

Expert face‐to‐face

Teledermatology

photographic image

Expert face‐to‐face

Jolliffe 2001a

100 (13/13)

100 (13/13)

0.00

(‐2.28 to 2.28)

92.9 (118/127)

94.7 (124/131)

‐1.74

(‐8.18 to 4.49)

Kroemer 2011

93 (43/46)

100 (46/46)

‐0.06

(‐0.17 to 0.02)

88 (51/58)

90 (52/58)

‐0.02

(‐0.14 to 0.10)

Teledermatology dermoscopic image

Expert face‐to‐face

Teledermatology dermoscopic image

Expert face‐to‐face

Kroemer 2011

85 (39/46)

100 (46/46)

‐0.15

(‐0.28 to ‐0.04)

91 (53/58)

90 (52/58)

0.02

(‐0.13 to 0.09)

Invasive melanoma or atypical intraepidermal melanocytic variants

Teledermatology photographic + dermoscopic image

Expert face‐to‐face

Teledermatology photographic + dermoscopic image

Expert face‐to‐face

Coras 2003

81 (13/16)

88 (14/16)

‐0.06

(‐0.32 to 0.20)

93 (27/29)

93 (27/29)

0.00

(‐0.16 to 0.16)

Teledermatology dermoscopic image

Expert face‐to‐face

Teledermatology dermoscopic image

Expert face‐to‐face

Kroemer 2011

100 (5/5)

100 (5/5)

0.00

(‐0.43 to 0.43)

98 (97/99)

99 (98/99)

‐0.01

(‐0.06 to 0.04)

Warshaw 2010b

59 (24/41)

73 (30/41)

‐0.15

(‐0.33 to 0.06)

41 (604/1473)

63 (930/1473)

‐0.22

(‐0.26 to ‐0.19)a

Invasive melanoma

Teledermatology photographic/dermoscopic image

Expert face‐to‐face

Teledermatology photographic/dermoscopic image

Expert face‐to‐face

Piccolo 2000

81.8 (9/11)

72.7 (8/11)

9.10

(‐25.2 to 41.2)

100 (32/32)

96.9 (31/32)

3.13

(‐7.90 to 15.7)

Basal cell carcinoma

Teledermatology photographic image

Expert face‐to‐face

Teledermatology photographic image

Expert face‐to‐face

Jolliffe 2001a

100 (9/9)

100 (9/9)

0.00

(‐29.9 to 29.9)

97.0 (127/131)

97.8 (132/135)

‐0.83

(‐5.60 to 3.68)

CI: confidence interval.

adenotes statistically significant difference.

2. Target condition: invasive melanoma or atypical intraepidermal melanocytic variants

Nine studies with 2510 lesions and 206 melanoma reported the diagnostic accuracy of teledermatology assessment for the detection of invasive cutaneous melanoma or atypical intraepidermal variants (Figure 8), only two of which also reported data for the detection of any skin cancer (Table 2).


Forest plot of tests for the detection of invasive melanoma or atypical intraepidermal melanocytic variants. CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive; VI: visual inspection.

Forest plot of tests for the detection of invasive melanoma or atypical intraepidermal melanocytic variants. CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive; VI: visual inspection.

In four studies with 1834 lesions and 106 melanomas comparing diagnosis based on photographic images to histology, sensitivities ranged from 59% (95% CI 42% to 74%) to 100% (95% CI 48% to 100%) and specificities from 30% (95% CI 22% to 40%) to 100% (95% CI 93% to 100%) (Kroemer 2011; Moreno Ramirez 2005; Warshaw 2010b; Wolf 2013). In three studies, the data extracted were for the correct diagnosis of melanoma whereas in Wolf 2013, lesions were classified as 'atypical' or 'typical' as opposed to reporting the correct diagnosis of melanoma. The difference in the diagnostic decision recorded in Wolf 2013 is likely to account for the low specificity observed.

The relatively low sensitivity and specificity observed in Warshaw 2010b is difficult to explain but may be related to differences in population characteristics between studies. Two studies restricted inclusion to pigmented lesions considered clinically atypical by at least one dermatologist (Wolf 2013), or meeting explicit criteria that might suggest a higher risk for melanoma (Moreno Ramirez 2005). Wolf 2013 selecting specific lesion types and excluding those with an equivocal diagnosis. Kroemer 2011 included participants with any lesion type who were either self‐referred or referred by a local doctor to a general dermatology clinic. Warshaw 2010b also included any lesion type (pigmented or non‐pigmented) from participants who required or requested removal of one or more skin lesions (denoted 'high risk' by the study authors) or from people who were referred by non‐dermatology healthcare providers for specialist assessment (denoted as 'lower risk'); furthermore, lesions in 30 histopathological categories with fewer than 25 lesions were excluded (171 lesions).

Two studies compared dermoscopic image‐based diagnosis to histology (Ferrara 2004; Kroemer 2011). Sensitivities were 71% (95% CI 29% to 96%; Ferrara 2004) and 100% (95% CI 48% to 100%; Kroemer 2011) and specificities were 60% (95% CI 15% to 95%; Ferrara 2004) to 97% (95% CI 91% to 99%; Kroemer 2011). Ferrara 2004 reported sensitivity of 71% (95% CI 29% to 96%) and specificity 60% (95% CI 15% to 95%) for the correct differentiation of seven melanomas from five benign lesions. Kroemer 2011 reported very similar sensitivity and specificity for the correct diagnosis of melanoma using only dermoscopic images to their result using non‐magnified photographic images: sensitivities with both approaches were 100% (95% CI 48% to 100%) while specificities were 98% (95% CI 96% to 100%) using photographic images compared to 97% (95% CI 91% to 99%) for dermoscopic images. Data were not presented for diagnosis using both clinical and dermoscopic images; however, the study reported that "re‐evaluation of both image types in (discordant cases) did not improve the diagnostic accuracy of teleconsultations" (Kroemer 2011).

Four studies with 664 lesions and 93 cases of melanoma or atypical intraepidermal melanocytic variants compared teledermatology based on both clinical and dermoscopic images to histology (Bowns 2006; Congalton 2015; Coras 2003; Grimaldi 2009). Summary estimates of sensitivity were 85.4% (95% CI 68.3% to 94.1%) and specificity 91.6% (95% CI 81.1% to 96.5%) (Table 2; Figure 9).


Summary receiver operating characteristic plot of teledermatology using clinical and dermoscopic images for the detection of invasive melanoma or atypical intraepidermal melanocytic variants.

Summary receiver operating characteristic plot of teledermatology using clinical and dermoscopic images for the detection of invasive melanoma or atypical intraepidermal melanocytic variants.

Across the 10 teledermatology datasets, the number of melanomas missed ranged from 0 (two datasets, both from Kroemer 2011) to 17 (Warshaw 2010b).

Three studies in this group compared teledermatology assessment of images to face‐to‐face diagnosis by a dermatologist (Coras 2003; Kroemer 2011; Warshaw 2010b) (Table 3). In Warshaw 2010b, the diagnostic accuracy of teledermatology diagnosis of melanoma using photographic images and patient history was considerably lower compared to an in‐person dermatologist diagnosis (using visual inspection with or without the use of dermoscopy as determined by the individual clinician); data from the author showed that sensitivity was 15% lower for teledermatology assessment (95% CI ‐33% to 6%) and specificity was 22% lower (95% CI ‐26% to ‐19%) (Table 3). The accuracy of the expert face‐to‐face diagnosis was nevertheless relatively low, with sensitivity of 73% (95% CI 57% to 86%) and specificity of 63% (95% CI 61% to 66%) (Figure 8). The two studies comparing teledermatology diagnosis using both macro and dermoscopic images demonstrated only marginal differences between the two approaches (Coras 2003; Kroemer 2011) (Table 3).

3. Target condition: invasive cutaneous melanoma

Piccolo 2000 with 43 lesions reported the diagnostic accuracy of teledermatology assessment for the detection of invasive melanoma selected for their 'diagnostic difficulty (11 cases of melanoma) and Piccolo 2004 for the differentiation of six acral melanoma from 71 benign acral lesions (Figure 10). In Piccolo 2000, the sensitivity for store‐and‐forward teledermatology assessment was 82% (95% CI 48% to 98%) and specificity was 100% (95% CI 89% to 100%) in comparison to in‐person dermatologist assessment of the same lesions where sensitivity was 73% (95% CI 39% to 94%) and specificity 97% (95% CI 84% to 100%) (one invasive melanoma missed in the face‐to‐face encounter was identified using teledermatology). Similar teledermatology accuracy (based on a consensus' of six out of 11 observers) was obtained for acral lesions in Piccolo 2004 using only dermoscopic images: observed sensitivity was 83% (95% CI 36% to 100%) and specificity was 96% (95% CI 88% to 99%).


Forest plot of tests for the detection of invasive melanoma. CI: confidence interval; FN: false negative; FP: false positive; MM: malignant melanoma; TN: true negative; TP: true positive; VI: visual inspection.

Forest plot of tests for the detection of invasive melanoma. CI: confidence interval; FN: false negative; FP: false positive; MM: malignant melanoma; TN: true negative; TP: true positive; VI: visual inspection.

4. Target condition: BCC

Four studies reported the diagnostic accuracy of teledermatology assessment for the detection BCC (Figure 11; Figure 12).


Forest plot of tests for the detection of basal cell carcinoma (BCC). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.

Forest plot of tests for the detection of basal cell carcinoma (BCC). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.


Receiver operating characteristic plot teledermatology using photographic images alone versus histology for the detection of basal cell carcinoma.

Receiver operating characteristic plot teledermatology using photographic images alone versus histology for the detection of basal cell carcinoma.

Three evaluations of photographic images for 301 lesions with 62 cases of BCC (Jolliffe 2001a; Kroemer 2011; Moreno Ramirez 2005), produced summary estimates of sensitivity of 93.5% (95% CI 84.0% to 97.6%) and specificity 95.8% (95% CI 92.4% to 97.7%) (Table 2; Figure 12). Four BCCs were missed in two studies (Kroemer 2011; Moreno Ramirez 2005).

One study reported lower sensitivity using dermoscopic images (80%, 95% CI 61% to 92%) compared to photographic images (90%, 95% CI 73% to 98%), due to an additional two BCCs being mistaken for actinic keratosis (Kroemer 2011). A further study reporting data only for teledermatology using clinical and dermoscopic images also reported low sensitivity for BCC of 66% (95% CI 46% to 82%) (Bowns 2006). Both studies reported specificities of 93% and over (Figure 11).

Two studies provided a comparison with expert face‐to‐face assessment (Table 3). Kroemer 2011 reported higher sensitivity and specificity in the face‐to‐face assessments and Jolliffe 2001a reported almost identical sensitivity and specificity estimates from the two approaches.

5. Target condition: cutaneous squamous cell carcinoma

Kroemer 2011 reported accuracy for the detection of cSCC in 104 lesions with 10 cases of cSCC. There was a sensitivity of 90% (95% CI 55% to 100%) (one cSCC missed) for diagnosis based on photographic images and for the face‐to‐face assessments, with a sensitivity of 60% (95% CI 26% to 88%) (four cSCCs missed) for remote assessment based on dermoscopic images (Figure 13). Specificities were over 98% for all three approaches to diagnosis.


Forest plot of tests for the detection of cutaneous squamous cell carcinoma (cSCC). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive; VI: visual inspection.

Forest plot of tests for the detection of cutaneous squamous cell carcinoma (cSCC). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive; VI: visual inspection.

6. Referral accuracy

Six studies gave information on diagnostic decision making by teledermatology consultants compared to expert face‐to‐face decisions (as the reference standard) (Figure 14).


Forest plot of tests for referral accuracy (expert in‐person diagnosis as reference standard). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.

Forest plot of tests for referral accuracy (expert in‐person diagnosis as reference standard). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.

Four studies reported data for store‐and‐forward teledermatology using photographic images, for the diagnosis of malignancy (Oliveira 2002), for the decision to excise a lesion (Mahendran 2005; Shapiro 2004), for the decision to refer versus not refer (Jolliffe 2001b), or to excise or follow‐up at a later date (Mahendran 2005). Jolliffe 2001b reported data both for teledermatology by a dermatologist and by a dermatology registrar. It was not possible to pool results across these studies due to heterogeneity in the teledermatology.

Two studies found perfect or almost perfect agreement between teledermatology and face‐to‐face consultation (sensitivities 100% and specificities 98% to 100%; Oliveira 2002; Shapiro 2004), while Mahendran 2005 also reported 100% sensitivity both for the decision to excise a lesion and the decision to excise or follow‐up (Figure 14). In Jolliffe 2001b, the sensitivity of teledermatology by the dermatologist was 69% (95% CI 61% to 77%), with 44 lesions recommended for face‐to‐face consultation 'missed' by the remote observer, compared to 92% for the registrar's teledermatology assessment (12 lesions 'missed').

Specificities were more variable with 69% (95% CI 55% to 81%) and 57% (95% CI 39% to 73%) reported in Mahendran 2005 for the decision to excise a lesion and the decision to excise or follow‐up a lesion and 85% (95% CI 82% to 87%) and 67% (95% CI 63% to 70%) reported in Jolliffe 2001b for the dermatologist and dermatology registrar for the decision to refer a lesion. The number of lesions recommended for some action by the teledermatologist that were not recommended for action by the face‐to‐face expert in these studies ranged from 16 (Mahendran 2005) to 217 (Jolliffe 2001b).

Manahan 2015 reported the sensitivity and specificity of the teledermatologist's decision to recommend a lesion for face‐to‐face consultation based on macro and dermoscopic images compared to the same recommendation by the face‐to‐face dermatologist (301 lesions; 35 recommended for face‐to‐face consultation): the resulting sensitivity was 91% (95% CI 77% to 98%) with three lesions 'missed' and specificity was 89% (95% CI 85% to 93%) with 28 lesions recommended for a face‐to‐face visit that were not selected by the in‐person dermatologist.

Finally, Phillips 1998 reported data for the accuracy of live‐link teledermatology using videoconferencing compared to a dermatologist's face‐to‐face decision (as the reference standard) for 107 lesions. Data were reported at three different thresholds: for the correct diagnosis of a skin cancer (melanoma, BCC or cSCC), for the classification of a lesion as definitely or probably malignant and for the decision to biopsy a lesion. Sensitivities were 67% (95% CI 22% to 96%) for correct diagnosis, 60% (95% CI 15 to 95%) for definitely or probably malignant and 82% (95% CI 48 to 98%) for decision to biopsy a lesion Specificities were 96% (95% CI 90 to 99%) for both the correct diagnosis of a skin cancer and lesions definitely or probably malignant and 86% (95% CI 78 to 93%) for decision to biopsy a lesion.

Investigations of heterogeneity

We were unable to undertake planned formal investigations of heterogeneity due to insufficient number of studies.

Discussion

Although in some countries teledermatology services may provide recommendations to allow skin cancer management (including biopsy or excision) in a primary care setting, in the UK teledermatology consultations for the most part ensure that people with potentially malignant skin lesions are appropriately referred from a generalist (usually primary care) setting for specialist assessment and treatment. Therefore, the primary objective of this review was to assess the accuracy of teledermatology for the detection of any skin cancer in adults, comparing its accuracy with that of an in‐person specialist diagnosis.

Summary of main results

We included 22 studies: 16 considering the diagnostic accuracy of image‐based teledermatology (five in comparison to an in‐person assessment), and six examining the referral accuracy of teledermatology assessment. Key results are presented in the summary of findings Table. The overall risk of bias was rated as high or unclear for participant selection, reference standard, and participant flow and timing in at least half of all studies; the majority were considered at low risk of bias for the index test. The applicability of study findings were of high or unclear concern for most studies in all domains assessed due to the recruitment of study participants from secondary care settings or specialist clinics rather than from the primary or community‐based settings in which teledermatology is more likely to be used and due to the acquisition of lesion images by dermatologists or in specialist imaging units rather than by primary care clinicians.

Seven studies addressed our primary objective of the detection of any skin cancer. For the correct diagnosis of lesions as malignant, summary sensitivity from four studies using photographic images was 94.9% (95% CI 90.1% to 97.4%) and summary specificity 84.3% (95% CI 48.5% to 96.8%). Individual study estimates using dermoscopic images or a combination of photographic and dermoscopic images generally suggested similarly high sensitivities with highly variable specificities. Limited comparative data suggested similar diagnostic accuracy between teledermatology assessment and in‐person diagnosis by a dermatologist; however, data were too scarce to draw firm conclusions.

For the detection of invasive melanoma or atypical intraepidermal melanocytic variants, both sensitivities and specificities were very variable, with reported diagnostic thresholds including the correct diagnosis of melanoma, classification of lesions as 'atypical' or 'typical', and the decision to refer or to excise a lesion. For teledermatology using photographic images, sensitivities ranged from 59% (95% CI 42% to 74%) to 100% (95% CI 48% to 100%) and specificities from 30% (95% CI 22% to 40%) to 100% (95% CI 93% to 100%) in four studies. For teledermatology using both photographic and dermoscopic images summary estimates for another four studies were 85.4% (95% CI 68.3% to 94.1%) for sensitivity and 91.6% (95% CI 81.1% to 96.5%) for specificity. The number of melanomas missed ranged from 0 to 17.

Referral accuracy data comparing teledermatology against a face‐to‐face reference standard was based on a number of different diagnostic decisions including the diagnosis of malignancy, the decision to excise a lesion, the decision to refer versus not refer, or to excise or follow‐up at a later date. Agreement was generally good for lesions considered to require some positive action by face‐to‐face assessment (sensitivities of over 90%). For lesions considered of less concern when assessed face‐to‐face (e.g. for those not recommended for excision or referral), agreement was more variable with teledermatology specificities ranging from 57% (95% CI 39% to 73%) to 100% (95% CI 86% to 100%), suggesting that remote assessment is more likely recommend excision, referral or follow‐up compared to in‐person decisions.

Across all studies, there were wide variations in sensitivity and specificity for all definitions of the target condition. Studies were generally small with varying approaches to teledermatology assessment, including the use of clinical or dermoscopic images (or both); use of mobile phone cameras, digital cameras or video images; and varying thresholds for deciding test positivity. The definition of the target condition also varied such that data for the primary objective could only be extracted from seven of the 16 studies assessing diagnostic accuracy, nine studies reporting data only for the detection of individual skin cancers. These factors somewhat limited our ability to pool results across studies and further, to draw conclusions regarding the accuracy of teledermatology.

Overall, there were four key limitations of the studies.

  • The spectrum (or case mix) of different lesion types varied across studies, with a relatively high prevalence of malignant lesions.

Study participants were largely recruited from secondary care settings or from pigmented lesion clinic databases rather than from primary care or other limited prior testing settings and 50% of the studies assessing diagnostic accuracy relied on a histological reference standard (i.e. all included participants underwent lesion excision). In others, accuracy data could be extracted for less than 25% of lesions assessed at a virtual lesion clinic (Congalton 2015), or as part of a preventive medical screening programme (Massone 2014). Therefore, recruited participants were more likely to have lesions with a higher index of suspicion of malignancy compared to those for whom a GP might have considered a teledermatology assessment in practice, thereby limiting the generalisability of study results. These 'spectrum effects' are an increasingly recognised concept for medical tests, often leading to lower sensitivity and higher specificity when applied in settings with participants with limited prior testing compared to participants further down the referral pathway (Usher‐Smith 2016). However, the direction of effect is not consistent across tests and diseases (Leeflang 2013), the mechanisms in action often being complex and sometimes difficult to identify.

  • Study definitions of 'malignancy' varied and teledermatology results were often not provided according to lesion type.

In four included studies, the reported definition of malignancy did not accord with our protocol‐defined definition, with studies including melanoma metastases, Bowens disease or 'in situ cSCC', actinic keratosis or severely dysplastic naevi as 'malignant.' In some cases, data were reported to allow reclassification of these lesions as disease negative for at least some of the reported thresholds for test positivity (Borve 2015; Massone 2014); however, for others, reclassification of these lesions as disease positive was not possible and these studies could only be included in our analyses for the detection of individual skin cancers (melanoma or BCC) (Bowns 2006; Congalton 2015). Other studies had to be excluded from the review altogether due to varying definitions of 'malignant' (e.g. Borve 2013; Tandjung 2015). The lack of teledermatology results according to lesion type further limited our ability to comment on the implications of missed malignancies; the failure to pick up a melanoma or cSCC potentially carrying more severe consequences in comparison to a missed BCC.

  • The definition of a positive teledermatology result varied and was not always relevant to decision making in practice.

Of the 16 diagnostic accuracy studies, only three studies reported data for the decision to excise a lesion (one for any skin cancer and two for the detection of melanoma), and five reported data for teledermatologists' classification of lesions as malignant (or probably malignant). Nine of the 16 studies focused on teledermatologists' ability to correctly diagnose lesions as melanomas (eight lesions) or as BCCs (four lesions) which, although of interest, is not the primary factor driving teledermatology decisions in practice where the key judgement in most circumstances is whether or not a lesion should be referred for a face‐to‐face consultation.

  • Insufficient comparisons were available for the diagnostic accuracy of teledermatology‐based diagnosis and diagnosis based on a face‐to‐face dermatology clinic visit.

Only five of the 16 studies assessing diagnostic accuracy included a comparison of teledermatology‐based diagnosis with diagnosis based on in‐person diagnosis by a dermatologist; two for the detection of any skin cancer (Jolliffe 2001a; Kroemer 2011), and three for the detection of melanoma (Jolliffe 2001a; Kroemer 2011; Piccolo 2000). Therefore, we were unable to adequately assess whether a teledermatology diagnosis of malignancy accurately reflects a diagnosis made in‐person. The six studies of referral accuracy suggested that diagnosis of individual lesions using store‐and‐forward teledermatology could miss around 10% of lesions recommended for clinical action (e.g. surgical excision) during a face‐to‐face consultation (even up to 31% in one study) and is also likely to recommend an action is required for lesions that are considered of less or no concern when seen in‐person. In practice, a face‐to‐face consultation also allows a total body skin examination which may lead to incidental skin cancers being picked up which could also be missed by the teledermatology referral of only one or two lesions (Hanson 2016).

Our systematic review of dermoscopy as an addition to visual inspection of a lesion for the diagnosis of melanoma found in‐person dermoscopy (including 26 studies) to be substantially more accurate compared to diagnosis based on dermoscopic images (including 60 studies) (relative diagnostic odds ratio 4.6, 95% CI 2.4 to 9.0; P < 0.001) (Dinnes 2018a). Despite a number of contributing factors, including differences in study populations, different algorithms to assist test interpretation and differences in observer experience, we concluded that remote test interpretation cannot approximate a physical, face‐to‐face patient‐to‐clinician interaction. In particular, total body skin examination is likely to have a significant impact on the decision to excise a lesion suspected to be melanoma (Aldridge 2013; Argenziano 2012; Grob 1998). Only two of the 22 included teledermatology studies mentioned the use of total body photography (Arzberger 2016; Phillips 1998). It is notable also, that across the 60 image‐based evaluations in the review of dermoscopy, 30 (50%) were blinded to all other participant information and only 17 (28%) provided observers with the photographic image of the same lesion to assist test interpretation (Dinnes 2018a). It is conceivable that an image‐based assessment with full patient information provided in the context of a proper teledermatology consultation would provide a closer approximation of the diagnostic decision that would be made with the patient present.

Strengths and weaknesses of the review

The strengths of this review include an indepth and comprehensive electronic literature search, systematic review methods including double extraction of papers by both clinicians and methodologists, and contact with authors to allow study inclusion or clarify data. A clear analysis structure was planned to allow test accuracy to be estimated according to varying definitions of the target condition and a detailed and replicable analysis of methodological quality was undertaken.

The main concerns for the review were the lack of studies, small sample sizes, heterogeneity in teledermatology assessments, inadequate reporting of primary studies to allow quality to be fully judged and importantly, the lack of clinical applicability of the findings due to participant recruitment from, and image acquisition in, referral settings.

In comparison to other available systematic (Ndegwa 2010; Warshaw 2011) and non‐systematic (Bashshur 2015; Whited 2006; Whited 2016) reviews, our review provides a focus on the triaging or diagnosis of skin cancer as opposed to the evaluation of teledermatology for any dermatological disorder. The two most recently published reviews either found evidence generally in support of teledermatology (Bashshur 2015), or suggesting inferior accuracy compared to in‐person assessments for pigmented lesions (Whited 2016). In contrast we were unable to identify sufficient evidence to establish the diagnostic accuracy of teledermatology in comparison to a face‐to‐face clinical assessment. Our reviews of the diagnostic accuracy of visual inspection of suspicious skin lesions for the detection of melanoma (Dinnes 2018b), and of dermoscopy in comparison to visual inspection of a suspicious skin lesion (Dinnes 2018a), suggest that image‐based assessment may not be equivalent to a face‐to‐face patient:clinician interaction. However, both reviews focused on the correct diagnosis of melanoma as opposed to any skin cancer, neither included studies that were specifically designed to evaluate teledermatology programmes and did not examine any effect on accuracy from potential improvements in image quality over time.

Our a priori decision to exclude studies with fewer than five malignant cases could be construed as a weakness of the review; however, of the seven studies excluded on this basis, five were also excluded due other reasons such as inability to construct a 2×2 contingency table or ineligible study populations and one used multiple images of only four lesions to examine the effect of the positioning of a lesion within an image on clinicians ability to detect it (Chen 2002). The final study which was excluded only on the basis of sample size reported the effect of adding dermoscopic images to an existing teledermatology consultation system for 63 lesions fulfilling the criteria for teleconsultation (Moreno‐Ramirez 2006); all three malignant lesions were correctly picked up using both photographic images and with the addition of dermoscopic images (100% sensitivities) while specificities were 65% with photographic images and 78% with photographic and dermoscopic images.

We were also unsuccessful in our attempts to contact the authors of eight of 12 studies which could have been eligible for inclusion in the review, especially those studies which reported only agreement between observers or agreement with final lesion diagnoses rather than providing results in a 2×2 contingency table format. Furthermore, only partial data were provided to allow the inclusion of one study in the review (Warshaw 2010b); the review would be considerably strengthened if data for teledermatology using dermoscopic images could have been included. Finally, the review was limited to the identification of skin cancer rather than assessing the potential additional benefits of teledermatology such as positive identification of benign lesions such as actinic keratosis (Janda 2015).

Ongoing technological advances are continually improving the quality, clarity and colour‐rendition of digital images taken with cameras and mobile phones. The ability to zoom in on new larger image files potentially provides even more detailed information compared to observation of a lesion with the naked eye alone. Such advances could be of particular help in the triage of lesions with no distinguishing features (e.g. for the identification of amelanotic melanomas), the additional magnification potentially giving subtle clues to aid diagnosis. Although we have documented the equipment used to obtain images in the included studies, we were unable to identify any clear effects from changes in technology.

Our review of the diagnostic accuracy of teledermatology was also unable to evaluate a number of other pertinent factors.

  • The archiving and auditable trail provided by a teledermatology consultation. With conventional face‐to‐face consultations, much of the interchange is verbal and unrecorded; for teleconsultation however, every aspect of the referral and diagnostic opinion are recorded such that it can be reviewed and audited at a later date.

  • The possibility of 'crowd‐review' of lesion images. In‐person clinical assessment is often conducted by a single clinician, consultation with other qualified clinicians reserved for more difficult lesions or to support more junior clinicians. The nature of teledermatology diagnosis lends itself to lesion review by multiple clinicians and to have virtual multidisciplinary review of cases if necessary.

  • Changing the referral behaviour of primary care clinicians. The availability of a teledermatology service could result in a specialist opinion being sought for much earlier presentations of conditions than would normally be the case, which not only changes the spectrum of lesion types observed by teledermatologists, potentially impacting on their accuracy, but could ultimately result in overdiagnosis and treatment. For example, some lesions identified and either excised or treated non‐surgically following a teleconsultation might have resolved spontaneously if monitored for longer in primary care, or if referred for a standard face‐to‐face consultation. The effect on referral behaviour is likely to be exacerbated where ease of access to a teledermatology service for a specialist opinion may be preferred by GP trainees to seeking the opinion of a more experienced GP.

  • Possible over‐reliance by GPs and reassurance for patients on a benign diagnosis from a teledermatology consultation. As with all clinical consultations, a diagnostic opinion from a teledermatology consultation is limited to the quality of the clinical information provided and the circumstances at the time of consultation. If those circumstances change, for example if a pigmented lesion diagnosed as 'benign' evolves or changes its nature in some way, then that lesion should be reviewed. 'Safety‐netting' such lesions is an important part of management and monitoring in primary care.

Applicability of findings to the review question

The data included in this review are unlikely to be generally applicable to the intended setting. Most studies recruited participants from secondary care or referral settings rather than from primary care settings where patients are far less likely to have skin cancer, and potentially suspicious lesions are likely to be earlier in their development and evolution. Lesion images were often acquired in secondary care rather than being acquired in primary care and transmitted for a specialist opinion using teledermatology. Considerable heterogeneity in approaches to teledermatology were also observed limiting generalisability.

Sample photograph of superficial spreading melanoma(left), basal cell carcinoma (centre) and squamous cell carcinoma (right). Copyright © 2012 Dr Rubeta Matin: reproduced with permission.
Figures and Tables -
Figure 1

Sample photograph of superficial spreading melanoma(left), basal cell carcinoma (centre) and squamous cell carcinoma (right). Copyright © 2012 Dr Rubeta Matin: reproduced with permission.

Current clinical pathway for people with skin lesions.
Figures and Tables -
Figure 2

Current clinical pathway for people with skin lesions.

PRISMA flow diagram.
Figures and Tables -
Figure 3

PRISMA flow diagram.

Risk of bias and applicability concerns graph: review authors' judgements about each domain presented as percentages across included studies.
Figures and Tables -
Figure 4

Risk of bias and applicability concerns graph: review authors' judgements about each domain presented as percentages across included studies.

Risk of bias and applicability concerns summary: review authors' judgements about each domain for each included study.
Figures and Tables -
Figure 5

Risk of bias and applicability concerns summary: review authors' judgements about each domain for each included study.

Forest plot of tests for the detection of any skin cancer (any). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.
Figures and Tables -
Figure 6

Forest plot of tests for the detection of any skin cancer (any). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.

Summary receiver operating characteristic plot of teledermatology using photographic images versus histology for the detection of any skin cancer (any).
Figures and Tables -
Figure 7

Summary receiver operating characteristic plot of teledermatology using photographic images versus histology for the detection of any skin cancer (any).

Forest plot of tests for the detection of invasive melanoma or atypical intraepidermal melanocytic variants. CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive; VI: visual inspection.
Figures and Tables -
Figure 8

Forest plot of tests for the detection of invasive melanoma or atypical intraepidermal melanocytic variants. CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive; VI: visual inspection.

Summary receiver operating characteristic plot of teledermatology using clinical and dermoscopic images for the detection of invasive melanoma or atypical intraepidermal melanocytic variants.
Figures and Tables -
Figure 9

Summary receiver operating characteristic plot of teledermatology using clinical and dermoscopic images for the detection of invasive melanoma or atypical intraepidermal melanocytic variants.

Forest plot of tests for the detection of invasive melanoma. CI: confidence interval; FN: false negative; FP: false positive; MM: malignant melanoma; TN: true negative; TP: true positive; VI: visual inspection.
Figures and Tables -
Figure 10

Forest plot of tests for the detection of invasive melanoma. CI: confidence interval; FN: false negative; FP: false positive; MM: malignant melanoma; TN: true negative; TP: true positive; VI: visual inspection.

Forest plot of tests for the detection of basal cell carcinoma (BCC). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.
Figures and Tables -
Figure 11

Forest plot of tests for the detection of basal cell carcinoma (BCC). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.

Receiver operating characteristic plot teledermatology using photographic images alone versus histology for the detection of basal cell carcinoma.
Figures and Tables -
Figure 12

Receiver operating characteristic plot teledermatology using photographic images alone versus histology for the detection of basal cell carcinoma.

Forest plot of tests for the detection of cutaneous squamous cell carcinoma (cSCC). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive; VI: visual inspection.
Figures and Tables -
Figure 13

Forest plot of tests for the detection of cutaneous squamous cell carcinoma (cSCC). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive; VI: visual inspection.

Forest plot of tests for referral accuracy (expert in‐person diagnosis as reference standard). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.
Figures and Tables -
Figure 14

Forest plot of tests for referral accuracy (expert in‐person diagnosis as reference standard). CI: confidence interval; FN: false negative; FP: false positive; TN: true negative; TP: true positive.

Teledermatology – photographic image vs histology (any).
Figures and Tables -
Test 1

Teledermatology – photographic image vs histology (any).

Teledermatology – dermoscopic image vs histology (any).
Figures and Tables -
Test 2

Teledermatology – dermoscopic image vs histology (any).

Teledermatology – photographic + dermoscopic image vs histology (any).
Figures and Tables -
Test 3

Teledermatology – photographic + dermoscopic image vs histology (any).

Expert face‐to‐face (visual inspection (VI) + dermoscopy) vs histology (any).
Figures and Tables -
Test 4

Expert face‐to‐face (visual inspection (VI) + dermoscopy) vs histology (any).

Teledermatology – photographic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variantsi).
Figures and Tables -
Test 5

Teledermatology – photographic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variantsi).

Teledermatology – dermoscopic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variants).
Figures and Tables -
Test 6

Teledermatology – dermoscopic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variants).

Teledermatology photographic + dermoscopic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variants).
Figures and Tables -
Test 7

Teledermatology photographic + dermoscopic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variants).

Expert face‐to‐face (VI + dermoscopy) (invasive melanoma or atypical intraepidermal melanocytic variants).
Figures and Tables -
Test 8

Expert face‐to‐face (VI + dermoscopy) (invasive melanoma or atypical intraepidermal melanocytic variants).

Teledermatology – photographic + dermoscopic image vs histology (invasive melanoma).
Figures and Tables -
Test 9

Teledermatology – photographic + dermoscopic image vs histology (invasive melanoma).

Expert face‐to‐face (VI + dermoscopy) vs histology (invasive melanoma).
Figures and Tables -
Test 10

Expert face‐to‐face (VI + dermoscopy) vs histology (invasive melanoma).

Teledermatology – dermoscopic image vs histology (acral melanoma).
Figures and Tables -
Test 11

Teledermatology – dermoscopic image vs histology (acral melanoma).

Teledermatology – photographic image vs histology (basal cell carcinoma (BCC)).
Figures and Tables -
Test 12

Teledermatology – photographic image vs histology (basal cell carcinoma (BCC)).

Teledermatology – dermoscopic image vs histology (BCC).
Figures and Tables -
Test 13

Teledermatology – dermoscopic image vs histology (BCC).

Teledermatology – photographic + dermoscopic image vs histology (BCC).
Figures and Tables -
Test 14

Teledermatology – photographic + dermoscopic image vs histology (BCC).

Expert face‐to‐face (VI + dermoscopy) vs histology (BCC).
Figures and Tables -
Test 15

Expert face‐to‐face (VI + dermoscopy) vs histology (BCC).

Teledermatology – photographic image vs histology (cutaneous squamous cell carcinoma (cSCC)).
Figures and Tables -
Test 16

Teledermatology – photographic image vs histology (cutaneous squamous cell carcinoma (cSCC)).

Teledermatology – dermoscopic image vs histology (cSCC).
Figures and Tables -
Test 17

Teledermatology – dermoscopic image vs histology (cSCC).

Expert face‐to‐face (VI + dermoscopy) vs histology (cSCC).
Figures and Tables -
Test 18

Expert face‐to‐face (VI + dermoscopy) vs histology (cSCC).

Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action).
Figures and Tables -
Test 19

Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action).

Teledermatology – photographic image vs face‐to‐face clinical diagnosis (of malignancy).
Figures and Tables -
Test 20

Teledermatology – photographic image vs face‐to‐face clinical diagnosis (of malignancy).

Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action or follow‐up).
Figures and Tables -
Test 21

Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action or follow‐up).

Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action; registrar data).
Figures and Tables -
Test 22

Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action; registrar data).

Teledermatology – photographic/dermoscopic image vs face‐to‐face clinical diagnosis (immediate action).
Figures and Tables -
Test 23

Teledermatology – photographic/dermoscopic image vs face‐to‐face clinical diagnosis (immediate action).

Teledermatology – live link vs face‐to‐face clinical diagnosis (diagnosis of any skin cancer).
Figures and Tables -
Test 24

Teledermatology – live link vs face‐to‐face clinical diagnosis (diagnosis of any skin cancer).

Teledermatology – live link vs face‐to‐face clinical diagnosis (definitely/probably malignant).
Figures and Tables -
Test 25

Teledermatology – live link vs face‐to‐face clinical diagnosis (definitely/probably malignant).

Teledermatology – live link vs face‐to‐face clinical diagnosis (biopsy decision).
Figures and Tables -
Test 26

Teledermatology – live link vs face‐to‐face clinical diagnosis (biopsy decision).

Summary of findings Summary of findings table

Question:

What is the diagnostic accuracy of teledermatology for the detection of skin cancer in adults?

Population:

Adults with lesions suspicious for skin cancer

Index test:

TD using photographic or dermoscopic (or both) images

Comparator test:

Face‐to‐face diagnosis using visual inspection or dermoscopy (or both)

Target condition:

Any skin cancer, including invasive melanoma and atypical intraepidermal melanocytic variants, BCC and cSCC

Reference standard:

Histology with or without long‐term follow‐up (diagnostic accuracy); expert face‐to‐face diagnosis (for referral accuracy)

Action:

If accurate, positive results ensure the malignant lesions are not missed but are appropriately referred for specialist assessment or are treated appropriately in a non‐referred setting, and those with negative results can be safely reassured and discharged.

Quantity of evidence

Number of studies

Total lesions

Total cases

Diagnostic accuracy

16

4057

879

Referral accuracy

6

1449

270

Limitations

Risk of bias:

Low risk for participant selection in 7 studies; high risk (5) from case‐control design (2) or inappropriate exclusion criteria (4). Low risk for teledermatology assessments (22). Low risk for comparison with face‐to‐face diagnosis (2/5); unclear (3). Low risk for reference standard (10/22); high risk from use of expert diagnosis alone – referral accuracy (6) or inadequate reference standard (6). High risk for participant flow (17) due to differential verification (5), and exclusions following recruitment (14); timing of tests not mentioned in 14 studies.

Applicability of evidence to question:

High concern (14/22) for applicability of participants due to recruitment from secondary care or specialist clinics (12) or inclusion of multiple lesions per participant (6). High concern for applicability of teledermatology assessments (12/22) due to images acquired by dermatologists secondary care settings or in medical imaging units rather than images acquired in primary care. Low concern for reference standard (6/22); unclear concern due to lack of information concerning the expertise of the histopathologist (13) or expert face‐to‐face diagnosis (3).

Findings:

7 studies reported diagnostic accuracy data for the primary target condition of any skin cancer; 9 studies for the detection of invasive melanoma or atypical intraepidermal melanocytic variants; 2 studies for invasive melanoma alone; and 4 studies for BCC alone. 6 studies reported only referral accuracy data (teledermatology decisions versus face‐to‐face decisions). The findings presented are based on results for the primary target condition of any skin cancer and for the detection of invasive melanoma or atypical intraepidermal melanocytic variants.

Diagnostic accuracy data

Number of datasets

Total lesions

Total malignant

Test: TD using photographic images for any skin cancer

4

717

452

3 studies reported a cross‐tabulation of lesion final diagnoses against the diagnosis on teledermatology such that data could be extracted for the detection of any malignancy, regardless of any misclassification of 1 skin cancer for another, e.g. a BCC diagnosed as a melanoma or vice versa; and 1 study presented data for the detection of 'malignant' versus benign cases with no breakdown of individual lesion diagnoses given. Summary sensitivity was 94.9% (95% CI 90.1% to 97.4%) and summary specificity 84.3% (95% CI 48.5% to 96.8%). 2 studies providing a direct comparison between TD assessment and in‐person diagnosis by a dermatologist the data suggested similar accuracy between approaches; however, data were too scarce to draw firm conclusions.

Test: TD using clinical and dermoscopic images for any skin cancer

3

928

215

Sensitivities were 100% in all 3 studies. Specificities ranged from 25% (95% CI 5% to 57%) to 92% (95% CI 74% to 99%). Studies used varying thresholds to decide test positivity and included highly selected populations. No statistical pooling was undertaken.

Test: TD using photographic images for invasive melanoma or atypical intraepidermal melanocytic variants

4

1834

106

Sensitivities ranged from 59% (95% CI 42% to 74%) to 100% (95% CI 48% to 100%) and specificities from 30% (95% CI 22% to 40%) to 100% (95% CI 93% to 100%). Diagnostic thresholds were correct diagnosis of melanoma (3) or classification as 'atypical' or 'typical.' Populations also varied, some including only atypical or higher‐risk pigmented lesions and excluding equivocal lesions and others including both pigmented and non‐pigmented lesions who were either self‐referred or were deemed to require lesion excision. The number of melanomas missed ranged from 0 to 17.

Test: TD using photographic and dermoscopic images for invasive melanoma or atypical intraepidermal melanocytic variants

4

664

93

Summary estimates were 85.4% (95% CI 68.3% to 94.1%) for sensitivity and 91.6% (95% CI 81.1% to 96.5%) for specificity. Sensitivities were lower for the correct diagnosis of melanoma (71% to 81%) compared to the decision to refer or to excise a lesion (96% to 100%). The number of melanomas missed ranged from 0 to 7.

All data (referral accuracy)

6

1449

270

TD diagnoses were reported based on photographic images alone (4), photographic and dermoscopic images (1) and using live‐link TD (1). Diagnostic decisions on TD varied, including the diagnosis of malignancy, the decision to excise a lesion, the decision to refer versus not refer, or to excise or follow‐up at a later date. For store‐and‐forward TD sensitivities were generally above 90% indicating good agreement between remote image‐based decisions with the face‐to‐face reference standard for lesions considered to require some positive action by face‐to‐face assessment. Specificities were more variable ranging from 57% (95% CI 39% to 73%) to 100% (95% CI 86% to 100%) suggesting that remote assessment is more likely to recommend excision, referral or follow‐up for lesions considered of less concern when assessed face‐to‐face.

BCC: basal cell carcinoma; CI: confidence interval; cSCC: cutaneous squamous cell carcinoma; TD: teledermatology.

Figures and Tables -
Summary of findings Summary of findings table
Table 1. Cross‐tabulation of included studies against target condition assessed, by type of images used for teledermatology

Studies of diagnostic accuracy –TD vs histology

FTF dxa

vs histology

TD

vs expert FTF

Study

Any skin cancer

Melanomab

BCC

cSCC

Action

Arzberger 2016

Photo/Derm (excise or not)

Borve 2015

Photo/Derm (dx as malignant/possibly malignant)

Bowns 2006

Photo/Derm (dx as MM/MiS)

Photo/Derm (dx as BCC)

Congalton 2015

Photo/Derm (excise or not)

Coras 2003

Photo/Dermc (dx as MM/MiS)

dx as MM

Ferrara 2004

Derm only (dx as MM/MiS)

Grimaldi 2009

Photo/Dermc (excise or not)

(GP dx as suspicious for malignancy; data not included)

Jolliffe 2001a

Photo onlyc (dx as any SC)

< 5 MM

Photo onlyc (dx as BCC)

dx as any SC

dx as BCC

Kroemer 2011

Photo onlyc

Derm onlyc (dx as malignant)

Photo onlyc

Derm onlyc (dx as MM/MiS)

Photo onlyc

Derm onlyc (dx as BCC)

Photo onlyc

Derm onlyc (dx as cSCC)

dx as malignant

dx as MM/MiS

dx as BCC

dx as cSCC

Massone 2014

Photo/Derm (dx as malignant)

Moreno Ramirez 2005

Photol only (dx as malignant)

Photo only (dx as MM/MiS)

Photo only (dx as BCC)

Piccolo 2000

Photo/Dermc (dx as MM)

dx as MM

Piccolo 2004

Derm only (dx as MM)

Silveira 2014

Photo only (dx as malignant)

Warshaw 2010b

Photo onlyc (dx as MM/MiS)

dx of MM/MiS

Wolf 2013

Photo only (dx as atypical)

Studies of referral accuracy – TD vs expert FTF decision

Jolliffe 2001b

Clin only (refer or not)

Mahendran 2005

Clin only (excise or not)

Manahan 2015

Clin/Derm (see FTF)

Oliveira 2002

Clin only (dx as malignant)

Phillips 1998

Live link

(dx as any SC;

definitely/probably malignant/

excise or not)

Shapiro 2004

Clin only (excise or not)

BCC: basal cell carcinoma; clin: clinical; cSCC: cutaneous squamous cell carcinoma; Derm: dermoscopic images; dx: diagnosis; FTF: face‐to‐face; GP: general practitioner; MiS: melanoma in situ (atypical intraepidermal melanocytic variants); MM: invasive melanoma; Photo: photographic images; SC: skin cancer; TD: teledermatology.

aFace‐to‐face diagnosis by an expert/dermatologist unless otherwise stated.

bAll include melanoma in situ as disease positive apart from Piccolo 2000 and Piccolo 2004, which reported detection of invasive melanoma only.

cIncluded a direct comparison with expert face‐to‐face diagnosis versus histology.

Figures and Tables -
Table 1. Cross‐tabulation of included studies against target condition assessed, by type of images used for teledermatology
Table 2. Summary estimates of sensitivity and specificity for teledermatology

Index test,a target condition

Studies

Cases/number of participants

Summary sensitivity (95% CI) %

Summary specificity (95% CI) %

Teledermatology photographic image, any

4

452/717

94.9

(90.1 to 97.4)

84.3

(48.5 to 96.8)

Teledermatology photographic/dermoscopic image, MM + MiS

4

93/664

85.4

(68.3 to 94.1)

91.6

(81.1 to 96.5)

Teledermatology photographic image vs histology, BCC

3

62/301

93.5

(84.0 to 97.6)

95.8

(92.4 to 97.7)

BCC: basal cell carcinoma; CI: confidence interval; MM + MiS: invasive melanoma and atypical intraepidermal melanocytic variants.

aReference standard was histology for all comparisons.

Figures and Tables -
Table 2. Summary estimates of sensitivity and specificity for teledermatology
Table 3. Direct comparisons of teledermatology with face‐to‐face diagnosis of melanoma and other types of skin cancer

Study

Sensitivity (true positives/cases) %

Difference (95% CI)

Specificity (true negatives/non cases) %

Difference (95% CI)

Any skin cancer

Teledermatology photographic image

Expert face‐to‐face

Teledermatology

photographic image

Expert face‐to‐face

Jolliffe 2001a

100 (13/13)

100 (13/13)

0.00

(‐2.28 to 2.28)

92.9 (118/127)

94.7 (124/131)

‐1.74

(‐8.18 to 4.49)

Kroemer 2011

93 (43/46)

100 (46/46)

‐0.06

(‐0.17 to 0.02)

88 (51/58)

90 (52/58)

‐0.02

(‐0.14 to 0.10)

Teledermatology dermoscopic image

Expert face‐to‐face

Teledermatology dermoscopic image

Expert face‐to‐face

Kroemer 2011

85 (39/46)

100 (46/46)

‐0.15

(‐0.28 to ‐0.04)

91 (53/58)

90 (52/58)

0.02

(‐0.13 to 0.09)

Invasive melanoma or atypical intraepidermal melanocytic variants

Teledermatology photographic + dermoscopic image

Expert face‐to‐face

Teledermatology photographic + dermoscopic image

Expert face‐to‐face

Coras 2003

81 (13/16)

88 (14/16)

‐0.06

(‐0.32 to 0.20)

93 (27/29)

93 (27/29)

0.00

(‐0.16 to 0.16)

Teledermatology dermoscopic image

Expert face‐to‐face

Teledermatology dermoscopic image

Expert face‐to‐face

Kroemer 2011

100 (5/5)

100 (5/5)

0.00

(‐0.43 to 0.43)

98 (97/99)

99 (98/99)

‐0.01

(‐0.06 to 0.04)

Warshaw 2010b

59 (24/41)

73 (30/41)

‐0.15

(‐0.33 to 0.06)

41 (604/1473)

63 (930/1473)

‐0.22

(‐0.26 to ‐0.19)a

Invasive melanoma

Teledermatology photographic/dermoscopic image

Expert face‐to‐face

Teledermatology photographic/dermoscopic image

Expert face‐to‐face

Piccolo 2000

81.8 (9/11)

72.7 (8/11)

9.10

(‐25.2 to 41.2)

100 (32/32)

96.9 (31/32)

3.13

(‐7.90 to 15.7)

Basal cell carcinoma

Teledermatology photographic image

Expert face‐to‐face

Teledermatology photographic image

Expert face‐to‐face

Jolliffe 2001a

100 (9/9)

100 (9/9)

0.00

(‐29.9 to 29.9)

97.0 (127/131)

97.8 (132/135)

‐0.83

(‐5.60 to 3.68)

CI: confidence interval.

adenotes statistically significant difference.

Figures and Tables -
Table 3. Direct comparisons of teledermatology with face‐to‐face diagnosis of melanoma and other types of skin cancer
Table Tests. Data tables by test

Test

No. of studies

No. of participants

1 Teledermatology – photographic image vs histology (any) Show forest plot

4

717

2 Teledermatology – dermoscopic image vs histology (any) Show forest plot

1

104

3 Teledermatology – photographic + dermoscopic image vs histology (any) Show forest plot

3

871

4 Expert face‐to‐face (visual inspection (VI) + dermoscopy) vs histology (any) Show forest plot

2

248

5 Teledermatology – photographic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variantsi) Show forest plot

4

1834

6 Teledermatology – dermoscopic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variants) Show forest plot

2

116

7 Teledermatology photographic + dermoscopic image vs histology (invasive melanoma or atypical intraepidermal melanocytic variants) Show forest plot

4

664

8 Expert face‐to‐face (VI + dermoscopy) (invasive melanoma or atypical intraepidermal melanocytic variants) Show forest plot

3

1663

9 Teledermatology – photographic + dermoscopic image vs histology (invasive melanoma) Show forest plot

1

43

10 Expert face‐to‐face (VI + dermoscopy) vs histology (invasive melanoma) Show forest plot

1

43

11 Teledermatology – dermoscopic image vs histology (acral melanoma) Show forest plot

1

77

12 Teledermatology – photographic image vs histology (basal cell carcinoma (BCC)) Show forest plot

3

301

13 Teledermatology – dermoscopic image vs histology (BCC) Show forest plot

1

104

14 Teledermatology – photographic + dermoscopic image vs histology (BCC) Show forest plot

1

256

15 Expert face‐to‐face (VI + dermoscopy) vs histology (BCC) Show forest plot

2

248

16 Teledermatology – photographic image vs histology (cutaneous squamous cell carcinoma (cSCC)) Show forest plot

1

104

17 Teledermatology – dermoscopic image vs histology (cSCC) Show forest plot

1

104

18 Expert face‐to‐face (VI + dermoscopy) vs histology (cSCC) Show forest plot

1

104

19 Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action) Show forest plot

3

951

20 Teledermatology – photographic image vs face‐to‐face clinical diagnosis (of malignancy) Show forest plot

1

90

21 Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action or follow‐up) Show forest plot

1

106

22 Teledermatology – photographic image vs face‐to‐face clinical diagnosis (immediate action; registrar data) Show forest plot

1

796

23 Teledermatology – photographic/dermoscopic image vs face‐to‐face clinical diagnosis (immediate action) Show forest plot

1

301

24 Teledermatology – live link vs face‐to‐face clinical diagnosis (diagnosis of any skin cancer) Show forest plot

1

107

25 Teledermatology – live link vs face‐to‐face clinical diagnosis (definitely/probably malignant) Show forest plot

1

107

26 Teledermatology – live link vs face‐to‐face clinical diagnosis (biopsy decision) Show forest plot

1

107

Figures and Tables -
Table Tests. Data tables by test