Scolaris Content Display Scolaris Content Display

Personal protective equipment for preventing highly infectious diseases due to exposure to contaminated body fluids in healthcare staff

This is not the most recent version

Collapse all Expand all

Background

In epidemics of highly infectious diseases, such as Ebola Virus Disease (EVD) or SARS, healthcare workers (HCW) are at much greater risk of infection than the general population, due to their contact with patients' contaminated body fluids. Contact precautions by means of personal protective equipment (PPE) can reduce the risk. It is unclear which type of PPE protects best, what is the best way to remove PPE, and how to make sure HCWs use PPE as instructed.

Objectives

To evaluate which type or component of full‐body PPE and which method of donning or removing (doffing) PPE have the least risk of self‐contamination or infection for HCWs, and which training methods most increase compliance with PPE protocols.

Search methods

We searched MEDLINE (PubMed up to 8 January 2016), Cochrane Central Register of Trials (CENTRAL up to 20 January 2016), EMBASE (embase.com up to 8 January 2016), CINAHL (EBSCOhost up to 20 January 2016), and OSH‐Update up to 8 January 2016. We also screened reference lists of included trials and relevant reviews, and contacted NGOs and manufacturers of PPE.

Selection criteria

We included all eligible controlled studies that compared the effect of types or components of PPE in HCWs exposed to highly infectious diseases with serious consequences, such as EVD and SARS, on the risk of infection, contamination, or noncompliance with protocols. This included studies that simulated contamination with fluorescent markers or a non‐pathogenic virus.

We also included studies that compared the effect of various ways of donning or removing PPE, and the effects of various types of training in PPE use on the same outcomes.

Data collection and analysis

Two authors independently selected studies, extracted data and assessed risk of bias in included trials. We intended to perform meta‐analyses but we did not find sufficiently similar studies to combine their results.

Main results

We included nine studies with 1200 participants evaluating ten interventions. Of these, eight trials simulated the exposure with a fluorescent marker or virus or bacteria containing fluids. Five studies evaluated different types of PPE against each other but two did not report sufficient data. Another two studies compared different types of donning and doffing and three studies evaluated the effect of different types of training.

None of the included studies reported a standardised classification of the protective properties against viral penetration of the PPE, and only one reported the brand of PPE used. None of the studies were conducted with HCWs exposed to EVD but in one study participants were exposed to SARS.

Different types of PPE versus each other

In simulation studies, contamination rates varied from 25% to 100% of participants for all types of PPE. In one study, PPE made of more breathable material did not lead to a statistically significantly different number of spots with contamination but did have greater user satisfaction (Mean Difference (MD) ‐0.46 (95% Confidence Interval (CI) ‐0.84 to ‐0.08, range 1 to 5, very low quality evidence). In another study, gowns protected better than aprons. In yet another study, the use of a powered air‐purifying respirator protected better than a now outdated form of PPE. There were no studies on goggles versus face shields, on long‐ versus short‐sleeved gloves, or on the use of taping PPE parts together.

Different methods of donning and doffing procedures versus each other

Two cross‐over simulation studies (one RCT, one CCT) compared different methods for donning and doffing against each other. Double gloving led to less contamination compared to single gloving (Relative Risk (RR) 0.36; 95% CI 0.16 to 0.78, very low quality evidence) in one simulation study, but not to more noncompliance with guidance (RR 1.08; 95% CI 0.70 to 1.67, very low quality evidence). Following CDC recommendations for doffing led to less contamination in another study (very low quality evidence). There were no studies on the use of disinfectants while doffing.

Different types of training versus each other

In one study, the use of additional computer simulation led to less errors in doffing (MD ‐1.2, 95% CI ‐1.6 to ‐0.7) and in another study additional spoken instruction led to less errors (MD ‐0.9, 95% CI ‐1.4 to ‐0.4). One retrospective cohort study assessed the effect of active training ‐ defined as face‐to‐face instruction ‐ versus passive training ‐ defined as folders or videos ‐ on noncompliance with PPE use and on noncompliance with doffing guidance. Active training did not considerably reduce noncompliance in PPE use (Odds Ratio (OR) 0.63; 95% CI 0.31 to 1.30) but reduced noncompliance with doffing procedures (OR 0.45; 95% CI 0.21 to 0.98, very low quality evidence). There were no studies on how to retain the results of training in the long term or on resource use.

The quality of the evidence was very low for all comparisons because of high risk of bias in studies, indirectness of evidence, and small numbers of participants. This means that it is likely that the true effect can be substantially different from the one reported here.

Authors' conclusions

We found very low quality evidence that more breathable types of PPE may not lead to more contamination, but may have greater user satisfaction. We also found very low quality evidence that double gloving and CDC doffing guidance appear to decrease the risk of contamination and that more active training in PPE use may reduce PPE and doffing errors more than passive training. However, the data all come from single studies with high risk of bias and we are uncertain about the estimates of effects.

We need simulation studies conducted with several dozens of participants, preferably using a non‐pathogenic virus, to find out which type and combination of PPE protects best, and what is the best way to remove PPE. We also need randomised controlled studies of the effects of one type of training versus another to find out which training works best in the long term. HCWs exposed to highly infectious diseases should have their use of PPE registered and should be prospectively followed for their risk of infection.

PICOs

Population
Intervention
Comparison
Outcome

The PICO model is widely used and taught in evidence-based health care as a strategy for formulating questions and search strategies and for characterizing clinical studies or meta-analyses. PICO stands for four different potential components of a clinical question: Patient, Population or Problem; Intervention; Comparison; Outcome.

See more on using PICO in the Cochrane Handbook.

Clothes and equipment for healthcare staff to prevent Ebola and other highly infective diseases

Healthcare staff are at much greater risk of infections such as Ebola Virus Disease or SARS than people in general. One way of preventing infection is to use personal protective equipment, such as protective clothing, gloves, masks, and goggles to prevent contamination of the worker. It is unclear which type of equipment protects best and how it can best be removed after use. It is also unclear what is the best way to train workers to comply with guidance for this equipment.

Studies found

We found six studies with 295 participants in which workers' protective clothing was sprayed with a fluorescent marker or a harmless virus to simulate what happens in hospitals. Four of these compared different types of protective clothing. Two studies compared different ways of putting clothing on and taking it off. Three studies with 905 participants compared the effect of active training on the use of protective equipment to passive training. All studies had a high risk of bias.

Various types of clothing compared

In spite of protective clothing, the marker was found on the skin of 25% to 100% of workers. In one study, more breathable clothing did not lead to more contamination than non‐breathable clothing, but users were more satisfied. Gowns led to less contamination than aprons in another study. Two studies did not report enough data to enable conclusions. This evidence was of very low quality.

Various types of removal of clothing compared

In one study, two pairs of gloves led to less contamination than only one pair of gloves. The outer gloves were immediately removed after the task was finished. In another study, following CDC guidance for apron or gown removal led to less contamination. This evidence was also of very low quality.

Active training

Active training, including computer simulation and spoken instructions, led to less errors with guidance on which protection to use and how to remove it among healthcare staff compared to passive training.

Quality of the evidence

We judged the quality of the evidence to be very low because of limitations in the studies, indirectness and small numbers of participants.

What do we still need to find out?

There were no studies on the effects of goggles, face shields, long‐sleeved gloves or taping on the risk of contamination. We need simulation studies with several dozens of participants, preferably using exposure to a harmless virus, to find out which type and combination is most protective. The best way to remove protective clothing after use is also unclear. We need studies that use chance to assign workers to different types of training to find out which training works best. Healthcare staff exposed to highly infectious diseases should have their protective equipment registered and be followed for their risk of infection. We urge WHO and NGOs to organise more studies.

Authors' conclusions

available in

Implications for practice

In addition to other infection control measures, consistent use of full‐body PPE can diminish the risk of infection for HCWs. EN and ISO standards for chemical protective clothing and fabric permeability for viruses are helpful to determine which PPE should technically protect sufficiently against highly infectious diseases. However the risk of contamination depends on more than just these technical factors. In simulation studies, contamination happened in almost all intervention and control arms.

There is very low quality evidence, based on single exposure simulation studies, that more breathable fabric may still lead to similar levels of contamination protection as less breathable fabric, and is preferred by users. The lack of use of standards in the included studies prevents the extrapolation of results of studies comparing PPE types to PPE in current use. There were no studies, and thus no evidence, of the effectiveness of taping parts of PPE together, or the effects of goggles compared to visors.

For different procedures of donning and doffing, there is very low quality evidence based on a single study each that double gloves, as part of PPE and following CDC guidelines, may reduce the risk of contamination. There are no studies on the use of disinfectant during doffing.

For various training procedures there is very low quality evidence that more active training, including computer simulation or spoken instructions may reduce the risk of infection compared to passive training or to not giving such instructions or to simulation. There are no studies that have compared methods to retain PPE skills needed for proper donning and doffing in the long term.

The quality of the evidence is very low for all comparisons because of high risk of bias in studies, indirectness of evidence, and small numbers of participants. This means that we are uncertain about the estimates of effects, and it is therefore likely that the true effects may be substantially different from the ones reported in this review.

Implications for research

We call on WHO and NGOs in medical relief work to organise studies and to raise awareness for the lack of evidence for the effect of specific PPE. We also call upon them to develop a more transparent and uniform labeling of infection control measures and the protection level of PPE for HCWs. We believe that this is an important prerequisite for the universal implementation of infection control measures for HCWs.

Simulation studies are a feasible and relatively simple way to compare different types of PPE such as gowns versus coveralls, goggles versus visors, combinations of hoods and goggles, taping versus no taping, spraying disinfectant versus not spraying and various procedures after breach of PPE to find out which protects best against contamination. It is a prerequisite for a reliable answer that methods of simulation studies are standardised in terms of exposure and outcome measurement. Viral marker Bacteriophage MS2 seems to be the most sensitive marker and we would advocate to use this. Studies should have sufficient power. To be able to detect a RR of 0.5 with a control group rate of contamination of 0.7, assuming α = 0.05 and β = 0.80, a sample size of 62 would be needed.

To find out how PPE behaves under real exposure, we need prospective follow‐up of HCWs involved in the treatment of patients with highly infectious diseases, with careful registration of PPE and risk of infection. Because different NGOs use different PPE guidance, cohorts of workers would be relatively simple to establish if there would be sufficient political will. Here, the effect sizes would be smaller and thus the sample size should be bigger than 60.

In addition, case‐control studies comparing PPE use among infected HCWs and matched healthy controls, using rigorous collection of exposure data, can provide information about the effects of PPE on the risk of infection. The sample sizes should be much bigger than the current case studies because we would like to detect small but important differences in effect between various combinations of PPE such as gowns versus coveralls. There is a need for collaboration between organisations serving epidemic areas to carry out this important research within limited resources, and during the throes of an outbreak.

We also need more randomised controlled studies of the effects of one type of training versus another, to find out which training works best, especially at long‐term follow‐up of one year or more. Also here, the effect size seems to be quite large and thus a sample size of around 60 seems to provide adequate power.

Summary of findings

Open in table viewer
Summary of findings for the main comparison. Comparison 1: One type of PPE versus another – PAPR versus E‐RCP attire

PAPR versus E‐RCP Attire for preventing contact with contaminated body fluids in healthcare staff

Patient or population: healthcare staff volunteers
Settings: simulation study
Intervention: PPE with Powered Air Purifying Respirator (PAPR) Attire

Control: Enhanced respiratory and contact precautions (E‐RCP) attire according to 2005 CDC recommendation

Outcomes

Illustrative comparative risks* (95% CI)

Relative effect
(95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

E‐RCP attire

PAPR Attire

Any contamination
fluorescent marker

Follow‐up: post intervention

960 per 1000

259 per 1000
(163 to 413)

RR 0.27
(0.17 to 0.43)

50
(1 cross‐over RCT)

⊕⊕⊝⊝
very low1,2,3

Analyses presented in this table are unadjusted for the paired nature of the cross‐over design but similar to the results that the authors presented while taking the cross‐over into account

Compliance with guidance ‐ Noncompliance

with donning guidance

Follow‐up: post intervention

40 per 1000

300 per 1000
(72 to 1000)

RR 7.5
(1.81 to 31.1)

50
(1 cross‐over RCT)

⊕⊕⊝⊝
very low1,2,3

Compliance with guidance ‐ Noncompliance

with doffing guidance

Follow‐up: post intervention

240 per 1000

120 per 1000
(48 to 295)

RR 0.5
(0.2 to 1.23)

50
(1 cross‐over RCT)

⊕⊕⊝⊝
very low1,2,3

Infection with EVD

See comment

See comment

Not estimable

0
(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).
CI: Confidence interval; RR: Risk ratio;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Simulation study, downgraded for indirectness
2 One cross‐over study with 50 participants, downgraded for imprecision

3 HIgh risk of bias, downgraded for study limitations

Open in table viewer
Summary of findings 2. Comparison 1: One type of PPE versus another – Three types of PPE attire

Three types of PPE attire compared by number of contaminated spots

Patient or population: healthcare worker volunteers
Settings: simulation study
Intervention: more protective attire, not permeable not breathable (A)
Comparison: less protective attire: permeable but breathable (B); fairly permeable, not breathable (D)

Outcomes

Illustrative comparative risks* (95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

Less protective type of PPE (B or D)

Most protective type of PPE attire (A)

Number of contaminated spots ‐ Neck

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in control group B was
0.12 spots

The mean number of contaminated spots in the intervention group was
0.7 higher
(0.26 lower to 1.66 higher)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Number of contaminated spots ‐ Foot

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in the control group B was
2.86 spots

The mean number of contaminated spots in the intervention group was
0.96 lower
(2.35 lower to 0.43 higher)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Number of contaminated spots ‐ Palm

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in the control group B was
17.83

The mean number of contaminated spots in the intervention group was
7.72 lower
(15.65 lower to 0.21 higher)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Number of contaminated spots ‐ Foot

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in the control group D was
4.96

The mean number of contaminated spots in the intervention group was
4.1 lower
(6.94 to 1.26 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Number of contaminated spots ‐ Palm

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in the control group D was
20.49

The mean number of contaminated spots in the intervention group was
12.76 lower
(21.62 to 3.9 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Infection with EVD

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

Compliance with guidance

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on compliance with guidance.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group.
CI: Confidence interval;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Simulation study, downgraded for indirectness
2 One study 100 participants, 25 participants per arm, downgraded for imprecision

3 Unclear risk of bias in the study, downgraded one level

Open in table viewer
Summary of findings 3. Comparison 1: One type of PPE versus another – Gowns versus aprons

Gowns versus aprons for preventing highly infectious diseases due to contact with contaminated body fluids in healthcare staff

Patient or population: healthcare worker volunteers
Settings: simulation study
Intervention: gowns versus aprons

Outcomes

Illustrative comparative risks* (95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

Aprons

Gowns

Contamination with marker; individual type of doffing

Follow‐up: post intervention

The mean contamination with marker in the control groups was
16.98 small spots

The mean contamination with marker in the intervention groups was 10.28 lower (14.77 to 5.79 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Cross‐over study; the analyses were unadjusted for the paired nature of the data but similar to the analysis of the authors who took this into account

Contamination with marker; CDC recommended doffing

Follow‐up: post intervention

The mean contamination with marker in the control groups was
1.88 small spots

The mean contamination with marker in the intervention groups was
0.62 lower (1.75 lower to 0.51 higher)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Infection with EVD

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

Compliance with guidance

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on compliance with guidance.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group.
CI: Confidence interval;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Randomisation method unclear, downgraded one level
2 Simulation study, downgraded for indirectness
3 Single cross‐over study with 50 participants, downgraded for imprecision

Open in table viewer
Summary of findings 4. Comparison 2: One procedure for donning/doffing versus another – Doffing with double gloves compared to doffing with single gloves

Doffing with double gloves compared to doffing with single gloves for preventing contact with contaminated body fluids in healthcare staff

Patient or population: healthcare staff volunteers
Settings: Simulation study
Intervention: Doffing with double gloves
Comparison: Doffing with single gloves

Outcomes

Illustrative comparative risks* (95% CI)

Relative effect
(95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

Doffing with single gloves

Doffing with double gloves

Contamination: virus detected ‐ All body parts

Follow‐up: post intervention

778 per 1000

280 per 1000
(124 to 607)

RR 0.36
(0.16 to 0.78)

18
(1 cross‐over study)

⊕⊝⊝⊝
very low1,2

Non‐randomised cross‐over study; the analyses were unadjusted for the paired nature of the data but the results are similar to those analysed taking into account the paired nature of the data

Compliance with guidance ‐ Noncompliance: any error

Follow‐up: post intervention

667 per 1000

720 per 1000
(467 to 1000)

RR 1.08
(0.7 to 1.67)

18
(1 cross‐over study)

⊕⊝⊝⊝
very low1,2

Infection with EVD

See comment

See comment

Not estimable

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).
CI: Confidence interval; RR: Risk ratio;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Similation study, downgraded one level
2 One cross over study with 18 participants, downgraded for imprecision

Open in table viewer
Summary of findings 5. Comparison 2: One procedure for donning/doffing versus another – CDC method versus individual doffing

CDC method versus individual doffing for preventing contact with contaminated body fluids in healthcare staff

Patient or population: healthcare staff volunteers
Settings: simulation study
Intervention: CDC method in doffing

Control: Individual method of doffing

Outcomes

Illustrative comparative risks* (95% CI)

Relative effect
(95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

Individual doffing method

CDC recommended doffing method

Contamination with fluor marker when using gowns

Follow‐up: post intervention

The mean contamination with fluor marker in the control group was
6.7 small spots

The mean contamination with fluor marker in the intervention group was
5.44 lower
(7.43 to 3.45 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Cross‐over study; the analyses were unadjusted for the paired nature of the data but similar to the analysis of the authors who took this into account

Contamination with fluor marker when using aprons

Follow‐up: post intervention

The mean contamination with fluor marker in the control group was
16.98 small spots

The mean contamination with fluor marker in the intervention group was
15.1 lower
(19.28 to 10.92 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Infection with EVD

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

Compliance with guidance

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on compliance with guidance.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group.
CI: Confidence interval;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Randomisation procedure unclear, downgraded one level
2 Simulation study, downgraded for indirectness
3 One cross‐over study with 50 participants

Background

available in

Description of the condition

Over 59 million people are employed in the healthcare sector worldwide (WHO 2006). Some of these healthcare workers (HCWs) are at risk of developing life‐threatening infectious diseases due to contact with patients’ blood or body fluids such as mucus or vomit. The risk of infection and its consequences vary, but the 2013 to 2015 Ebola Virus Disease (EVD) epidemic put healthcare workers at high risk of a disease with a very high fatality rate in the epidemic areas (Ebola 2014). Not only nurses and doctors are at risk, but also staff engaged in transportation, cleaning and burial of patients. Healthcare workers can also be at risk when seeing patients arriving from the epidemic areas (Forrester 2014). Due to the high risk of infection and the high fatality rate, hundreds of HCWs died in the epidemic areas (Kilmarx 2014). According to the most recent statistics from October 2015, there were 1049 registered cases of HCWs infected with 535 deaths (WHO 2015). Just a decade earlier, healthcare workers lost their lives due to the Severe Acute Respiratory Syndrome (SARS) epidemic (CDC 2003). Even though the transmission routes are different, EVD and SARS are both highly infectious and they can have fatal consequences and especially affect healthcare workers.

Healthcare workers can get infected through various routes of transmission, depending on the pathogen. Infection can occur through splashes and droplets of contaminated body fluids on non‐intact skin, or via needle‐stick injuries through intact skin. Infection can also occur when splashes or droplets of contaminated body fluids land on the mucous membranes in the eyes, mouth or nose, or when the same mucous membranes come into contact with contaminated skin, such as when rubbing the eyes with a hand carrying pathogens after shaking hands with a patient. For EVD, this is the main route of transmission, even though there is doubt about the transmission of virus particles through aerosols, or while performing patient care. For SARS, the highest risk of infection was due to inhalation of aerosols, but the disease was also transmitted through droplet infection. Another risk of HCW infection is that infected HCWs will infect patients or that they will act as a vector for the transfer of the disease between patients.

Here, we focus on highly infectious diseases which means that contamination with a small amount of infectious material can already lead to clinical disease. We also focus on those infections that have serious consequences such as a high case fatality rate because this has implications for the motivation of HCWs to protect oneself.

Description of the intervention

Exposure can be best controlled by organisational measures that minimise the exposure to contaminated body fluids or infected patients. One part of this comprehensive prevention strategy is that HCWs use proper personal protective equipment. The most important preventive measure is the proper organization of the hospital or health care unit to avoid unnecessary contact. Once this has been implemented, the main strategy for reducing physical exposure to highly infectious diseases is through personal protective equipment (PPE). Coveralls, gowns, hoods, masks, eye shields, and respirators, among others, are used to prevent skin and mucous membranes from becoming contaminated.

Personal protective equipment will only be effective if the equipment can form a barrier between the HCW and the exposure to contaminated body fluids. Therefore, standards have been developed that, once complied with, assure that PPE is of sufficient quality to protect against biohazards (Mäkelä 2014; NIOSH 2014). Even though the biohazard symbol (Figure 1) is widely used to indicate the presence of biohazards, it is not a label for protective clothing. For biohazards, these standards are based on laboratory tests that evaluate to what extent the fabric and the seams of protective clothing are leak‐tight, that is, they are impermeable for liquids, viruses, or both at certain pressure levels. The standards in Europe and the US are different. Personal protective equipment should contain a label that specifically indicates the standards against which it has been tested.


International symbol indicating biohazards

International symbol indicating biohazards

In Europe, there is standard EN 14126 for clothing, specifically coveralls that protect workers against biological hazards from microorganisms. Clothing compliant with the standard EN 14126 is classified with the same six clothing types as chemical protective clothing. Type one provides the most protection by complete encapsulation. Type three clothing protects against pressurised liquid splashes, but is also very leak‐tight, which makes it heavy to work in. Type four provides protection against non‐pressurised liquid splashes, and is more breathable. There is no requirement for the type of clothing, whether it be a coverall or a gown. In addition, the clothing material should be classified according to the ISO 16604 test against viral penetration. Again, materials can pass the test at six levels. Class six is the most protective, and indicates that the test bacteriophage particles do not pass through the fabric at a hydrostatic pressure of 20 kPa (2.9 psi), but for Class one, the fabric is protective only at a pressure of 0 kPa. There is a separate standard for surgical gowns, EN 13795, which is designed to protect the patient.

In the US, there is standard ANSI/AAMI PB70 2012 for surgical and isolation gowns to protect both patients and healthcare workers from becoming infected. The standard specifies four levels of protection, with the highest, level four, being tested for viral protection at a pressure of 2 psi. Level one is tested for water resistance, with less than 4.5 grams of water allowed to be absorbed during the test. There is also US standard NFPA 1999 for protective coveralls, which was specifically developed to address a range of different clothing items worn by emergency medical service first responders, and also applies to medical first receivers. NFPA 1999 lists many performance requirements for garments used by emergency medical personnel, including (but not limited to) viral penetration resistance, tensile strength, liquid integrity, and seam strength.

Thus, the qualities of garments certified by different standards are not fully comparable. Nonetheless they all aim to ensure that garments are of a quality that prohibits water and blood‐like fluids with virus particles, applied under a specified amount of pressure, to pass through. In addition, some standards have requirements that the whole garment, including the seams, must be non‐permeable to liquids (NFPA 1999).

For gowns to be used with EVD, WHO 2014 currently recommends EN 13795 high performance surgical gowns or ANSI/AAMI PB70 2012 level three (option one), level four (option two), or equivalent. As the first option for coveralls, WHO currently recommends protection equivalent to EN 14126, with clothing material that provides Class three protection against blood at 0.5 kPa, based on ISO 16603 (ISO 2004a), and Class two against viruses at a pressure of 1.75 kPa, based on ISO 16604 (ISO 2004).

Both in the EU and in the US, it is mandatory for employers to protect their workers against blood‐borne pathogens and other infections at work (EU 2010; OSHA 2012).

Clothing that is manufactured according to the standards mentioned above is impermeable to body fluids and viruses and will technically prevent skin contamination. However, this review does not deal with the technical physical standards of equipment, but rather if its use in practice will prevent contamination and infection.

There are several guidelines available for choosing proper PPE (Australian NHMRC 2010; CDC 2014; ECDC 2014; WHO 2014). Even though all guidelines propose using similar protective clothing, there are also noticeable differences. In 2014, guidelines differed widely. For example, WHO 2014 proposed double gloving only when carrying out strenuous tasks, or when in contact with body fluids whereas the other guidelines proposed always using double gloves. ECDC 2014 proposed taping gloves, boot covers and goggles onto the coveralls to prevent leaving any openings but the other guidelines did not recommend this. By March 2015, most guidelines had been updated and are now more in line with each other. However, differences still exist. WHO 2014 does not recommend taping, but ECDC 2014 does.

Overprotection can be a problem. Some propose using three layers of gloves, because according to their experience, this is best practice (Lowe 2014). However, it may make work more difficult, and eventually lead to an increased rather than a decreased risk of infection, especially during doffing (i.e. removing the PPE). For example, the combined use of several respirators probably does not lead to more protection, but considerably increases the burden on the worker (Roberge 2008; Roberge 2008a).

In spite of using proper PPE, probably the biggest risk of infection is associated with self‐contamination by HCWs inappropriately removing the PPE (Fischer 2014). Some types of PPE make donning and doffing more difficult, thereby increasing the risk of contamination (Zamora 2006). The highest risk time of doffing is usually managed by an assistant, who guides the worker through the process while watching for breaches, and spraying chlorine as each item is removed. There is evidence that when doffing PPE, the use of a double pair of gloves decreases the risk of contamination (Casanova 2012). How contamination of PPE occurs has also been clearly illustrated with a simulation study about cleaning up vomit (Makison 2014). The results of such simulation studies should increase HCWs' confidence in executing the donning and doffing procedures correctly, and thus can also be an incentive for their uptake and compliance with the guidelines. Therefore, specific guidance has been developed for donning and doffing PPE (CDC 2014; WHO 2014).

Compliance with guidance on correct PPE use in health care is historically poor. HCWs sometimes distrust infection control, and using PPE is stressful (Zelnick 2013). For respiratory protection such as masks and respirators, compliance has been reported to be around 50% on many occasions (Nichol 2008). Due to lack of proper fitting and incorrect use, real field conditions almost never match laboratory standards (Coia 2013; Howie 2005). Also, reports of hand hygiene show that there is still large room for improvement and guidelines recommend education and training in combination with other implementation measures (WHO 2009). From reports of HCWs, it is clear that most appropriate PPE is not user‐friendly in tropical conditions. It prevents heat loss through sweating because it is not made of breathable material. A common reason for a breach in the barrier of the PPE is the worker sweating and then instinctively wiping their face (Cherry 2006). Staff are being trained on arrival to the epidemic or treatment site by repeatedly practicing donning and doffing PPE and running through drills of what they should do if the protocol is breached while in the "red zone" (i.e. the Ebola patient area, also called hot zone).

In this review, we only concentrated on PPE for highly infectious diseases that have serious consequences for health, such as EVD. We excluded other highly infectious, but less serious viral infections, such as norovirus, as we expected the effect of PPE to be different. We included SARS as it was highly infectious to HCWs, sometimes fatal, and had similar recommendations on PPE use and training as EVD.

We did not specifically study the effects of hand hygiene or of respiratory protection. Hand hygiene is also crucial in preventing skin contamination, but this has already been covered in another review (Gould 2010). The protective effect of different types of respiratory protection, and effects of interventions to increase their uptake are covered in two other reviews (Jefferson 2011; Sakunkoo 2012).

How the intervention might work

First, HCWs, their supervisors, or occupational health professionals should choose the proper type of PPE, as indicated in the guidance described above. Then, a HCW needs to know how to don and doff PPE according to the guidelines provided. Next, a HCW needs to comply with established procedures for correctly using, donning and doffing PPE. Education and training is used to increase compliance. The emphasis in teaching correct use of PPE is on doing everything slowly and carefully to minimise the risk of making a mistake. Often an assistant or buddy, sometimes coupled with a mirror, is used while donning PPE, while a hygienist supervises doffing.

Compliance can be increased by personal supervision and instruction, checklists, audits of performance, by providing feedback, and by allowing sufficient time for donning and doffing. Education and training on uptake and compliance with PPE should have an effect in both the short term and the long term (Northington 2007; Ward 2011). Education and training can be seen as one method to increase compliance (Gershon 2009; Hon 2008). Compliance with PPE can also be improved by providing sufficient, comfortable, well‐fitting, and more user‐ and patient‐friendly PPE. Compliance with guidelines has been studied for hand hygiene. There is some evidence that multifaceted interventions and staff involvement are important, but altogether, there is little evidence that allows firm conclusions (Gould 2010).

Why it is important to do this review

There is still uncertainty about the optimal type, composition, amount, and way of using full‐body PPE to prevent skin and mucous membrane contamination of HCWs, while treating patients infected with highly infectious diseases. This is also reflected in the different ways guidelines for PPE are implemented in Europe (De Iaco 2012), and acknowledged in current WHO guidelines (WHO 2014).

Since full‐body protective suits have mainly evolved as a direct result of experiences gained from the recent outbreaks of deadly viruses, there are still many types available with varying types of components. The comparative effectiveness of one type against another is still unknown. Recent cases of HCWs becoming infected, despite wearing seemingly appropriate PPE when treating EVD patients, have raised the question of what really works and what does not when using a full‐body protective suit. There are ongoing debates about types of PPE, with individuals from various nations and roles having different opinions. Factors to consider when choosing PPE for a healthcare facility may include: availability of supply, because large numbers of disposable suits will be used; standardisation of equipment, which will avoid mistakes; worker preferences (e.g. goggles steam up, which could be avoided by visors); and costs of making equipment affordable for low income countries.

We have learned from those who have faced the Ebola epidemic that breaches in PPE use are common. One possible explanation for this could be the lack of a common language between HCWs representing international aid organisations and local staff. Another explanation is the often poor literacy of local staff. This means that checklists are often unhelpful for training and implementation of PPE. Yet another problem with PPE is that HCWs walking around residential areas wearing what might be described as "astronaut suits" can be scary and may impact on community engagement and service avoidance.

HCWs working with Ebola patients and occupational health professionals still have uncertainty about which types of equipment to choose, the best procedures for doffing and how to deal with breaches of the barrier of the PPE, and with gross contamination.

Regarding the equipment, there is uncertainty if visors protect as well as goggles, especially when goggles are combined with a hood. It is not immediately obvious if the strap of the goggles should go over or under the hood. There is uncertainty if triple or quadruple gloves would be more protective than double gloves. Regarding suits, it is unclear if gowns are as protective as coveralls, and how breathable and impermeable for liquids or viruses they should be. Some argue that using more breathable material would decrease the risk of contamination (Kuklane 2015).

When it comes to donning and doffing procedures, there is uncertainty about the effect of integrity checks of gloves and other equipment, and if gloves should be changed when highly contaminated. With doffing especially, it is unclear if this should be done in pairs with a helper buddy removing part of the PPE, or if this can be done alone or in pairs. Another element of the doffing procedure that is uncertain is if spraying with a disinfectant chlorine spray is more protective than not using spray. It is not clear which disinfectant is the best anti‐viral: chlorine solution or alcohol gel, and at which concentration.

The complexity of the drill and the procedures for updating skills, retraining, and responding to individual training needs after a potential or realised breach are also important.

Objectives

available in

To summarise and critically appraise current evidence of the effectiveness of PPE for preventing nosocomial infection in healthcare staff exposed to body fluids contaminated with viral haemorrhagic fevers such as EVD, Lassa, Marburg, Congo‐Crimean Haemorrhagic Fever, or comparable highly infectious diseases with serious consequences, such as SARS. In particular, we addressed questions identified from the West Africa EVD epidemic, that include the evaluation of the effect of:

  • one type or component of PPE as part of full‐body protection PPE versus another on contamination and infection rates;

  • one procedure in donning and doffing full‐body PPE versus another on contamination and infection rates; and

  • one intervention to improve compliance with guidelines for full‐body PPE, including education and training, versus another on compliance, contamination and infection rates.

Methods

available in

Criteria for considering studies for this review

Types of studies

Since the circumstances for evaluation studies are difficult during epidemics, we anticipated including a broad range of study designs.

We included simulation studies of different full‐body PPE types for donning and doffing procedures that use marker chemicals that make contamination visible. We included any study that compared different types of PPE or different procedures of donning and doffing, or different types of education and training.

We included field studies that compared outcomes between hospitals or treatment centres that used different types of PPE, different procedures, or different types of education and training. These studies were observational and retrospective, and measured infection rates and compliance rates. This included cohort studies, defined as studies that followed HCWs over time and compared the effect of PPE, procedures, or training on infection or compliance rates. This also included case‐control studies that compared PPE, procedures, or training retrospectively between cases that had become infected and comparable controls that did not get infected. We also included randomised controlled trials that compared different types of PPE, procedures, or training.

We intended to also include uncontrolled audit reports or case reports of PPE failure for descriptive purposes, but we did not find any. If we find any such reports in future updates of this review, we will not use them for drawing conclusions, but only to compare with findings produced by the above study types.

Types of participants

For simulation studies, we included participants using PPE designed for EVD or comparable highly infectious diseases with serious consequences.

For field studies, we included studies conducted with HCWs and ancillary staff exposed to body fluids in the form of splashes, droplets or aerosols contaminated with particles of highly infectious diseases that have serious consequences for health such as EVD or SARS. We excluded studies conducted with laboratory staff because there preventive measures would be more detailed and easier to comply with.

Types of interventions

1. We included studies that evaluated the effectiveness of different types of whole‐body protection, comparing different types, compositions, or amounts of the following:

  • body protection such as gowns, coveralls or hazmat suits;

  • eye and face protection such as glasses, goggles, face shields or visors, or masks or hoods that cover the entire head;

  • hand protection: gloves; and

  • foot protection: overshoes or boots.

We defined PPE as any of the above equipment designed or intended to protect healthcare staff from contamination with infected patients' body fluids.

We especially sought to include studies that had compared the use of gowns with coveralls, different types of fabrics, such as less breathable fabrics with more breathable fabrics, goggles versus visors, various forms of hoods in combination with goggles, single versus multiple layers of gloves, and taping versus no taping of separate elements of PPE.

2. We included studies that evaluated the effectiveness of different procedures for donning and doffing of the PPE.

We especially sought to include studies that had compared a single person or two person procedure, procedures with and without spraying disinfectants, procedures for changing gloves or PPE after gross contamination or breach of barrier protection versus no change.

3. We included any intervention to increase compliance with guidance for selection and proper use of PPE, including but not limited to:

  • education,

  • training,

  • supervision during donning and doffing,

  • information only such as posters, guideline leaflets, etc.,

  • audit and feedback, or

  • monetary or organisational incentives.

Types of outcome measures

Primary outcomes

We included all studies that had measured the effectiveness of interventions as:

  • contamination of skin or clothing, measured with any type of test material to visualise contamination (e.g. stains made visible with UV‐light);

  • infection with EVD, another viral haemorraghic fever, or comparable highly infectious disease with serious consequences such as SARS; or

  • compliance with guidance on selection of type and use of PPE measured, for example, with an observation checklist.

Secondary outcomes

  1. User‐reported assessment of comfort and convenience

  2. Costs or resource use

  3. Time to don and doff the PPE

For future updates of this review we will also look for other outcomes that appear relevant to the questions being addressed.

Search methods for identification of studies

Electronic searches

We conducted a systematic literature search to identify all published and unpublished trials that could be considered eligible for inclusion in this review. We adapted the search strategy we developed for PubMed (see Appendix 1) for use in the other electronic databases. The literature search identified potential studies in all languages. We asked native speakers to assess the papers in Russian (AP) and in Chinese (CCC) for potential inclusion in the review.

We searched the following electronic databases from inception to present for identifying potential studies:

  • Cochrane Central Register of Controlled Trials (CENTRAL) (Wiley Online Library) until 20 January 2016;

  • MEDLINE (PubMed) (Appendix 1) until 8 January 2016;

  • EMBASE (embase.com) to 8 January 2016;

  • CINAHL (EBSCOhost) to 20 January 2016;

  • NIOSHTIC (OSH‐UPDATE) to 8 January 2016;

  • NIOSHTIC‐2 (OSH‐UPDATE) to 8 January 2016;

  • HSELINE (OSH‐UPDATE) to 8 January 2016;

  • CISDOC (OSH‐UPDATE) to 8 January 2016;

We also conducted a search of ClinicalTrials.gov (www.ClinicalTrials.gov) and the WHO trials portal (www.who.int/ictrp/en/) which includes the Pan African Registry for potential studies on EVD. We searched all databases from their inception to the present and we did not impose a restriction on language of publication.

Searching other resources

We checked reference lists of all primary studies and reviewed articles for additional references. We contacted non‐governmental organisations involved in medical relief operations in the high risk EVD areas to identify additional unpublished materials:

  • Médécins Sans Frontières (MSF)

  • Save the Children

We also used Twitter to ask for unpublished reports from people in the field. Evidence Aid helped in locating relevant organisations and asking them for unpublished reports.

We further contacted the following manufacturers of PPE to request unpublished studies:

  • DuPont

  • 3M

  • Alpha Pro Tech

Data collection and analysis

Selection of studies

Pairs of two review authors (JV, CM, SI, JR, KN and RS) independently screened titles and abstracts of all the potential studies that we identified with our systematic search, to identify studies for inclusion. The same authors coded them as 'retrieve' (eligible or potentially eligible/unclear) or 'do not retrieve'. We retrieved the full‐text study reports/publication and pairs of two review authors (JV, CM, SI, JR, KN and RS) independently screened the full‐text and identified studies for inclusion, and identified and recorded reasons for exclusion of the ineligible studies. We used the computer programme Covidence for the selection of references and full‐text studies. We resolved any disagreement through discussion so there was no need to consult a third review author. We identified and excluded duplicates and collated multiple reports of the same study so that each study rather than each report is the unit of interest in the review. We recorded the selection process in sufficient detail and completed a PRISMA flow diagram (see Figure 2) and a 'Characteristics of excluded studies' table.


PRISMA study flow diagram

PRISMA study flow diagram

Data extraction and management

We used a data collection form for study characteristics and outcome data which had been piloted on one included study. Two review authors (JV, CM, JR, SI, ME, KN, RS) independently extracted the following study characteristics from included studies:

  1. Methods: study design, total duration of study, study location, study setting, withdrawals, and date of study.

  2. Participants: N, mean age or age range, gender, severity of condition, diagnostic criteria if applicable, inclusion criteria, and exclusion criteria.

  3. Interventions: description of intervention, comparison, duration, intensity, content of both intervention and control condition, and co‐interventions.

  4. Outcomes: description of primary and secondary outcomes specified and collected, and at which time points reported.

  5. Notes: funding for trial, and notable conflicts of interest of trial authors, country where trial was conducted.

Pairs of two review authors (JV, CM, SI, JR, KN, ME, RS) independently extracted outcome data from included studies. We noted in the 'Characteristics of included studies' table if outcome data were not reported in a usable way. We resolved disagreements by consensus so there was no need to involve a third review author. One review author (JV) transferred the data into Review Manager (RevMan 2014). We double‐checked that data were entered correctly by comparing the data presented in the systematic review with the study reports. A second review author (CM) spot‐checked study characteristics for accuracy against the trial report.

Assessment of risk of bias in included studies

Pairs of two review authors (JV, CM, SI, JR, KN, ME, RS) independently assessed risk of bias for each randomised study using the criteria outlined in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011). We resolved any disagreements by discussion so there was no need to involve another author. We assessed the risk of bias according to the following domains in all randomised controlled trials.

  1. Random sequence generation,

  2. Allocation concealment,

  3. Blinding of participants and personnel,

  4. Blinding of outcome assessment,

  5. Incomplete outcome data,

  6. Selective outcome reporting, and

  7. Other bias

We rated each potential source of bias as high, low or unclear and provided a quote from the study report or author together with a justification for our judgment in the 'Risk of bias' table. We summarised the risk of bias judgements across different studies for each of the domains listed. For compliance, we considered blinding to PPE type significant for the outcome assessor only. Where information on risk of bias relates to unpublished data or correspondence with a trial author, we noted this in the 'Risk of bias' table.

We considered randomised studies to have a low overall risk of bias when we judged random sequence generation and blinded outcome assessment to have a low risk of bias and none of the other domains to have a high risk of bias.

We used domains three to seven listed above for all non‐randomised studies. Instead of the domains one and two ‐ random sequence generation and allocation concealment ‐ we used the following items as suggested in the Cochrane Acrobat tool for the assessment of risk of bias in non‐randomised intervention studies:

  1. Bias due to confounding. We made an overall assessment of risk of bias based on the following questions if the signalling question "Is confounding of the effect of intervention unlikely in this study?" was answered with no.

    • Did the authors use an appropriate analysis method that adjusted for all the critically important confounding domains?

    • Were confounding domains that were adjusted for measured validly and reliably by the variables available in this study? For this review question, we considered baseline differences between compared groups in the following factors significant: prior experience with PPE, healthcare qualification or education of HCWs, age and sex, ambient temperatures, and stressful activities.

  2. Bias due to selection of participants into the study. We made an overall assessment of this risk of bias based on the following questions if the signalling questions "Was selection into the study unrelated to intervention or unrelated to outcome?" and "Do start of follow‐up and start of intervention coincide for most subjects?" were answered with no.

    • Were adjustment techniques used that are likely to correct for the presence of selection biases?

    • For case‐control studies: Were the controls sampled from the population that gave rise to the cases, or using another method that avoids selection bias?

We considered the domains of confounding and selection of participants to yield high, low or unclear risk of bias. For a non‐randomised study as a whole, we considered the study to have a low risk of bias if all domains received a judgment of low risk of bias comparable to an RCT. This means receiving a low risk of bias judgment on the two domains listed above as well as domains three to seven in the previous section.

When considering treatment effects, we took into account the risk of bias for the studies that contributed to that outcome.

Assesment of bias in conducting the systematic review

We conducted the review according to the published protocol and where there were deviations from it, we reported these in the 'Differences between protocol and review' section of the systematic review.

Measures of treatment effect

We entered the outcome data for each study into the data tables in RevMan 2014 to calculate the treatment effects. We used risk ratios (RRs) for dichotomous outcomes, and mean differences (MDs) or standardised mean differences (SMDs) for continuous outcomes. When only effect estimates and their 95% confidence intervals or standard errors were reported in studies, we entered these data into RevMan 2014 using the generic inverse variance method. When authors used multivariate analyses, we used the most adjusted OR (Odds Ratios) or RRs. We ensured that higher scores for continuous outcomes had the same meaning for the particular outcome, explained the direction and reported where the directions were reversed, if this was necessary. If in future updates of this review we come across studies reporting results that we cannot enter in either way, we will describe them in the 'Characteristics of included studies' table, or we will enter the data into Additional tables. For cohort studies that compare an exposed to a non‐exposed population we intended to report both the RR for the intervention versus the control at baseline and at follow‐up for dichotomous outcomes to indicate the change brought about by the intervention but we did not find any such studies.

Unit of analysis issues

If in future updates of this review we come across studies that employ a cluster‐randomised design and that report sufficient data to be included in the meta‐analysis but do not make an allowance for the design effect, we will calculate the design effect based on a fairly large assumed intra‐cluster correlation of 0.10. We based this assumption of 0.10 being a realistic estimate by analogy on studies about implementation research (Campbell 2001). We will follow the methods stated in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011) for the calculations.

We intended to take the paired nature of the cross‐over design in the included studies into account in our data analysis. However, the included studies did not present suffidient data to do so and the results presented here are based on the unpaired test that is implemented in RevMan 2014 which resulted in wider confidence intervals than with the use of a paired t‐test.

Dealing with missing data

We contacted investigators in order to verify key study characteristics and obtain missing numerical outcome data where possible (e.g. when a study was identified as abstract only). If in future updates of this review we come across studies where this is not possible, and the missing data are thought to introduce serious bias, we will explore the impact of including such studies in the overall assessment of results by a sensitivity analysis.

Similarly, If in future updates of this review we come across studies where numerical outcome data are missing, such as SDs or correlation coefficients and they cannot be obtained from the authors, we will calculate them from other available statistics such as P values, according to the methods described in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011).

Assessment of heterogeneity

We assessed the clinical homogeneity of the results of included studies based on similarity of population, intervention, outcome and follow‐up. We considered populations as similar when they were HCWs directly engaged in patient treatment (nurses, doctors, paramedics) versus those who were not involved in patient therapy directly (cleaning and transport staff).

We considered interventions as similar when they fell into one of the intervention categories as stated in Types of interventions.

We considered any assessment of contamination of the skin or mucous membranes as similar enough to combine.

We considered the following follow‐up times as similar: from immediately following a procedure up until the end of the work shift (short term), and any time after the incubation time (long‐term).

We used the I² statistic to measure heterogeneity among the trials in each analysis. Where we identified substantial heterogeneity, we reported it and explored possible causes by prespecified subgroup analysis. We regarded an I² value above 50% as substantial heterogeneity.

We made a distinction between any kind of protection and protection that had been certified based on testing for biological hazards like CEN (European Committee for Standardization) or comparable standards.

Data synthesis

We pooled data from studies we judged to be clinically homogeneous using Review Manager 5.3 software (RevMan 2014). Where more than one study provided usable data in any single comparison, we performed a meta‐analysis. We used a random‐effects model when I² was above 40%; otherwise we used a fixed‐effect model. Where I² was higher than 75% we did not pool results of studies in meta‐analyses. We included a 95% confidence interval (CI) for all estimates.

If in future updates of this review, we come across studies with skewed data reported as medians and interquartile ranges, we will simply describe the data.

If in future updates of this review, we come across studies reporting multiple trial arms in a single trial, we will include only the relevant arms. If we need to combine two comparisons in the same meta‐analysis, we will halve the control group to avoid double‐counting.

Summary of findings table

We created a series of 'Summary of findings' tables to present the primary outcomes for Comparison 1 (One type of PPE versus another) and Comparison 2 (One procedure for donning/doffing versus another) only, as we felt these findings were the most useful to present as Summary of Findings tables. We used the five GRADE considerations (study limitations, consistency of effect, imprecision, indirectness and publication bias) to assess the quality of a body of evidence as it related to the studies that contributed data to the meta‐analyses for the prespecified outcomes. We used methods and recommendations described in Section 8.5 and Chapter 12 of the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011), using GRADEpro software (GRADEpro 2008). We justified all decisions to down‐ or upgrade the quality of studies using footnotes and we made comments to aid reader's understanding of the review where necessary. For non‐randomised studies, we started at the low‐quality level and for randomised studies at a high‐quality level. For future updates of this review, if the outcomes are measured in many different ways, we will prioritise the reporting of outcomes as follows: infection rates, contamination rates, compliance rates.

Subgroup analysis and investigation of heterogeneity

If future updates of this review find a sufficient number of studies, we will carry out the following subgroup analyses: high income versus low and middle income countries. We will also use our primary outcomes in subgroup analyses, and we will use the X² test, as implemented in RevMan 2014, to test for subgroup interactions. At this time, we did not identify enough studies to allow for such a subgroup analysis.

Sensitivity analysis

If future updates of this review find a sufficient number of studies, we will perform sensitivity analyses defined a priori to assess the robustness of our conclusions. This involves including only studies we judge to have a low risk of bias. At this time we did not identify enough studies to allow such a sensitivity analysis. However, we did check if using a fixed‐effect model instead of a random‐effects model had an influence on our conclusions when heterogeneity was between 10% and 50%.

Reaching conclusions

We based our conclusions only on findings from the quantitative or narrative synthesis of included studies we judged to have the lowest risk of bias. Consequently, we used findings from non‐randomised studies when we did not find evidence from randomised trials. We avoided making recommendations for practice based on more than just the evidence, such as values and available resources. Our implications for research suggest priorities for future research and outline what the remaining uncertainties are in the area.

Results

Description of studies

Results of the search

Alltogether, we screened 10234 references (see Figure 2) in three separate batches. The first search was ran in February‐April 2015, the second in October 2015 and the third in January 2016. We report the results of these searches here together. The search in PubMed yielded 6898 references. When we excluded references already identified in MEDLINE, EMBASE yielded 1023 references. In CENTRAL, the search yielded 152 references. From CINAHL, we retrieved 888 references and from OSH‐update 1283. From these references, we selected 205 articles for full‐text assessment. Through checking the references of included articles we found 18 additional articles, by using Google another five and through contacting NGOs one (Tomas 2015). Our contacts with the manufacturers did not yield any answers. Most of the studies that we did not locate with our electronic searches were studies of PPE use during the SARS epidemic that did not make reference to any type of PPE in the title or abstract. The same happened during the EBV epidemic where we could not locate Nyenswah 2015 because there was no reference to PPE. By using Google search, we found one additional article (Bell 2015) that wasn't indexed in any of the databases that we searched. We did not locate Tomas 2015 with our search strategy because the authors did not use any words referring to infection, disease or decontamination. Therefore we checked if there would be any other studies that only used the word contamination in addition to PPE. We did not find any other additional studies that we missed with our search strategy.

Based on a request of one of the peer referees we also searched the African Index Medicus which yielded 24 references but no new studies to include. For the next update of the review we will search also this database.

This added up to 205 papers that we checked full‐text for inclusion. Of these, we excluded 196. This resulted in nine included studies.

Included studies

Study Types

We located eight simulation studies of which six simulated exposure to contaminated body fluids and two studies simulated donning and doffing procedures.

Of these simulation studies five were randomised studies (three parallel group (Bell 2015; Hung 2015; Wong 2004) and two cross‐over (Guo 2014; Zamora 2006)) and three were non‐randomised controlled studies (one cross‐over (Casanova 2012; and two parallel group (Buianov 2004; Casalino 2015)).

In addition, we found one retrospective cohort study that evaluated the effect of PPE training on SARS infection rates and noncompliance with the doffing protocol (Shigayeva 2007). In this study, the authors located all HCWs that had been exposed to SARS patients and assessed, by questionnaire, compliance with PPE guidelines and PPE doffing guidelines.

Participants

In the simulation studies, researchers included 266 intervention and 139 control participants, taking into account that four used a cross‐over design and thus all participants were intervention participants. In the cohort study, there were 563 intervention and 232 control participants. Altogether there were 1031 participants.

The participants in all studies were healthcare workers with a mixture of occupations, but mainly physicians, nurses and respiratory technicians. There was one study that included medical students during their internships (Casalino 2015). There were no studies that included other healthcare staff such as persons working in emergency services or cleaning staff.

Countries

Two studies were performed in Canada, three in China and Hong Kong, two studies in the US, one in Russia and one was performed in three countries at the same time: France, Peru and Mexico (Casalino 2015). There were no studies that were carried out in countries dealing with an EVD epidemic.

Time period

All studies had been conducted after the year 2000, with four before and five after 2010.

Interventions and comparisons

The nine studies evaluated ten interventions. Five studies compared one type of PPE to another. Four studies compared two different ways of doffing. One study evaluated the effect of instruction and training.

Comparison 1: One type of PPE versus another

Five simulation studies compared different types of PPE outfits, but all in a different way. None of them were similar enough to be combined. None of the included studies used a standardized classification of the properties of the PPE that protect against viral penetration such as the EN 14126.

Buianov 2004 compared two different types of Powered Air‐Purifying Respirator (PAPR) that were escpecially developed for this project in Russia to protect health care personel against Ebola and similar viruses. Buianov 2004 also compared the effect of different airflow rates that varied from 50 to 300 liters per minute. The intervention participants were rquired to carry out a step test that lasted for four hours. The authors did not describe the equipment they tested in sufficient detail to be able to judge their technical qualities.

Zamora 2006 compared the use of a PAPR in use at the study hospital with the PPE for Enhanced Respiratory and Contact Precautions (E‐RCP)CDC recommended at the time of the study.

Wong 2004 compared four types of PPE according to their material properties. First, they tested the material according to the American Association of Textile Chemists and Colorists standards 22 and 127. We excluded the surgical gowns only category since it had no water repellency and insufficient viral barrier properties. Type A had good water repellency and water penetration resistance, but at the cost of poor air permeability. Type B had good water repellency and good air permeability, but poor water penetration resistance. Type C was the surgical gown with both poor water repellency and water penetration resistance. Type D, Barrierman ®, was made of Tyvek ® and had good water repellency, poor air permeability and fair water resistance.

Bell 2015 compared commercially available PPE compliant with CDC recommendations with locally available clothing, such as rain coats that were thought to be as protective as the commercially available ones.

Guo 2014 compared three types of PPE: disposable water resistant non‐woven gown, reusable woven cotton gown, and disposable non‐woven plastic apron. The second one was a cotton, water permeable, gown like a surgical gown. We left this arm out of the analysis because surgical gowns alone are not used for EVD. They tested the fabrics for water repellency and liquid penetration according to the American Association of Textile Chemists and Colorists standard 22. The gown and the apron received ratings of four and five respectively on a scale of zero to five for water repellency.

Contamination rates are not only determined by the type of PPE but also by the donning and doffing procedures. All studies had a priori determined donning and doffing procedures. It should be noted that these studies evaluated the totality of the type of PPE with the donning and doffing procedure. We have described the procedures in the 'Characteristics of included studies' table.

Comparison 2: One procedure for donning/doffing versus another

Two studies specifically evaluated different ways of donning and doffing.

Casanova 2012 compared the effect of wearing two pairs of gloves with wearing one pair of gloves on contamination rates. The study was classified under methods of doffing because the intention of the double gloving was to decrease the contamination during doffing. Doffing was done as per CDC recommendation, which describes how to do both single gloving and double gloving.

Guo 2014 compared the effect of doffing a gown or an apron according to an individual's own views versus the procedure recommended by CDC in the US in 2007: The following instruction were given: "Gown front and sleeves are contaminated! Unfasten neck, then waist ties. Remove gown using a peeling motion; pull gown from each shoulder toward the same hand. Gown will turn inside out. Hold removed gown away from body, roll into a bundle and discard into waste or linen receptacle".

Comparison 3: One intervention to improve compliance versus another

Three studies evaluated different training methods for donning and doffing procedures.

Casalino 2015 compared what they called reinforced training versus training that was not reinforced. The reinforcement consisted of an instructor saying out loud what the next step of donning and doffing was. They used the reinforcement with both basic PPE and enhanced PPE where the enhancement consisted of a full‐body suit and hood instead of a impermeable apron without a hood.

Hung 2015 compared a conventional training session for donning and doffing procedures to a procedure in which the conventional session was complemented with a computer simulation later in time.

Shigayeva 2007 evaluated the effect of active and passive training versus no training on compliance rates. Active training was defined as training that involved any group or face‐to‐face interaction. Passive training was defined as watching a video or receiving written instructions. This allowed us to make an indirect comparison between the effect of active and passive training. We calculated the effect of active training compared to passive training by subtracting the OR for passive training from the OR for active training, as outlined in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011). We calculated the variance of this indirect comparison by summing the variances of both direct comparisons. Then we calculated the SE by taking the square root of the combined variance. We used this as input for the generic inverse variance method in RevMan.

Outcomes
Infection rates

No studies evaluated the effect of the interventions on infection rates.

Contamination outcomes

Simulation studies simulated and measured contamination by using either a fluorescent marker that lit up with the use of a UV‐lamp, a non‐pathogenic bacteriophage MS2 virus, or 'aerosol containing microbes' to contaminate PPE, a manikin and its surroundings, or both.

Study authors developed the fluorescent markers in various, non‐standardised ways as follows:

  • For exposure, Bell 2015 used a base mixture of 500 ml liquid detergent, 500 ml water and three fluorescent tablets (Fluorescent powder (GloGerm ®), liquid clothes detergent with bleach alternative (Tide ®) and dissolvable fluorescent tablets (Bright Dyes Orange Dye ®)). This was combined with oatmeal, chocolate powder and crushed cereal to simulate different body fluids. For measurement of contamination after doffing, the researchers assessed the participants with an LED black light panel (Chauvet ®) and took photographs. Participants were counted as contaminated or not contaminated.

  • For exposure Wong 2004 used 0.2 mg of fluorescein (25% diluted in 100 ml of water). After doffing, participants were assessed with an UV‐scan and photographed. Researchers measured the number of stains that lit up.

  • For exposure, Guo 2014 used a fluorescent powder (GloGermCo, Moab, UT), developed for determining hand hygiene compliance. The GloGerm ® powder was mixed with light olive oil and water to resemble human aerosol as closely as possible. For measuring contamination, the authors used a UV scan in dim light and measured fluorescent patches as small and large patches.

  • For exposure, Zamora 2006 used 1 ml of a 25% solution in 100 ml of sterile water to spray on the torso and the face of the participants. Then 'invisible' Detection Paste ® 15 ml was applied to the forearms up to the elbows, and the palms of the hands. After doffing, an observer assessed contamination with an UV‐lamp. An evaluator assessed and measured all areas of contamination.

Casanova 2012 used bacteriophage MS2 in the following way to detect contamination: PPE was contaminated with bacteriophage MS2 suspended in 0.01 mol/L phosphate‐buffered saline. Sites of contamination were the front of the shoulder, right side of the N95 respirator, right front of the eye‐protection, and the palm of the dominant hand. Each side was contaminated with 105 plaque forming units (PFU) in 5 drops of 5 µl each. After doffing, the researchers took swabs from the face and sampled the hands with the 'glove juice‐method' and put the scrub shirt and pants in eluant liquid for sampling.

Buianov 2004 used an aerosol containing microbes with a concentration of 108 colony forming units (CFU)/m3. There were no further specifications of the aerosol and what it contained. We could not reach the authors for further clarification.

Compliance with guidance: Noncompliance rates with donning and doffing procedures

Five studies evaluated the effect on noncompliance.

Two contamination simulation studies (Casanova 2012; Zamora 2006) also measured the effect of different attire on noncompliance with the donning and doffing protocol because the researchers thought that a more complicated PPE composition would lead to more errors. Noncompliance was measured as the number of participants that did not follow the correct order of the protocol, omitted elements, or did not use the correct equipment.

Noncompliance in one training study (Shigayeva 2007) was measured as the number of violations against protocol as recorded from interviews. There were two different compliance outcomes. One was called consistent adherence and was calculated as the proportion of exposure episodes with full compliance with PPE. The other one was called unsafe doffing, measured if one or more of the elements of the doffing procedure were violated. We recalculated outcomes in such a way that they represented the frequency of noncompliance.

Hung 2015 measured compliance as a score on a 16‐item checklist for donning and 20‐item checklist for doffing. To get results comparable to the other studies we substracted the mean compliance values from the maximum score and used these as noncompliance values.

Casalino 2015 measured noncompliance as the number of errors per person for donning and for doffing and the number of persons with one or more errors as measured by the specialist trainer/instructor who also gave the spoken instructions in case of reinforcement. The authors also measured critical errors, which were those where there was contact between skin and potentially contaminated PPE, but we did not consider this a valid measure of contamination and disregarded this. We took measurement of the errors at the last training session as the effect of the intervention. We disregarded the error measurements at earlier training sessions.

Costs and economic outcomes

No studies reported on costs or other economic outcomes such as resource use.

Other relevant outcomes

Buianov 2004 measured heart rate and body temperature. We chose to report the results of this outcome as well, as we identified it as an additional outcome that appeared relevant to the questions being addressed.

Excluded studies

Description of case series or outbreak

One reason for excluding important studies was that the researchers only described a case‐series of HCW cases' use of PPE for EVD (Muyembe‐Tamfum 1999), Marburg Haemorraghic Fever infection (MHF) (Borchert 2007; Colebunders 2004; Jeffs 2007; Kerstiens 1999), Congo Crimean Haemorraghic Fever (CCHF) (Gozel 2013) or for SARS (Christian 2004; Ho 2003; Ofner 2003; Ofner‐Agostini 2006). None of these studies described the use of PPE by the cases in such detail that they could be replicated. In combination with the lack of a control condition, it is difficult to conclude how much PPE, or the lack thereof, contributed to the infection. The only different study of a series of cases during an outbreak was the study by Dunn 2015 that contained proper descriptions of PPE.

Description of PPE use only

We excluded studies if they only described how and what PPE was used without relation to an outcome (Lowe 2014; Marklund 2002; Minnich 2003).

One type of PPE only

Luo 2011 and Tomas 2015 evaluated only one type of PPE without a comparison in a simulation study.

No infection rates or compliance outcomes

Some studies measured only performance with PPE compared to no PPE use and not infection rates or compliance (Castle 2009; Coates 2000; Hendler 2000).

Comparison with no PPE only

We excluded studies that only compared PPE use with no PPE and not with alternative PPE use (Lu 2006; Schumacher 2010; Teleman 2004).

Studies that evaluated only one type of PPE and not part of full body PPE

Ogendo 2008 measured eye protection only. Bearman 2007 measured universal glove use only. Chughtai 2013, Lindsey 2012 and Lindsley 2014 measured masks or face shields only. Even though these studies yield valuable information, it is unclear how well the results also cover the use of these items as part of full‐body protection and therefore we excluded these studies.

Participants not exposed to highly infectious diseases with serious consequences

Many studies evaluated PPE use for other diseases than EVD and related haemorraghic fevers, such as HIV or other nosocomial infections that were not considered highly infectious and/or having serious consequences and we excluded these studies (Malik 2006; Ransjo 1979; Sorensen 2008).

Training or simulation studies without a control group

There were a number of studies that evaluated training but that did not use a control group. This makes it difficult to draw inferences about the effect of one type of training compared to another (Abrahamson 2006; Beam 2014; Hon 2008; Northington 2007; Tomas 2015).

Inconsistent use of PPE during the SARS epidemic

After intensive discussion, we excluded 11 studies that measured the use of PPE (mask, gloves, gowns, goggles) during the SARS outbreak and related that to the risk of SARS infection. One line of thinking was that these studies did not fulfil the inclusion criteria because the comparison here was not clearly one type of PPE versus another type of PPE. Another line of thinking was that the studies compared different types of PPE composition and thus would fulfil the inclusion criteria. We finally decided to deal with these studies in the discussion section only (Ho 2004; Lau 2004; Le 2004; Liu 2009; Loeb 2004; Nishiura 2005; Park 2004; Pei 2006; Scales 2003; Seto 2003; Teleman 2004).

Risk of bias in included studies

See Figure 3 for an overview of our judgment of the risk of bias per study. Since the table contains the risk of bias assessments for both randomised and non‐randomised studies, not all cells are applicable to both study types and those that aren't applicable remain empty.


Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

Allocation

Allocation was random in five studies but only two of them stated what method they had used for generating the random sequence (Wong 2004; Zamora 2006). Consequently we assessed these two as having a low risk of bias. Allocation concealment was not reported in the included randomised studies and therefore we rated it as unclear.

Blinding

In the simulation studies, the participants could not be blinded for the type of attire they were wearing or the type of donning or doffing procedure they were following. It is unclear if they could have contaminated themselves more with attire that they thought was not good, or they did not like, and we rated the risk of bias as unclear for all these studies. For one study, Casalino 2015, we rated the risk of performance bias as high, because the instructors who provided the intervention were very much aware if instruction was given or not and they were the assessors at the same time.

For the non‐randomised SARS study (Shigayeva 2007), we considered the risk of performance bias low because the study was retrospective and the participants did not know they were part of a study.

The risk of detection bias was unclear in most studies, as it not reported whether outcome assessors were blinded. We considered the risk to be high in one study (Casalino 2015) as providers of the intervention are also the assessors of compliance, and in a second study (Shigayeva 2007) because the intervention and the outcome were assessed with the same questionnaire at the same time. We judged the risk to be low in the remaining two studies because the authors stated that assessors were blind to group status (Hung 2015; Zamora 2006).

Incomplete outcome data

We judged the risk of attrition bias low in four studies (Bell 2015; Guo 2014; Shigayeva 2007; Zamora 2006) and unclear in five studies (Buianov 2004; Casalino 2015; Casanova 2012; Hung 2015; Wong 2004).

Selective reporting

Selective reporting was a risk of bias that was difficult to judge because none of the included studies had published a protocol. We judged Bell 2015 to be at risk of reporting bias because they did not report separate outcomes for high or normal exposure. We also judged Hung 2015 to have a high risk of reporting bias, as the authors did not fully report the results of the computer usability questionnaire. We judged two studies (Casalino 2015; Guo 2014) to have a low risk of reporting bias, as the authors appeared to have reported all relevant data.

Other potential sources of bias

We did not consider any other sources of bias.

Bias due to confounding (Non‐randomised studies)

We judged there to be a low risk of bias due to confounding in two non‐randomised studies (Casanova 2012; Shigayeva 2007), unclear risk in one non‐randomised study (Casalino 2015), and a high risk in one non‐randomised study (Buianov 2004).

Bias due to selection of participants into the study (Non‐randomised studies)

We judged there to be a low risk of bias due to selection of participants into the study for all four non‐randomised studies (Buianov 2004; Casalino 2015; Casanova 2012; Shigayeva 2007).

Overall Risk of Bias per study

We judged studies to have a low overall risk of bias if we judged them to have a low risk of bias in the following domains: both random allocation and allocation concealment, or both confounding and selection bias, and incomplete outcome data and selective reporting. We considered the blinding of participants and outcome assessors less important because the outcomes were objective or we could not imagine that participants would have an interest in a certain type of attire and outcome. This left us with no studies at low risk of bias but all at either unclear or high risk of bias.

Effects of interventions

See: Summary of findings for the main comparison Comparison 1: One type of PPE versus another – PAPR versus E‐RCP attire; Summary of findings 2 Comparison 1: One type of PPE versus another – Three types of PPE attire; Summary of findings 3 Comparison 1: One type of PPE versus another – Gowns versus aprons; Summary of findings 4 Comparison 2: One procedure for donning/doffing versus another – Doffing with double gloves compared to doffing with single gloves; Summary of findings 5 Comparison 2: One procedure for donning/doffing versus another – CDC method versus individual doffing

PPE types

1. Comparison 1: One type of PPE versus another
1.1 Powered air‐purifying respirator (PAPR) versus Enhanced respiratory and contact precautions (E‐RCP)

1.1.1 Outcome: Contamination with fluorescent marker

Zamora 2006 found that the PAPR system in use in their hospital led to less contamination than using the E‐RCP system (Relative Risk (RR) 0.27; 95% Confidence Interval (CI) 0.17 to 0.43, Analysis 1.1). Other ways of measuring contamination also led to less contamination with the PAPR system: contamination more than 1 cm (RR 0.21; 95% CI 0.12 to 0.36). The total contaminated area was also less with a Mean Difference (MD) of ‐81.10 cm² (95% CI ‐96.07 to ‐66.13). This was mainly due to a lack of protection of the neck in the E‐RCP system.

1.1.2. Outcomes: Compliance with guidance ‐ Donning and doffing noncompliance

Noncompliance with donning guidelines occurred more with the PAPR system as this consists of more elements (RR 7.50; 95% CI 1.81 to 31.10; Analysis 1.4; Zamora 2006). Noncompliance with doffing guidelines was more frequent with the E‐RCP system, but this was not statistically significant (RR 0.50; 95% CI 0.20 to 1.23; Analysis 1.5).

1.1.3. Outcomes: Donning and doffing time

The donning (MD = 259 seconds) and doffing time (MD = 337 seconds) were considerably longer with the PAPR system (Analysis 1.6; Analysis 1.7; Zamora 2006).

1.2 One type of PAPR versus another and different airflow rates

1.2.1 Outcome: Contamination with microbial aerosol

Buianov 2004 found that the suit that had the hood attached to the suit (СКБ‐I) had a lower 'contamination penetration rate' than the suits that had separate hoods and coveralls with a percentage of 8.10‐8 for the suit and 2.10‐1 for the coveralls. However, we could not understand the meaning of the penetration rate and we decided that we would not use these results for our conclusions (their results are not shown in data tables).

1.2.2 Outcomes: Heart rate and body temperature

Buianov 2004 also found that beyond 250L/min airflow rates there was no contamination anymore. Body temperature and heart rates were also lower at these airflow rates.

1.3 Four types of PPE versus another

Wong 2004 compared four types of PPE according to their material properties. Type A had good water repellency and water penetration resistance but at the cost of poor air permeability. Type B had good water repellency and good air permeability but poor water penetration resistance. Type C was the surgical gown with both poor water repellency and water penetration resistance. Type D, Barrierman ®, was made of Tyvek ® and had good water repellency, poor air permeability, and fair water resistance.

1.2.1 Outcomes: Contamination, User‐reported assessment of comfort and convenience ‐ usability, donning and doffing times

There were no considerable differences in contamination (Analysis 2.1) between Type A and Type B for face, neck, trunk, foot, or hand, but Type B scored about 10% higher on usability with MD ‐0.46 (95% CI ‐0.84 to ‐0.08; Analysis 2.2); this was due especially to better breathability of the fabric. There were no considerable differences in donning and doffing times (Analysis 2.3; Analysis 2.4).

There were considerable differences in contamination of the foot (MD ‐4.1 spots; 95% CI ‐6.94 to ‐1.26) and the hand (MD ‐12.76 spots; 95% CI ‐21.62 to ‐3.9) between Type A and Type D (Analysis 2.5); donning (MD 33 seconds, Analysis 2.7) and doffing (MD 17 seconds, Analysis 2.8) times were also much worse for Type D. Usability was not rated considerably differently (MD 0.25; 95% CI ‐0.12 to 0.62, Analysis 2.6).

It was unclear how many participants had no contamination. On average, all types of PPE had some contamination.

1.4 Formal PPE versus locally available PPE

Bell 2015 compared contamination in four subjects with formal PPE with four subjects with locally available protective gear, such as raincoats. They found contamination in one participant in both study arms. The study was so small that it is difficult to draw conclusions (Analysis 3.1).

1.5 Gown versus apron

Guo 2014 compared a gown with an apron and found that the gown left less contamination than an apron, regardless of the way of doffing (Analysis 4.1; Analysis 4.2).

2. Comparison 2: One procedure for donning/doffing versus another
2.1 Double gloving versus single gloving

2.1 Outcomes: Contamination with MS2 virus and Compliance with guidance ‐ compliance errors

Casanova 2012 found that contamination with the use of double gloves was less than with single gloves, if all contaminated sites were taken together (RR 0.36; 95% CI 0.16 to 0.78; Analysis 5.1). However, all participants had some level of contamination. Measured as the quantity of virus found, the hands were less contaminated after degloving when participants used double gloves but due to missing data this could not be tested (Analysis 5.2). There were no more errors in compliance with the donning or doffing protocol (RR 1.08; 95% CI 0.70 to 1.67; Analysis 5.3).

2.2 CDC's recommended versus individual doffing

Guo 2014 found that the CDC's recommended way of doffing a gown or an apron led to a different decrease in contamination than individually chosen doffing. When doffing the gown, there were 5.4 fewer smaller contamination patches (95% CI ‐7.4 to ‐3.4) and 5.2 fewer stains in the environment (95% CI ‐7.3 to ‐3.3), but no difference in small contamination patches on the hands, shoes or underwear. For doffing the apron, there were fewer smaller stains, stains on the hands, shoes, and environment, but more large stains and a similar number of stains on the underwear (Analysis 6.1; Analysis 6.2).

Comparison 3: One intervention to improve compliance versus another
Comparison 3a. Training and instruction for proper and complete PPE use
3a.1 Active training versus passive training

3a.1.1 Outcome: Compliance with guidance ‐ Noncompliance with PPE guidance

Shigayeva 2007 defined consistent adherence as always wearing gloves, gown, mask, and eye‐protection. We transformed this to inconsistent use as being non‐compliant with the guidance. The study found that active training led to less noncompliance than no training (OR 0.37; 95% CI 0.2 to 0.58; Analysis 7.1). For passive training, they found a lower risk of noncompliance compared to no training (OR 0.58; 95% CI 0.33 to 1.00). For the indirect comparison, active versus passive training, the OR was 0.63 (95% CI 0.31 to 1.30).

Comparison 3b. Training and instruction for PPE donning and doffing
3b.1. Active versus passive instruction

3b.1.2. Outcome: Compliance with guidance ‐ Noncompliance with doffing procedures

Shigayeva 2007 did not find a considerable effect of active (OR 0.70; 95% CI 0.45 to 1.11) or passive training (OR 1.56; 95% CI 0.83 to 2.94) compared to no training (Analysis 8.1), on the number of errors in compliance with the doffing protocol. For the indirect comparison, active versus passable training, the OR was 0.45 (95% CI 0.21 to 0.98).

3b.2 Additional spoken personal instructions versus no such instructions

4.1.1. Outcome: Compliance with guidance ‐ Noncompliance

Casalino 2015 found that there were substantially less noncompliance (persons with one or more errors) after additional spoken instruction compared to no instructions with RR = 0.31 (95% CI 0.11 to 0.93) and also that the mean number of errors fell with on average almost one (MD ‐0.89 95% CI ‐1.36 to ‐0.41) in the group with spoken instructions (Analysis 9.1; Analysis 9.2).

3b.3 Additional computer simulation versus no additional computer simulation

3b.3.1. Outcome: Compliance with guidance ‐ Noncompliance

Even though the number of errors was low already, Hung 2015 found that adding computer simulation reduced the number of errors with on average half an error for donning (MD = ‐0.52, 95% CI ‐0.90 to ‐0.14, Analysis 10.1) and with more then one error for doffing (MD = ‐1.16, 95% CI ‐1.63, ‐0.69) doffing (Analysis 10.2).

4. Subgroup and sensitivity analysis

We planned a subgroup analysis of high versus low and middle income countries. However, there were not enough studies for such a subgroup analysis to be meaningful.

We also planned a sensitivity analysis including only studies we judged to have a low risk of bias. As none of the included studies fulfilled this criterion, we could not perform this analysis.

5. Quality of the evidence

We judged if there was a reason to downgrade the quality of the evidence for each domain of GRADE. Since we judged all studies to have a high risk of bias, we downgraded all comparisons by one level. We considered simulation studies to be indirect evidence, and downgraded the evidence yielded by these studies by one level as well. In addition, when there was only one small study, we downgraded because of imprecision. All in all, the quality of the evidence was very low for all comparisons.

Discussion

available in

Summary of main results

We found no studies conducted among HCWs exposed to EVD. We found one study that was conducted among HCWs exposed to SARS. We found eight simulation studies that either simulated the exposure or the donning and doffing procedure.

There were five studies that compared various types of PPE to one another but we could not combine their results due to clinical differences and thus all conclusions are based on single studies. There is very low quality evidence that it seems possible to improve breathability of protective suits without increasing the risk of contamination. Improved breathability of protective suits also increases user satisfaction ratings. For doffing, there is low quality evidence that double gloving as part of full‐body PPE possibly reduces the risk of contamination and reduces the viral load on the hands without increasing the frequency of noncompliance with the doffing protocol. Following CDC recommendations for doffing gowns and aprons compared to individually chosen ways seems to decrease the risk of contamination. In all simulation studies, contamination happened both in the intervention arm and in the control arm for most participants.

For training, there is very low quality evidence from one SARS‐related study and two simulation studies that more active training in PPE use decreases noncompliance with donning and doffing guidance more than passive training. Active training comprised of, for example, spoken instruction and computer simulation. The effect of training on noncompliance with PPE guidance was not significant in one study.

We found no audit reports or other unpublished reports, even though we put considerable effort into asking PPE producers and NGOs involved in relief aid to provide them.

Overall completeness and applicability of evidence

We did not find evidence on a number of important comparisons for types of PPE: gowns versus coveralls, goggles versus visors, hoods in combination with goggles, or taping versus no taping. In the studies, there were no data on breach of PPE. Most studies provided sparse descriptions of the PPE they used, except for Zamora 2006. They gave a detailed description of the name and brand of each PPE item used, including details of the manufacturer. Five studies were over ten years old, and it is unclear if the attire used in those studies would still be obtainable today. This makes the evidence on absolute level of protection difficult to apply.

None of the studies used the ISO or EN classifications for chemical protection (ISO 2013) or for viral protection (ISO 2004; EN 14126). We believe that the standards are very helpful to determine the protective potential of PPE, and we don't see a reason to doubt the technical protection that these materials provide when properly implemented. A working group led by Médecins Sans Frontières (MSF) has suggested that a new standard should be developed for the ability of fabric to prevent penetration with EVD (Sprecher 2015). To us, this does not seem like the first research priority, given the high level of contamination in the included studies while donning and doffing, even after instruction and training.

Even though doffing procedures are fairly easy to evaluate in simulation studies, we found only two studies that evaluated the effect of double gloving and CDC recommendations for gowns and aprons. Important questions that we could not answer were: single versus two‐person procedures, with or without spraying of disinfectants, and what to do in a case of breach of barrier protection. It seems that it would not be difficult to perform simulation studies to find out how important these procedures are. For the evaluation of the use of disinfectants, bacteriophage MS2 could be a viable option for simulated exposure with the assumption that the disinfectant would work equally well for EVD and the MS2 virus. Using MS2 has the advantage of being able to quantify the contamination as the amount of virus that is found on the skin. For example Casanova 2012 found that the amount of virus on the hands after doffing ranged from 1.4 to 316 PFU (measured as MPN). According to Judson 2015 infectious doses of less than 10 PFU of EVD have been reported to cause viremia in non‐human primates. This means that in spite of protection, in the Casanova 2012 study there was still a risk of infection.

Because studies seem feasible and because we searched exhaustively, there must be other reasons why there is so little evidence. One of these is probably the highly politicized context in which such a study has to be performed during an epidemic. For example, during the epidemic in 2014, we weren't able to get in touch with WHO and the researchers of the systematic review commissioned by WHO did not want to communicate with us, presumably because of the political implications. During the Ebola epidemic, 28 HCWs who worked for MSF became infected and 14 of them died (MSF 2015).Two of those that have been infected were not locals, but international HCWs (French and Norwegian). There were a total of 250 international HCWs working during the Ebola epidemic which yields an infection risk of 0.8 %. As 26 out of 3000 national HCWs died their risk was also 0.8%. MSF assumes in their report that both international HCWs had an occupational infection and that all national HCWs were infected outside work. In their report, MSF also promises to provide more information. Even though we asked MSF several times for more explanation about their HCWs that became infected, we did not get a response. This stresses the need for a joint initiative, possibly coordinated by WHO to overcome the political pressures and to organise more studies.

While the included studies showed that more active training prevented errors, it is not clear how long the effects of training last. Northington 2007 showed that at six months after training, only 14% of participants were able to correctly don and doff PPE. It is unclear from the included studies, if fit‐testing of masks is part of training. This is a prerequisite for proper functioning of respiratory protection.

There were no studies conducted in a low‐ and middle‐income (LMI) country. Since most serious haemorrhagic fever epidemics occur in Africa, this is a serious disadvantage of the current evidence. Training on the correct way to use appropriate PPE is a challenge in these countries. We still hope that there are unpublished reports from the 2014 to 2015 Ebola epidemic that we can use for an update of this review.

From the SARS epidemic, it seems that the consistent use of PPE rather than the the type of PPE was most important (Appendix 6). At the start of the epidemic, SARS patients were not appropriately diagnosed, and the importance of PPE was not immediately clear. Personal protective equipment compliance was higher in the later stages, and infections occurred less frequently (Nishiura 2005). During the 1995 Ebola epidemic in Kiwit, a study also reported that once PPE and other control measures were used, there were very few HCW infections (Kerstiens 1999).

We set out to include and describe case studies in the hope that they would provide information on the effects of PPE. Only one (Dunn 2015) of the case studies we found provided systematic information on the use of PPE and therefore we weren't able to draw any conclusions from these. This is a similar experience as Hersi 2015 had as they performed a systematic review of case studies and case series but had to conclude that they did not provide useful information. We reanalysed the case study by Dunn 2015 as a cohort of exposed HCWs. The relative risk of contracting Ebola infection for HCWs using gloves only versus those not using PPE was 0.16 (0.04 to 0.71) indicating that using gloves already provides a lot of protection. For using gloves or a gown or more compared to no PPE, the RR was 0.03 (0.00 to 0.57) (Verbeek 2016). This is very similar to the findings of the SARS studies mentioned above. It is also, to a certain extent, reassuring for those situations in low‐ and middle income countries that do not have sufficient PPE available (Levy 2015) that some PPE already decreases the risk of infection considerably.

Quality of the evidence

We rated the quality of the evidence as very low for all comparisons, mainly because all the included studies had a high risk of bias. The SARS study had a high risk of recall bias because participants had to recall their use of PPE after the epidemic occurred. Half of the simulation studies had a very small sample size and did not report their outcomes by intervention or control group.

One of the major problems was that most of the studies did not indicate if the PPE that they used complied with one or more of the international standards for protective clothing and whether they used viral barrier fabrics. The lack of attention to the designation of PPE as being protective for viruses is problematic also in practice.

The many different labels and standards in use to designate protection against contamination with viral diseases such as EBV make it almost impossible to make the right choice for a HCW in practice. The confusing language of infection control has also been reported for isolation practices in general. Therefore, Landers 2010 called for the adoption of internationally accepted and standardized category terms for isolation precautions. Others have tried to improve the standardisation by providing HCWs with a summary card of the various types of precautions that have to be taken and indicated that this increased the implementation of precautionary measures (Russell 2015).

Zamora 2006 used attire recommended by the CDC in 2005, but since then, the recommendation has been superseded by much more stringent protection.

In simulation studies, it is not clear how well the exposure represents real life exposure. Some studies used 'high volume exposure to simulate splash' (Bell 2015), whereas other studies only used a powdered fluorescent marker spread in the room (Beam 2011). It is also not clear how well the fluorescent marker can indicate that there is no viral contamination. Casanova 2008 showed that in spite of no fluorescent marker being detected, there could still be viral contamination with bacteriophage MS2. Therefore, in simulation studies, the objective should be to reach zero contamination.

Only one of the case studies that we collected (Dunn 2015) properly described the use of PPE. Better description would enable better analysis.

Potential biases in the review process

We excluded all studies that evaluated only one piece of PPE, such as goggles or masks. However, none of these excluded studies would have answered the questions that in our current review remained unanswered. From Casanova 2012, it became clear that using double gloves as part of full‐body PPE is important, because it facilitates the removal of the other pieces of PPE without contaminating the hands. This shows that it is important to consider the effect of one piece of PPE as part of the full‐body PPE. In addition, seldom is there only one clear transmission route. Even with SARS which, as a respiratory infection, was spread by droplets and aerosols, consistent use of other pieces of PPE besides respiratory protection was still important. Therefore, we don't think that these strict inclusion criteria biased the results of our review.

We assumed that adherence to PPE use and training would work in a similar way between SARS, EVD and simulation studies. However, there is an important difference; at the start of the SARS epidemic, the causal virus and transmission were unclear and workers were probably not instructed well enough to protect themselves. On the other hand, it has been known for years that EVD is a highly contagious disease with a very high fatality rate. Thus, compliance and effectiveness of training concerning EVD might be higher than we concluded from the SARS study.

In the SARS studies that we excluded, there was high heterogeneity in the effects of consistently wearing PPE that we could not explain. The heterogeneity in effect is also underpinned by studies that did not find any SARS infections in spite of imperfect protection with PPE. This means that the effectiveness of PPE at best, is not fully understood.

Four of the simulation studies were cross‐over studies where the authors analysed the data with tests that took into account the paired nature of the data: Zamora 2006 used the Mailand‐Gart test, Guo 2014 used repeated measures, and Casanova 2012 the paired t‐test. We could not use the results of these tests in our analyses in RevMan which resulted in wider confidence intervals than if a paired analysis had been used. There were insufficient data in the studies to properly adjust for the cross‐over effect in our analyses. However, all results that were reported as being statistically significant were also statistically significant in our analyses. Therfore, we don't think that this has biased our results.

For the simulation studies, the way exposure was simulated is an important element to consider. This varied highly between the studies. However, most studies used a worst case scenario, spraying fluorescent marker over large parts of the body. For future studies, it would be good to have consensus on how exposure can be best simulated.

For the included non‐randomised studies, we assessed risk of bias with a hybrid version of the Cochrane risk of bias tool and the recently developed Acrobat tool. This might not have been the most optimal way to assess risk of bias. However, we believe that the limitations of the available studies are profound and a more rigorous bias assessment could not have lowered our confidence in the evidence.

Agreements and disagreements with other studies or reviews

We found two other reviews that have evaluated the effect of PPE for highly infectious diseases with serious consequences in HCWs: Hersi 2015 and Fischer 2015. Hersi 2015 was commissioned by WHO to underpin the PPE guidelines issued for HCWs exposed to EVD. The authors originally included only controlled studies of interventions to protect HCWs against EVD and similar hemorrhagic fever infections with infection rates as outcomes. During the review process the authors decided to also include case studies and case series but they were not able to draw conclusions from these studies because the PPE use was not well described. Fischer 2015 took a more pragmatic but unsystematic approach and included all articles pertaining to filovirus transmission and PPE and in addition articles that evaluated donning and doffing strategies. They conclude that there is a lack of evidence but that simulation studies could provide evidence for guidelines.

Heat stress and breathability is an important issue in PPE especially for Ebola. Kuklane 2015 argued that using other materials would substantially reduce the heat stress but these come at a tenfold higher price. Other researchers that have looked into this problem have found inconsistent results. Coca 2015 found that PPE on manikins led to a critical body core temperature of 38.4ºC in one hour. On the other hand, Grélot 2015 found that HCWs caring for Ebola patients had only a 0.46ºC rise in core body temperature after being at work for one hour. Of the 25 workers studied only four reached a core body temperature over 38.5ºC.

An independent panel of experts that evaluated the Ebola response (Moon 2015) concluded, among many other things, that a coordinated research effort is needed to build a better global system for infectious disease outbreak and response. Their recommendation is that research funders should establish a worldwide research and development financing facility for outbreak‐relevant drugs, vaccines, diagnostics, and non‐pharmaceutical supplies (such as PPE). This is very much in line with what we experienced and found in this review.

Missair 2014 reviewed implications of EVD patient management for anaesthetists based on a literature review of all types of studies on EVD. This is why their inclusion criteria were very broad and non‐specific. Finally the authors relied on PPE guidelines as provided by WHO and MSF to make recommendations with no evidence of their comparability. This makes their results difficult to compare to ours.

Moore 2005 reviewed all measures to prevent healthcare workers from SARS and other respiratory pathogens in a narrative format, from 168 publications. They concluded that a positive safety climate is the most important factor for adherence to universal precautions. They recommend using adequate PPE, but they do not define 'adequate'. Their inclusion criteria were much broader and the results are difficult to compare with ours. The same research group formulated valuable advice about research gaps based on this review but focused only on respiratory protection (Yassi 2005). They corroborate the findings of Jefferson 2011, that N95 respirators may not be all that superior, citing the early containment of the SARS epidemic without these in Hanoi.

The Cochrane review by Jefferson 2008, updated in Jefferson 2011, evaluated the effect of physical interventions to interrupt the spread of respiratory viruses for all populations. Even though they only included studies on respiratory infections and any type of protection for any person at risk, 10 studies in their review are about SARS and protecting healthcare workers. The authors did not conduct a subgroup or additional analysis of these HCW studies. Because the infection risk for HCW is substantially different from the populations they protect, the Jefferson 2011 results are not applicable to HCWs.

International symbol indicating biohazards
Figures and Tables -
Figure 1

International symbol indicating biohazards

PRISMA study flow diagram
Figures and Tables -
Figure 2

PRISMA study flow diagram

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.
Figures and Tables -
Figure 3

Risk of bias summary: review authors' judgements about each risk of bias item for each included study.

Comparison 1 PAPR versus E‐RCP Attire, Outcome 1 Any contamination.
Figures and Tables -
Analysis 1.1

Comparison 1 PAPR versus E‐RCP Attire, Outcome 1 Any contamination.

Comparison 1 PAPR versus E‐RCP Attire, Outcome 2 Contamination > 1 cm.
Figures and Tables -
Analysis 1.2

Comparison 1 PAPR versus E‐RCP Attire, Outcome 2 Contamination > 1 cm.

Comparison 1 PAPR versus E‐RCP Attire, Outcome 3 Contamination area.
Figures and Tables -
Analysis 1.3

Comparison 1 PAPR versus E‐RCP Attire, Outcome 3 Contamination area.

Comparison 1 PAPR versus E‐RCP Attire, Outcome 4 Donning noncompliance.
Figures and Tables -
Analysis 1.4

Comparison 1 PAPR versus E‐RCP Attire, Outcome 4 Donning noncompliance.

Comparison 1 PAPR versus E‐RCP Attire, Outcome 5 Doffing noncompliance.
Figures and Tables -
Analysis 1.5

Comparison 1 PAPR versus E‐RCP Attire, Outcome 5 Doffing noncompliance.

Comparison 1 PAPR versus E‐RCP Attire, Outcome 6 Donning time.
Figures and Tables -
Analysis 1.6

Comparison 1 PAPR versus E‐RCP Attire, Outcome 6 Donning time.

Comparison 1 PAPR versus E‐RCP Attire, Outcome 7 Doffing time.
Figures and Tables -
Analysis 1.7

Comparison 1 PAPR versus E‐RCP Attire, Outcome 7 Doffing time.

Comparison 2 Four types of PPE attire compared, Outcome 1 A vs B Contamination, mean number of spots.
Figures and Tables -
Analysis 2.1

Comparison 2 Four types of PPE attire compared, Outcome 1 A vs B Contamination, mean number of spots.

Comparison 2 Four types of PPE attire compared, Outcome 2 A vs B Usability score (1‐5).
Figures and Tables -
Analysis 2.2

Comparison 2 Four types of PPE attire compared, Outcome 2 A vs B Usability score (1‐5).

Comparison 2 Four types of PPE attire compared, Outcome 3 A vs B Donning time.
Figures and Tables -
Analysis 2.3

Comparison 2 Four types of PPE attire compared, Outcome 3 A vs B Donning time.

Comparison 2 Four types of PPE attire compared, Outcome 4 A vs B Doffing time.
Figures and Tables -
Analysis 2.4

Comparison 2 Four types of PPE attire compared, Outcome 4 A vs B Doffing time.

Comparison 2 Four types of PPE attire compared, Outcome 5 A vs D Contamination, mean number of spots.
Figures and Tables -
Analysis 2.5

Comparison 2 Four types of PPE attire compared, Outcome 5 A vs D Contamination, mean number of spots.

Comparison 2 Four types of PPE attire compared, Outcome 6 A vs D Usability score (1‐5).
Figures and Tables -
Analysis 2.6

Comparison 2 Four types of PPE attire compared, Outcome 6 A vs D Usability score (1‐5).

Comparison 2 Four types of PPE attire compared, Outcome 7 A vs D Donning time.
Figures and Tables -
Analysis 2.7

Comparison 2 Four types of PPE attire compared, Outcome 7 A vs D Donning time.

Comparison 2 Four types of PPE attire compared, Outcome 8 A vs D Doffing time.
Figures and Tables -
Analysis 2.8

Comparison 2 Four types of PPE attire compared, Outcome 8 A vs D Doffing time.

Comparison 3 Formal versus local available attire, Outcome 1 Contamination.
Figures and Tables -
Analysis 3.1

Comparison 3 Formal versus local available attire, Outcome 1 Contamination.

Comparison 4 Gown versus apron, Outcome 1 Contamination with marker; individual doffing.
Figures and Tables -
Analysis 4.1

Comparison 4 Gown versus apron, Outcome 1 Contamination with marker; individual doffing.

Comparison 4 Gown versus apron, Outcome 2 Contamination with marker; CDC doffing.
Figures and Tables -
Analysis 4.2

Comparison 4 Gown versus apron, Outcome 2 Contamination with marker; CDC doffing.

Comparison 5 Doffing with double gloves vs doffing with single gloves, Outcome 1 Contamination: virus detected.
Figures and Tables -
Analysis 5.1

Comparison 5 Doffing with double gloves vs doffing with single gloves, Outcome 1 Contamination: virus detected.

Comparison 5 Doffing with double gloves vs doffing with single gloves, Outcome 2 Contamination: virus quantity.
Figures and Tables -
Analysis 5.2

Comparison 5 Doffing with double gloves vs doffing with single gloves, Outcome 2 Contamination: virus quantity.

Comparison 5 Doffing with double gloves vs doffing with single gloves, Outcome 3 Non‐compliance: any error.
Figures and Tables -
Analysis 5.3

Comparison 5 Doffing with double gloves vs doffing with single gloves, Outcome 3 Non‐compliance: any error.

Comparison 6 CDC versus individual doffing, Outcome 1 Gown; Contamination with fluor marker.
Figures and Tables -
Analysis 6.1

Comparison 6 CDC versus individual doffing, Outcome 1 Gown; Contamination with fluor marker.

Comparison 6 CDC versus individual doffing, Outcome 2 Apron; Contamination with fluor marker.
Figures and Tables -
Analysis 6.2

Comparison 6 CDC versus individual doffing, Outcome 2 Apron; Contamination with fluor marker.

Comparison 7 Active training in PPE use versus passive training, Outcome 1 Noncompliance with PPE.
Figures and Tables -
Analysis 7.1

Comparison 7 Active training in PPE use versus passive training, Outcome 1 Noncompliance with PPE.

Comparison 8 Active training in PPE doffing versus passive training, Outcome 1 Noncompliance doffing protocol.
Figures and Tables -
Analysis 8.1

Comparison 8 Active training in PPE doffing versus passive training, Outcome 1 Noncompliance doffing protocol.

Comparison 9 Donning and doffing with instructions versus without instructions, Outcome 1 Persons with one or more errors.
Figures and Tables -
Analysis 9.1

Comparison 9 Donning and doffing with instructions versus without instructions, Outcome 1 Persons with one or more errors.

Comparison 9 Donning and doffing with instructions versus without instructions, Outcome 2 Mean errors.
Figures and Tables -
Analysis 9.2

Comparison 9 Donning and doffing with instructions versus without instructions, Outcome 2 Mean errors.

Comparison 10 Computer simulation vs no simulation, Outcome 1 Number of errors while donning.
Figures and Tables -
Analysis 10.1

Comparison 10 Computer simulation vs no simulation, Outcome 1 Number of errors while donning.

Comparison 10 Computer simulation vs no simulation, Outcome 2 Number of errors while doffing.
Figures and Tables -
Analysis 10.2

Comparison 10 Computer simulation vs no simulation, Outcome 2 Number of errors while doffing.

Summary of findings for the main comparison. Comparison 1: One type of PPE versus another – PAPR versus E‐RCP attire

PAPR versus E‐RCP Attire for preventing contact with contaminated body fluids in healthcare staff

Patient or population: healthcare staff volunteers
Settings: simulation study
Intervention: PPE with Powered Air Purifying Respirator (PAPR) Attire

Control: Enhanced respiratory and contact precautions (E‐RCP) attire according to 2005 CDC recommendation

Outcomes

Illustrative comparative risks* (95% CI)

Relative effect
(95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

E‐RCP attire

PAPR Attire

Any contamination
fluorescent marker

Follow‐up: post intervention

960 per 1000

259 per 1000
(163 to 413)

RR 0.27
(0.17 to 0.43)

50
(1 cross‐over RCT)

⊕⊕⊝⊝
very low1,2,3

Analyses presented in this table are unadjusted for the paired nature of the cross‐over design but similar to the results that the authors presented while taking the cross‐over into account

Compliance with guidance ‐ Noncompliance

with donning guidance

Follow‐up: post intervention

40 per 1000

300 per 1000
(72 to 1000)

RR 7.5
(1.81 to 31.1)

50
(1 cross‐over RCT)

⊕⊕⊝⊝
very low1,2,3

Compliance with guidance ‐ Noncompliance

with doffing guidance

Follow‐up: post intervention

240 per 1000

120 per 1000
(48 to 295)

RR 0.5
(0.2 to 1.23)

50
(1 cross‐over RCT)

⊕⊕⊝⊝
very low1,2,3

Infection with EVD

See comment

See comment

Not estimable

0
(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).
CI: Confidence interval; RR: Risk ratio;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Simulation study, downgraded for indirectness
2 One cross‐over study with 50 participants, downgraded for imprecision

3 HIgh risk of bias, downgraded for study limitations

Figures and Tables -
Summary of findings for the main comparison. Comparison 1: One type of PPE versus another – PAPR versus E‐RCP attire
Summary of findings 2. Comparison 1: One type of PPE versus another – Three types of PPE attire

Three types of PPE attire compared by number of contaminated spots

Patient or population: healthcare worker volunteers
Settings: simulation study
Intervention: more protective attire, not permeable not breathable (A)
Comparison: less protective attire: permeable but breathable (B); fairly permeable, not breathable (D)

Outcomes

Illustrative comparative risks* (95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

Less protective type of PPE (B or D)

Most protective type of PPE attire (A)

Number of contaminated spots ‐ Neck

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in control group B was
0.12 spots

The mean number of contaminated spots in the intervention group was
0.7 higher
(0.26 lower to 1.66 higher)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Number of contaminated spots ‐ Foot

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in the control group B was
2.86 spots

The mean number of contaminated spots in the intervention group was
0.96 lower
(2.35 lower to 0.43 higher)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Number of contaminated spots ‐ Palm

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in the control group B was
17.83

The mean number of contaminated spots in the intervention group was
7.72 lower
(15.65 lower to 0.21 higher)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Number of contaminated spots ‐ Foot

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in the control group D was
4.96

The mean number of contaminated spots in the intervention group was
4.1 lower
(6.94 to 1.26 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Number of contaminated spots ‐ Palm

fluorescent marker

Follow‐up: post intervention

The mean number of contaminated spots in the control group D was
20.49

The mean number of contaminated spots in the intervention group was
12.76 lower
(21.62 to 3.9 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Infection with EVD

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

Compliance with guidance

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on compliance with guidance.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group.
CI: Confidence interval;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Simulation study, downgraded for indirectness
2 One study 100 participants, 25 participants per arm, downgraded for imprecision

3 Unclear risk of bias in the study, downgraded one level

Figures and Tables -
Summary of findings 2. Comparison 1: One type of PPE versus another – Three types of PPE attire
Summary of findings 3. Comparison 1: One type of PPE versus another – Gowns versus aprons

Gowns versus aprons for preventing highly infectious diseases due to contact with contaminated body fluids in healthcare staff

Patient or population: healthcare worker volunteers
Settings: simulation study
Intervention: gowns versus aprons

Outcomes

Illustrative comparative risks* (95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

Aprons

Gowns

Contamination with marker; individual type of doffing

Follow‐up: post intervention

The mean contamination with marker in the control groups was
16.98 small spots

The mean contamination with marker in the intervention groups was 10.28 lower (14.77 to 5.79 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Cross‐over study; the analyses were unadjusted for the paired nature of the data but similar to the analysis of the authors who took this into account

Contamination with marker; CDC recommended doffing

Follow‐up: post intervention

The mean contamination with marker in the control groups was
1.88 small spots

The mean contamination with marker in the intervention groups was
0.62 lower (1.75 lower to 0.51 higher)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Infection with EVD

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

Compliance with guidance

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on compliance with guidance.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group.
CI: Confidence interval;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Randomisation method unclear, downgraded one level
2 Simulation study, downgraded for indirectness
3 Single cross‐over study with 50 participants, downgraded for imprecision

Figures and Tables -
Summary of findings 3. Comparison 1: One type of PPE versus another – Gowns versus aprons
Summary of findings 4. Comparison 2: One procedure for donning/doffing versus another – Doffing with double gloves compared to doffing with single gloves

Doffing with double gloves compared to doffing with single gloves for preventing contact with contaminated body fluids in healthcare staff

Patient or population: healthcare staff volunteers
Settings: Simulation study
Intervention: Doffing with double gloves
Comparison: Doffing with single gloves

Outcomes

Illustrative comparative risks* (95% CI)

Relative effect
(95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

Doffing with single gloves

Doffing with double gloves

Contamination: virus detected ‐ All body parts

Follow‐up: post intervention

778 per 1000

280 per 1000
(124 to 607)

RR 0.36
(0.16 to 0.78)

18
(1 cross‐over study)

⊕⊝⊝⊝
very low1,2

Non‐randomised cross‐over study; the analyses were unadjusted for the paired nature of the data but the results are similar to those analysed taking into account the paired nature of the data

Compliance with guidance ‐ Noncompliance: any error

Follow‐up: post intervention

667 per 1000

720 per 1000
(467 to 1000)

RR 1.08
(0.7 to 1.67)

18
(1 cross‐over study)

⊕⊝⊝⊝
very low1,2

Infection with EVD

See comment

See comment

Not estimable

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group and the relative effect of the intervention (and its 95% CI).
CI: Confidence interval; RR: Risk ratio;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Similation study, downgraded one level
2 One cross over study with 18 participants, downgraded for imprecision

Figures and Tables -
Summary of findings 4. Comparison 2: One procedure for donning/doffing versus another – Doffing with double gloves compared to doffing with single gloves
Summary of findings 5. Comparison 2: One procedure for donning/doffing versus another – CDC method versus individual doffing

CDC method versus individual doffing for preventing contact with contaminated body fluids in healthcare staff

Patient or population: healthcare staff volunteers
Settings: simulation study
Intervention: CDC method in doffing

Control: Individual method of doffing

Outcomes

Illustrative comparative risks* (95% CI)

Relative effect
(95% CI)

No of Participants
(studies)

Quality of the evidence
(GRADE)

Comments

Assumed risk

Corresponding risk

Individual doffing method

CDC recommended doffing method

Contamination with fluor marker when using gowns

Follow‐up: post intervention

The mean contamination with fluor marker in the control group was
6.7 small spots

The mean contamination with fluor marker in the intervention group was
5.44 lower
(7.43 to 3.45 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Cross‐over study; the analyses were unadjusted for the paired nature of the data but similar to the analysis of the authors who took this into account

Contamination with fluor marker when using aprons

Follow‐up: post intervention

The mean contamination with fluor marker in the control group was
16.98 small spots

The mean contamination with fluor marker in the intervention group was
15.1 lower
(19.28 to 10.92 lower)

50
(1 study)

⊕⊝⊝⊝
very low1,2,3

Infection with EVD

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on infection rates.

Compliance with guidance

See comment

See comment

0

(0 studies)

See comment

No studies evaluated the effect of the interventions on compliance with guidance.

*The basis for the assumed risk is the control group risk. The corresponding risk (and its 95% confidence interval) is based on the assumed risk in the comparison group.
CI: Confidence interval;

GRADE Working Group grades of evidence
High quality: Further research is very unlikely to change our confidence in the estimate of effect.
Moderate quality: Further research is likely to have an important impact on our confidence in the estimate of effect and may change the estimate.
Low quality: Further research is very likely to have an important impact on our confidence in the estimate of effect and is likely to change the estimate.
Very low quality: We are very uncertain about the estimate.

1 Randomisation procedure unclear, downgraded one level
2 Simulation study, downgraded for indirectness
3 One cross‐over study with 50 participants

Figures and Tables -
Summary of findings 5. Comparison 2: One procedure for donning/doffing versus another – CDC method versus individual doffing
Comparison 1. PAPR versus E‐RCP Attire

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Any contamination Show forest plot

1

Risk Ratio (M‐H, Random, 95% CI)

Totals not selected

2 Contamination > 1 cm Show forest plot

1

Risk Ratio (M‐H, Random, 95% CI)

Totals not selected

3 Contamination area Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

4 Donning noncompliance Show forest plot

1

Risk Ratio (M‐H, Random, 95% CI)

Totals not selected

5 Doffing noncompliance Show forest plot

1

Risk Ratio (M‐H, Random, 95% CI)

Totals not selected

6 Donning time Show forest plot

1

100

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

7 Doffing time Show forest plot

1

100

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

Figures and Tables -
Comparison 1. PAPR versus E‐RCP Attire
Comparison 2. Four types of PPE attire compared

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 A vs B Contamination, mean number of spots Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

1.1 Face type A vs type B

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.2 Trunk type A vs type B

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.3 Neck type A vs type B

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.4 Foot type A vs type B

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.5 Palm type A vs type B

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2 A vs B Usability score (1‐5) Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

3 A vs B Donning time Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

4 A vs B Doffing time Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

5 A vs D Contamination, mean number of spots Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

5.1 Face type A vs type D

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

5.2 Trunk type A vs type D

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

5.3 Neck type A vs type D

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

5.4 Foot type A vs type D

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

5.5 Palm type A vs type D

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

6 A vs D Usability score (1‐5) Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

7 A vs D Donning time Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

8 A vs D Doffing time Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

Figures and Tables -
Comparison 2. Four types of PPE attire compared
Comparison 3. Formal versus local available attire

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Contamination Show forest plot

1

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

Figures and Tables -
Comparison 3. Formal versus local available attire
Comparison 4. Gown versus apron

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Contamination with marker; individual doffing Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

1.1 small patches

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.2 large patches

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.3 hand

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.4 shoe

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.5 underwear

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.6 environment

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2 Contamination with marker; CDC doffing Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

2.1 small patches

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.2 large patches

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.3 hand

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.4 shoe

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.5 underwear

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.6 environment

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

Figures and Tables -
Comparison 4. Gown versus apron
Comparison 5. Doffing with double gloves vs doffing with single gloves

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Contamination: virus detected Show forest plot

1

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

1.1 All body parts

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.2 Face

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.3 Shirt

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.4 Pants

1

Risk Ratio (M‐H, Fixed, 95% CI)

0.0 [0.0, 0.0]

2 Contamination: virus quantity Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

2.1 Dominant hand

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.2 Non‐dominant hand

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.3 Face

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.4 Shirt

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.5 Pants

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

3 Non‐compliance: any error Show forest plot

1

Risk Ratio (M‐H, Fixed, 95% CI)

Totals not selected

Figures and Tables -
Comparison 5. Doffing with double gloves vs doffing with single gloves
Comparison 6. CDC versus individual doffing

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Gown; Contamination with fluor marker Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

1.1 small patch

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.2 large patch

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.3 hand

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.4 shoe

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.5 underwear

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

1.6 environment

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2 Apron; Contamination with fluor marker Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

2.1 small patch

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.2 large patch

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.3 hand

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.4 shoe

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.5 underwear

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

2.6 environment

1

Mean Difference (IV, Fixed, 95% CI)

0.0 [0.0, 0.0]

Figures and Tables -
Comparison 6. CDC versus individual doffing
Comparison 7. Active training in PPE use versus passive training

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Noncompliance with PPE Show forest plot

1

Odds Ratio (Fixed, 95% CI)

Totals not selected

Figures and Tables -
Comparison 7. Active training in PPE use versus passive training
Comparison 8. Active training in PPE doffing versus passive training

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Noncompliance doffing protocol Show forest plot

1

Odds Ratio (Fixed, 95% CI)

Totals not selected

Figures and Tables -
Comparison 8. Active training in PPE doffing versus passive training
Comparison 9. Donning and doffing with instructions versus without instructions

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Persons with one or more errors Show forest plot

1

120

Risk Ratio (M‐H, Random, 95% CI)

0.31 [0.11, 0.93]

1.1 Basic PPE

1

60

Risk Ratio (M‐H, Random, 95% CI)

0.15 [0.04, 0.62]

1.2 Enhanced PPE

1

60

Risk Ratio (M‐H, Random, 95% CI)

0.47 [0.22, 0.98]

2 Mean errors Show forest plot

1

120

Mean Difference (IV, Random, 95% CI)

‐0.89 [‐1.36, ‐0.41]

2.1 Basic PPE

1

60

Mean Difference (IV, Random, 95% CI)

‐0.70 [‐1.15, ‐0.25]

2.2 Enhanced PPE

1

60

Mean Difference (IV, Random, 95% CI)

‐1.2 [‐1.87, ‐0.53]

Figures and Tables -
Comparison 9. Donning and doffing with instructions versus without instructions
Comparison 10. Computer simulation vs no simulation

Outcome or subgroup title

No. of studies

No. of participants

Statistical method

Effect size

1 Number of errors while donning Show forest plot

1

Mean Difference (IV, Random, 95% CI)

Totals not selected

2 Number of errors while doffing Show forest plot

1

Mean Difference (IV, Fixed, 95% CI)

Totals not selected

Figures and Tables -
Comparison 10. Computer simulation vs no simulation