Hodgkin lymphoma (HL) is one of the most common haematological malignancies in young adults and, with cure rates of 90%, has become curable for the majority of individuals. Positron emission tomography (PET) is an imaging tool used to monitor a tumour’s metabolic activity, stage and progression. Interim PET during chemotherapy has been posited as a prognostic factor in individuals with HL to distinguish between those with a poor prognosis and those with a better prognosis. This distinction is important to inform decision‐making on the clinical pathway of individuals with HL.
ObjectivesTo determine whether in previously untreated adults with HL receiving first‐line therapy, interim PET scan results can distinguish between those with a poor prognosis and those with a better prognosis, and thereby predict survival outcomes in each group.
Search methodsWe searched MEDLINE, Embase, CENTRAL and conference proceedings up until April 2019. We also searched one trial registry (ClinicalTrials.gov).
Selection criteriaWe included retrospective and prospective studies evaluating interim PET scans in a minimum of 10 individuals with HL (all stages) undergoing first‐line therapy. Interim PET was defined as conducted during therapy (after one, two, three or four treatment cycles). The minimum follow‐up period was at least 12 months. We excluded studies if the trial design allowed treatment modification based on the interim PET scan results.
Data collection and analysisWe developed a data extraction form according to the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS). Two teams of two review authors independently screened the studies, extracted data on overall survival (OS), progression‐free survival (PFS) and PET‐associated adverse events (AEs), assessed risk of bias (per outcome) according to the Quality in Prognosis Studies (QUIPS) tool, and assessed the certainty of the evidence (GRADE). We contacted investigators to obtain missing information and data.
Main resultsOur literature search yielded 11,277 results. In total, we included 23 studies (99 references) with 7335 newly‐diagnosed individuals with classic HL (all stages).
Participants in 16 studies underwent (interim) PET combined with computed tomography (PET‐CT), compared to PET only in the remaining seven studies. The standard chemotherapy regimen included ABVD (16) studies, compared to BEACOPP or other regimens (seven studies). Most studies (N = 21) conducted interim PET scans after two cycles (PET2) of chemotherapy, although PET1, PET3 and PET4 were also reported in some studies. In the meta‐analyses, we used PET2 data if available as we wanted to ensure homogeneity between studies. In most studies interim PET scan results were evaluated according to the Deauville 5‐point scale (N = 12).
Eight studies were not included in meta‐analyses due to missing information and/or data; results were reported narratively. For the remaining studies, we pooled the unadjusted hazard ratio (HR). The timing of the outcome measurement was after two or three years (the median follow‐up time ranged from 22 to 65 months) in the pooled studies.
Eight studies explored the independent prognostic ability of interim PET by adjusting for other established prognostic factors (e.g. disease stage, B symptoms). We did not pool the results because the multivariable analyses adjusted for a different set of factors in each study.
Overall survival
Twelve (out of 23) studies reported OS. Six of these were assessed as low risk of bias in all of the first four domains of QUIPS (study participation, study attrition, prognostic factor measurement and outcome measurement). The other six studies were assessed as unclear, moderate or high risk of bias in at least one of these four domains. Four studies were assessed as low risk, and eight studies as high risk of bias for the domain other prognostic factors (covariates). Nine studies were assessed as low risk, and three studies as high risk of bias for the domain 'statistical analysis and reporting'.
We pooled nine studies with 1802 participants. Participants with HL who have a negative interim PET scan result probably have a large advantage in OS compared to those with a positive interim PET scan result (unadjusted HR 5.09, 95% confidence interval (CI) 2.64 to 9.81, I² = 44%, moderate‐certainty evidence). In absolute values, this means that 900 out of 1000 participants with a negative interim PET scan result will probably survive longer than three years compared to 585 (95% CI 356 to 757) out of 1000 participants with a positive result.
Adjusted results from two studies also indicate an independent prognostic value of interim PET scan results (moderate‐certainty evidence).
Progression‐free survival
Twenty‐one studies reported PFS. Eleven out of 21 were assessed as low risk of bias in the first four domains. The remaining were assessed as unclear, moderate or high risk of bias in at least one of the four domains. Eleven studies were assessed as low risk, and ten studies as high risk of bias for the domain other prognostic factors (covariates). Eight studies were assessed as high risk, thirteen as low risk of bias for statistical analysis and reporting.
We pooled 14 studies with 2079 participants. Participants who have a negative interim PET scan result may have an advantage in PFS compared to those with a positive interim PET scan result, but the evidence is very uncertain (unadjusted HR 4.90, 95% CI 3.47 to 6.90, I² = 45%, very low‐certainty evidence). This means that 850 out of 1000 participants with a negative interim PET scan result may be progression‐free longer than three years compared to 451 (95% CI 326 to 569) out of 1000 participants with a positive result.
Adjusted results (not pooled) from eight studies also indicate that there may be an independent prognostic value of interim PET scan results (low‐certainty evidence).
PET‐associated adverse events
No study measured PET‐associated AEs.
Authors' conclusionsThis review provides moderate‐certainty evidence that interim PET scan results predict OS, and very low‐certainty evidence that interim PET scan results predict progression‐free survival in treated individuals with HL. This evidence is primarily based on unadjusted data. More studies are needed to test the adjusted prognostic ability of interim PET against established prognostic factors.
Plain language summaryImaging with positron emission tomography (PET) during chemotherapy to predict outcome in adults with Hodgkin lymphoma
Review question
This Cochrane Review aimed to find out whether the results of a positron emission tomography (PET) during therapy in people with Hodgkin lymphoma (HL) can help to distinguish between those with a poor prognosis and those with a better prognosis, and predict survival outcomes in each group.
Background
Hodgkin lymphoma is a cancer which affects the lymphoid system of the body. It is considered a relatively rare disease (two to three cases per 100,000 people every year in Western countries), that is most common in young adults in their twenties, but it can also occur in children and elderly people. As treatment options have improved, most people with HL can now be cured. It is important that individuals receive the treatment with the greatest efficacy and least toxicity possible. PET is an imaging tool for assessing the disease stage of an individual, and monitoring tumour activity. It has been suggested that PET performed during therapy (so‐called interim PET, e.g. after two cycles of chemotherapy) can distinguish between people who respond well to therapy and those who do not respond well. The aim of this review was to demonstrate the prognostic ability to distinguish between these groups, and predict survival outcomes in each group, to help clinicians make an informed decision on the treatment pathway to improve long‐term outcomes and safety for people with HL.
Study characteristics
We included 23 studies to explore the association between interim PET scan results after one to four cycles of chemotherapy and survival outcomes in adults with HL (all stages). We contacted 10 authors, and six provided us with relevant information and/or data.
Key results
In 16 included studies, participants received either ABVD chemotherapy or BEACOPP chemotherapy (four studies) only, with or without radiotherapy. In 16 studies, participants underwent an interim PET scan in combination with a computed tomography (CT) (PET‐CT), which have higher accuracy in detecting primary and secondary cancers than a PET scan alone. In the remaining seven studies, PET‐only was conducted. Twenty‐one studies conducted interim PET scans after two cycles (PET2) of chemotherapy.
Eight studies did not report enough data on our outcomes or population of interest, so we reported the results from these studies narratively. We combined individual study results in meta‐analyses to provide robust evidence for our outcomes of interest overall survival and progression‐free survival. No study measured PET‐associated adverse events (harms).
For overall survival, combined results from nine studies (1802 participants) show that there is probably a large advantage in overall survival for people with a negative interim PET scan compared to people with a positive interim PET scan. For progression‐free survival, combined results from 14 studies (2079 participants) show that interim PET‐negative people may have an advantage for progression‐free survival, compared to interim PET‐positive people, but we are uncertain about this result. These are unadjusted results, where interim PET was tested as the only prognostic factor.
Eight studies reported adjusted results, where the independent prognostic ability of interim PET was assessed against other established prognostic factors (e.g. disease stage, B symptoms). We could not combine individual study results because the studies did not include identical sets of covariates. Nevertheless, their results indicate a probable independent prognostic ability of interim PET to predict both outcomes.
Certainty of the evidence
Regarding the unadjusted results, we rated our certainty of the evidence as 'moderate' for overall survival. This means that the true effect is likely to be close to the estimated effect, but there is a possibility that it is substantially different. For progression‐free survival, we rated our certainty of the evidence as 'very low', meaning that we have little confidence in the effect estimate, and that the true effect is likely to be substantially different from the estimated effect.
Regarding the adjusted results, we rated our certainty of the evidence as 'moderate' for overall survival, and 'low' for progression‐free survival.
How up‐to‐date is this review?
We searched data bases up until 2 April 2019, and one trial registry on 25 January 2019.
Summary of findings Summary of findings 1. Comparison of interim PET‐negative and interim PET‐positive individuals with Hodgkin Lymphoma. Comparison of interim PET‐positive and interim PET‐negative participants with Hodgkin lymphoma Population: Individuals with Hodgkin lymphoma Setting: Eleven studies recruited participants from a total of 28 haemato‐oncology treatment centres/hospitals in Brazil (N = 1), China (N = 1), Denmark (N = 4), France (N = 4), Italy (N = 3), Poland (N = 11), UK (N = 2) and the USA (N = 2). One study (Straus 2011) included participants from 29 institutions, but did not report the countries. One study (Simon 2016) reported the country (Hungary) but not the number of centres. One multi‐centre study (Hutchings 2014) recruited participants from four countries (USA, Italy, Poland and Denmark). One RCT (Kobe 2018) included participants from 301 hospitals and private practices in Germany, Switzerland, Austria, the Netherlands, and the Czech Republic. Outcomes Anticipated absolute effects* (95% CI) Relative effect (95% CI) № of participants (studies) Certainty of the evidence (GRADE) Comments Risk with Interim PET‐negative Risk with Interim PET‐positive Overall survivalHodgkin lymphoma (HL) is a cancer of the lymph nodes and the lymphoid system with possible involvement of other organs such as the liver, lung, bone or bone marrow (Lister 1989). With an annual incidence of approximately two to three per 100,000 inhabitants in Western countries, HL is a comparatively rare disease, but it is one of the most common malignancies in young adults (Howlader 2015). In industrialised countries, the age distribution of HL shows a first peak in the third decade and a second peak after the age of 50 (Thomas 2002).
The World Health Organization (WHO) Classification of Tumours of Haematopoietic and Lymphoid Tissues distinguishes between two types of HL: classical HL, representing about 95% of all HL; and lymphocyte‐predominant HL, representing about 5% of all HL (Swerdlow 2008). Both types differ in morphology, phenotype and molecular features, and therefore in clinical behaviour and presentation (Re 2005).
The Ann Arbor Classification is used for staging and distinguishes between four different tumour stages. Stages one to three indicate the degree of lymph node and localised extranodal organ involvement, or both, and stage four includes disseminated organ involvement, which can be found in 20% of cases. Factors associated with a poor prognosis include a large mediastinal mass, three or more involved lymph node areas, a high erythrocyte sedimentation rate, extranodal lesions, B symptoms (weight loss > 10%, fever, drenching night sweats) and advanced age, but the factors considered as significant vary slightly between different study groups (German Study Hodgkin Lymphoma Study Group (GHSG); European Organization for Research and Treatment of Cancer (EORTC); National Cancer Institute of Canada (NCIC)). The Cotswold modification of the Ann Arbor Classification also takes into consideration the occurrence of bulky disease (largest tumour diameter greater than 10 cm) (Lister 1989). Hodgkin lymphoma is classified into early favourable, early unfavourable and advanced stage (Engert 2007; Klimm 2005). In Europe, the early favourable‐stage group usually comprises Ann Arbor stages I and II without risk factors. The early unfavourable‐stage group includes individuals with Ann Arbor stages I or II and one or more risk factors. Most individuals with stages IIB, III or IV disease are included in the advanced‐stage risk group (Engert 2003).
With cure rates of up to 90%, HL is one of the most curable cancers worldwide (Engert 2010; Engert 2012; Rancea 2013a; von Tresckow 2012). A combination of adriamycin, bleomycin, vinblastine and dacarbazine (ABVD) is widely accepted as the standard chemotherapy regimen in early‐stage HL (Bröckelmann 2018, Canellos 1992; Engert 2010). Individuals in this stage usually receive a combination of chemotherapy and involved‐field radiation therapy (IF‐RT) (Engert 2010; von Tresckow 2012), whereas those with advanced‐stage disease receive an intensified regimen, such as BEACOPP (bleomycin, etoposide, doxorubicin, cyclophosphamide, vincristine, procarbazine and prednisone) (Skoetz 2017a; Borchmann 2011; Engert 2012; Skoetz 2013), or ABVD. A large randomised study showed that two cycles of ABVD followed by 20 Gy of IF‐RT is sufficient for the treatment of early‐favourable HL (Engert 2010), which is implemented into current standard treatment, whereas four cycles of chemotherapy followed by 30 Gy IF‐RT is more suitable for individuals with early‐unfavourable HL. Approximately 10% of people with HL will be refractory to initial treatment or will relapse; this is more common in people with advanced stage or bulky disease. These individuals can be treated with high‐dose chemotherapy and autologous stem cell transplantation (Rancea 2013). Immunotherapy for relapsed HL as another possible approach is under active investigation (Moskowitz 2018).
The current treatment approach for HL aims to maximise progression‐free and OS and to minimise acute and long‐term toxicities like cardiac and pulmonary damage, infertility and secondary cancers. Development of a secondary cancer is one of the major causes of morbidity and mortality once the risk of progression and relapse of HL is over, i.e. from about five years after first‐line treatment onwards. In a large systematic review based on individual patient data in people with HL, Franklin and colleagues demonstrated that treatment de‐intensification by avoiding additional radiotherapy reduces the risk of a secondary cancer (Franklin 2005).
Description of the index (prognostic) factorA prognostic factor is a characteristic of a patient or the disease (e.g. age, sex, co‐morbidities, disease stage, blood or imaging results) that is likely to predict patient outcomes or health events, often related to OS and disease‐free survival (Moons 2009; Riley 2013). Prognostic information ultimately provides a basis for the determination of treatment and also helps to stratify individuals for treatment according to their risk of future outcomes (Riley 2013). Established prognostic factors in HL include age, gender, B symptoms, Ann Arbor disease stage, bulky disease, albumin level, anaemia and white blood cell count, amongst others (Cuccaro 2014; Josting 2010; Kılıçkap 2013). Particularly male gender, advanced disease stage or age, and a low level of albumin, for example, are associated with worse prognosis and survival outcomes (Cuccaro 2014; Josting 2010).
The prognostic factor to be examined in this review is the tumour's metabolic activity, its stage, and progression as captured by [18F]‐fluorodeoxy‐D‐glucose (FDG)‐positron emission tomography (PET, also called PET scanning), which is an imaging tool. The principle of FDG‐PET is based on a radio‐labelled glucose analogue being a good indicator of the glucose metabolism of a tissue. It comprises two parts: a vector (2‐deoxy‐D‐glucose) taken up by cells with a high metabolic rate, and 18F, a positron‐emitting nuclide, which is detected by scintigraphy. FDG‐PET scanning provides the opportunity to identify the state and degree of progression of FDG‐avid tumours and has therefore become a standard imaging tool for various cancers (Boellaard 2010). Hodgkin lymphoma is a FDG‐avid tumour; in a study of 233 people with HL, 100% were FDG‐avid (Weigler‐Sagie 2010). However, as the field of imaging continuously evolves, it is now widely accepted to use PET in combination with a computed tomography (CT), known as PET‐CT (Barrington 2014). The combination of PET‐CT is argued to provide clearer imaging and a more accurate measurement of nodal size (Cheson 2014). Nevertheless, in the studies included in this review, the use of PET or PET‐CT varied.
Over the last few decades FDG‐PET has been used more and more for staging, prognosis, treatment planning and response evaluation in individuals with HL, and is a widely accepted procedure (Barrington 2017a; Cheson 2014; Fitzgerald 2019; Kobe 2010a; Markova 2009; Meignan 2009; Radford 2015; Specht 2007). FDG‐PET is primarily used for the pretreatment assessment in order to determine the stage of the disease of an individual and thereby to decide on the appropriate treatment regimen (Cheson 2014; Meignan 2009). However, it is now argued that PET should also be conducted during first‐line chemotherapy in individuals with HL, namely interim PET after a few cycles of chemotherapy (Barrington 2017a; Bröckelmann 2018; Meignan 2009). The result of the interim PET scan (positive or negative) is believed to be a good predictor of outcome, aiding the distinction between individuals with a poor prognosis from those with a better prognosis, while undergoing early treatment (Gallamini 2007; Kobe 2010; Markova 2012). Therapy adaptation based on interim PET results was introduced after detailed exploration of the FDG‐PET procedure (Engert 2012; Kobe 2008a), the idea being to achieve maximum efficacy in terms of OS and progression‐free survival (PFS).We will refer to the prognostic factor henceforth as 'interim PET'.
Why it is important to do this reviewThere is a need to systematically explore the prognostic ability of the factor (interim PET) in conditions where there is no treatment adaptation. The 'no treatment adaptation' clause is a rather important point in the prognostic exploration as adapting treatment based on interim PET results in daily practice when its prognostic ability is not yet proven is not desired. There is one systematic review on the prognostic value of interim PET without treatment adaptation in individuals with HL (Adams 2015a). However, this review looked at 'treatment failure' as an outcome of the interim PET scan, which is different to the outcomes the current review explored. Moreover, and despite the fact that it is entitled as a review of prognosis studies, the methodology used is akin to diagnostic test evaluation (with calculations of diagnostic odds ratio, specificity and sensitivity), rather than using established prognostic methodology and crucially, the confidence in the calculated estimates was not rated. Moreover, the review included studies published before December 2014 and, therefore, important research published since that time is not included.
One Cochrane Review on the role of PET‐adapted treatment modification for people with HL found some evidence that PFS was decreased in people with early‐stage HL and a negative PET scan receiving only chemotherapy (PET‐adapted therapy) compared to those receiving radiotherapy in addition to chemotherapy (which is the standard therapy regimen) (Sickinger 2015). A similar result was found in another Cochrane Review (Blank 2017). The authors compared the effects of chemotherapy alone versus chemotherapy plus radiotherapy on outcome and safety for adults with early stage HL. They found moderate evidence that when individuals receive the same number of chemotherapy cycles, the addition of radiotherapy can improve PFS. However, both reviews were not able to give definite conclusions on the effect on OS. Another systematic review suggests the change of therapy after interim PET in advanced‐stage individuals only (Amitai 2018). In the current German guideline for the treatment of HL, for example, it is recommended that patients with advanced HL receive an interim PET scan after two cycles of chemotherapy. The result of the interim PET scan can then be used to guide further treatment for patients in advanced stages of HL (Bröckelmann 2018). Hence, the disease stage is an additional key prognostic factor for patients with HL. Several randomised controlled trials (RCTs) have recently been published that investigated the consequences of treatment adaptation based on interim PET scan results on outcome and safety for individuals with HL (Andre 2017; Casasnovas 2019; Kobe 2018; Johnson 2016; Radford 2015).
Hence, the prognostic role of interim PET in individuals with HL undergoing first‐line chemotherapy is very important and will strongly influence decision‐making particularly regarding the choice of subsequent treatments. Therefore, we have summarised all available data from identified studies and included these in a meta‐analysis when they were sufficiently homogeneous. Our aim was to produce robust evidence based on the improved power that a meta‐analysis provides over the limitations of individual primary studies, and grade the evidence. A reliable answer to the question of the prognostic value of interim PET scan to predict survival outcomes in individuals with HL will strongly influence decision‐making at a crucial point of an individual’s treatment pathway. Moreover, grading the evidence on the prognostic value of interim PET will provide readers with an estimate of how much they can rely on the calculated results.
The aim of this systematic review was to determine whether in previously untreated adults with HL receiving first‐line therapy, interim PET scan results can distinguish between those with a poor prognosis and those with a better prognosis, and whether it can predict survival outcomes in each group. Thereby, we assessed the prognostic value of interim PET scan results. Meta‐analyses and grading of the evidence allow a conclusion of whether interim PET is a prognostic factor. This comprehensive overview will have a great impact on international guidelines and clinical pathways, and will contribute to a high‐grade support in clinical decision‐making for effective, supportive strategies for the individual patient.
ObjectivesTo determine whether in previously untreated adults with Hodgkin lymphoma (HL) receiving first‐line therapy, interim positron emission tomography (PET) scan results can distinguish between those with a poor prognosis and those with a better prognosis, and thereby predict survival outcomes in each group.
Primary objectiveTo identify all studies evaluating interim PET scan results as a prognostic factor, describe the characteristics and risk of bias of included studies and meta‐analyse results on the association between PET scan results and overall survival (OS), progression‐free survival (PFS) and PET‐associated adverse events.
PICOTSWe used the PICOTS (population, index, comparator, outcome(s), timing, setting) system to describe the key items for framing this review and its objective and methodology (Table 1) (Debray 2017; Riley 2019).
Table 1. PICOTS system
Population Index (prognostic) factor Comparator Outcome(s) Timing SettingPeople with classic HL, at any stage of the disease
Newly diagnosed individuals undergoing first‐line therapy
Adults, as defined in the studies
Overall survival (OS)
Progression‐free survival (PFS)
PET‐associated adverse events (AEs)
Interim PET scan should be conducted during chemotherapy (after one, two, three or four cycles of chemotherapy)
This is a systematic review of prognostic factor studies.
Criteria for considering studies for this review Types of studiesWe included retrospective and prospective studies evaluating interim PET scan results in a minimum of 10 individuals with Hodgkin lymphoma (HL) undergoing first‐line therapy.
We excluded studies that modified the treatment regimen based on the interim PET scan results in order to draw an unbiased conclusion of the ability of interim PET to predict the outcomes under study.
ParticipantsWe included studies on adults with newly diagnosed classic HL receiving first‐line therapy. If in a study a percentage of the included participants were adolescents but received adult treatment regimen and dosage, and the study considered them as adults, then we also accepted this 'adult' definition.
All participants received an interim PET scan during chemotherapy (e.g. after one, two, three and/or four cycles of chemotherapy), and continued with the planned chemotherapy regimen, without treatment adaptation due to the interim PET scan result.
Index (prognostic) factorWe included studies that assessed interim PET scan results as the index (prognostic) factor to predict survival outcomes. We expected the interim PET scan to be conducted during first‐line treatment of adults with HL, and without interim PET‐guided treatment adaptation, meaning participants should be treated in the same way regardless of the interim PET scan result. We accepted all studies that conducted a PET or PET‐CT (see Background 'Description of index (prognostic) factor').
In the literature, it is generally recommended to use a five‐point scale to assess the grade of uptake and report the PET scan result (Meignan 2009). Generally, scores 1‐3 indicate PET‐negativity, while scores 4‐5 indicate PET‐positivity (Barrington 2014). Most of the included studies used a validated scale, such as the 5‐PS Deauville criteria (Meignan 2009), the Lugano classification (Cheson 2014), the Imaging Subcommittee of International Harmonization Project in Lymphoma criteria (Juweid 2007) or the joint Italian‐Danish study criteria (Gallamini 2007).
Type of outcome measures Primary outcomeOverall survival (OS), defined as the time to death due to any cause.
We chose OS as our primary outcome because it has the greatest clinical relevance and is most important for individuals with HL. Furthermore, death due to any cause is an objective endpoint not susceptible to bias by the outcome assessor.
Secondary outcomesProgression‐free survival (PFS), defined as the time to disease progression, relapse, death due to any cause or last follow‐up.
Adverse events (AEs), defined as any event associated with the index factor (e.g. radiation safety).
To report meaningful findings, the required minimum follow‐up period was 12 months for each outcome.
Search methods for identification of studies Electronic searchesReporting and therefore retrieval of prognostic factor studies is very poor, as evaluation of guidelines on reporting of prognostic markers in cancer have shown (Altman 2012; Mallett 2010; McShane 2005). Moreover, no specific search filter exists for this new methodological approach, therefore published filters have to be combined for a sensitive search strategy (Geersing 2012). However, as PET scans often are not reported as a prognostic factor, we did not combine our search strategy with a filter for prognosis research. Therefore, the search strategy was not very specific and the results were screened independently and in detail by two teams of two review authors. Furthermore, we did not apply a language restriction in order to reduce the language bias, according to chapter six of the Cochrane Handbook for Systematic Reviews of Interventions (Lefebvre 2011).
We searched the following databases.
Databases of medical literature
Cochrane Central Register of Controlled Trials (CENTRAL; 2 April 2019, Issue 11) (Appendix 1)
MEDLINE Ovid SP (1946 until 2 April 2019) (Appendix 2)
Embase (1990 until 2 April 2019) (Appendix 2)
Conference proceedings of annual meetings of the following societies for abstracts (2000 to 2019)
American Society of Hematology
European Hematology Association
International Symposium on Hodgkin Lymphoma
We searched ClinicalTrials.gov (on 25 January 2019 using the query PET and Hodgkin lymphoma) to identify clinical trials.
Handsearching of references
We searched the references of all identified studies, relevant review articles and current treatment guidelines for further literature to find other relevant studies and to identify associated articles.
Personal contacts
We contacted 10 principal investigators of included studies for further information, of whom six replied and answered our questions for clarification. Two out of these six provided us also with relevant data to conduct our analyses.
Two teams of two review authors (AA, LE, MHT, NS) independently screened the results of the search strategies to identify eligible studies by reading the titles and abstracts in Covidence (Covidence). In case of disagreements, consensus between the two review authors was reached by discussion of the full‐text publication. When consensus could not be reached, a third review author was consulted for final decision (Higgins 2011).
We documented the study selection process in a flow chart as recommended in the Preferred Reporting Items for Systematic Reviews and Meta‐Analyses (PRISMA) statement (Moher 2009), showing the total numbers of retrieved references and the numbers of included and excluded studies (Figure 1).
1.Study flow diagram according to PRISMA
Data extraction and managementWe developed a data extraction form specific to studies of prognostic factors based on the Checklist for Critical Appraisal and Data Extraction for Systematic Reviews of Prediction Modelling Studies (CHARMS) (Moons 2014). The form was piloted using four of the included studies, and then further assessed during several teleconferences between the review authors to discuss required changes. After several amendments of the form, two teams of two review authors (AA, LE, MHT, NS) independently extracted all relevant data from the included studies. After data extraction, we contacted 10 principal investigators of included studies to request additional information.
Our form included the following items (in short).
General information
i.e. Author, title, source, publication date, country, language, duplicate publications
Source of data
i.e. Cohort, prospective planned study, randomised study participants, or registry data
Participants
Participant eligibility and recruitment method (e.g. consecutive participants, location, number of centres, setting, inclusion and exclusion criteria)
Participant description (e.g. age, gender, stage of disease)
Details of treatments received
Study dates
Prognostic factor
Definition and method for measurement of prognostic factor
Timing of prognostic factor measurement (number of chemotherapy cycles before and after measurement of the prognostic factor)
Outcomes to be predicted
Definition and method for measurement of outcome
Was the same outcome definition (and method for measurement) used in all individuals?
Was the outcome assessed without knowledge of the prognostic factor (i.e. blinded)?
Time of outcome occurrence or summary of duration of follow‐up
Sample size
Number of participants and number of outcomes/events
Missing data
Number of participants with any missing value (include predictors and outcomes)
Handling of missing data (e.g. complete‐case analysis, imputation, or other methods)
Reported results
Overall survival (OS) (including duration of follow‐up)
Progression‐free survival (PFS) (including duration of follow‐up)
Adverse events (AEs) (including duration of follow‐up)
In the protocol for this review we prespecified that we will use the Quality in Prognostic Studies (QUIPS) tool (Hayden 2013) for the risk of bias assessment. However, recent methodological developments for the systematic review of prognostic factor studies (Riley 2019; Riley 2019b) led us to consider amending this tool. In the light of this we consulted the primary author (Hayden 2013) of the QUIPS tool and following discussions decided to add to the three bias ratings ('low', 'moderate' and 'high' risk of bias) a fourth 'unclear' option. This was necessary due to the inconsistent reporting of the included studies, when information was clearly missing, and hence, without an 'unclear' category, risk of bias assessment would not be feasible.
Following further discussions, we additionally decided to rename the fifth domain 'study confounding' to 'other prognostic factors (covariates)' in order to highlight the important distinction between confounding (the preferred term when seeking estimates of causal effect of a specific etiologic factor) and adjusting for other important prognostic factors, namely covariates (advocated when seeking the independent prognostic ability of index prognostic factors). As said, in the context of our review (adults with Hodgkin lymphoma), the disease stage is a key factor that is taken into account together with the interim PET scan result when decisions about treatment adaptation are made in daily clinical practice (Bröckelmann 2018). Hence, we assessed studies that only included participants within one disease stage (e.g. only early stages or only advanced stages of HL) as 'low' risk of bias, as such patient sampling can be considered as accounting for disease stage as another prognostic factor. Studies that included participants within all disease stages, but offered adjusted results including disease stage as another prognostic factor, were also assessed as 'low' risk of bias. Studies with participants of all disease stages, not accounting for disease stage, were assessed as 'high' risk of bias in this domain. This latter modification is also reflected in the GRADE assessment. Regardless of whether meta‐analysis of adjusted or unadjusted (crude) effects of the prognostic factor of interest (interim PET scan results) was possible, we included this domain's risk of bias assessment in our GRADE judgement.
Two teams of two review authors (AA, LE, MHT, NS) independently assessed the risk of bias of the included studies according to the domains of the QUIPS tool. We judged each domain by taking into account the criteria listed for each domain in the QUIPS tool (Hayden 2013), and also provided a brief statement supporting our judgement.
We made the following judgements.
Low risk of bias: the relationship between the prognostic factor and outcome is unlikely to be different for participants and eligible non‐participants.
Moderate risk of bias: the relationship between the prognostic factor and outcome may be different for participants and eligible non‐participants.
High risk of bias: the relationship between the prognostic factor and outcome is very likely to be different for participants and eligible non‐participants.
Unclear risk of bias: the study does not provide sufficient information that allows a clear judgement for this domain.
Furthermore, we decided to assess the risk of bias per outcome in each study because not all studies reported all of our outcomes of interest, and even studies reporting at least two of our outcomes showed differences in their outcome reporting.
We judged the following domains and criteria.
Study participation
Adequate participation in the study by eligible persons
Description of the source population or population of interest
Description of the baseline study sample
Adequate description of the sampling frame and recruitment
Adequate description of the period and place of recruitment
Adequate description of inclusion and exclusion criteria
Study attrition
Adequate response rate for study participants
Description of attempts to collect information on participants who dropped out
Reasons for loss to follow‐up are provided
Adequate description of participants lost to follow‐up
There are no important differences between participants who completed the study and those who did not
Prognostic factor measurement
A clear definition or description of the prognostic factor is provided
Method of prognostic factor measurement is adequately valid and reliable
Continuous variables are reported or appropriate cut points are used
The method and setting of measurement of prognostic factor is the same for all study participants
Adequate proportion of the study sample has complete data for the prognostic factor
Appropriate methods of imputation are used for missing prognostic factor data
Outcome measurement
A clear definition of the outcome is provided
Method of outcome measurement used is adequately valid and reliable
The method and setting of outcome measurement is the same for all study participants
Other prognostic factors (covariates)
Other prognostic factors (covariates) are measured
Clear definitions of the important prognostic factors (covariates) measured are provided
Measurement of all important prognostic factors (covariates) is adequately valid and reliable
The method and setting of prognostic factor measurement are the same for all study participants
Appropriate methods are used if imputation is used for missing data
Important potential prognostic factors (covariates) are accounted for in the study design
Important potential prognostic factors (covariates) are accounted for in the analysis
Statistical analysis and reporting
Sufficient presentation of data to assess the adequacy of the analytic strategy
Strategy for model building is appropriate and is based on a conceptual framework or model
The selected statistical model is adequate for the design of the study
There is no selective reporting of results
Methods and reporting in prognostic research often do not follow current methodological recommendations, limiting retrieval, reliability and applicability of these publications (Bouwmeester 2012; Peat 2014). There is evidence suggesting that prognosis research in cancer is cluttered with false‐positive studies, which would not have been published if the results were negative (Kyzas 2005; Kyzas 2007; Sauerbrei 2005). Moreover, studies evaluating prognostic factors are usually not prospectively registered and no protocol is published (Peat 2014; Riley 2013), resulting in difficulties to identify all studies and to assess potential risks of publication bias. We used sensitive search filters for the disease (HL) and the prognostic factor (interim PET scan results) without any specific filter for research on prognosis in order to increase retrieval.
Due to the expected large effect of hazard ratios (HRs), tests for funnel plot asymmetry could result in publication bias being incorrectly indicated by the test (Macaskill 2010). Therefore, we decided not to evaluate the risk of publication bias by funnel plot asymmetry and describe reporting deficiencies instead.
Data synthesisWe performed analyses according to the recommendations of Cochrane, and the Cochrane Prognosis Methods Group in particular, and used the Cochrane statistical package Review Manager 5 (Deeks 2011; Review Manager 2014). We are aware that since the protocol development, the methodology on assessing studies of prognosis has evolved; hence, some differences between the published protocol and this full review may exist to account for the updated guidance. We have listed these in Differences between protocol and review.
We pooled unadjusted (crude) HRs for OS and PFS by applying meta‐analysis using the RevMan's generic inverse variance methods random‐effects model. Due to reporting inefficiencies and the expected heterogeneity between studies, we only combined studies that were sufficiently similar (e.g. most studies used ABVD as the main therapy regimen, or most studies conducted interim PET after two cycles of chemotherapy). Studies did not always provide an HR and associated standard error (SE), which are the parameters needed for meta‐analysis. Where these values were not available, we estimated them from other available data where possible using an in‐house calculator based on published methods for recovering survival data (Altman 1999; Parmar 1998; Tierney 2007). Recovered data included information and results reported in the text, tables, and Kaplan‐Meier (K‐M) curves. We also contacted 10 principal investigators of included studies to either ask for additional data, or to clarify issues regarding the studies.
As prespecified in the protocol, we would have also pooled adjusted HRs of the interim PET scan‐result (the index prognostic factor) from multivariable analyses of the included studies as adjusted prognostic effects (e.g. HRs) indicate the independent prognostic value of the prognostic factor over and above other clinically relevant prognostic factors (Riley 2019). However, pooling of adjusted estimates is recommended only if the same (largely) prognostic factors (covariates) are adjusted for in multivariable analyses (Riley 2019; Riley 2019b). As, said clinically relevant prognostic factors in individuals with HL particularly include the disease stage, as well as age, gender, and B symptoms (Cuccaro 2014). Regardless of whether pooling of adjusted or unadjusted effects of interim PET scan results was possible, we always assessed the risk of bias for all studies using the QUIPS tool, including the fifth domain 'other prognostic factors (covariates)', where we considered the disease stage as an important covariate to be taken into account.
Detailed description of the estimation of hazard ratios (HRs) and standard errors (SEs)We used unadjusted HRs as the effect measure for OS and PFS. In cases where the HR and SE were not reported, we estimated them from available data using an in‐house calculator (Trivella 2006), based on methods reported by Tierney 2007, Altman 1999 and Parmar 1998, or contacted authors to request additional data (Higgins 2011b). Recovered data included sample size, number of events, results such as the logrank P‐value and confidence intervals (CIs), which were reported in the text, tables, and K‐M curves. We kept detailed records of how the HR and SEs were calculated for each outcome in each included study. We identified the following six categories of HR precision.
HR was provided in the study, and the SE was either provided or easily estimated from reported CIs, and/or using the RevMan inbuilt calculator.
HR was provided but on checking while attempting to obtain the SE, there were errors and/or discrepancies with related provided data and we re‐estimated the HR.
HR and SE were not provided but all necessary data for their estimation were available in the study.
HR and SE were not provided. Other necessary data were available but not an exact logrank P value, hence the nearest value was used in the estimation. For example, if they reported P < 0.001, then the nearest exact value was used, in this case P = 0.0009.
HR and SE were not provided. Other necessary data were available but the number of events was estimated from the K‐M curves.
IPD data were available and HR and SE were accurately calculated.
We are aware that categories four and five are likely to over‐ or under‐estimate the HR and associated SE. However, they were the best estimates we could obtain. We consider the remaining categories as precise. We explored the precision of the estimates in a post‐hoc sensitivity analysis where the imprecise studies were temporarily removed to examine the robustness of the pooled result.
Grading the evidenceAccording to the recommendations of the GRADE working group, we rated and described the confidence in estimates for each outcome by assessing potential risk of bias, inconsistency, imprecision, indirectness and publication bias. We applied an approach that has been proposed for prognosis studies by the GRADE working group, suggesting that the starting point is one of high certainty of the evidence for observational studies (Iorio 2015).
Dealing with missing dataWe dealt with missing data as suggested in Chapter 16 of the Cochrane Handbook for Systematic Reviews of Interventions (Higgins 2011b). We contacted ten principal investigators of included studies to answer our questions regarding the studies and/or to provide us with additional data. Six principal investigators replied and answered our questions, of which two also provided us with additional data necessary to perform our analyses. One investigator kindly provided us with individual participant data for the whole data set. In some studies, the description of the methodology was rather unclear or relevant information was missing. In addition, some studies did not fully report their statistical analyses and data were missing, which complicated a full assessment of the study. We performed sensitivity analysis to assess how sensitive the results were to reasonable changes in the assumptions that were made, and addressed the potential impact of missing data on the findings of this review in the Discussion.
Furthermore, we noticed that most studies applied exclusion criteria on the baseline population (such as unavailability of interim PET or descriptive information) without providing a description of the size of this population and/or reasons for missing information. We treated this as a potential source of selection bias in the domain study participation of the QUIPS tool.
Investigation of heterogeneityWe investigated and discussed clinical and statistical heterogeneity and design aspects of included studies as mentioned in the section 'Data extraction and data management'. We assessed between‐study heterogeneity using the I2 statistic (an I2 greater than 50% = moderate heterogeneity; an I2 greater than 80% = considerable heterogeneity) (Deeks 2011). As most studies of prognosis are observational in nature, we are aware that they are prone to higher and/or inflated heterogeneity. Hence, we also assessed the Tau2 values from the meta‐analyses to be able to make a more robust judgment on the degree of statistical heterogeneity.
As specified in the protocol, we explored potential causes of heterogeneity by subgroup analysis. We considered the following parameters.
Study design (e.g. prospective versus retrospective)
Disease stage (e.g. early versus advanced stages)
Type of chemotherapy (e.g. ABVD versus BEACOPP)
Type of radiotherapy (e.g. involved field versus involved site)
Type of PET measurement (e.g. PET versus PET‐CT) (post‐hoc)
In addition, we conducted a post hoc sensitivity analysis for the timing of the interim PET, as well as the availability/estimation of HR and SE to explore the robustness of the pooled results.
Results Results of the searchOur literature search in CENTRAL, MEDLINE and Embase (until 2 April 2019, see Appendix 1, Appendix 2 and Appendix 3, respectively) and one trial registry (ClinicalTrials.gov on 25 January 2019), identified 11,277 potentially relevant publications. After removal of 358 duplicates, we screened titles and abstracts of 10,919 references using inclusion and exclusion criteria defined at the protocol stage. These criteria led to the exclusion of 10,651 references, and 268 references were then included for full‐text screening. Before starting full‐text screening, we discussed and determined exclusion reasons. Full‐text screening led to the exclusion of 133 references. Thirty‐four references that were identified are still awaiting assessment (see Studies awaiting classification), and one study is still ongoing (see Ongoing studies). Hence, we finally included 23 studies (from 99 references) in this review. The overall number of publications screened, identified, selected and included in this review is shown in Figure 1
Description of studies Included studiesSee also Characteristics of included studies.
We included 23 studies in this review (Andre 2017; Annunziata 2016; Barnes 2011; Casasnovas 2019; Cerci 2010; Gallamini 2014; Gandikota 2015; Hutchings 2005; Hutchings 2006; Hutchings 2014; Kobe 2018; Markova 2012; Mesguich 2016; Oki 2014; Okosun 2012; Orlacchio 2012; Rossi 2014; Simon 2016; Straus 2011; Touati 2014; Ying 2014; Zaucha 2017; Zinzani 2012), which added up to a total of 99 references when secondary citations were included. To avoid duplication and overlapping of participant data in our analyses, we grouped those publications that assessed the same population (or groups from the same population). In such cases, we chose the publication with the greatest number of participants and/or most information as the primary publication. Duplicate or overlapping study populations were found for eight studies (Andre 2017; Barnes 2011; Gallamini 2014; Kobe 2018; Markova 2012; Simon 2016; Straus 2011; Zinzani 2012). Four studies did not report the duration of follow‐up (Andre 2017; Annunziata 2016; Orlacchio 2012; Straus 2011). The earliest study recruited participants between 1993 and 2004 (Hutchings 2005), and the most recent between 2007 and 2014 (Annunziata 2016).
There was considerable heterogeneity between the included studies, particularly with regard to: stages of disease; treatment regimens; and the timing and criteria for evaluation of the interim PET scans, which are described in detail in the sections below. For meta‐analyses, we only grouped studies that were homogenous enough in order to ensure comparability, and conducted subgroup analyses to explore the potential impact of heterogeneity on our results (see Methods 'Investigation of heterogeneity').
Study designOf the 23 included studies, seven studies were retrospective single‐centre studies (Annunziata 2016; Markova 2012; Oki 2014; Orlacchio 2012; Rossi 2014; Touati 2014; Ying 2014). Five studies were retrospective multi‐centre studies (ranging between two to 17 centres) (Barnes 2011; Gallamini 2014; Mesguich 2016; Okosun 2012; Zinzani 2012). Two retrospective studies did not report the number of centres from which participants were recruited (Gandikota 2015; Simon 2016). Out of eight studies with a prospective study design, one study was a single‐centre study (Cerci 2010), three were multi‐centre studies (including between four and 11 centres, with Hutchings 2014 not reporting the number of study centres) (Hutchings 2006; Hutchings 2014; Zaucha 2017), and four were clinical trials (Andre 2017; Casasnovas 2019; Kobe 2018; Straus 2011). One study did not report the study design (Hutchings 2005).
For more details see Characteristics of included studies.
Sample sizeThe smallest study included 23 participants (Okosun 2012) and the largest study included 1945 participants (Kobe 2018).
LocationThe included studies were conducted in a variety of countries, including Austria, Belgium, Brazil, Croatia, Czech Republic, Denmark, France, Germany, Hungary, Italy, the Netherlands, Poland, Slovakia, Switzerland, the United Kingdom (UK), the United States of America (USA), and the People's Republic of China. Four studies reported the country but not the study centre (Annunziata 2016; Hutchings 2014; Markova 2012; Simon 2016), and two studies reported neither country nor study centre (Gandikota 2015; Straus 2011).
ParticipantsThis review included a total of 7335 male and female consecutive participants who were newly diagnosed with classic HL and received first‐line therapy. Out of these, a total of 2205 participants were included in meta‐analyses.
Follow‐upThere were differences in the follow‐up time between studies. Three studies did not report follow‐up time (Annunziata 2016; Orlacchio 2012; Straus 2011). Two studies reported follow‐up time per subgroup, i.e. surviving participants only (Kobe 2018; Zaucha 2017). The median follow‐up time for the remaining 18 studies ranged from 23 to 66 months. The total raw range of follow‐up time was between two to 195 months.
Stages of diseaseFifteen studies included all stages of the disease. Four studies included only early stages (Andre 2017; Barnes 2011; Gandikota 2015, Straus 2011) and four studies only advanced stages (Casasnovas 2019; Kobe 2018; Markova 2012; Okosun 2012).
Treatment/therapyThe following chemotherapy regimens were administered.
ABVD (adriamycin/doxorubicin, bleomycin, vinblastine and dacarbazine) in 16 studies (Andre 2017; Annunziata 2016; Barnes 2011; Cerci 2010; Gallamini 2014; Hutchings 2005; Hutchings 2006; Hutchings 2014; Mesguich 2016; Oki 2014; Okosun 2012; Orlacchio 2012; Simon 2016; Touati 2014; Zaucha 2017; Zinzani 2012).
Either ABVD or BEACOPP in one study (Ying 2014).
BEACOPPescalated (bleomycin, etoposide, doxorubicin, cyclophosphamide, vincristine, procarbazine and prednisone in escalated doses) in one trial (Casasnovas 2019).
BEACOPPescalated or BEACOPPescalated with rituximab in one trial (Kobe 2018).
BEACOPPescalated or time‐condensed BEACOPP14baseline (BEACOPP in standard, non‐escalated doses repeated on day 15) in one study (Markova 2012).
AVG (doxorubicin, vinblastine and gemcitabine) in one trial (Straus 2011).
ABV/MOPP (adriamycin, bleomycin, vinblastine, mechlorethamine, vincristine, procarbazine and prednisone), ABVD/COPP (ABVD plus cyclophosphamide, vincristine, procarbazine and prednisone), eBEACOPP, or PVAG (prednisone, vinblastine, doxorubicin and gemcitabine) in subgroups of participants in three studies (Hutchings 2005; Hutchings 2006; Touati 2014).
Anthracycline‐based chemotherapy not further specified in one study (Rossi 2014).
The following number of chemotherapy cycles were administered.
Two, three, four, six or eight cycles of chemotherapy alone or combined with radiotherapy in 15 studies (Andre 2017; Annunziata 2016; Barnes 2011; Casasnovas 2019; Cerci 2010; Gallamini 2014; Hutchings 2014; Markova 2012; Mesguich 2016; Orlacchio 2012; Rossi 2014; Simon 2016; Straus 2011; Zaucha 2017; Zinzani 2012). The number of cycles usually depended on the stage of the disease.
Four, six or eight cycles of chemotherapy, depending on the interim PET scan results, in one trial (Kobe 2018). A protocol amendment during the trial introduced a reduction of standard therapy from eight to six cycles.
Six cycles of chemotherapy combined with antiretroviral therapy due to HIV‐positive study population in one study (Okosun 2012).
Six studies did not report the number of cycles (Gandikota 2015; Hutchings 2005; Hutchings 2006; Oki 2014; Touati 2014; Ying 2014).
The following radiotherapy techniques were used either in all or a subgroup of participants.
Involved‐field radiotherapy in eight studies (Barnes 2011; Gallamini 2014; Hutchings 2005; Hutchings 2006; Hutchings 2014; Mesguich 2016; Rossi 2014; Simon 2016), and either involved‐field radiotherapy or extended‐field radiotherapy in one study (Gandikota 2015).
Involved‐node radiotherapy in three studies (Andre 2017; Annunziata 2016; Zaucha 2017).
Involved‐site radiotherapy in two studies (Touati 2014; Zinzani 2012).
Radiotherapy without further specification in five studies (Cerci 2010; Kobe 2018; Markova 2012; Orlacchio 2012; Ying 2014).
No radiotherapy in three studies (Oki 2014; Okosun 2012; Straus 2011).
Stem cell transplantation was conducted in participants who relapsed after first‐line therapy despite treatment escalation or salvage therapy.
Autologous stem cell transplantation in eight studies (Cerci 2010; Gallamini 2014; Hutchings 2014; Mesguich 2016; Touati 2014; Ying 2014; Zaucha 2017).
Autologous and/or allogeneic stem cell transplantation in one study (Zinzani 2012).
Type of stem cell transplantation not specified in four studies (Hutchings 2005; Hutchings 2006; Markova 2012; Orlacchio 2012).
No stem cell transplantation reported in 10 studies (Andre 2017; Annunziata 2016; Barnes 2011; Gandikota 2015; Kobe 2018; Oki 2014; Okosun 2012; Rossi 2014; Simon 2016; Straus 2011).
Participants in 16 out of 23 studies underwent PET combined with computed tomography (CT), contrast enhanced CT, or multi detector CT (MDCT), compared to PET‐only for participants in the other studies. Participants in 13 studies underwent PET‐CT (Annunziata 2016; Cerci 2010; Gallamini 2014; Gandikota 2015; Hutchings 2014; Kobe 2018; Mesguich 2016; Okosun 2012; Rossi 2014; Simon 2016; Touati 2014; Ying 2014; Zaucha 2017). Participants in another study underwent either PET or PET‐CT (Barnes 2011); participants in one study underwent PET with contrast‐enhanced CT (Markova 2012); and participants in another study underwent PET/MDCT (Orlacchio 2012). In the remaining seven studies, participants underwent a PET scan only (Andre 2017; Casasnovas 2019; Hutchings 2005; Hutchings 2006; Oki 2014; Straus 2011; Zinzani 2012).
Timing of interim PETThe timing of interim PET imaging varied between studies. In most studies, participants underwent an interim PET scan after two cycles (PET2) of chemotherapy (Andre 2017; Casasnovas 2019; Cerci 2010; Gallamini 2014; Hutchings 2005; Hutchings 2006; Kobe 2018; Mesguich 2016; Oki 2014; Okosun 2012; Orlacchio 2012; Rossi 2014; Simon 2016; Straus 2011; Touati 2014; Zinzani 2012). In another study, participants underwent an interim PET scan after the first cycle (PET1) of chemotherapy only (Annunziata 2016). In one study, participants underwent interim PET scans after the first and second cycle of chemotherapy, but the study protocol was amended after interim analysis to limit PET2 scans to participants with positive results after PET1 (Zaucha 2017). In one multi‐centre study, participants from two centres underwent both PET1 and PET2, whereas participants from the remaining two centres underwent PET2 only if PET1 was positive (Hutchings 2014). Three retrospective studies included participants who underwent interim PET after two to four cycles of chemotherapy (Barnes 2011; Gandikota 2015; Ying 2014), and in another study participants underwent interim PET after four cycles (PET4) of chemotherapy (Markova 2012). For meta‐analyses, we used information at PET2 whenever available in order to ensure homogeneity across studies.
Evaluation of PET scansIn most studies, two nuclear medicine physicians evaluated the PET scans individually, and disagreements in scoring were solved in a consensus meeting (Annunziata 2016; Barnes 2011; Cerci 2010; Hutchings 2005; Hutchings 2006; Hutchings 2014; Mesguich 2016; Orlacchio 2012; Rossi 2014; Ying 2014; Zinzani 2012). Evaluation of PET scans was performed by only one expert in one study (Markova 2012); and by a panel consisting of three to six experts in eight studies (Andre 2017; Casasnovas 2019; Gallamini 2014; Kobe 2018; Oki 2014; Okosun 2012; Straus 2011; Zaucha 2017). Three studies did not report the number or qualification of persons who performed evaluation of PET scans (Gandikota 2015; Simon 2016; Touati 2014). Nine out of 13 multi‐centre studies reported that evaluation of PET scans took place centrally (Andre 2017; Gallamini 2014; Hutchings 2006; Kobe 2018; Mesguich 2016; Okosun 2012; Straus 2011; Zaucha 2017; Zinzani 2012), and two studies did not report how reviewing of PET scans was performed across centres (Barnes 2011; Hutchings 2014).
In 11 studies, outcome assessors were blinded to the outcome (Kobe 2018; Gallamini 2014; Gandikota 2015; Hutchings 2006; Hutchings 2014; Mesguich 2016; Oki 2014; Rossi 2014; Straus 2011; Zaucha 2017; Zinzani 2012). The remaining studies did not report blinding.
Criteria for evaluationMost studies reported the use of a standardised scale for the evaluation of the PET scans, but the scoring systems and cut‐off points between studies varied.
In 12 studies, the Deauville 5‐point scoring system for evaluation of PET scans was used: in nine studies, Deauville scores 1 ‐ 3 were considered as PET‐negative, and Deauville scores 4 ‐ 5 as PET‐positive (cut‐off ≥4) (Annunziata 2016; Casasnovas 2019; Gallamini 2014; Hutchings 2014; Oki 2014; Okosun 2012; Rossi 2014; Simon 2016; Zaucha 2017); in two studies, both cut‐off points for evaluation of the PET scans were used by scoring each image twice, and comparing performance of interim PET between both scales (Kobe 2018; Mesguich 2016); and in one study, it was reported that the PET scans were re‐interpreted retrospectively using the Deauville criteria, but it was not indicated which cut‐off points were used (Touati 2014).
In one study, the International Harmonization Project criteria were used: a PET scan was considered positive when the residual mass is ≥ 2 cm or, if less than 2 cm, positive if its activity is above that of the surrounding background (Andre 2017). A negative PET scan corresponds to Deauville score 1 (no uptake) and score 2 (uptake ≤ mediastinum).
In two studies, the scoring systems were not specified, but similar scales and cut‐off points as the Deauville scoring system were used: in one study, PET scans were reviewed using a 4‐point scale (Barnes 2011), and in another study using a 5‐point scale (Gandikota 2015).
In three studies, other standardised scales for the evaluation of PET scans were used: one study used the Juweid criteria (Zinzani 2012), and two studies used the International Harmonization Project guidelines (Orlacchio 2012; Straus 2011).
Two studies did not report how PET scans were evaluated (Hutchings 2005; Hutchings 2006); and four studies reported performance of visual evaluation but did not indicate the use of a standardised scoring system (Cerci 2010; Markova 2012; Touati 2014; Ying 2014).
Univariable analyses
Twelve out of 23 included studies reported unadjusted results for our primary outcome OS (Barnes 2011; Casasnovas 2019; Cerci 2010; Gallamini 2014; Hutchings 2005; Hutchings 2006; Hutchings 2014; Kobe 2018; Simon 2016; Touati 2014; Zaucha 2017; Zinzani 2012). Of these, nine provided sufficient information and data to be included in meta‐analysis. One study reported an HR that we used (Kobe 2018). Another study reported an HR, but we still re‐calculated it due to discrepancies in values between the graph and table (Simon 2016). For the other seven studies, we estimated the HR using other available data from the publications (Barnes 2011; Cerci 2010; Hutchings 2005; Hutchings 2014; Touati 2014; Zaucha 2017; Zinzani 2012).
Multivariable analyses
Two studies reported adjusted results for OS (Kobe 2018; Simon 2016). Two additional studies planned, but did not conduct the analysis for different reasons (Gallamini 2014; Hutchings 2005).
Secondary outcomes Progression‐free survival (PFS)Univariable analyses
Twenty‐one out of 23 studies reported unadjusted results for PFS (Andre 2017; Annunziata 2016; Barnes 2011; Casasnovas 2019; Cerci 2010; Gallamini 2014; Hutchings 2005; Hutchings 2006; Hutchings 2014; Kobe 2018; Markova 2012; Mesguich 2016; Oki 2014; Okosun 2012; Rossi 2014; Simon 2016; Straus 2011; Touati 2014; Ying 2014; Zaucha 2017; Zinzani 2012). Of these, 15 provided sufficient information and data to be included in meta‐analysis. Three studies provided an HR which we used (Annunziata 2016; Kobe 2018; Simon 2016). Another three studies reported an HR, but we still re‐calculated it due to unclear description of the statistical methods used (Hutchings 2006), reporting discrepancies between graphs and tables (Mesguich 2016) or general uncertainties in the reported values (Rossi 2014). For eight studies we estimated the HR using other available data (Barnes 2011; Cerci 2010; Hutchings 2005; Straus 2011; Touati 2014; Ying 2014; Zaucha 2017; Zinzani 2012).
Multivariable analyses
Eight studies reported adjusted results for PFS (Casasnovas 2019; Gallamini 2014; Hutchings 2005; Hutchings 2006; Kobe 2018; Mesguich 2016; Rossi 2014; Simon 2016). Three studies took the importance of adjustment into account, but did not actually conduct a multivariable analysis (Annunziata 2016; Hutchings 2014; Oki 2014).
Definitions of Progression‐free survival (PFS)The definition of the progression outcome varied between studies. Four studies that reported PFS did not provide a definition (Hutchings 2014; Simon 2016; Straus 2011; Zaucha 2017). One study analysed event‐free survival (Cerci 2010), which was identical with PFS and, therefore, included in the analysis. Table 2 presents an overview of definitions used for progression outcome. Studies with identical definitions were grouped.
Table 2. Definitions of progression outcomes
Study Definition of progression outcome Andre 2017 Progression‐free survival, defined – from the date of random assignment to date of progression – as experiencing relapse after previous complete remission or progression after reaching partial remission (50% decrease and resolution of B symptoms and no new lesions); progressive disease (50% increase from nadir of any previous partial remission lesions or appearance of new lesions) on CT scan measurements during protocol treatment; or death from any cause, whichever occurred first. Casasnovas 2019 Progression‐free survival defined as the time from randomisation to first progression, relapse, or death from any cause or last follow‐up. Annunziata 2016 The primary endpoint was PFS, with progression during treatment, lack of complete remission at the end of the first‐line treatment, and relapse counted as adverse events. Barnes 2011;None of the included studies measured PET‐associated AEs.
Conflict of interestTwo studies reported potential conflicts of interest (Andre 2017; Casasnovas 2019). Fourteen studies declared that the investigators had no conflict of interest (Annunziata 2016; Barnes 2011; Hutchings 2006; Hutchings 2014; Kobe 2018; Mesguich 2016; Oki 2014; Okosun 2012; Orlacchio 2012; Rossi 2014; Simon 2016; Straus 2011; Zaucha 2017; Zinzani 2012). Seven studies did not report investigators' disclosures of potential conflicts of interest (Cerci 2010; Gallamini 2014; Gandikota 2015; Hutchings 2005; Markova 2012; Touati 2014; Ying 2014).
Excluded studiesAfter screening titles and abstracts, we excluded 10651 references that did not match our inclusion criteria. In addition, we excluded a total of 133 references after full‐text screening for the following reasons.
Fifty‐six references had a study design or publication type that did not match our inclusion criteria, i.e. letters and commentaries, case studies with a small sample size or validation studies (Adams 2016; Adams 2017; Adams 2018; Adams 2018a; Adams 2018b; Adams 2019; Afanasyev 2017; Ansell 2016; Barrington 2017; Bar‐Shalom 2003; Basu 2009; Becherer 2002; Bednaruk‐Mlynski 2015; Biggi 2012; Bishop 2015; Bodet‐Milin 2009; Boisson 2007; Borchmann 2016; Bucerius 2006; Cremerius 1999; D'Urso 2018; Dann 2018; deAndres‐Galiana 2015; Diehl 2007; El‐Galaly 2012; Evens 2014; Fanti 2008; Friedberg 2002; Friedberg 2004; Gallamini 2008; Gallamini 2018a; Gallowitsch 2008; Guidez 2016; Hagtvedt 2015; Hartmann 2012; Hartridge‐Lambert 2013; Kobe 2008; Kobe 2014; Lowe 2002; Milgrom 2017; Mocikova 2010; NCT02292979; Pichler 2000; Reinhardt 2005; Rigacci 2002; Rigacci 2017; Rubello 2015; Sakr 2017; Specht 2007; Spinner 2018; Strigari 2016; Tirelli 2015; Xie 2018; Yasgur 2015; Zabrocka 2016; Zaucha 2009).
Thirty‐nine references adapted the treatment based on PET‐results (Albano 2017; Albano 2018; Biggi 2017; Carras 2018; Ciammella 2016; Cuccaro 2016; Damlaj 2017; Damlaj 2019; Danilov 2017; Dann 2009; Dann 2010; Dann 2010a; Dann 2012; Dann 2013; Dann 2016; Dann 2017; Fornecker 2017; Gallamini 2017; Gallamini 2018; Greil 2018; Illidge 2015; Johnson 2015; Johnson 2016; Kamran 2016; Kamran 2018; Moskowitz 2015; NCT00784537; NCT00795613; NCT01358747; NCT01652261; Nguyen 2017; Paolini 2007; Pavlovsky 2019; Simontacchi 2015; Straus 2018; Torizuka 2004; Trotman 2017; Villa 2018; Zinzani 2016).
Eighteen references also included participants with other types of lymphoma and did not report data for HL separately (Awan 2013; Blum 2002; Bodet‐Milin 2008; Cremerius 2001; Filmont 2003; Freudenberg 2004; Fruchart 2006; Goldschmidt 2011; Haioun 2005; Honda 2014; Iagaru 2008; Kostakoglu 2006; Li 2013; Slaby 2002; Tomita 2015; Torizuka 2004; Zinzani 1999; Zinzani 2002).
Ten references included participants who received treatment other than first‐line therapy, i.e. second‐line therapy for relapsed or refractory disease (Bjurberg 2006; Front 1999; Huic 2006; Mocikova 2010; Mocikova 2011; Schot 2007; Sucak 2011; Tseng 2012; Weidmann 1999; Yoshimi 2008).
Eight references reported only end‐of‐chemotherapy PET‐results (Advani 2007; Hueltenschmidt 2001; Hutchings 2007; Jerusalem 2003; Molnar 2010; Naumann 2001; Panizo 2004; Spaepen 2001).
Two were duplicates (Freudenberg 2004; Kobe 2014).
These publications are described in Characteristics of excluded studies.
Risk of bias in included studiesWe assessed the risk of bias at outcome level (OS and PFS) for each study using the QUIPS tool. No study reported PET‐associated AE. The detailed assessment can be found in the 'Risk of bias (QUIPS)' section in the Characteristics of included studies.
Risk of bias in studies included in meta‐analysesThe 'Risk of bias' summary (Figure 2) presents the combined judgement made by the review authors in a cross‐tabulation. Studies included in meta‐analysis are highlighted in bold.
2.'Risk of bias' assessment according to QUIPS (Quality in Prognostic Studies) by outcome.
Overall survival (OS)For our primary outcome OS, one out of nine studies included in meta‐analysis was assessed as 'low' in all risk of bias domains (Kobe 2018). Four studies were assessed as 'unclear' for the domain study participation (Barnes 2011; Hutchings 2005; Simon 2016; Touati 2014), mostly due to a lack of information about the baseline population from which the study sample originated. Most studies had defined exclusion criteria to sample participants from the baseline population (e.g. unavailability of interim PET2) without providing a description of the original population or reasons for missing information. Considering this a potential source of selection bias, we assessed this domain as ‘unclear’ when information about the baseline population was missing. For the domains study attrition, prognostic factor measurement and outcome measurement, risk of bias was assessed as 'low' in most studies. Two studies did not report the use of standardised criteria for prognostic factor measurement, therefore we assessed the risk of bias as 'moderate' (Barnes 2011; Touati 2014). One study was assessed as 'moderate' risk because PET2 availability was dependent on PET1 result (Zaucha 2017). Due to inconsistency in reporting of the timing of the interim PET measurement, the risk of bias for outcome measurement was assessed as 'high' in one study (Barnes 2011), while the remaining studies were all assessed as 'low'. Two studies were assessed as 'low' risk of bias in the domain other prognostic factors (covariates) because they only included participants within one disease stage (e.g. early or advanced stages) (Barnes 2011; Kobe 2018), while the remaining seven studies were assessed as 'high' risk of bias for this domain because they included all disease stages without adjusting for stage (Cerci 2010; Hutchings 2005; Hutchings 2014; Simon 2016; Touati 2014; Zaucha 2017; Zinzani 2012). Six studies provided sufficient information about the methods used for univariable analysis (Hutchings 2005; Hutchings 2014; Kobe 2018; Touati 2014; Zaucha 2017; Zinzani 2012), therefore we assessed the risk of bias for statistical analysis and reporting as 'low'. The same domain was assessed as 'high' in three studies due to discrepancies between text and figures and/or tables (Barnes 2011; Cerci 2010; Simon 2016).
Progression‐free survival (PFS)For our secondary outcome PFS, two out of 14 studies included in meta‐analysis were assessed as 'low' risk of bias in all domains (Casasnovas 2019; Mesguich 2016). Eight studies provided clear descriptions of study characteristics and participants (Cerci 2010; Kobe 2018; Mesguich 2016; Rossi 2014; Straus 2011; Ying 2014; Zaucha 2017; Zinzani 2012), so we assessed the risk of bias as 'low'. Five studies did not report inclusion and/or exclusion criteria (Annunziata 2016; Barnes 2011; Hutchings 2005; Simon 2016; Touati 2014), so we assessed the risk of bias for study participation as 'unclear'. One study reported a high number of participants with unavailable interim PET scans without further information (Hutchings 2006), so we assessed the risk of bias as 'high' in the same domain. Most studies had no loss to follow‐up to report or provided a clear description of how missing data were handled, so we assessed the risk of bias for study attrition as 'low' in the majority of studies. One study was assessed as 'unclear' due to a lack of information regarding loss to follow‐up (Annunziata 2016); another study was assessed as 'moderate' because no explanation was provided as to why some participants were lost to follow‐up (Hutchings 2005). The risk of bias for the domains prognostic factor measurement and outcome measurement was assessed as 'low' in most studies. Three studies did not report the use of standardised criteria for prognostic factor measurement, therefore we assessed the risk of bias as 'moderate' (Barnes 2011; Touati 2014; Ying 2014). A fourth study was assessed as 'moderate' risk because PET2 availability was dependent on PET1 result (Zaucha 2017). Due to lack of outcome definition or inconsistency in the reporting of the timing of the interim PET measurement, the risk of bias for outcome measurement was assessed as 'high' in one study (Barnes 2011). In another study, this domain was also assessed as 'high' because the outcome was not defined (Zaucha 2017). The remaining studies were all assessed as 'low' for the domain outcome measurement. For the domain other prognostic factors (covariates), six studies were assessed as 'low' risk of bias, because they either included participants within one disease stage only, or if all disease stages were included, the authors adjusted for disease stage (Barnes 2011; Hutchings 2005; Hutchings 2006; Kobe 2018; Mesguich 2016; Straus 2011). The remaining eight studies were assessed as 'high' risk of bias for this domain (Annunziata 2016; Cerci 2010; Rossi 2014; Simon 2016; Touati 2014; Ying 2014; Zaucha 2017; Zinzani 2012). Eight studies provided sufficient information about the methods used for univariable analysis (Hutchings 2005; Hutchings 2006; Kobe 2018; Mesguich 2016; Rossi 2014; Straus 2011; Touati 2014; Zinzani 2012), so we assessed the risk of bias for statistical analysis and reporting as 'low'. Five studies were assessed as 'high' for this domain because of the poor reporting of results (Annunziata 2016; Barnes 2011; Cerci 2010; Simon 2016; Ying 2014), including discrepancies between text and figures and/or tables in some studies. Another study was also assessed as 'high' because the method of analysis was not sufficiently described (Zaucha 2017).
Risk of bias in studies reported narrativelyThe risk of bias for all studies reported narratively is included in Figure 2.
Overall survival (OS)The results for OS from three studies are reported narratively in this review (Casasnovas 2019; Gallamini 2014; Hutchings 2006). For two studies (Casasnovas 2019; Gallamini 2014) we assessed the risk of bias as 'low' in all six domains of the QUIPS tool. For Hutchings 2006, the first four domains were assessed as 'low' risk of bias. For the domain study participation, the study was assessed as 'high' risk because a great number of participants initially included in the study did not undergo an early interim PET. The study was also assessed as 'high' risk for the domain other prognostic factors (covariates) because participants within all disease stages were included.
Progression‐free survival (PFS)For PFS, the results from seven studies are reported narratively (Andre 2017; Casasnovas 2019; Gallamini 2014; Hutchings 2014; Markova 2012; Oki 2014; Okosun 2012). Out of these, two studies (Casasnovas 2019; Gallamini 2014) were assessed as 'low' risk of bias in all six domains of the QUIPS tool. From the remaining five studies, all were assessed as 'low' risk of bias for the domain study participation. For the domains study attrition, prognostic factor measurement and outcome measurement, three studies were assessed as a 'low' risk of bias (Hutchings 2014; Oki 2014; Okosun 2012). For the other two studies (Andre 2017; Markova 2012), the domain prognostic factor measurement was assessed as 'moderate' risk because the prognostic factor was measured differently in some participants. For the domain other prognostic factors (covariates), five studies were assessed as 'low' risk of bias (Andre 2017; Casasnovas 2019; Gallamini 2014; Markova 2012; Okosun 2012). The other two studies were assessed as 'high' risk for this domain because they included all disease stages without adjusting for disease stage (Hutchings 2014; Oki 2014). Regarding the domain statistical reporting and analysis, five studies were assessed as 'low' risk because they used appropriate methods for the planned analysis (Andre 2017; Casasnovas 2019; Gallamini 2014; Hutchings 2014; Markova 2012). The remaining two studies were assessed as 'high' risk due to inconsistent conduct and reporting of the analyses (Oki 2014; Okosun 2012).
Other potential sources of bias Reporting deficiencies and selective reportingWe detected reporting deficiencies in some of the studies, particularly when not all analyses that were planned in the methods were actually conducted. In some cases, this was due to the low number of events (i.e. in PET‐negative participants) that did not allow for further analyses. In other cases, it was unclear why certain analyses were performed and others not. This was particularly the case with regard to multivariable analyses, when studies planned to assess the independent prognostic ability of the interim PET in a prognostic model including other clinically relevant prognostic factors (covariates). Studies either did not perform such an analysis even though they initially planned to, or they did not consider adjustment. None of the studies stated clearly their rationale for the choice of covariates; in some cases, the choice was based on their significance in univariable analysis. For example, in studies that only included two or less covariates in the model in addition to interim PET, the interim PET was always independent in its performance. However, how interim PET possibly performed in comparison to other covariates remains unclear. Hence, it is particularly important to state why certain covariates were taken into account. Thus, we cannot be sure that studies did not only report certain positive ('significant') results, which can be an issue of selective reporting.
In addition, we detected discrepancies in the reporting of results within the texts of some studies, or between text and the corresponding graph(s) (i.e. in the reporting of the HR or number of events). In these cases, we tried to contact the corresponding principal investigator(s) for clarification in order to have a better understanding of the results.
Blinding of prognostic factor assessorEleven studies reported that the clinicians evaluating the interim PET scans were blinded to the outcome (Kobe 2018; Gallamini 2014; Gandikota 2015; Hutchings 2006; Hutchings 2014; Mesguich 2016; Oki 2014; Rossi 2014; Straus 2011; Zaucha 2017; Zinzani 2012).
Results of the analysesTwenty‐three studies evaluated interim PET as a prognostic factor in individuals with HL. Two studies did not report data for our outcomes of interest (Gandikota 2015; Orlacchio 2012) and we have not been able to either obtain or estimate any relevant data. None of the included studies reported PET‐associated AEs. Fifteen studies were included in meta‐analyses. Another six of the included studies in this review reported results for OS and/or PFS, but we were not able to pool results because, despite our approaches for possible estimation of missing data items, there was a lack of accurate information or data to do so (Andre 2017; Casasnovas 2019; Gallamini 2014; Markova 2012; Oki 2014; Okosun 2012). For all studies that were not included in meta‐analyses, we reported the main results narratively in this review.
Overall survival (OS) Meta‐analysis of unadjusted resultsWe included nine studies with 1802 participants in meta‐analysis for OS (Barnes 2011; Cerci 2010; Hutchings 2005; Hutchings 2014; Kobe 2018; Simon 2016; Touati 2014; Zaucha 2017; Zinzani 2012). There were 475 interim PET‐positive and 1327 interim PET‐negative participants. Meta‐analysis shows a clear advantage in OS for participants with a negative interim PET scan compared to participants with a positive interim PET scan (HR 5.09, 95% CI 2.64 to 9.81, I2 44%, moderate certainty of evidence) (Analysis 1.1) (Figure 3).
1.1. Analysis.Comparison 1: Univariable comparison of PET+ve vs. PET‐ve, Outcome 1: Overall survival
3.Forest plot of comparison: 1 Univariable comparison of PET+ve vs. PET‐ve, outcome: 1.1 Overall survival
Subgroup analysisWe conducted subgroup analyses to explore the underlying clinical heterogeneity between the studies.
For subgroup analysis by radiotherapy, we found evidence on subgroup difference between the groups (P = 0.05, INRT/ISRT in three studies: N = 548, IFRT in four studies: N = 428, RT not further specified in two studies: N = 826). Results still show an advantage in OS for PET‐negative participants, irrespective of the type of radiotherapy they received (Analysis 2.1).
2.1. Analysis.Comparison 2: Subgroups in univariable comparison of OS: PET+ve vs. PET‐ve, Outcome 1: OS by radiotherapy
For the remaining subgroups, there was no evidence of subgroup differences.
Different study designs (P = 0.28; three prospective studies: N = 406, four retrospective studies: N = 589, one RCT: N = 722) (Analysis 2.2). One study (Hutchings 2005) was not included in this subgroup analysis because they did not explicitly state their study design.
Different chemotherapy regimens (P = 0.33; ABVD in five studies: N = 801, ABVD and other in three studies: N = 279, BEACOPP in one study: N = 722) (Analysis 2.3). Chemotherapy‐regimen in the included studies was mainly ABVD, with differentiating numbers of cycles, with or without radiotherapy (Barnes 2011; Cerci 2010; Hutchings 2014; Simon 2016; Zaucha 2017; Zinzani 2012). In Hutchings 2005, the majority of participants received ABVD, while the remaining received MOPP or MOPP/ABV, or another regimen which was not specified. Some participants also received additional radiotherapy. In Kobe 2018, all participants received eBEACOPP. In Touati 2014, the regimens included ABVD, MOPP/ABV hybrid or BEACOPP. If separate data had been available for each type of chemotherapy, we could have performed more specific subgroup analysis to test for differences between chemotherapies.
PET‐CT versus PET (P = 0.66; PET‐CT in five studies: N = 595, PET only in three studies: N = 1111) (Analysis 2.4). One study (Barnes 2011) was not included in this subgroup analysis because they conducted PET in some participants and PET‐CT in the other participants.
Different stages of disease (P = 0.33; early stages with A or B symptoms in one study: N = 96, all stages in seven studies: N = 984, advanced stages in one study: N = 722) (Analysis 2.5). One study included disease stages IA, IB, IIA and IIB (Barnes 2011) and another study included advanced‐stages only (Kobe 2018). The remaining seven studies included participants representing all disease stages of HL.
Comparison 2: Subgroups in univariable comparison of OS: PET+ve vs. PET‐ve, Outcome 2: OS by study design
2.3. Analysis.Comparison 2: Subgroups in univariable comparison of OS: PET+ve vs. PET‐ve, Outcome 3: OS by chemotherapy
2.4. Analysis.Comparison 2: Subgroups in univariable comparison of OS: PET+ve vs. PET‐ve, Outcome 4: OS for PET/CT vs PET
2.5. Analysis.Comparison 2: Subgroups in univariable comparison of OS: PET+ve vs. PET‐ve, Outcome 5: OS by disease stage
Sensitivity analysisWe conducted sensitivity analyses for the timing of interim PET (removing those that did not conduct a PET2), and the precision of the estimated HR and SE (removing the studies with imprecise HR and SE estimation).
Regarding the timing of the interim PET, interim PET2 was conducted in six studies (N = 1495 participants in total) (Cerci 2010; Kobe 2018; Simon 2016; Touati 2014; Zinzani 2012; Zaucha 2017). In three studies (N = 307 participants in total), interim PET was conducted at other timings: in Barnes 2011, 41 participants received PET2 while the rest of the participants received PET3; in Hutchings 2005, 55 participants received PET2 and 35 participants received PET3; and in Hutchings 2014, PET1 was conducted for all participants (N = 126). Although 89 out of 126 also received a PET2, we used the data for PET1 as the publication provided us with the most information on PET1. At sensitivity analysis, temporarily removing studies that did not perform a PET2 slightly affected the pooled OS (overall: HR 5.09, 95% CI 2.64 to 9.81; sensitivity: HR 3.53, 95% CI 1.97 to 6.32) (Analysis 2.6). It seems that there was an over‐estimation of the HR for the studies that did not perform a PET2. However, the direction of the effect is firm and unchanged. This difference may also be partly explained by the very wide follow‐up ranges within the studies. Hence, following the sensitivity analysis, we consider the overall OS to be robust.
2.6. Analysis.Comparison 2: Subgroups in univariable comparison of OS: PET+ve vs. PET‐ve, Outcome 6: Timing of interim PET
Regarding the precision of the HR estimation, we were able to either obtain or estimate a precise HR and SE for seven studies (N = 1638 participants in total) (Cerci 2010; Hutchings 2005; Hutchings 2014; Kobe 2018; Simon 2016; Zaucha 2017; Zinzani 2012). For two studies (N = 164 participants in total) (Barnes 2011; Touati 2014), we were only able to provide imprecise estimations of the HR and SE. Temporarily removing the imprecise studies during sensitivity analysis barely affected the pooled results for OS, indicating that the measurements obtained from our imprecise method were quite accurate after all (overall: HR 5.09, 95% CI 2.64 to 9.81; sensitivity: HR 5.70, 95% CI 2.60 to 12.48) (Analysis 2.7). Hence, we concluded that the overall pooled OS is robust.
2.7. Analysis.Comparison 2: Subgroups in univariable comparison of OS: PET+ve vs. PET‐ve, Outcome 7: OS by HR type of estimation
Narrative reporting of results Univariable analysesThree studies (Casasnovas 2019; Gallamini 2014; Hutchings 2006) that reported results for OS were not included in meta‐analysis due to lack of adequate data for estimating the HR and associated SE (Table 3).
Table 3. Narrative reporting of results from univariable analysis for OS
Study No. of participants + stages Timing of interim PET scan Unadjusted results for interim PET scan Casasnovas 2019 Standard armTwo studies (Kobe 2018; Simon 2016) reported adjusted effect estimates to test the prognostic ability of PET2 in addition to other prognostic factors. Table 4 displays a list of established prognostic factors (Cuccaro 2014; Josting 2010; Kılıçkap 2013), and shows which were considered as covariates in the final multivariable model. The selection of prognostic factors (covariates) for the final model was either based on the literature (Simon 2016), or on their significance in univariable analysis (Kobe 2018). However, pooling of adjusted data was not possible. In Simon 2016, only the results of those covariates that remained independent prognostic markers in multivariable analysis, namely LMR and PET2‐positivity, were reported. It is unclear whether, or which other covariates were included in the final model. A full list of study‐specific, candidate covariates can be found in the respective table for each study in the Characteristics of included studies.
The statistical methods used were Cox proportional hazards regression model and logistic regression model, which are the appropriate methods for a multivariable analysis.
Table 4. Adjusted results from final multivariable model for OS
Study Prognostic factors Adjusted results for interim PET Interim PET Age Gender Disease stage B symptoms Bulky disease IPS Other study‐specific factors Kobe 2018 x ‐ ‐ ‐ ‐ ‐ x x Interim PET‐positivity (DS 4)We included 14 studies with 2079 participants in meta‐analysis for PFS (Annunziata 2016; Barnes 2011; Cerci 2010; Hutchings 2005; Hutchings 2006; Kobe 2018; Mesguich 2016; Rossi 2014; Simon 2016; Straus 2011; Touati 2014; Ying 2014; Zaucha 2017; Zinzani 2012). There were 529 interim PET‐positive and 1550 interim PET‐negative participants. Meta‐analysis shows a clear advantage in PFS for participants with a negative interim PET scan compared to participants with a positive interim PET scan (HR 4.90, 95% CI 3.47, 6.90, I² = 45%, very low certainty of evidence) (Analysis 1.2) (Figure 4).
1.2. Analysis.Comparison 1: Univariable comparison of PET+ve vs. PET‐ve, Outcome 2: Progression‐free survival
4.Forest plot of comparison: 1 Univariable comparison of PET+ve vs. PET‐ve, outcome: 1.2 Progression‐free survival
Subgroup analysisWe conducted subgroup analyses to explore the underlying clinical heterogeneity between the studies.
Regarding the disease stage, we detected a significant difference between the groups (P = 0.02, early stages with A or B symptoms in two studies: N = 184, all stages in eleven studies: N = 1173, advanced stages in one study: N = 722). Results still showed an advantage for PFS in PET‐negative participants in any stage of the disease (Analysis 3.4). Twelve studies included all disease stages, while one study included stages IA ‐ IIB (Barnes 2011), and another study included advanced‐stages only (Kobe 2018).
3.4. Analysis.Comparison 3: Subgroups in univariable comparison of PFS: PET+ve vs. PET‐ve, Outcome 4: PFS by disease stage
For the remaining subgroups, there was no evidence of subgroup differences.
Different study designs (P = 0.29, three prospective studies: N = 357, eight retrospective studies: N = 827, two RCTs: N = 165) (Analysis 3.1). One study (Hutchings 2005) was not included in this subgroup analysis because they did not explicitly state their study design.
Different chemotherapy regimen (P = 0.43; ABVD in seven studies: N = 945, ABVD and other chemotherapy in four studies: N = 265, other chemotherapies in three studies: N = 869) (Analysis 3.2). Chemotherapy‐regimen was ABVD in seven studies, with or without radiotherapy (Annunziata 2016; Barnes 2011; Cerci 2010; Mesguich 2016; Simon 2016; Zaucha 2017; Zinzani 2012). In two studies, participants received either ABVD, ABV/MOPP, ABVD/COPP, BEACOPP esc., PVAG or radiotherapy only (Hutchings 2005; Hutchings 2006). In Touati 2014, the regimens included ABVD, MOPP/ABV hybrid or BEACOPP. In Ying 2014, participants received either ABVD or BEACOPP. In Kobe 2018, all participants received eBEACOPP. In Rossi 2014, all participants received anthracycline‐based chemotherapy, and in Straus 2011, all participants received AVG.
PET versus PET‐CT (P = 0.30; PET‐CT in eight studies: N = 707, PET only in five studies: N = 1276) (Analysis 3.3). One study (Barnes 2011) was not included in this analysis because they conducted PET in some participants and PET‐CT in the other participants.
Different radiotherapy (P = 0.29; INRT/ISRT in five studies: N = 651, IFRT in six studies: N = 514, RT not specified in two studies: N = 826, no RT given in one study: N = 88) (Analysis 3.5).
Comparison 3: Subgroups in univariable comparison of PFS: PET+ve vs. PET‐ve, Outcome 1: PFS by study design
3.2. Analysis.Comparison 3: Subgroups in univariable comparison of PFS: PET+ve vs. PET‐ve, Outcome 2: PFS by chemotherapy
3.3. Analysis.Comparison 3: Subgroups in univariable comparison of PFS: PET+ve vs. PET‐ve, Outcome 3: PFS for PET/CT vs PET
3.5. Analysis.Comparison 3: Subgroups in univariable comparison of PFS: PET+ve vs. PET‐ve, Outcome 5: PFS by radiotherapy
In addition, we detected variations between the studies with regard to the definition of PFS. However, all trials included in meta‐analysis reported some progression endpoint such as treatment failure, progression or relapse. We have provided the exact reported definitions in Table 2.
Sensitivity analysisRegarding the timing of interim PET, interim PET was conducted after two cycles of chemotherapy (PET2) in nine studies (N = 1677 participants in total) (Cerci 2010; Hutchings 2006; Kobe 2018; Rossi 2014; Simon 2016; Straus 2011; Touati 2014; Zaucha 2017; Zinzani 2012). In five studies (N = 402 participants in total), interim PET was conducted at other timings: in Annunziata 2016 all participants received PET1; in Barnes 2011 and Hutchings 2005 participants received either PET2 or PET3; and in Hutchings 2006 and Mesguich 2016 participants received either a PET2, PET3 or PET4. At sensitivity analysis, temporarily removing studies that did not perform a PET2 barely affected the results for PFS (overall: HR 4.90, 95% CI 3.47 to 6.90; sensitivity: HR 4.68, 95% CI 3.14 to 6.98) (Analysis 3.6). Hence, the timing of the interim PET measurement (when conducted at a time point other than PET2) did not affect the overall pooled result for PFS.
3.6. Analysis.Comparison 3: Subgroups in univariable comparison of PFS: PET+ve vs. PET‐ve, Outcome 6: Timing of interim PET
Regarding the precision of the HR estimation, we were able to provide a precise estimation of the HR and SE for nine studies (N = 1450 participants in total) (Annunziata 2016; Barnes 2011; Hutchings 2005; Kobe 2018; Rossi 2014; Simon 2016; Straus 2011; Ying 2014; Zaucha 2017). For five studies (N = 629 participants in total) we were only able to provide a slightly imprecise estimation of the HR and SE (Cerci 2010; Hutchings 2006; Mesguich 2016; Touati 2014; Zinzani 2012). However, at sensitivity analysis we found that the imprecise HRs did not significantly affect the pooled results. Temporarily removing the imprecise studies during sensitivity analysis barely affected the pooled results (overall: HR 4.90, 95% CI 3.47 to 6.90; sensitivity: HR 4.69, 95% CI 2.84 to 7.73) (Analysis 3.7). Hence, we concluded that the overall pooled PFS is robust and was not affected by our slightly imprecise method of HR and SE estimation.
3.7. Analysis.Comparison 3: Subgroups in univariable comparison of PFS: PET+ve vs. PET‐ve, Outcome 7: PFS by HR type of estimation
Narrative reporting of results Univariable analysesSeven studies that reported results for PFS were not included in meta‐analysis (Andre 2017; Casasnovas 2019; Gallamini 2014; Hutchings 2014; Markova 2012; Oki 2014; Okosun 2012). Table 5 presents the results from these studies narratively. We extracted all data that were available and relevant to us (i.e. number of interim PET‐negative and interim PET‐positive participants, number of events and percentages for PFS). Due to strong differences in the reporting between studies, the table presents more information for some studies compared to others.
Table 5. Narrative reporting of results from univariable analysis for PFS
Study No. of participants analysed Timing of interim PET scan Unadjusted results for interim PET Andre 2017 Favourable:Eight studies reported adjusted effect estimates for PFS (Casasnovas 2019; Gallamini 2014; Hutchings 2005; Hutchings 2006; Kobe 2018; Mesguich 2016; Rossi 2014; Simon 2016). Table 6 shows which prognostic factors (covariates) were considered in the final multivariable model of the studies. In two studies, only the results of those covariates that remained independent prognostic factors in multivariable analysis were reported (Gallamini 2014; Simon 2016). It is unclear whether, or which other covariates were included in the final multivariable model. The selection of prognostic factors (covariates) for adjustment in the studies was either based on their significance in univariable analysis (Casasnovas 2019; Hutchings 2006; Kobe 2018), or on the literature (established prognostic factors) (Hutchings 2005; Rossi 2014; Simon 2016). In two studies, the rationale for the covariates was not clearly stated (Gallamini 2014; Mesguich 2016).
As there are no final models with an identical set of covariates, pooling of adjusted effect estimates was not feasible. A full list of study‐specific, candidate covariates can be found in the respective table for each study in the Characteristics of included studies.
The statistical methods used were Cox proportional hazards regression model and logistic regression model.
Table 6. Adjusted results from final multivariable model for PFS
Study Prognostic factors Adjusted results for interim PET Interim PET Age Gender Disease stage B symptoms Bulky disease IPS Other study‐specific factors Casasnovas 2019 x ‐ x x x x x x Multivariable analysis not reported separately for standard treatment group. Gallamini 2014 x ‐ ‐ ‐ ‐ ‐ ‐ x PET2None of the included studies measured PET‐associated AE.
Studies not reporting our outcomesTwo studies (Gandikota 2015; Orlacchio 2012) did not report data for our outcomes of interest, but were still included in this review as they fit our inclusion criteria. Their investigated outcomes were very close to our review outcomes and potentially the authors could have measured them, but did not report them in their publication. However, it has not been possible to obtain the relevant information; therefore, they are reported narratively in this review. Table 7 presents the results from these studies narratively.
Table 7. Narrative reporting of results from studies not reporting our outcomes of interest
Study No. of participants Outcomes/comparison Results Gandikota 2015 77 (stages IIA ‐ IIB)Analysis of imaging at different time points: Baseline imaging, imaging during (after two to four cycles of ABVD) and at the end of treatment, follow‐up imaging
Need for surveillance imaging
77 participants had baseline PET‐CT scans, 1 had only chest X‐ray due to pregnancy at baseline
77 participants had interim PET‐CT during chemotherapy (N = 34) or after chemotherapy before initiation of radiotherapy (N = 43)
Out of 77, 4 remained PET‐positive, scans after completion of radiotherapy showed a complete response in 2/4, inflammation in 1/4, resolution of all adenopathy in 1/4, 0/4 relapsed during follow‐up
Median follow‐up: 46 months (range 24 to 126)
Total of 466 scans in 78 participants (PET‐CT in N = 42)
No relapses occurred in the entire cohort, N = 3 were diagnosed with a second primary malignancy by either imaging or clinical presentation, N = 6 had false‐positive imaging findings (3/6 PET‐CT) requiring further supplementary imaging or biopsy/surgery
Negative interim PET2: 104
Positive interim PET2: 28
Negative final PET: 102/104
Positive final PET: 2/104
Negative final PET: 16/28
Positive final PET: 12/28
Quote: “Final PET confirmed the negative results in 102 cases (98%) and revealed pathological uptake in the remaining two cases (2%).”
Of the 28 interim PET‐positive participants, 19 showed a partial response and nine had disease stability or progression. Twelve of the 28 interim PET‐positive participants had a positive final PET. Hence, the remaining 16 had a negative final PET.
Quote: “Interim PET had a NPV of 98%, with 85.7% sensitivity, 86.4% specificity and 86.4% diagnostic accuracy.”
Quote: “[In univariable analysis] the only independent predictor is the result of interim PET. […] PET had a PPV of 42%."
In this systematic review, we summarised unadjusted data for interim positron emission tomography (PET) scan results as a prognostic factor in individuals with classic Hodgkin lymphoma (HL). The results of an interim PET scan during therapy, e.g. after two cycles of chemotherapy, has been suggested as a good predictor of outcome. Interim PET scan results have also been suggested as an indicator to guide further treatment in order to achieve the best possible outcome in those that have a poor prognosis and those that have a good prognosis, while also minimising adverse events due to the toxicity of the chemotherapy. The results of our review are summarised in the Table 1.
The findings emerging from meta‐analyses are as follows.
Unadjusted results for overall survival (OS) show a large advantage for participants with a negative interim PET scan result compared to participants with a positive interim PET scan result. We rated the certainty of the evidence as 'moderate'.
Unadjusted results for progression‐free survival (PFS) show an advantage for participants with a negative interim PET scan result compared to participants with a positive interim PET scan result, but the evidence is very uncertain. We rated the certainty of the evidence as 'very low'.
The findings of the adjusted results from multivariable analyses, reported narratively in this review, are as follows.
Adjusted results for OS indicate an independent prognostic ability of interim PET beyond other associated factors. We rated our certainty of the evidence as 'moderate'.
Adjusted results for PFS indicate that there may be an independent prognostic ability of interim PET beyond other associated factors. We rated our certainty of the evidence as 'low'.
No study measured adverse events (AEs) associated with PET.
Overall completeness and applicability of the evidenceThe evidence in this review mostly applies to adults who were newly diagnosed with classic HL, and who receive a PET scan in combination with CT (PET‐CT) after two cycles of chemotherapy (PET2). The studies included in this review addressed our research question in a total of 7335 male and female participants representing all stages of classic HL (Ann Arbor stages I ‐ IV with A or B symptoms). Nine studies included individuals aged 18 years or older, while the remaining studies also included adolescents and young adults (the youngest being 13 years of age, although most studies started from the age of 16 and onwards). Overall, the findings from this review support the statement that in this group of individuals, interim PET scan results can predict OS and PFS. Most participants in the included studies received ABVD (adriamycin/doxorubicin, bleomycin, vinblastine and dacarbazine) chemotherapy, which is the standard treatment regimen for early‐stage disease (Bröckelmann 2018; Engert 2010). However, as participants can have different therapy regimens, which is decided based on their disease stage and other clinical or individual characteristics, results should always be interpreted with caution for different patient groups, and this naturally restrains the applicability of the evidence for all people with classic HL. Twelve out of 23 studies reported our primary outcome of interest OS, while 21 studies reported PFS. No study reported PET‐associated AE. As the main aim of the review was to identify the prognostic value of interim PET results to predict survival outcomes, it is unlikely that studies on prognosis will measure or report AE.
Heterogeneity between the studies was also found with regard to the evaluation of the interim PET scan, as studies used different criteria for the interpretation of the results. Most studies used the Deauville five‐point scale (DS 1 ‐ 5) for the evaluation of the PET scans. However, different cut‐off values were used for PET‐positivity. Most studies considered scores one to three (DS 1‐3) for PET‐negativity, and scores four to five (DS 4‐5) for PET‐positivity. In some studies, however, DS3 was also considered (or tested) for PET‐positivity. Results from these studies should be interpreted with caution, as using a score of ≥3 can have an important impact on the results and possibly introduce bias. Firstly, using this cut‐off can lead to an increased number of false‐positive results for interim PET (Casasnovas 2019). This can have a relevant impact for the individual, if treatment would be modified based on the interim PET scan results (such as in the studies by Andre 2017; Casasnovas 2019; Kobe 2018). Furthermore, using this cut‐off can lead to an overestimation of the positive outcomes in the interim PET‐positive group. In the study by Kobe 2018, in which cut‐off DS3 and DS4 were tested for PET‐positivity, the results showed no significant difference in DS1‐2 compared to DS3, but a significant difference between DS1‐3 and 4. Thereby, the authors argue for DS4 as the cut‐off value for PET‐positivity, which is interpreted as an [18F]‐fluorodeoxy‐D‐glucose (FDG) uptake higher than in the liver, instead of an uptake higher than in the mediastinum (corresponding DS3) (Kobe 2018). Hence, the implementation of a commonly used cut‐off in clinical practice is important in order to improve interobserver reliability and agreements between central reviewers, and is also highly crucial for the individual (Kobe 2018; Meignan 2009a). In the remaining studies included in this review, either different criteria were used (e.g. International Harmonization Project in Lymphoma criteria (Juweid 2007)), or no specific scale was indicated. However, in most studies, at least two nuclear medicine physicians independently interpreted the interim PET scan results.
One of the greatest issues regarding the prognostic factor studies in this review relates to the difficult reporting of their statistical analyses. Even when the methods of the statistical analyses were appropriate for the study design, the data were insufficiently reported in many of the included studies. We used hazard ratios (HRs) as the effect measure for time‐to‐event data in this review. We were able to pool data from only 15 studies, either because the HR and associated standard error (SE) were not reported, or because we did not have separate data for our participants or outcomes of interest. Out of these 15 studies, six studies reported an HR, but we still re‐calculated the value for four of them for different reasons. For example, values were re‐calculated either when we detected discrepancies between the text and corresponding graph(s) and table(s), or when they were simply not reported, while other relevant data were, helping us to estimate the HR and SE. For the remaining studies, we estimated the HR using other available data where possible (Altman 2012; Parmar 1998; Tierney 2007; Trivella 2006). For this reason, we contacted 10 principal investigators to clarify our questions and provide us with additional information or data, or both. This step was particularly helpful for deciding which data to pool.
We prespecified in our protocol that we would only pool adjusted associations of the index prognostic factor if analyses were based on an identical set of covariates. Although this was not feasible for our review, we suggest that future authors of systematic reviews of prognostic factor studies consider pre‐specifying a core set of covariates (established prognostic factors) that are important to the disease under review, and should be investigated in the included studies (Riley 2019; Riley 2019b). In this way, authors may be able to pool adjusted effect estimates, if studies are homogenous enough in the adjustment set of the other prognostic factors. In addition, we have moderate between‐study heterogeneity, which is reflected in the I² and wide confidence intervals (CIs). We took these issues around the reporting in the studies into account when we assessed risk of bias and GRADE.
Furthermore, the pooled estimates of the prognostic effect of the interim PET scan result in our analyses are based on crude HRs (no adjustment for covariates), therefore the reported results are at risk of overestimating the prognostic ability of the interim PET scan result. Hence, in light of the absence of adjustment for other prognostic factors, and considering the risk of bias assessment for the fifth domain of the QUIPS tool, we downgraded the strength of the evidence in our GRADE assessment. This is because it is widely acknowledged that adjusting the predictive effect of a specific prognostic factor for the contribution of other prognostic factors strengthens the robustness of the evidence on the clinically relevant prognostic ability of that factor (Riley 2019; Riley 2019b).
Lastly, although we did not conduct a test for funnel plot asymmetry as this type of test is not necessarily recommended for survival data due to issues of censoring (Debray 2018), we cannot exclude potential publication bias and the presence of small‐study effects in our review (Riley 2019). Firstly, we assume that publication bias may be present in our review as most studies in our analyses have rather small sample sizes, of which all present positive results on the prognostic ability of interim PET scan results. Secondly, most studies included in this review are retrospective studies that have not been pre‐registered, for example, in trial registries. Studies are also not always labelled or indexed as prognosis studies, and search filters for studies on prognosis are still under development, which is the main reason as to why we conducted a broad search with the disease (HL) and prognostic factor (PET) of interest. This led to a high number of search results that had to be screened. Thirdly, we identified a great number of conference abstracts on studies for which we could not find full‐text publications (see Characteristics of studies awaiting classification). Hence, based on these experiences, we cannot preclude that more studies may exist that have either not been published, or not indexed properly.
Certainty of the evidenceOur certainty of the evidence is presented in the Table 1.
Unadjusted resultsFor our primary outcome OS, we judged the certainty of the evidence as 'moderate'. We included nine studies in the meta‐analysis, of which eight were observational studies and one was a clinical trial. We used the data of participants from the standard arm (no treatment adaptation) of this trial. We judged the certainty of the evidence as 'moderate' due to some methodological issues. We downgraded by one point for risk of bias due to a high risk of bias in seven studies for the domain other prognostic factors (covariates), as well as a high risk of bias in three studies for the domain statistical analysis and reporting. In addition, we downgraded by one point for imprecision because the HR had to be estimated in seven studies, and re‐calculated in one study. Hence, only one out of nine studies reported a HR that we used. Nevertheless, we upgraded by one point for a large effect showing the large difference in the OS between interim PET‐positive and interim PET‐negative participants (HR 5.09, CI 2.64 to 9.81).
For the outcome PFS, we judged the certainty of the evidence as 'very low'. We included 14 studies in the meta‐analysis, of which 12 were observational studies and two were clinical trials (participants from the standard arms). For this outcome, we downgraded by one point for inconsistency because the definition of PFS varied across the studies. We also downgraded by one point for imprecision because the HR had to be estimated in 10 studies and re‐calculated in one study. Hence, we were able to use a reported HR for only three out of 14 studies. In addition, we downgraded by one point for risk of bias, because of a high risk of bias in eight studies for the domain other prognostic factors (covariates), and high risk of bias in six studies for the domain 'statistical analysis and reporting'.
Adjusted resultsFor the outcome OS, two studies reported adjusted results from multivariable analyses including established prognostic factors (e.g. International Prognostic Score) in individuals with HL, and the results of both studies indicate the independent prognostic ability of interim PET to predict OS. We judged our certainty in the evidence as 'moderate' for this outcome due to some methodological issues. We downgraded by one point for risk of bias due to a high risk of bias in the domains other prognostic factors (covariates) and statistical analysis and reporting for one study.
For the outcome PFS, there were eight studies that reported adjusted results (adjusted for e.g. disease stage or B symptoms). All studies found that interim PET scan results have an independent prognostic ability to predict PFS. However, we rated our certainty in the evidence as 'low' for this outcome. We downgraded by one point for risk of bias due to a high risk of bias in the domain study participation in one study, as well as a high risk of bias in the domains other prognostic factors (covariates) and statistical analysis and reporting in a second study. Furthermore, we downgraded by one point for inconsistency because the studies included a heterogenous set of covariates in the multivariable analyses, which made the pooling of adjusted results not feasible.
Potential biases in the review processTo prevent bias in this review, two teams of two review authors independently performed all relevant processes (i.e. screening, data extraction, risk of bias and GRADE assessment). Due to the complexity of assessing bias in prognostic factor studies, as well as assessing the certainty of the evidence from these types of studies, we conducted several teleconferences with different experts in the field of prognosis to discuss our assessments. We consulted Jill Hayden (Hayden 2013) for the 'Risk of bias' assessment, and the GRADE for Prognosis working group for the GRADE assessment. In particular, the methods for grading the evidence from prognosis studies are still under development.
For the 'Risk of bias' assessment, we are aware that adding 'unclear' as a fourth possible rating, thereby setting an example for future authors, can lead to a potential bias in the assessment. However, for our assessment we only used 'unclear' when relevant information was evidently missing, thereby making it difficult to make a fair and transparent judgement for the respective study and domain. We felt that rating a domain as high risk of bias in such cases would be inappropriate. We clearly advise against the use of 'unclear' as a default option and want to recommend future authors of reviews of prognosis studies to use this fourth rating carefully (if the fourth rating will be included in an update of the QUIPS tool).
Our analyses included post‐hoc subgroup analyses on the type of PET measurement (PET versus PET‐CT), as well as post‐hoc sensitivity analyses on the timing of the interim PET and the type of estimation (see Methods) used to estimate missing values. These analyses were necessary due to the heterogeneity between the studies. Results should be interpreted in light of differences that can exist when participants receive a PET‐CT as compared to a PET scan only. Furthermore, the timing of the interim PET is crucial, as PET1 and PET2 may provide different results compared to PET3 and PET4.
Regarding the adjusted results, we refrained from pooling results because, although the studies looked at established prognostic factors, they did not include identical sets of covariates. As the studies are already very heterogeneous, pooling of the adjusted results was not feasible for our review, as the comparison and interpretation of these results may be problematic in this case. To avoid this in the future, we suggest pre‐defining a core set of covariates in order to enable pooling of adjusted results (Riley 2019).
Agreements and disagreements with other studies or reviewsIn our review, we included studies that have assessed the prognostic value of interim PET in HL participants without treatment modification. Overall, the findings from this review are in agreement with similar reviews and studies that have investigated the prognostic value of interim PET. Our results are also in agreement with the literature that interim PET can be used for disease and therapy monitoring (Barrington 2017a). Some reviews and studies have investigated this in participants in whom the treatment was changed based on the interim PET scan result, and have come to similar conclusions that interim PET can predict outcome in the different groups (PET‐negative and PET‐positive participants).
We are aware of three systematic reviews (Adams 2015a; Amitai 2018;Sickinger 2015) that have investigated interim PET as a prognostic factor. Adams 2015a included ten studies with limited‐, intermediate‐ and advanced‐stage HL participants in whom the treatment regimen was not modified based on the interim PET scan results. In fact, nine out of these 10 studies are also included in our review. One study was not included in our review because they only included children. The authors of this review concluded that a negative interim PET cannot exclude treatment failure, but that a positive interim PET can identify and predict treatment failure. The authors assessed the quality of the studies with the QUIPS tool (as we did in our review) and judged the overall methodological quality of the included studies as moderate. We have compared their QUIPS assessment with ours for each individual study, and identified that for the domains study participation and study attrition in particular, we found agreements between the authors and our review that there is a low risk of bias in the studies. Disagreement was found regarding the domain prognostic factor measurement, for which the authors judged the quality as moderate mainly due to the heterogeneity between the studies regarding the use of PET‐CT versus PET only, which is an issue that we have also addressed in our review by subgroup analysis.
Comparison of interim PET with end PETNine of the included studies compared the performances of interim PET and end‐of treatment PET (end PET) (Barnes 2011; Hutchings 2006; Hutchings 2014; Markova 2012; Mesguich 2016; Orlacchio 2012; Straus 2011; Ying 2014; Zinzani 2012), as omitting one of the two can have an impact on radiation safety for the patient. However, results between studies are rather contradictory. For example, in Barnes 2011 the authors could not detect a significant difference in OS and PFS between interim PET‐negative and interim PET‐positive participants. In their analyses, interim PET‐positive participants that were negative at end PET had the same good outcomes as participants who were negative both at interim and end PET. In addition, after end PET, the difference between end PET‐positive and end PET‐negative participants was fairly high, with a greater four‐year OS and PFS for end‐PET‐negative participants. In this study, 74 (end PET) out of 79 participants (interim PET) remained PET‐negative, while nine (end PET) out of 17 (interim PET) participants remained PET‐positive. The authors concluded that end‐PET (after six cycles of chemotherapy) predicts outcome, rather than interim PET (after two or four cycles of chemotherapy). In Hutchings 2006, interim PET was conducted after two and four cycles of chemotherapy (total number of cycles was six to eight). Results show that PET2 and PET4 were similarly successful in predicting outcome in participants, but the authors of the study still argue that treatment modifications should be indicated as early as possible (e.g. after PET2) in order to achieve the best possible outcome. In the study by Mesguich 2016, interim PET was also lower in its predictive ability compared to end PET. Out of 60 interim PET‐negative participants, seven converted to a positive end PET. Out of 16 interim PET‐positive participants, seven converted to a negative end PET. In addition, treatment failure was most common in participants with a positive end PET as compared to participants with a positive interim PET. The sensitivity of interim PET was measured as 47% compared to 80% of end PET (Mesguich 2016).
Contrastingly, Orlacchio 2012 detected a very high negative predictive value (NPV) of 98% for interim PET2, with an overall diagnostic accuracy of 86.4%. Out of 104 interim PET‐negative participants, 102 were still negative after end PET. Out of 28 interim PET‐positive participants, however, 16 converted to a negative end PET. A high NPV for interim PET was also found in Hutchings 2005 (interim PET2/3) as interim PET‐negative participants rarely relapsed. In Hutchings 2014, 89 participants had an interim PET1 and PET2, and both show a strong prognostic ability for predicting outcome. In this study, none of the participants in early stages that had a negative interim PET1 progressed or relapsed. Advanced‐stage participants with a negative interim PET1 had a long‐term PFS of more than 90%. The three‐year PFS of interim PET1‐positive participants was 30%. In total, 89 participants had both PET1 and PET2. Out of these, 62 were PET1‐negative, and after treatment, 60 were in complete remission. Twenty‐seven participants were PET1‐positive, of which 15 were in complete remission. To compare, 76 participants were PET2‐negative, of whom 70 were in complete remission. Thirteen participants were PET2‐positive, of which five were in complete remission. The negative predictive value of PET1 was reported as 96.8%, while the positive predictive value was 44.4%. Zinzani 2012 also reported that interim PET after two cycles is highly predictive of OS and PFS. In their study, 92% of the interim PET‐negative participants (n = 251) were in continuous complete remission as compared to 24.5% of the interim PET‐positive participants (n = 53). These conclusions are supported by Ying 2014, although their sample size (n = 35) is too small to provide definite answers. Straus 2011 supported these statements particularly for participants in early stages (as included in their study), as participants with a negative interim PET2 result had a PFS of about 90%, compared to 50% for interim PET‐positive participants, at two years. Markova 2012 reported similar findings for interim PET4, which had a high NPV of 98%. Out of 68 participants in total, 50 had a negative interim PET, but 59 a negative end PET. In other words, nine interim PET‐positive participants were end PET‐negative after chemotherapy. The other nine participants who were interim PET‐positive were also end PET‐positive. At both timings (PET4 and PET6/8) the authors found a significant difference in the survival between PET‐positive and PET‐negative participants. The high NPV of interim PET supports early de‐escalation of chemotherapy, or omitting radiotherapy, in order to reduce the risk of toxicity and adverse events related to the harsh treatment.
Treatment adaptation based on interim PETAlthough not an aim of our review, we considered it important to discuss some results from recently published randomised controlled trials (RCTs) in which the interim PET scan result was used to adapt the therapy for individuals with HL in order to improve outcomes (Andre 2017; Casasnovas 2019; Johnson 2016; Kobe 2018), based on the premise that interim PET scan results are indeed prognostic. For example, in the trial by Johnson 2016, the primary aim was to test the omission of bleomycin due to its toxic effects. All participants (N = 1214, advanced stages) started with ABVD chemotherapy. After interim PET2, PET‐positive (DS4‐5) participants (N = 182) were assigned to BEACOPP, and PET‐negative (DS1‐3) participants (N = 935) were randomised to receive either ABVD or AVD. Results show that three‐year PFS was slightly better in the ABVD group compared to the AVD group (85.7% versus 84.4%, respectively). Regarding three‐year OS, the ABVD group reached 97.2% compared to 97.6% in the AVD group. Hence, there were no significant subgroup differences. However, grade 3 and 4 AEs due to the chemotherapy were more common in the ABVD group. In the PET‐positive group, which was escalated to BEACOPP chemotherapy, 3‐year PFS was 67.5% and 3‐year OS was 87.8%.
In another example by Casasnovas 2019, 823 advanced‐stage HL participants were randomly assigned to standard treatment group or PET‐driven treatment group. All participants received two cycles of BEACOPPescalated as the initial therapy and interim PET was conducted thereafter. PET‐positive participants in both groups, as well as PET‐negative participants in the standard group continued with the initial therapy after PET2. PET‐negative participants in the experimental arm, however, were switched to two cycles of ABVD. Results of five‐year PFS show a similar survival of PET‐negative participants in the standard group and experimental group: 88.4% and 89.4%, respectively.
Several systematic reviews were also published that investigated treatment adaptation based on interim PET scan results. Amitai 2018 included 13 studies (of which four were RCTs) that investigated interim PET‐adapted treatment in advanced‐staged HL. Their findings support the statement that PET‐adapted treatment is an appropriate strategy and that it should be considered as standard care for advanced HL (Amitai 2018). This finding is supported by a Phase II RCT (Carras 2018), which assessed interim PET‐response adapted treatment strategy in advanced‐stage HL. The authors concluded that early salvage therapy and high‐dose chemotherapy or autologous stem cell transplant (ASCT) for PET2‐positive participants is safe and can lead to similar positive outcomes as in PET2‐negative participants (Carras 2018). To compare, Sickinger 2015 included studies in which the treatment was also modified, but concluded that PFS was shorter in individuals with early‐stage HL and a negative PET scan receiving chemotherapy only (PET‐adapted therapy) than in those receiving additional RT (standard therapy). This finding was confirmed in another review by Blank 2017, showing improved PFS in early‐stage participants receiving radiotherapy in addition to chemotherapy. However, the overall methodological quality of the included studies in both reviews was judged as moderate (for PFS) to very low (for OS). Constrasting evidence on the clinical and prognostic value of interim PET‐adapted treatment was also found in non‐systematic reviews, which particularly acknowledge the heterogeneity between available studies that makes it difficult to give definite conclusions (Adams 2016a; Berriolo‐Riedinger 2018).
Authors' conclusions Implications for practiceThis review provides moderate‐certainty evidence that interim positron emission tomography (PET) scan results predict overall survival (OS), and very low‐certainty evidence that interim PET scan results predict progression‐free survival (PFS) in individuals with Hodgkin lymphoma (HL) (evidence of the pooled, unadjusted results). The evidence on the ability of interim PET scan results to distinguish between individuals with a poor prognosis and individuals with a good prognosis can aid decision‐making for clinicians and diagnosed individuals, and the evidence may be used in international treatment guidelines for individuals with HL.
Implications for research Multivariable analyses and prognostic modelsThus far, the prognostic value of interim PET has mostly been assessed in univariable analyses, in which its prognostic ability of determining survival outcomes in individuals with HL has been shown. However, using one single factor is usually not sufficient to give a satisfactory prediction of an outcome, and clinicians, therefore, usually additional factors to give an accurate prediction of an individual's disease progression and health outcome (Moons 2009). Hence, it is important to assess the independent prognostic value of the prognostic factor of interest (in this case interim PET) against established prognostic factors such as disease stage, age, sex, B symptoms or other relevant clinical and individual factors in multivariable analyses as well (Moons 2009; Riley 2019). In such analyses, the independent prognostic ability of a factor, as well as its incremental value on top of other prognostic factors, can be assessed (Moons 2009). In a next step, prognostic models can be built that include multiple prognostic factors that have been proven to be predictive of outcome. Such models are built for risk adaptation and treatment stratification for participants who present those specific factors included in a prediction model for a specific disease, and thereby enables more individualised disease monitoring and treatment guidance. Using a combination of factors, rather than one factor only, allows for a more individual and accurate estimate of the risk of a patient to experience a certain health event (or outcome) within a specific period of time (Moons 2009; Steyerberg 2013).
With regard to our index prognostic factor, we could pool adjusted results in meta‐analyses in an update of this review if new studies would adjust for the same set of prognostic factors (covariates). There is a number of different established clinical and individual prognostic factors that can be used to predict survival outcomes in individuals with HL (Cuccaro 2014; Josting 2010; Kılıçkap 2013). In order to enable pooling of adjusted results, future authors of systematic reviews of prognostic factor studies could define a core set of covariates a priori (Riley 2019).
Study designThere is some evidence from retrospective studies that interim PET scan results can predict outcome in individuals during chemotherapy. However, it is commonly agreed that the true prognostic value of this factor can best be assessed in randomised controlled trials (RCTs), in which participants are randomly assigned to a standard or an experimental arm. In the standard arm, participants continue with the planned therapy regimen independent of the interim PET scan result. In the experimental arm, however, different treatments are given according to the interim PET scan result, e.g. de‐escalation of treatment in interim PET‐negative participants. Hence, RCTs are the most suitable study design, with results from experimental arms in which participants receive therapy adaptation based on the interim PET scan result providing the most robust evidence on whether outcome can be approved, while treatment can be safer, by this strategy. Although assessing therapy modification was not an aim of our review, we judged it important to present and discuss some results of published trials that evaluated the impact of PET‐adapted treatment on survival outcomes.
What's new Date Event Description 14 August 2020 Amended Following correspondence between the editorial base and the funding institution of one of the authors, the internal sources of support and the acknowledging statement was updated. HistoryProtocol first published: Issue 4, 2017 Review first published: Issue 9, 2019
Date Event Description 20 December 2019 New citation required but conclusions have not changed Following correspondence between the authors and one of the peer reviewers post‐publication, the authors have revised some of the risk of bias judgements. Some terminology around confounding has also been changed. 20 December 2019 Amended Following correspondence between the authors and one of the peer reviewers post‐publication, the authors have revised some of the risk of bias judgements. Some terminology around confounding has also been changed. AcknowledgementsThis review was published in collaboration with the Cochrane Fast‐Track Service. We particularly thank Helen Wakeford (Managing Editor) and Heather Maxwell (copy‐editor) for their support. The Fast‐Track Service team made comments on the review and managed the editorial process.
We thank Nicola Köhler of the Cochrane Haematological Malignancies (CHM) Editorial Base as well as the editors Dr. Julia Bohlius and PD Dr. Sebastian Theurich and the consumer editor Anne Lyddiatt for commenting on the protocol of this review.
We thank Ina Monsef of CHM for developing the search strategy and conducting the search for this review, and Marius Goldkuhle and Vanessa Piechotta for commenting on the first submitted draft of this review.
We thank Jill Hayden for her help with the QUIPS tool, and particularly for agreeing to our adaptation of the tool for the purpose of this review.
We thank Yu‐Tian Xiao for the translation of the Chinese publication (Ying 2014).
We thank all principal investigators who have replied to all of our questions and provided us with more information or data, thereby helping us to include as many studies as possible in our analysis. Hence, we want to thank Prof. Dr. Jan Maciej Zaucha, Prof. Dr. Andrea Gallamini, Prof. Dr. Peter Borchmann and his colleague Helen Görgen, Prof. Dr. Pier Luigi Zinzani, Dr. Martin Hutchings, Prof. Dr. Marc André and his colleague Dr. John Raemaekers.
We thank Kym Snell and Sarah Hodgkinson for commenting on the first submitted draft of this review, as well as Robin Featherstone for commenting on the search strategy.
We thank all external peer‐reviewers who read and commented on this review. We thank Robert Wolff, Bob Phillips, Michel Meignan, Tim Illidge, Julian Higgins and Ulrike Holtkamp who greatly helped to improve this review.
The research was supported by NHS Blood and Transplant and the National Institute for Health Research (NIHR) Oxford Biomedical Research Centre (BRC). The views expressed are those of the author(s) and not necessarily those of the NHS, the NIHR or the Department of Health.
Appendices Appendix 1. Cochrane Central Register of Controlled Trials search strategySearches until 07/02/2016
ID Search
#1 MeSH descriptor: [Lymphoma] this term only
#2 MeSH descriptor: [Hodgkin Disease] explode all trees
#3 Germinoblastom*
#4 Reticulolymphosarcom*
#5 Hodgkin*
#6 (malignan* near/2 (lymphogranulom* or granulom*))
#7 #1 or #2 or #3 or #4 or #5 or #6
#8 MeSH descriptor: [Positron‐Emission Tomography] explode all trees
#9 (pet* or petscan*)
#10 tomograph*
#11 emission*
#12 MeSH descriptor: [Deoxyglucose] explode all trees
#13 MeSH descriptor: [Fluorodeoxyglucose F18] explode all trees
#14 (deoxyglucose* or desoxyglucose* or deoxy‐glucose* or desoxy‐glucose* or deoxy‐d‐glucose* or desoxy‐d‐glucose* or 2deoxyglucose* or 2deoxy‐d‐glucose* or fluorodeoxyglucose* or fluorodesoxyglucose* or fludeoxyglucose* or fluordeoxyglucose* or fluordesoxyglucose* or 18fluorodeoxyglucose* or 18fluorodesoxyglucose* or 18fluordeoxyglucose* or fdg* or 18fdg* or 18f‐dg*)
#15 (fluor* or 2fluor* or fluoro* or fluorodeoxy* or fludeoxy* or fluorine* or 18f* or 18flu*)
#16 glucose*
#17 #8 or #9 or #10 or #11 or #12 or #13 or #14 or #15 or #16
#18 #7 and #17 in trials
Searches from 08/02/2016 ‐ 13/07/2017
ID Search
#1 MeSH descriptor: [Lymphoma] this term only
#2 MeSH descriptor: [Hodgkin Disease] explode all trees
#3 Germinoblastom*
#4 Reticulolymphosarcom*
#5 Hodgkin*
#6 (malignan* near/2 (lymphogranulom* or granulom*))
#7 #1 or #2 or #3 or #4 or #5 or #6
#8 MeSH descriptor: [Positron‐Emission Tomography] explode all trees
#9 (pet* or petscan*)
#10 tomograph*
#11 emission*
#12 MeSH descriptor: [Deoxyglucose] explode all trees
#13 MeSH descriptor: [Fluorodeoxyglucose F18] explode all trees
#14 (deoxyglucose* or desoxyglucose* or deoxy‐glucose* or desoxy‐glucose* or deoxy‐d‐glucose* or desoxy‐d‐glucose* or 2deoxyglucose* or 2deoxy‐d‐glucose* or fluorodeoxyglucose* or fluorodesoxyglucose* or fludeoxyglucose* or fluordeoxyglucose* or fluordesoxyglucose* or 18fluorodeoxyglucose* or 18fluorodesoxyglucose* or 18fluordeoxyglucose* or fdg* or 18fdg* or 18f‐dg*)
#15 (fluor* or 2fluor* or fluoro* or fluorodeoxy* or fludeoxy* or fluorine* or 18f* or 18flu*)
#16 glucose*
#17 #15 and #16
#18 #8 or #9 or #10 or #11 or #12 or #13 or #14 or #17
#19 #7 and #18 in Trials
#20 #19 Publication Year from 2016 to 2017
Searches from 12/07/2017 ‐ 12/11/2018
Cochrane Central Register of Controlled Trials (Cochrane Library Issue 11, 2018)
ID Search
#1 MeSH descriptor: [Lymphoma] this term only
#2 MeSH descriptor: [Hodgkin Disease] this term only
#3 germinoblastom*
#4 reticulolymphosarcom*
#5 hodgkin*
#6 (malignan* near/2 (lymphogranulom* or granulom*))
#7 #1 or #2 or #3 or #4 or #5 or #6
#8 MeSH descriptor: [Positron‐Emission Tomography] explode all trees
#9 (pet*)
#10 tomograph*
#11 emission*
#12 MeSH descriptor: [Deoxyglucose] explode all trees
#13 MeSH descriptor: [Fluorodeoxyglucose F18] this term only
#14 (deoxyglucose* or desoxyglucose* or deoxy‐glucose* or desoxy‐glucose* or deoxy‐d‐glucose* or desoxy‐d‐glucose* or 2deoxyglucose* or 2deoxy‐d‐glucose* or fluorodeoxyglucose* or fluorodesoxyglucose* or fludeoxyglucose* or fluordeoxyglucose* or fluordesoxyglucose* or 18fluorodeoxyglucose* or 18fluorodesoxyglucose* or 18fluordeoxyglucose* or fdg* or 18fdg* or 18f‐dg*)
#15 (glucose* and (fluor* or 2fluor* or fludeoxy* or 18f*))
#16 #8 or #9 or #10 or #11 or #12 or #13 or #14 or #15
#17 #7and #16 in Trials
key: *: truncation, near/#: adjacent within # number of words
Searches from 12/11/2018 ‐ 02/04/2019
ID Search
#1 MeSH descriptor: [Lymphoma] this term only
#2 MeSH descriptor: [Hodgkin Disease] explode all trees
#3 Germinoblastom*
#4 Reticulolymphosarcom*
#5 Hodgkin*
#6 (malignan* near/2 (lymphogranulom* or granulom*))
#7 #1 or #2 or #3 or #4 or #5 or #6
#8 MeSH descriptor: [Positron‐Emission Tomography] explode all trees
#9 (pet* or petscan*)
#10 tomograph*
#11 emission*
#12 MeSH descriptor: [Deoxyglucose] explode all trees
#13 MeSH descriptor: [Fluorodeoxyglucose F18] explode all trees
#14 (deoxyglucose* or desoxyglucose* or deoxy‐glucose* or desoxy‐glucose* or deoxy‐d‐glucose* or desoxy‐d‐glucose* or 2deoxyglucose* or 2deoxy‐d‐glucose* or fluorodeoxyglucose* or fluorodesoxyglucose* or fludeoxyglucose* or fluordeoxyglucose* or fluordesoxyglucose* or 18fluorodeoxyglucose* or 18fluorodesoxyglucose* or 18fluordeoxyglucose* or fdg* or 18fdg* or 18f‐dg*)
#15 (fluor* or 2fluor* or fluoro* or fluorodeoxy* or fludeoxy* or fluorine* or 18f* or 18flu*)
#16 glucose*
#17 #15 and #16
#18 #8 or #9 or #10 or #11 or #12 or #13 or #14 or #17
#19 #7 and #18 in Trials
Appendix 2. MEDLINE Ovid search strategy # Searches until 02/02/2016 1 *LYMPHOMA/ 2 exp HODGKIN DISEASE/ 3 Germinoblastom$.tw,kf,ot. 4 Reticulolymphosarcom$.tw,kf,ot. 5 Hodgkin$.tw,kf,ot. 6 (malignan$ adj2 (lymphogranulom$ or granulom$)).tw,kf,ot. 7 or/1‐6 8 POSITRON‐EMISSION TOMOGRAPHY/ 9 (pet$ or petscan$).tw,kf,ot. 10 tomograph$.tw,kf,ot. 11 emission$.tw,kf,ot. 12 exp DEOXYGLUCOSE/ 13 FLUORODEOXYGLUCOSE F18/ 14 (deoxyglucose$ or desoxyglucose$ or deoxy‐glucose$ or desoxy‐glucose$ or deoxy‐d‐glucose$ or desoxy‐d‐glucose$ or 2deoxyglucose$ or 2deoxy‐d‐glucose$ or fluorodeoxyglucose$ or fluorodesoxyglucose$ or fludeoxyglucose$ or fluordeoxyglucose$ or fluordesoxyglucose$ or 18fluorodeoxyglucose$ or 18fluorodesoxyglucose$ or 18fluordeoxyglucose$ or fdg$ or 18fdg$ or 18f‐dg$).tw. 15 (fluor$ or 2fluor$ or fluoro$ or fluorodeoxy$ or fludeoxy$ or fluorine$ or 18f$ or 18flu$).tw. 16 glucose$.tw. 17 or/8‐16 18 7 and 17 19 ANIMALS/ not HUMANS/ 20 18 not 19 # Searches from 03/02/2016 – 12/07/2017 1 *LYMPHOMA/ 2 exp HODGKIN DISEASE/ 3 Germinoblastom$.tw,kf,ot. 4 Reticulolymphosarcom$.tw,kf,ot. 5 Hodgkin$.tw,kf,ot. 6 (malignan$ adj2 (lymphogranulom$ or granulom$)).tw,kf,ot. 7 or/1‐6 8 exp POSITRON‐EMISSION TOMOGRAPHY/ 9 (pet$ or petscan$).tw,kf,ot. 10 tomograph$.tw,kf,ot. 11 emission$.tw,kf,ot. 12 exp DEOXYGLUCOSE/ 13 FLUORODEOXYGLUCOSE F18/ 14 (deoxyglucose$ or desoxyglucose$ or deoxy‐glucose$ or desoxy‐glucose$ or deoxy‐d‐glucose$ or desoxy‐d‐glucose$ or 2deoxyglucose$ or 2deoxy‐d‐glucose$ or fluorodeoxyglucose$ or fluorodesoxyglucose$ or fludeoxyglucose$ or fluordeoxyglucose$ or fluordesoxyglucose$ or 18fluorodeoxyglucose$ or 18fluorodesoxyglucose$ or 18fluordeoxyglucose$ or fdg$ or 18fdg$ or 18f‐dg$).tw. 15 (fluor$ or 2fluor$ or fluoro$ or fluorodeoxy$ or fludeoxy$ or fluorine$ or 18f$ or 18flu$).tw. 16 glucose$.tw. 17 15 and 16 18 or/8‐14 19 17 or 18 20 7 and 19 21 ANIMALS/ not HUMANS/ 22 20 not 21 23 limit 22 to ed=20160203‐20170712 # Searches from 12/07/2017 ‐ 12/11/2018 1 *LYMPHOMA/ 2 HODGKIN DISEASE/ 3 germinoblastom$.tw,kf,ot. 4 reticulolymphosarcom$.tw,kf,ot. 5 hodgkin$.tw,kf,ot. 6 (malignan$ adj2 (lymphogranulom$ or granulom$)).tw,kf,ot. 7 or/1‐6 8 exp POSITRON‐EMISSION TOMOGRAPHY/ 9 (pet$).tw,kf,ot. 10 tomograph$.tw,kf,ot. 11 emission$.tw,kf,ot. 12 exp DEOXYGLUCOSE/ 13 FLUORODEOXYGLUCOSE F18/ 14 (deoxyglucose$ or desoxyglucose$ or deoxy‐glucose$ or desoxy‐glucose$ or deoxy‐d‐glucose$ or desoxy‐d‐glucose$ or 2deoxyglucose$ or 2deoxy‐d‐glucose$ or fluorodeoxyglucose$ or fluorodesoxyglucose$ or fludeoxyglucose$ or fluordeoxyglucose$ or fluordesoxyglucose$ or 18fluorodeoxyglucose$ or 18fluorodesoxyglucose$ or 18fluordeoxyglucose$ or fdg$ or 18fdg$ or 18f‐dg$).tw. 15 (glucose$ and (fluor$ or 2fluor$ or fludeoxy$ or 18f$)).tw. 16 or/8‐15 17 7 and 16 18 exp ANIMALS/ not HUMANS/ 19 17 not 18 20 limit 19 to ed=20160203‐20170712 21 limit 19 to ed=20170712‐20181112 22 ANIMALS/ not HUMANS/ 23 21 not 22 24 limit 23 to ed=20160203‐20170712 25 limit 23 to ed=20170712‐20181112 # Searches from 12/11/2018 ‐ 02/04/2019 1 *LYMPHOMA/ 2 HODGKIN DISEASE/ 3 Germinoblastom$.tw,kf,ot. 4 Reticulolymphosarcom$.tw,kf,ot. 5 Hodgkin$.tw,kf,ot. 6 (malignan$ adj2 (lymphogranulom$ or granulom$)).tw,kf,ot. 7 or/1‐6 8 exp POSITRON‐EMISSION TOMOGRAPHY/ 9 (pet$ or petscan$).tw,kf,ot. 10 tomograph$.tw,kf,ot. 11 emission$.tw,kf,ot. 12 exp DEOXYGLUCOSE/ 13 Fluorodeoxyglucose F18/ 14 (deoxyglucose$ or desoxyglucose$ or deoxy‐glucose$ or desoxy‐glucose$ or deoxy‐d‐glucose$ or desoxy‐d‐glucose$ or 2deoxyglucose$ or 2deoxy‐d‐glucose$ or fluorodeoxyglucose$ or fluorodesoxyglucose$ or fludeoxyglucose$ or fluordeoxyglucose$ or fluordesoxyglucose$ or 18fluorodeoxyglucose$ or 18fluorodesoxyglucose$ or 18fluordeoxyglucose$ or fdg$ or 18fdg$ or 18f‐dg$).tw. 15 (glucose$ and (fluor$ or 2fluor$ or fludeoxy$ or 18f$)).tw. 16 or/8‐15 17 7 and 16 18 exp ANIMALS/ not HUMANS/ 19 17 not 18 20 limit 19 to ed=20160203‐20170712 21 limit 19 to ed=20170712‐20181112 22 limit 19 to ed=20181112‐20190402key: exp # /: explode # MeSH subject heading, tw: text word, kf: keyword heading word, ot: original title, ti: title, $: truncation, adj#: adjacent within # number of words
Appendix 3. Embase/Ovid search strategy # Searches 1 exp CLASSICAL HODGKIN LYMPHOMA/ 2 *HODGKIN DISEASE/ 3 germinoblastom*.tw,kw. 4 reticulolymphosarcom*.tw,kw. 5 hodgkin*.tw,kw. 6 (malignan* adj2 (lymphogranulom* or granulom*)).tw,kw. 7 or/1‐6 8 exp POSITRON EMISSION TOMOGRAPHY/ 9 pet*.tw,kw. 10 tomograph*.tw,kw. 11 emission*.tw,kw. 12 exp DEOXYGLUCOSE/ 13 FLUORODEOXYGLUCOSE F 18/ 14 (deoxyglucose* or desoxyglucose* or deoxy‐glucose* or desoxy‐glucose* or deoxy‐d‐glucose* or desoxy‐d‐glucose* or 2deoxyglucose* or 2deoxy‐d‐glucose* or fluorodeoxyglucose* or fluorodesoxyglucose* or fludeoxyglucose* or fluordeoxyglucose* or fluordesoxyglucose* or 18fluorodeoxyglucose* or 18fluorodesoxyglucose* or 18fluordeoxyglucose* or fdg* or 18fdg* or 18f‐dg*).tw. 15 (glucose* and (fluor* or 2fluor* or fludeoxy* or 18f*)).tw. 16 or/8‐15 17 7 and 16 18 exp ANIMAL/ not HUMAN/ 19 17 not 18 20 limit 19 to yr="1990 ‐Current" 21 meta‐analys:.mp. or search:.tw. or review.pt. 22 (child* or p?ediatric*).ti. 23 20 not (21 or 22) 24 limit 23 to embase 25 limit 23 to conference abstracts 26 24 or 25key: exp # /: explode # EMTREE term, * # /: focus # EMTREE term, /: EMTREE term, tw: text word, kw: keyword, ti: title, mp: multiple purpose, pt: publication type, *: truncation, ?: wildcard
search line #21: Review Embase search filter ‐ best balance of sensitivity and specificity https://hiru.mcmaster.ca/hiru/HIRU_Hedges_EMBASE_Strategies.aspx
Data and analyses Comparison 1. Univariable comparison of PET+ve vs. PET‐ve. Outcome or subgroup title No. of studies No. of participants Statistical method Effect size 1.1 Overall survival 9 1802 Hazard Ratio (IV, Random, 95% CI) 5.09 [2.64, 9.81] 1.2 Progression‐free survival 14 2079 Hazard Ratio (IV, Random, 95% CI) 4.90 [3.47, 6.90] Comparison 2. Subgroups in univariable comparison of OS: PET+ve vs. PET‐ve. Outcome or subgroup title No. of studies No. of participants Statistical method Effect size 2.1 OS by radiotherapy 9 1802 Hazard Ratio (IV, Random, 95% CI) 5.09 [2.64, 9.81] 2.1.1 Involved node and/or site 3 548 Hazard Ratio (IV, Random, 95% CI) 3.45 [1.22, 9.72] 2.1.2 involved field 4 428 Hazard Ratio (IV, Random, 95% CI) 12.75 [4.98, 32.68] 2.1.3 not specified 2 826 Hazard Ratio (IV, Random, 95% CI) 2.80 [1.17, 6.67] 2.2 OS by study design 8 1717 Hazard Ratio (IV, Random, 95% CI) 4.63 [2.43, 8.80] 2.2.1 Prospective 3 406 Hazard Ratio (IV, Random, 95% CI) 5.35 [1.07, 26.68] 2.2.2 Retrospective 4 589 Hazard Ratio (IV, Random, 95% CI) 7.12 [3.14, 16.14] 2.2.3 RCT 1 722 Hazard Ratio (IV, Random, 95% CI) 2.60 [1.03, 6.56] 2.3 OS by chemotherapy 9 1802 Hazard Ratio (IV, Random, 95% CI) 5.09 [2.64, 9.81] 2.3.1 ABVD 5 801 Hazard Ratio (IV, Random, 95% CI) 5.19 [2.11, 12.72] 2.3.2 ABVD and/or other 3 279 Hazard Ratio (IV, Random, 95% CI) 10.30 [1.71, 62.13] 2.3.3 BEACOPP 1 722 Hazard Ratio (IV, Random, 95% CI) 2.60 [1.03, 6.56] 2.4 OS for PET/CT vs PET 8 1706 Hazard Ratio (IV, Random, 95% CI) 5.01 [2.50, 10.02] 2.4.1 PET/CT 5 595 Hazard Ratio (IV, Random, 95% CI) 4.70 [1.86, 11.86] 2.4.2 PET only 3 1111 Hazard Ratio (IV, Random, 95% CI) 6.99 [1.58, 30.90] 2.5 OS by disease stage 9 1802 Odds Ratio (IV, Random, 95% CI) 5.09 [2.64, 9.81] 2.5.1 Stages I and II with A and B symptoms 1 96 Odds Ratio (IV, Random, 95% CI) 9.21 [0.71, 120.03] 2.5.2 All stages 7 984 Odds Ratio (IV, Random, 95% CI) 6.28 [2.62, 15.05] 2.5.3 Advanced 1 722 Odds Ratio (IV, Random, 95% CI) 2.60 [1.03, 6.56] 2.6 Timing of interim PET 9 1802 Hazard Ratio (IV, Random, 95% CI) 5.09 [2.64, 9.81] 2.6.1 PET2 6 1495 Hazard Ratio (IV, Random, 95% CI) 3.53 [1.97, 6.32] 2.6.2 Other (including mixed) 3 307 Hazard Ratio (IV, Random, 95% CI) 20.13 [5.04, 80.38] 2.7 OS by HR type of estimation 9 1802 Hazard Ratio (IV, Random, 95% CI) 5.09 [2.64, 9.81] 2.7.1 precise 7 1638 Hazard Ratio (IV, Random, 95% CI) 5.70 [2.60, 12.48] 2.7.2 Imprecise 2 164 Hazard Ratio (IV, Random, 95% CI) 3.60 [0.89, 14.64] Comparison 3. Subgroups in univariable comparison of PFS: PET+ve vs. PET‐ve. Outcome or subgroup title No. of studies No. of participants Statistical method Effect size 3.1 PFS by study design 13 1349 Hazard Ratio (IV, Random, 95% CI) 5.66 [4.02, 7.97] 3.1.1 prospective 3 357 Hazard Ratio (IV, Random, 95% CI) 3.95 [2.23, 7.00] 3.1.2 retrospective 8 827 Hazard Ratio (IV, Random, 95% CI) 6.85 [4.66, 10.08] 3.1.3 RCT 2 165 Hazard Ratio (IV, Random, 95% CI) 6.21 [2.87, 13.42] 3.2 PFS by chemotherapy 14 2079 Hazard Ratio (IV, Random, 95% CI) 4.90 [3.47, 6.90] 3.2.1 ABVD 7 945 Hazard Ratio (IV, Random, 95% CI) 5.13 [3.18, 8.27] 3.2.2 ABVD and/or other 4 265 Hazard Ratio (IV, Random, 95% CI) 7.07 [3.40, 14.70] 3.2.3 other NON‐ABVD chemo 3 869 Hazard Ratio (IV, Random, 95% CI) 3.64 [1.83, 7.24] 3.3 PFS for PET/CT vs PET 13 1983 Hazard Ratio (IV, Random, 95% CI) 5.08 [3.57, 7.21] 3.3.1 PET/CT 8 707 Hazard Ratio (IV, Random, 95% CI) 6.03 [3.68, 9.90] 3.3.2 PET only 5 1276 Hazard Ratio (IV, Random, 95% CI) 4.06 [2.33, 7.08] 3.4 PFS by disease stage 14 2079 Hazard Ratio (IV, Random, 95% CI) 4.90 [3.47, 6.90] 3.4.1 Stages I and II with A and B symptoms 2 184 Hazard Ratio (IV, Random, 95% CI) 3.88 [1.54, 9.83] 3.4.2 All stages 11 1173 Hazard Ratio (IV, Random, 95% CI) 5.81 [3.93, 8.57] 3.4.3 Advanced 1 722 Hazard Ratio (IV, Random, 95% CI) 2.27 [1.35, 3.82] 3.5 PFS by radiotherapy 14 2079 Hazard Ratio (IV, Random, 95% CI) 4.90 [3.47, 6.90] 3.5.1 Involved node and/or site 5 651 Hazard Ratio (IV, Random, 95% CI) 5.35 [2.94, 9.75] 3.5.2 Involved field 6 514 Hazard Ratio (IV, Random, 95% CI) 7.06 [4.15, 12.00] 3.5.3 Not specified 2 826 Hazard Ratio (IV, Random, 95% CI) 2.97 [1.48, 5.98] 3.5.4 None 1 88 Hazard Ratio (IV, Random, 95% CI) 5.09 [1.95, 13.29] 3.6 Timing of interim PET 14 2079 Hazard Ratio (IV, Random, 95% CI) 4.90 [3.47, 6.90] 3.6.1 PET2 9 1677 Hazard Ratio (IV, Random, 95% CI) 4.68 [3.14, 6.98] 3.6.2 Other (including mixed) 5 402 Hazard Ratio (IV, Random, 95% CI) 6.32 [3.40, 11.75] 3.7 PFS by HR type of estimation 14 2079 Hazard Ratio (IV, Random, 95% CI) 4.90 [3.47, 6.90] 3.7.1 precise 9 1450 Hazard Ratio (IV, Random, 95% CI) 4.69 [2.84, 7.73] 3.7.2 Imprecise 5 629 Hazard Ratio (IV, Random, 95% CI) 5.66 [3.65, 8.77] Characteristics of studies Characteristics of included studies [ordered by study ID] Andre 2017. Study characteristics Methods Secondary citation(s)Raemarkers 2014, Cottereau 2018
English
Prospective, multi‐centre, phase III randomised trial
Various
Belgium, Croatia, Denmark, France, Italy, the Netherlands, Slovakia, Switzerland
55 months
Total: 1925
Randomised to standard treatment without change in protocol because of interim PET: 954
Previously untreated
Classic supradiaphragmic stage I and II HL
Age 15 to 70 years
Previous laparotomy
Concomitant or previous cancer other than basal‐cell carcinoma of the skin or in situ carcinoma of the cervix
Concomitant severe illness that would reduce life expectancy
Social circumstances not allowing for proper treatment and follow‐up
Positivity for the human immunodeficiency virus
Yes; written informed consent
November 2006 to June 2011
Favourable, standard treatment group median: 31 (15‐49)
Unfavourable, standard treatment group median: 32 (15‐70)
Not reported
Early stages (I and II)
Not reported, except for the exclusion criteria
ABVD and radiotherapy depending on treatment arm, favourable/unfavourable disease, and early PET (ePET) positivity
Early PET (ePET)
Not reported
After 2 ABVD cycles
International Harmonization Project criteria. According to these criteria: PET‐negative corresponds to Deauville score 1 (no uptake) and score 2 (uptake ≤ mediastinum)
Central review performed online (up to 6 experts, and one local expert)
Central review started later for 2 centres in Italy due to technical difficulties, only 75% of ePET were centrally reviewed
Not reported
Progression‐free survival (PFS), defined as time from random assignment to date of progression (as experiencing relapse after previous complete remission, progressive disease, or death from any cause)
Overall survival (OS), not defined
At 5 years
Yes
Not reported
No
Not applicable
PFS: all
OS: all
Kaplan‐Meier method
HR (95% CI)
Randomised arms were compared using the log‐rank test stratified by Ann Arbor stage and availability of a baseline FDG‐PET scan
Binary
Low risk
Clear description of participants and study characteristics.
Low risk
Length of follow‐up reported. Exclusion of participants due to safety amendment during the study.
Moderate risk
Adequate measurement and description. Central review only for 75% of scans and delayed in the case of 2 centres due to technical difficulties.
Low risk
No definition of outcome. Outcome measured the same way for all participants.
Low risk
Low risk
Statistical method appropriate for the data.
Casasnovas O: honoraria received from Genentech, Takeda, Gilead Sciences, Sanofi; consulting or advisory role at Genentech, Takeda, Gilead Sciences; research funding received from Genentech; travel, accomodation, expenses received from Genentech, Takeda, Gilead Sciences
Brice P: research funding received from Merck Sharp & Dohme Oncology, Takeda; travel, accomodation, expenses received from Takeda
Specht L: consulting or advisory role at Takeda; research funding received from Varian Medical Systems; travel, accomodation, expenses received from Takeda
Delarue R: honoraria received from Servier, Gilead Sciences, Roche, Celgene, Takeda; consulting or advisory role at Gilead Sciences, Roche; Speakers' Bureau at Karyopharm Therapeutics; travel, accomodation, expenses received from Roche, Takeda, Celgene
Hutchings M: consulting or advisory role at Takeda, Genentech, Celgene, Bayer; research funding received from Takeda, Janssen‐Cilag, Genentech, Celgene; travel, accomodation, expenses received from Takeda, Bristol‐Myers, Squibb, Janssen‐Cilag
Supported by European Organisation for Research and Treatment of Cancer (Belgium), LYmphoma Study Association (France), Fondazione Italiana Limfomi (Italy), Fondation Belge Contre le Cancer (Belgium), Dutch Cancer Society (the Netherlands), Institut National du Cancer (France), Assistance Publique des Hopitaux de Paris (France), Societe Française de Medecine Nucleaire et Imagerie Moleculaire (France), Associazone Angela Serra (Italy), van Vlissingen Lymfoom Fonds (the Netherlands), and Chugai Pharmaceutical (Japan).
NA
English
Retrospective, single‐centre study
Not reported
Italy
Not reported
68
HL diagnosis
Not reported
Not reported
January 2007 to December 2014
39 (16‐72)
Not reported
All stages
Not reported
ABVD according to the presence of risk factors defined by the European Organisation for Research and Treatment of Cancer (EORTC)
Favourable group (age < 50 years with ≤ 3 involved nodal areas, absence of mediastinal bulk (mediastinum‐to‐thorax ratio < 0.35), and erythrocyte sedimentation rate (ESR) < 50 mm without B symptoms or ESR < 30 mm with B symptoms): 3 cycles ABVD followed by radiotherapy, or 4 cycles ABVD without radiotherapy
Unfavourable group (age ≥ 50 years, > 4 involved nodal areas, presence of mediastinal bulk (mediastinum to‐thorax ratio ≥ 0.35), or ESR ≥ 50 mm without B symptoms or ESR ≥ 30 mm with B symptoms): 4 cycles ABVD followed by radiotherapy, or 6 cycles ABVD without radiotherapy
Interim PET
Half‐body PET scan (base of the skull to mid‐thigh)
Around day 25 (mean, range 22‐27) after cycle 1 of ABVD
Deauville 5‐point scoring system
Scores of 1‐3 considered negative, scores of 4‐5 considered positive
2 nuclear medicine physicians interpreted all scans
Yes
Not reported
Progression‐free survival (PFS), with progression during treatment, lack of complete remission at the end of first‐line treatment, and relapse counted as adverse events (AE)
None
At 2 years
Yes
Not reported
No
Not applicable
PFS: all
Receiver operating characteristic (ROC) approach
Kaplan‐Meier (survival analysis)
Log‐rank (differences between groups)
Cox proportional hazards model
Binary
Unclear
Clear description of participants and study characteristics. No inclusion and exclusion criteria provided.
Unclear risk
No loss to follow‐up reported. No length of follow‐up reported.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
Clear definition. Outcome measured the same way for all participants.
High risk
Stated in methods section that multiple factors were taken into account for analysis, but unclear which variables and how adjustment was conducted. Disease stage not accounted for.
High risk
Poorly reported. Unclear whether multivariable analysis was reported.
The authors declare that they have no conflict of interest.
Not reported
Sher 2009
English
Retrospective, multi‐centre study (2 centres)
Massachusetts General Hospital Cancer Center and Dana‐Farber Cancer Institute, Massachusetts, USA
USA
46 months
96
Diagnosed with classic, histology‐proven HL
Adults
Limited‐stage non‐bulky disease (mass < 10 cm)
ABVD chemotherapy
Availability of interim PET and end‐of‐treatment PET
Nodular lymphocyte predominant HL
Not reported
January 2000 to December 2008
34 (18‐77)
Not reported
Early stages (I to IIB)
Not reported
4 or 6 cycles of ABVD with or without IFRT
Interim PET
Whole‐body PET scan (base of the skull to mid‐thighs)
After 2 to 4 treatment cycles
2 nuclear medicine physicians interpreted all scans, final result based on consensus
Grading on a 4‐point scale with scores 0 or 1 considered negative and scores 2 to 4 considered positive
Yes
Not reported
Overall survival (OS), defined as the time from initial pathological diagnosis to death from any cause
Progression‐free survival (PFS), defined as time from diagnosis to progression or death from any cause
Overall response rate (ORR), defined as number of subjects with either complete response (CR) or partial response (PR)
Primary refractory disease, defined as progressive disease on treatment or relapse within 3 months of completing therapy
Unclear: 4 years reported in text, 10 years reported in figure
Yes
Not reported
Not reported
Not applicable
OS: all
PFS: all
Kaplan‐Meier (survival analysis)
Log‐rank test
Fisher’s exact test (CR)
Binary
Unclear risk
Description of participants provided. Missing interim and end‐of‐treatment PET was part of the exclusion criteria. No comparison of baseline study sample (n = 155) and participants (n = 96) included. No reasons for missing scans provided.
Low risk
No loss to follow‐up.
Moderate risk
Adequate measurement and description. No standardised criteria, but description of scoring system used. Prognostic factor measured the same way for all participants. Blinding not reported.
High risk
Clear definition. Reporting of timing inconsistent (4 vs. 10 years).
Low risk
High risk
Statistical method appropriate for the data, but discrepancies between text and graphs detected.
High risk
Clear definition. Reporting of timing inconsistent (4 vs. 10 years).
Low risk
High risk
Statistical method appropriate for the data, but discrepancies between text and graphs detected.
There are no relevant conflicts of interests to disclose.
Not reported
Casasnovas 2018
English
Open‐label, randomised phase 3 trial
Multicentre (90 centres)
Belgium, France
50.4 months (IQR: 42.9‐59.3) for all participants, not reported separately for standard treatment group
823 in total
413 in standard treatment group
Age 16‐60 years
Newly diagnosed HL
ECOG performance status score < 3
Minimum life expectancy of 3 months
Ann Arbor disease stage III, IV, or IIB with a mediastinum‐to‐thorax ratio of 0.33 or greater or extranodal localisation
No previous treatment for HL
Baseline PET (PET0) with at least one hypermetabolic lesion
Negative HIV, hepatitis C virus, and human T‐lymphotropic serology
Normal liver, renal, and haematological functions except for abnormalities related to HL
Nodular lymphocyte predominant subtype
Severe cardiopulmonary or metabolic disease
Written, informed consent
19 May 2011 to 29 April, 2014
31 (IQR; ranges 23 ‐ 41)
Not reported
II to IV, with B symptoms
Not reported
Standard treatment group: 4 cycles of BEACOPPescalated, irrespective of PET2 result. After PET4: If PET4‐negative: 2 further cycles of BEACOPPescalated, if PET4‐positive: salvage therapy.
Interim PET
Whole‐body PET scan (groin to head)
2 to 4 weeks after completion of cycles 2 and 4 of chemotherapy
Deauville criteria, with scores 1 to 3 considered negative, and scores 4 or 5 considered positive; Independent central review by 3 expert reviewers, final decision was based on at least two concordant responses
Yes; participants were scanned on the same camera for all PET scans
Presumably yes, but not explicitly mentioned
Progression‐free survival (PFS), defined as the time from randomisation to first progression, relapse, or death from any cause or last follow‐up
Safety, not defined
Overall response, not defined
Event‐free survival, defined as the time from randomisation to the first documented disease progression, relapse, start of a new anti‐lymphoma therapy, death from any cause, or last follow‐up
Disease‐free survival, defined as the time that complete response was recorded to the date of first documented disease progression, relapse or death related to lymphoma, toxicity from the study treatment (including treatment‐related secondary cancer), unknown cause or last follow‐up
Overall survival, defined as the time from randomisation to death from any cause or last follow‐up
PFS: at 5 years
Yes
Not reported
Yes; N = 11 stopped treatment before PET2, and further N = 14 stopped treatment before PET4
All 413 participants included in ITT analysis, N = 412 included in safety analysis, N = 372 included in per‐protocol analysis
PFS, OS: N = 413 in ITT analysis, N = 372 in per‐protocol analysis
Kaplan‐Meier (survival analysis)
Log‐rank test
Cox proportional hazard regression models
Binary
PFS: 759 (all participants that had reviewed PET2 and PET4 scans; not reported separately for standard group after PET2 without treatment modification)
OS: not reported
Cox proportional hazards regression model
Binary
8
PET assessment (PET2 and PET4)
Sex
Age
Eastern Cooperative Oncology Group score
B symptoms
Ann Arbor disease stage
Bulky disease
International Prognosis Score
Low risk
Adequate description of study population and recruitment. Detailed inclusion criteria.
Low risk
Reasons for loss to follow‐up provided for most participants with missing data.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
Clear definition. Outcome measured the same way for all participants.
Low risk
Only advanced stages included.
Low risk
Statistical methods appropriate and analysis fully reported.
Low risk
Clear definition. Outcome determined based on investigator assessment.
Low risk
Only advanced stages included. Multivariable analysis conducted.
Low risk
Statistical methods appropriate and analysis fully reported.
R‐OC has received grants, personal fees and non‐financial support from Gilead, Roche, and Takeda, personal fees and non‐financial support from Bristol‐Myers Squibb, Celegne, and Merck Sharpe & Dohme, and personal fees from Abbvie. PB has received personal fees from Bristol‐Myers Squibb, Merck Sharpe & Dohme, and Takeda, grants from Takeda Millenium, and non‐financial support from Roche. AS has received personal fees from Takeda. EN‐V has received personal fees from Keocyt and Sanofi. FM has received personal fees from Celegne, Gilead, Janssen, and Roche/Genentech. RD has received personal fees from Bristol‐Myers Squibb, Celegne, Gilead, Janssen, Karyopharm, Roche, Sanofi, and Takeda. MM has received personal fees from Roche China. The other authors declare no competing interests.
Programme Hospitalier de Recherche Clinique
NA
English
Prospective, single‐centre study
São Paulo University Clinics Hospital, Brazil
Brazil
36 months (32‐40)
104
Newly diagnosed, biopsy‐proven, classic HL
Pregnancy
Yes; written
August 2005 to December 2007
28 (13‐82)
Not reported
All stages
Not reported
ABVD 4‐6 cycles (stage I and II), 6‐8 cycles (stage III), 8 cycles (stage IV)
Radiation therapy (stage I or II with no adverse risk factors and treated with 4 cycles ABVD; participants with bulky disease)
Interim PET
Whole‐body PET scan
After 2 cycles of ABVD, as late as possible within the week before start of cycle 3
No specific scale indicated
2 board‐certified nuclear medicine physicians interpreted all scans
PET‐negative defined as no pathologic 18F‐FDG uptake at any site; PET‐positive defined as presence of focal 18F‐FDG uptake not attributed to physiologic biodistribution
Yes
Not reported
3‐year event‐free survival (EFS), defined as the time from diagnosis to treatment failure (incomplete response after first‐line treatment, progression during therapy, relapse or death) or last follow‐up
3‐year overall survival (OS)
At 3 years
Yes
Not reported
No
Not applicable
EFS: all
OS: all
Log‐rank (probability of treatment failure)
Kaplan‐Meier (survival curves)
Binary
Low risk
Clear description of participants and study characteristics, consecutive sampling and no participants excluded based on interim‐PET availability.
Low risk
Loss to follow‐up reported.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
No definition. Outcome measured the same way for all participants.
High risk
Univariable analysis for multiple prognostic factors showed significance of factor of interest, but no multivariable analysis performed. Disease stage not accounted for.
High risk
Statistical method in univariable analysis appropriate for the data, but no figures, only table with prognostic values, sensitivity and specificity. Discrepancies detected between text and graphs.
Low risk
Clear definition. Outcome measured the same way for all participants.
High risk
Univariable analysis for multiple prognostic factors showed significance of factor of interest, but no multivariable analysis performed. Disease stage not accounted for.
High risk
Statistical method in univariable analysis appropriate for the data, but no figures, only table with prognostic values, sensitivity and specificity. Discrepancies detected between text and graphs.
Not reported
Not reported
Agostinelli 2016, Biggi 2013, Gallamini 2006, Gallamini 2007
English
Retrospective, international, multi‐centre study (17 centres)
17 academic institutions worldwide
Various
37 months (2‐110)
260
HL participants with early stage unfavourable disease (IIA with adverse prognostic factors) or advanced stage disease (IIB – IVB)
Staging with PET‐CT at baseline and after 2 courses of ABVD
No change of treatment according to PET2
Minimum follow‐up of 1 year after completion of first treatment
Missing CT data, baseline PET, interim PET, CT or PET slices; poor quality PET images; miscellaneous reasons (n=9)
No; due to retrospective study design
January 2002 to December 2009
37.3 (14‐82)
Not reported
Early stage unfavourable (IIA HL with adverse prognostic factors)
Advanced stages (IIB – IVB)
Not reported
4‐8 cycles ABVD with or without involved‐field radiotherapy or consolidation radiotherapy
Interim PET
Not reported
A median of 12.3 days (range, 7‐22) after cycle 2 of ABVD
Deauville 5‐point scoring system
PET negative defined as scores 1‐3, PET positive defined as scores 4 or 5
6 reviewers interpreted all scans independently
Yes
Yes
Disease progression, defined as new disease within 6 months of first‐line treatment
Relapse, defined as disease occurring 6 months or longer after achieving complete remission
Progression‐free survival (PFS), defined as time from diagnosis to either disease progression or relapse, or to death as a result of any cause, whichever occurred first
Overall survival (OS), defined as the time from diagnosis to death from any cause
Inter‐observer agreement using the 5‐PS for PET2 interpretation
At 3 years
Yes
Yes
No
Not applicable
PFS: 260
Kaplan Meier survival curves with Mantel‐Haenszel, log‐rank, Wilcoxon and Tarone‐Ware tests
Univariable regression analyses
Binary
PFS: 260
Cox proportional hazards regression model
Binary
9
Bulky disease
Lymphocyte
Albumin
White blood cells
IPS (0‐2 vs. ≥3)
Continued complete remission (CR) vs. no CR
Lactate dehydrogenase
Bone marrow involvement
PET2
Low risk
Clear description of participants and study characteristics.
Low risk
Length of follow‐up reported.
Low risk
Adequate measurement and description. Blinding not reported.
Low risk
Clear definition. Outcome measured the same way for all participants. Blinding not reported.
Low risk
Only unfavourable and advances stages included.
Low risk
Statistical method appropriate for the data.
Low risk
Clear definition. Outcome measured the same way for all participants. Blinding not reported.
Low risk
Only unfavourable and advances stages included.
Low risk
Statistical method appropriate for the data.
Not reported
Reporting incomplete
The authors would like to thank: ... Keosys company for providing the Positoscope (R) network to distribute images to reviewers.
NA
English
Retrospective study
Not reported
Not reported
46 months (24‐126)
78
Biopsy‐proven, early‐stage (IA to IIB) classic HL of any subtype with or without bulky disease
Age > 18 years
Completion of planned ABVD and radiation therapy
At least 24 months of follow‐up or until proven relapse if earlier
None
No
January 2000 to December 2012
43 (median; 22‐86)
Not reported
Early stages (IA to IIB)
Not reported
ABVD (number of cycles based on risk factors and institutional guidelines) followed by involved‐field or extended‐field radiotherapy
Interim PET
PET‐CT scan (from base of the skull to upper thigh)
After ABVD cycle 2 to 4 or at the end of chemotherapy
5‐point scale
PET negative defined as a score ≤ 3
Staff physicians who were unaware of patient outcomes reviewed all scans
Yes
Yes
Outcomes relevant to this review were not explored in the study
Not applicable
Not applicable
Not applicable
Not applicable
Yes: one patient without baseline PET due to pregnancy; one patient without detectable disease on the baseline scan (excision of single site disease)
One patient without detectable disease on the baseline scan did not receive follow‐up PET since not considered necessary
The authors made no disclosure.
No specific funding was disclosed.
NA
English
Not reported
Guy’s and St. Thomas’ Hospital, London
UK
40.2 months (6‐125)
85
Histologically‐confirmed HL
Early interim FDG‐PET scans
None
Not reported
May 1993 to January 2004
36.7 (15‐73)
Not reported
All stages
Not reported
According to departmental protocols: mainly ABVD, number of cycles not reported; additional radiotherapy depending on stage and site of HL
Alternative therapy for participants without satisfactory remission during initial chemotherapy
Interim PET
Half‐body PET scan (mid‐brain to upper thigh)
After 2 or 3 cycles of chemotherapy, within the second week of the interval between cycles or as late as possible before administration of the next cycle
No specific scale indicated
2 experienced nuclear medicine physicians interpreted all scans, differences decided by consensus
PET‐negative defined as no evidence of disease; PET‐positive defined as increased uptake suspicious for malignant disease; Minimal residual uptake (MRU) defined as low‐grade uptake not likely to represent malignancy
Yes
Not reported
Progression‐free survival (PFS), defined as the time from diagnosis to first evidence of progression or relapse, or to disease‐related death
Overall survival (OS), defined as the time from diagnosis to death from any cause
None
At 2 and 5 years
Yes
Not reported
No
NA
PFS: all
OS: not reported
Kaplan‐Meier (survival curves)
Log‐rank (differences between groups)
Proportional hazards Cox regression analysis
Binary
PFS: all
OS: none
Kaplan‐Meier (survival curves)
Log‐rank (differences between groups)
Proportional hazards Cox regression analysis
Binary
4
Early interim PET
Ann Arbor stage
PET‐MRU vs. PET‐negative
PET‐positive vs. PET‐negative
Unclear risk
All eligible participants included. Clear description of participants and study characteristics. No inclusion / exclusion criteria provided. No comparison to baseline population, and no explanation of missing scans provided.
Moderate risk
Loss to follow‐up (8 participants), but reasons not provided.
Low risk
Adequate description. PET results separated into negative, positive and low MRU, which sometimes was considered negative (clearly stated in these cases). No clear cut‐off in numbers.
Low risk
Clear definition. Outcome measured the same way for all participants. Participants lost to follow‐up were still included in analysis.
High risk
Disease stage not accounted for.
Low risk
Statistical method in univariable analysis appropriate for the data.
Low risk
Clear definition. Outcome measured the same way for all participants. Participants lost to follow‐up were still included in analysis.
Low risk
Adjusted for disease stage.
Low risk
Statistical method appropriate for the data.
Not reported
Not reported
NA
English
Prospective, multi‐centre study (4 centres)
Copenhagen University Hospital, Rigshospitalet, Herlev Hospital, Aarhus University Hospital
Denmark
22.8 months (6.1‐40.8)
Total: 99
With Interim‐PET: 77
Newly diagnosed HL
Adults (≥18 years of age)
Diabetes mellitus
Pregnancy
Yes; written
November 2001 to June 2004
36.2 (18.6 – 74.0)
Not reported
All stages
Not reported
Various therapy regimens: ABVD (91%), ABV/MOPP (3%), ABVD/COPP (3%), BEACOPPesc. (3%), PVAG (1%)
Interim PET
Half‐body PET scan (mid‐brain to upper thigh)
Within the last week before start of cycle 3 (PET2) and before cycle 5 (PET4)
No specific scale indicated
2 experienced nuclear medicine physicians interpreted all scans, differences in interpretation decided by consensus
Definitions for PET‐positive and PET–negative not reported
Yes
Yes; nuclear medicine physicians were blinded from all clinical information except diagnosis
Progression‐free survival (PFS), defined as the time from diagnosis to first evidence of progression or relapse, or to disease‐related death
Overall survival (OS), defined as the time from diagnosis to death from any cause
None
At 2 years
Yes
Yes; clinicians were blinded from the results of PET
No
Not applicable
PFS: all
OS: not reported
Kaplan‐Meier (survival curves)
Log‐rank (differences between groups)
Proportional hazards Cox regression analysis
Binary
PFS: all
OS: not reported
Kaplan‐Meier (survival curves)
Log‐rank (differences between groups)
Proportional hazards Cox regression analysis
Binary
3
Interim PET
Clinical stage
Extranodal disease
High risk
Significant number of participants without PET (n = 22 out of total n = 99). Imbalance between groups with or without PET scan regarding stage of disease.
Low risk
Lack of compliance in a small number of participants (n = 7 out of n = 99), but not in the subjects included in PET2 analysis.
Low risk
Adequate measurement and description. Significant number of participants without PET (n = 22 out of n = 99).
Low risk
Clear definition of outcome. Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
Low risk
Statistical method appropriate for the data.
Low risk
Clear definition of outcome. Outcome measured the same way for all participants.
Low risk
Adjusted for disease stage.
Low risk
Statistical method appropriate for the data.
The authors have no financial interests in products studied in this work.
Not reported
NA
English
Prospective, multi‐centre study
Not reported
USA, Italy, Poland, Denmark
29 months
126*
Newly diagnosed classic HL
None
Yes; written
Not reported
34.1 (median, 16.8‐76.7)
Not reported
All stages
Not reported
Early‐stage disease: 2‐4 cycles ABVD followed by radiotherapy, or 6 cycles ABVD
Advanced‐stage disease: 6‐8 cycles ABVD with or without consolidation radiotherapy, with exceptions (5 Danish participants treated with BEACOPPesc)
Interim PET
Whole‐body PET scan
Within the last 5 days of cycle 1 (PET1) and cycle 2 (PET2) (US and Italian participants had PET2 only if PET1 was positive)
Deauville 5‐point scoring system
Scores of 1‐3 considered negative, scores of 4‐5 considered positive
Baseline interpretation by an expert with access to clinical information, second interpretation by an independent expert from another country blinded to clinical information
No; not all participants received PET2
Yes; experts in both stages blinded to clinical outcome, baseline experts also blinded to clinical information
Progression‐free survival (PFS), not defined
Overall survival (OS), not defined
None
At 2 and 3 years
Not reported; unclear due to multi‐national study design
Not reported
No
Not applicable
PFS: all
OS: all
Kaplan‐Meier (survival analysis)
Log‐rank (differences between groups)
Binary
PFS: all
OS: none
Kaplan‐Meier (survival analysis)
Log‐rank (differences between groups)
Binary
3
Interim PET (positive or negative)
Extranodal involvement
Disease stage (early or advanced stage)
Low risk
Description of participants and study characteristics given. No inclusion and exclusion criteria. Consecutive sampling and no exclusion based on interim PET availability. Detailed description of treatment regimen.
Low risk
No loss to follow‐up.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
High risk
Disease stage not accounted for.
Low risk
Statistical method in univariable analysis appropriate for the data.
Low risk
No definition of outcome. Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
Low risk
Statistical method in univariable analysis appropriate for the data.
The author(s) indicated no potential conflicts of interest.
Not reported
Borchmann 2017
English
Open‐label, international, randomised phase 3 trial
301 hospitals and private practices in five European countries
Germany, Switzerland, Austria, the Netherlands, Czech Republic
Not reported for entire study population
Total: 2101
Qualified for randomisation: 1945
Histologically proven primary diagnosis of HL
Advanced stages: stage IIB with one or both of the risk factors large mediastinal mass and extranodal lesions, or stage III or IV
No previous treatment for HL
Age 18‐60 years at inclusion
Normal organ function, except for HL‐related impairments
Negative HIV test
Negative pregnancy test
Life expectancy > 3 months
Incomplete diagnosis of the disease stage
Prior or concurrent disease that prevents treatment according to protocol
HL as part of a composite lymphoma
Prior chemotherapy or radiation
Malignant disease within the last 5 years (exceptions: basalioma, carcinoma in situ of the cervix uteri, completely resected melanoma TNMpT1)
Pregnancy, lactation
Eastern Cooperative Oncology Group (ECOG) performance status > 2
Long‐term ingestion of corticosteroids or antineoplastic drugs
Patient's lack of accountability, inability to appreciate the nature, meaning and consequences of the trial and to formulate his/her own wishes correspondingly
Noncompliance: refusal of blood products during treatment, epilepsy, drug dependency, change of residence to abroad, prior cerebral injury or similar circumstances that appear to make protocol treatment or long‐term follow‐up impossible
Antiepileptic treatment
General intolerance of any protocol medication
Unsafe contraceptive methods
Relationship of dependence or employer‐employee relationship to the sponsor or the investigator
Commitment to an institution on judicial or official order
Participation in another interventional trial that could interact with this trial
Yes; written, including consent to participate in the trial and to storage of data and tissue samples
14 May, 2008 to 18 July 2014
Not reported for entire study population (Borchmann 2017)
Not reported
Advanced stages: stage III‐IV, or stage II with B symptoms and one or both risk factors of large mediastinal mass
None, due to exclusion criteria
6 or 8 cycles of eBEACOPP (standard arm)
4 cycles of eBEACOPP or 8 cycles of eBEACOPP with rituximab (experimental arm)
Interim PET
Not reported
Between day 17 and day 21 of cycle 2 of chemotherapy
Deauville 5‐point scoring system
PET negative defined as scores 1 or 2, PET positive defined as scores 3 to 5
A multidisciplinary panel of experts centrally interpreted all scans
Yes
No; assessors who were masked to local findings, centrally reviewed PET‐2 and CT scans as well as x‐rays and clinical information (Borchmann 2017)
Progression‐free survival (PFS), defined as the time from completion of staging until progression, relapse, or death from any cause
Overall survival (OS), defined as time from completion of staging until death from any cause
At 3 years
Yes
Not reported
Participants with progressive disease, denoted by DS5 (Deauville score 5), were taken off protocol
505 participants treated before the protocol amendment in June 2011 were excluded from survival analysis
Participants with missing data were excluded from analysis
OS: 722
PFS: 722
Kaplan‐Meier (survival analysis)
Cox regression analysis (hazard ratios)
Binary
OS: 722
PFS: 722
Kaplan‐Meier (survival analysis)
Cox regression analysis (hazard ratios)
Binary
9
Clinical stage
B symptoms
Large mediastinal mass
Extra‐nodal involvement
Involvement of 3 or more nodal areas
Elevated erythrocyte sedimentation rate
International Prognosis Score
HL subtype
PET positivity (DS4 vs. 1‐3)
Low risk
Clear description of participants and study characteristics.
Low risk
Length of follow‐up reported. Exclusion of participants due to safety amendment during the study.
Low risk
Adequate measurement and description.
Low risk
Clear definition. Outcome measured the same way for all participants.
Low risk
Only advanced stages included.
Low risk
Statistical method appropriate for the data.
Low risk
Clear definition. Outcome measured the same way for all participants.
Low risk
Only advanced stages included.
Low risk
Statistical method appropriate for the data.
We declare no competing interests.
The HD18 trial was funded by the Deutsche Krebshilfe (No. 107957 and 110617) and the Swiss State Secretariat for Education, Research and Innovation (SERI), and supported by Roche Pharma AG (No. ML‐21683).
Markova 2009
English
Retrospective, single‐centre study
Prague, institution not reported
Czech Republic
52 months
69
Newly diagnosed, histologically proven HL
Clinical stage IIB with large mediastinal mass and/or extranodal disease, stage III or IV
Age 18‐60 years
Presence of any concurrent disease precluding protocol treatment
Composite lymphoma
Previous malignancy
Previous chemo‐ or radiotherapy
Pregnancy or lactation
Diabetes mellitus and elevated fasting blood sugar level >130 mg/dl (exclusion from PET)
Not reported
January 2004 to February 2008
30.7 (± 8.4)
Not reported
IIB to IVB
None, due to exclusion criteria
Treatment according to the HD15 trial of the German Hodgkin Study Group (GHSG) randomly assigned to either 8 cycles of BEACOPPescalated, 6 cycles of BEACOPPescalated or 8 cycles of time‐condensed BEACOPP14baseline
Local radiotherapy for participants with partial remission with residual mass ≥2.5cm and positive PET scan after chemotherapy
Interim PET
Not reported
After cycle 4 of chemotherapy, as close as possible to cycle 5
A local nuclear medicine physician interpreted all interim‐PET scans
PET‐positive defined as focal or diffuse uptake above background in a location incompatible with normal anatomy or physiology, without a specific standardised uptake cut‐off value; PET‐negative defined as no uptake, or increased uptake at the site of residual mass with an intensity lower or equal to the mediastinal blood pool
Yes
Not reported
Progression‐free survival (PFS), defined as the time from diagnosis to the first evidence of progression or relapse, or death from any cause
None
After cycle 4, 6/8 and 3 months after completion of chemotherapy
Yes
Not reported
No
Not applicable
PFS: all
Kaplan‐Meier (survival analysis)
Log‐rank test (comparison between groups)
Binary
Low risk
All eligible participants included. Clear description of participants and study characteristics. Consecutive sampling. Inclusion and exclusion criteria provided.
Low risk
No loss to follow‐up.
Moderate risk
Prognostic factor measured differently: PET4 scans reviewed locally (at the centre) by one physician, whereas PET6/8 assessment included central review.
Low risk
Outcome measured the same way for all participants.
Low risk
Only advanced stages included.
Low risk
Statistical method in univariable analysis appropriate for the data.
Not reported
Not reported
NA
English
Retrospective, multi‐centre study (2 centres)
Haut‐Lévêque Hospital and Bergonié Institute, Bordeaux, France
France
58.9 months
76
Biopsy‐proven, classic HL
Availability of baseline, interim and end‐of‐treatment PET‐CT
Treatment with chemotherapy different than ABVD
Planned treatment modification following int‐PET results
End‐PET performance > 6 months after end of treatment
No; waived because of retrospective design
December 2005 to April 2011
37 (median; 14‐67)
Not reported
All stages
Not reported
Various therapy regimens: 3, 4, 6 or 8 cycles of ABVD with or without radiotherapy
Interim PET
Not reported
After 2, 3 or 4 treatment cycles
Deauville 5‐point scoring system
Consensual reading of two nuclear medicine physicians
Two cut‐offs for interim PET positivity tested and compared: either scores 4 to 5 considered PET positive, or scores 3 to 5 considered PET positive
Yes
Yes
Progression‐free survival (PFS), defined as the time from diagnosis to either failure of first‐line treatment, relapse or death
None
At 5 years
Yes
Not reported
No
NA
PFS: all
Kaplan Meier analysis curve
Log‐rank test
Binary
PFS: all
Cox proportional hazard models
Binary
3
Interim PET
Disease stage*
Bulky disease*
Low risk
Clear description of participants and study characteristics.
Low risk
No loss to follow‐up.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
Clear definition. Outcome measured the same way for all participants. Blinding not reported.
Low risk
Adjusted for disease stage.
Low risk
Statistical method appropriate for the data.
None declared.
No funding was sought or received for this study.
NA
English
Retrospective, single‐centre study
MD Anderson Cancer Center, Houston, Texas, USA
USA
45 months
Total: 325
229 participants with PET2 analysed
96 participants with PET3 excluded post‐hoc
Classic HL
Treatment with ABVD
Availability of interim PET scan
Additional treatment (e.g. with brentuximab vedotin or rituximab) except for radiotherapy
Not reported
January 2001 to May 2011
Group I (early‐stage non‐bulky): 32 (median, 18‐77)
Group II (stage II bulky): 36 (20‐60)
Group III (advanced stage IPS ≤ 2): 30 (19‐79)
Group IV (advanced stage IPS ≥ 3): 49 (19‐84)
Not reported
All stages
Not reported
ABVD with or without radiotherapy
Interim PET
Not reported
After 2 or 3 cycles of ABVD
Deauville 5‐point scoring system
Scores of 1‐3 considered negative, scores of 4‐5 considered positive
Independent assessment by 3 nuclear medicine physicians
No, 10 participants had only PET without CT scan
Yes
Progression‐free survival (PFS), defined as the time from diagnosis to disease progression, relapse or death from any cause
None
At 3 years
Yes
No
No
Not applicable
PFS: all
Kaplan‐Meier survival curves with log‐rank test per subgroup
Univariable Cox proportional hazard models
Binary
Low risk
Clear description of participants and study characteristics.
Low risk
No loss to follow‐up.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
Clear definition. Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
High risk
Exclusion of participants with PET3 during analysis due to lack of prognostic value. Stratification according to disease stage resulted in small sample sizes per subgroup.
No conflict of interest to disclose for the study.
Not reported
NA
English
Retrospective, multi‐centre study (6 centres)
6 centres in London, UK
UK
27 months
23
Newly diagnosed, histologically confirmed classic HL
Advanced stage
HIV positivity
None
Not reported
June 2007 to August 2010
42 (median, 32‐60)
Not reported
Advanced stages: stage III –IV or stage IIB with at least one adverse prognostic factor
HIV positive participants only
Treatment for HL: standard ABVD therapy
Treatment for HIV: HAART (two NRTIs in combination with either a non‐NRTI or a boosted protease inhibitor) antiretroviral therapy; G‐CSF per centre protocol; prophylaxis for Pneumocystis jiroveci
Interim PET
Half‐body PET‐CT scan
After 2‐3 cycles of ABVD, within the week before start of the next cycle
Deauville 5‐point scoring system
Scores 1‐3 considered negative, scores 4‐5 considered positive
Assessed at 3 established PET centres by own nuclear medicine physician and central review by nuclear medicine expert
Yes
Not reported
Progression‐free survival (PFS), defined as the time from diagnosis to disease progression or relapse or last follow‐up
Overall survival (OS), defined as the time from diagnosis to death from any cause
Complete remission, defined as the disappearance of all disease manifestations at the end of therapy
At 2 years
Yes
Not reported
No
Not applicable
PFS: all
OS: not applicable, since no participants died
Kaplan‐Meier survival curves with log‐rank test
Binary
Low risk
Clear description of participants and study characteristics. Three participants did not have a staging PET, no reasons for missing PET provided.
Low risk
No loss to follow‐up. Length of follow‐up reported. Participants without interim PET (n = 11) excluded.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
Clear definition. Outcome measured the same way for all participants.
Low risk
Only unfavourable and advanced stages included.
High risk
Small sample size for some events (only two participants with positive interim PET result).
All authors have no conflicts of interest or disclaimers to declare.
Not reported
NA
English
Retrospective, single‐centre study
Policlinico Universitario TorVergata, Rome, Italy
Italy
Not reported
132
HL diagnosis based on biochemical tests and bone marrow biopsy
PET‐MDCT staging examination, interim PET‐MDCT and end of treatment PET‐MDCT performed
None
Not reported
January 2005 to June 2010
34 (mean, 16‐74)
Not reported
All stages
Not reported
ABVD dose dependent on disease stage: stages I‐IIA 4x ABVD with radiotherapy; stages IIB‐IV 6‐8x ABVD with radiotherapy
Interim PET
PET scan from pelvis to head
At the end of the second ABVD cycle
International Harmonization Project guidelines
Rated by a radiologist and nuclear medicine specialist, confirmation by semi‐quantitative analysis
Yes
Not reported
Complete remission, defined as the disappearance of symptoms and metabolic activity at any nodal or extranodal site with negative bone marrow biopsy
Partial remission, defined as persistence of significant metabolic activity at one site only, with at least 50% reduction in volume of the nodal masses or parenchymal nodular formations and persistence of disease at bone marrow level
Stable disease, defined as unchanged metabolic findings
Disease progression, defined as the appearance of new sites of pathological uptake and as a 50% increase in volume of nodal masses or previously detected parenchymal localisations
None
At the end of treatment
Yes
Not reported
No
NA
Outcomes selected for univariable analysis unclear
Sensitivity, specificity, PPV, NPV
Binary
None
Not reported
NA
English
Retrospective, single‐centre study
Hospital of Dijon, France
France
50 months (22‐71)
59
First diagnosis of classic HL
Positive serology for HIV
Yes; written informed consent
January 2007 to January 2010
35.5 (16‐76)
Not reported
All stages
Not reported, except for exclusion of HIV positive participants
Anthracycline‐based chemotherapy dependent on disease stage: stages I‐II 4‐6x chemotherapy with radiotherapy; stages III‐IV 8x chemotherapy
Interim PET
Whole‐body PET‐CT scan
After 2 cycles of chemotherapy
Deauville 5‐point scoring system
Scores 1‐3 considered negative, scores 4‐5 considered positive
ΔSUVmax (PET0‐PET2) dichotomized by applying the ROC approach
Independent review by 2 nuclear medicine physicians
Different scanner used for 4 participants
Yes
Progression‐free survival (PFS), defined as the time from the beginning of treatment until progression, relapse, or death from any cause or the date of last follow‐up
Time to progression (TTP), defined as time from the date of the first course of chemotherapy to any treatment failure, including progression, relapse, or death related to lymphoma, or the date of last follow‐up (participants with death from other cause were censored at the time of death)
None
At 4 years
Yes
Not reported
No
Not applicable
PFS: all
Kaplan‐Meier product limit method with log‐rank test
Binary
PFS: all
Cox proportional hazards regression models per outcome
Binary
2
ΔSUVmax (PET0‐PET2)
International prognosis score (IPS)
Low risk
Clear description of participants and study characteristics.
Low risk
No loss to follow‐up.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
Clear definition. Outcome measured the same way for all participants. Blinding not reported.
High risk
Disease stage not accounted for.
Low risk
Statistical method appropriate for the data.
The costs of publication of this article were defrayed in part by the payment of page charges. Therefore, and solely to indicate this fact, this article is hereby marked “advertisement” in accordance with 18 USC section 1734. No potential conflict of interest relevant to this article was reported.
Not reported
Miltenyi 2015
English
Retrospective study
Not reported
Hungary
47.52 months (11‐80)
121
Newly diagnosed HL
No previous treatment
Immunosuppressive medications
Immunodeficiency
No
2007 to 2013
36.7 (mean, 17‐79)
Not reported
All stages
None due to exclusion criteria
ABVD dependent on disease stage: 6 or 8 cycles of ABVD, or 4 or 6 cycles of ABVD combined with radiotherapy
Interim PET
Not reported
After cycle 2 of ABVD between days 11 and 14
Deauville 5‐point scoring system
Scores 1‐3 considered negative, scores 4‐5 considered positive
Person(s) interpreting the scans not reported
Yes
Not reported
Overall survival (OS), not defined
Progression‐free survival (PFS), not defined
None
At 5 years after diagnosis
Yes
Not reported
No
NA
OS: all
PFS: all
Kaplan‐Meier (survival analysis)
Log‐rank test (comparison between groups)
Cox proportional hazard model (effect of variants on survival)
Binary
OS: all
PFS: all
Kaplan‐Meier (survival analysis)
Log‐rank test (comparison between groups)
Cox proportional hazard model (effect of variants on survival
Binary
8
Age
Disease stage
Gender
B symptoms
Bulky disease
Treatment
PET2 positivity
Lymphocyte/monocyte ratio (LMR)
Unclear risk
Description of participants provided, but no in‐ and exclusion criteria provided. Not clear how many participants were sampled and included from the baseline sample.
Low risk
No dropouts.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants.
Low risk
Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
High risk
Statistical analysis appropriate for the data. All primary outcomes reported, but discrepancies between text and graphs/tables detected.
Low risk
Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
High risk
Statistical analysis appropriate for the data and all primary outcomes reported. However, discrepancies between text and graphs/tables detected.
None of the authors have any competing interest in the manuscript.
Not reported
Kostakoglu 2012
English
Prospective phase 2, multi‐centre (29 centres), clinical trial
29 Cancer and Leukemia Group B (CALGB) institutions
Not reported
Not reported
Total: 99
With interim‐PET: 88
Previously untreated, histologically confirmed, classic HL with clinical stages I or II, measurable through physical examination or imaging studies
Bulky disease
Yes; written
15 May 2004 to 29 September 2006
37 (18‐80)
Not reported
Stages I ‐ IIB
Not reported
6 cycles of AVG administered on days 1 and 15 per cycle
Interim PET
Not reported
1 to 2 weeks after completion of cycle 2 of AVG
Visual assessment was performed using International Harmonization Project criteria
Central review by 2 independent reviewers and an adjudicator
Yes
Yes
Complete response, defined as complete remission or complete remission unconfirmed after 6 cycles of chemotherapy
Progression‐free survival (PFS), measured from study entry until relapse
Adverse events (AEs), defined as toxicity including grade 3 or greater myelosuppression
At 3 years
Yes
Not reported
No
None
Complete response: none
PFS: 88
Kaplan‐Meier (survival analysis)
Log‐rank test (comparison between groups)
Binary
Low risk
Clear description of participants and study characteristics.
Low risk
Loss to follow‐up reported (n = 2).
Low risk
Adequate measurement and description. PET2 available for n = 88 out of a total of n = 99 participants.
Low risk
Clear definition. Outcome measured the same way for all participants.
Low risk
Only stages I ‐ IIB included.
Low risk
Statistical method in univariable analysis appropriate for the data.
The publication costs of this article were defrayed in part by page charge payment. Therefore, and solely to indicate this fact, this article is hereby marked ‘‘advertisement’’ in accordance with 18 USC section 1734.The authors declare no competing financial interests.
This work was supported by the National Cancer Institute: CA77651 (D.J.S., H.S.), CA33601 (J.L.J.), CA32291 (A.S.L., G.P.C.), CA77440 (N.L.B.), CA04457 (L.K.), CA77658 (N.C.H., S.‐H.J.), CA32291 (R.W.T.), CA47642 (M.E.J.), and CA77597 (B.D.C.). This work was supported in part by the Lymphoma Foundation, Adam Spector Fund for Hodgkin Research, the Ernest & Jeanette Dicker Charitable Foundation, and Mr Daniel Moon and Family (for D.J.S.). This work was also supported by CALGB (National Cancer Institute) with partial support by Eli Lilly and Company. The research for CALGB 50203 was supported in part by grants from the National Cancer Institute (CA31946) to the CALGB (Dr Monica M. Bertagnolli, Chair) and to the CALGB Statistical Center (Dr Stephen George, CA33601). The content of this manuscript is solely the responsibility of the authors and does not necessarily represent the official views of the National Cancer Institute.
NA
English
Retrospective, single‐centre study
University Hospital of Limoges, France
France
65.8 months (2.2‐194.5)
Total: 158
With interim‐PET: 68
Histologically proven, classic HL
Nodular lymphocyte predominant HL
Not reported
February 1995 to July 2011
38 (16‐85)
Not reported
All stages
Not reported
According to the standard of care at the time of diagnosis therapy regimens included ABVD, MOPP/ABV hybrid or BEACOPP; number of cycles not reported
Interim PET
Not reported
After cycle 2 of chemotherapy
Visual evaluation
PET‐positive if focal or diffuse accumulation of FDG in lesions higher than in surrounding tissue
FDG‐PET‐CT data (2005 and later) retrospectively reinterpreted using the Deauville 5‐point scoring system
Different PET imaging techniques over time (dual‐head coincidence until 2005, then FDG‐PET‐CT), quality assurance and quality control program to ensure comparability of methods
Not reported
Progression‐free survival (PFS), defined as time from date of diagnosis until relapse or death
Overall survival (OS), defined as time from first day of diagnosis until death from any cause
None
At 5 years
Yes
Not reported
No
Not applicable
PFS: 68
OS: 68
Kaplan‐Meier (survival analysis)
Chi‐squared test or t‐test (differences between groups)
ANOVA (comparison of means)
Binary
Unclear risk
Availability of interim PET as part of inclusion criteria, but not clear why less than 50% of participants had interim PET data. No comparison of baseline study sample (n = 357) with included participants (n = 158).
Low risk
All participants with available interim PET included.
Moderate risk
Retrospective reinterpretation of PET scans using the Deauville criteria. Method described, but unclear whether assessors were blinded to initial interpretation.
Low risk
Clear definition. Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
Low risk
Statistical method appropriate for the data.
Low risk
Clear definition. Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
Low risk
Statistical method appropriate for the data.
Not reported
This work was supported by the University Hospital of Limoges, CHU Limoges, F‐87042 France.
NA
Chinese, translated to English
Retrospective study
Peking University Cancer Hospital
People’s Republic of China
29.4 months (12.2‐52.4)*
Total: 50
With interim PET: 35
Newly diagnosed HL according to the 2008 WHO Hematopoietic and Lymphoid Tissue Classification
Not reported
Not reported
September 2009 to December 2012
33 (14‐74)
Not reported
All stages
Not reported
ABVD or BEACOPP with or without radiotherapy
Interim PET
From the top of the head to the middle thigh, the entire lower extremity was scanned if necessary
After 2 to 4 cycles of treatment
Interpretation of scans by 2 experienced PET‐CT physicians
Scale and cut‐off not reported
Not reported
Not reported
Progression‐free survival (PFS), defined as the interval from diagnosis to first signs of tumour progression, patient death, or end of follow‐up
None
At 3 years
Unclear, follow‐up was conducted via telephone and/or outpatient visits
Not reported
Only 35/50 participants underwent interim PET
Not reported
PFS: 35
Kaplan‐Meier curves and life tables (survival analysis)
Log‐rank tests (comparison between groups)
Binary
Low risk
Clear description of participants and study characteristics.
Low risk
No loss to follow‐up. Length of follow‐up reported.
Moderate risk
Adequate measurement and description, but no standardised criteria for PET scan evaluation.
Low risk
Clear definition. Outcome assessed differently for some participants (via telephone and/or outpatient visits).
High risk
Disease stage not accounted for.
High risk
Poor reporting of univariable analysis.
Not reported
This study was funded by Natural Science Foundation of China (NSFC) grant no. 81470328 and Youth Fund of NSFC grant no. 81600162, 81600130.
NA
English
Prospective, observational, multi‐centre study (11 centres)
11 haemato‐oncology centres
Poland
44.7 months (12.7–90.2)*
310 registered participants, out of which 24 were excluded from analysis due to treatment intensification based on PET1 and/or clinical symptoms of active HL
Newly diagnosed with classic HL
Absent/poor‐quality PET‐CT images
Yes; written informed consent
January 2008 to October 2014
30.8 (median, 18–80)
Not reported
All stages
Not reported
ABVD dependent on disease stage: stages I‐IIA 2‐4x ABVD with radiotherapy or 6x ABVD; stages IIB‐IV 6‐8x ABVD with or without radiotherapy
Interim PET
Whole‐body scan (mandibular angle to one third upper femur)
11‐13 days after end of ABVD cycle 1 (PET1)
Additional scan after ABVD cycle 2 for participants with a PET1 score of 3‐5 (PET2)
Deauville 5‐point scoring system
Scores 1‐3 considered negative, scores 4‐5 considered positive
6 reviewers interpreted all scans using the blinded independent central review method, disagreements were resolved in a joint session
No; PET2 only administered to participants with a PET1 score of 3‐5
Yes
Progression‐free survival (PFS), not defined
Kinetics of response
At 3 years
Yes
Yes
Yes; only 198 participants had PET2 scans
Not reported
286 (PET1) / 198 (PET2)
Kaplan‐Meier (survival analysis)
Cox proportional hazard regression analysis (HR between treatment groups)
Binary
Low risk
Clear description of participants and study characteristics.
Low risk
Moderate risk
Adequate measurement and description. Prognostic factor measured the same way for all participants. However, while PET1 scans were available for all participants, the availability of PET2 scans was dependent on the result of PET1. No further scans were performed if PET1 was negative
High risk
No definition of outcome.
High risk
Disease stage not accounted for.
High risk
No detailed description of analysis.
The authors have declared no conflicts of interest.
No funders to report.
Zinzani 2006
English
Retrospective, multi‐centre study (2 centres)
Bologna and Florence, Italy
Italy
45 months (6‐100)
304
Diagnosed with HL
Other treatment regimens than ABVD
Secondary lymphomas
Continuation of therapy during data analysis
Yes; written informed consent
June 1997 to June 2009
32 (13‐78)
Not reported
All stages
Assessed, but not reported
ABVD dependent on disease stage: early stages 6x ABVD or 4x ABVD with radiotherapy; advanced stages 6x ABVD
Interim PET
Not reported
After cycle 2 of ABVD
Juweid criteria
PET positive considered if focal FDG uptake that could not be attributed to physiological biodistribution, benign uptake or normal anatomy, with clearly increased activity relative to the background, excluding participants with minimal residual uptake
2 experienced board‐certified nuclear medicine physicians interpreted all scans
Yes
No
Response at the end of first‐line treatment and at follow‐up
Progression‐free survival (PFS), defined as time from diagnosis to first observation of progressive disease or death from any cause
Overall survival (OS), defined as time from diagnosis to time of most recent visit or death
At 9 years (for PFS and OS)
Yes
No
No
Not applicable
PFS: all
OS: all
Kaplan‐Meier (survival analysis)
Log‐rank test (comparison between groups)
Binary
Low risk
Clear description of participants and study characteristics.
Low risk
No loss to follow‐up.
Low risk
Adequate measurement and description. Prognostic factor measured the same way for all participants. No blinding of assessors.
Low risk
Clear definition of outcome. Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
Low risk
Statistical method appropriate for the data.
Low risk
Clear definition of outcome. Outcome measured the same way for all participants.
High risk
Disease stage not accounted for.
Low risk
Statistical method appropriate for the data.
None
This work was partially supported by BolognAIL (Bologna, Italy).
To establish the prognostic value of interim PET scans in limited stage patients with non‐bulky disease
Retrospective
USA (Massachusetts General Hospital, Dana‐Farber Cancer Institute, Harvard School of Public Health, Boston MA)
96
Non‐bulky limited stage cHL treated at the institutions between 2000 and 2008; Bulk was defined as a mass >=10 cm or >=1/3 of the intrathoracic diameter.
None
4 to 6 cycles of ABVD with or without IFRT
Overall survival
Progression‐free survival
To investigate the prognostic value of qualitative and semi‐quantitative evaluations of interim‐PET in mediastinal lymphoma
Retrospective
Not reported
48
Previously untreated, age under 60 at diagnosis and at least one interim‐PET evaluation available
Individuals with sub‐diaphragmatic or medullar localisations of lymphoma
Not reported
Event‐free survival
To evaluate whether binary response assessment criteria (positive or negative) has any prognostic significance after 2 cycles of ABVD therapy
Retrospective
UK
99
Newly diagnosed adults with advanced‐stage HL undergoing baseline and interim (post‐2 cycles ABVD) 18F‐FDG PET‐CT
None
ABVD
Recurrence‐free survival after 1 year
To explore the prognostic role of i‐PET in individuals with HL
Retrospective
Italy
83
Newly diagnosed HL, stage I‐IV disease
None
Not reported
Overall survival
Disease‐free survival
To compare the effect on prognosis of the currently applied MRU definitions
Prospective
Hungary
82
Newly‐diagnosed HL
None
6 courses of ABVB/EBVD, additional radiotherapy according to the protocol
Overall survival
Progression‐free survival
To measure agreement between experienced reporters reading interim PET‐CT scans from an international cohort of patients according to the Deauville criteria
To measure progression‐free survival in advanced HL according to interim PET
Not reported
International study
262
Individuals diagnosed with stage IIB‐IV HL
None
ABVD
Progression‐free survival
To evaluate the significance of iPET in elderly individuals with HL
Retrospective study (1998 to 2016)
Unclear, multicentre study (5 centres)
95
Individuals diagnosed with HL between 1998 to 2016
Older adults (>= 60 years)
Not reported
Fifty‐nine participants received first‐line treatment with ABVD, in 13 participants chemotherapy was followed by IVRT (treatment unclear for the remaining participants)
Overall survival
Progression‐free survival
Time frame: five years
To determine if Negative Predictive Value (NPV) remains high in individuals who present with mediastinal bulky disease
Retrospective
France
38
Previously untreated individuals with HL, with localiSed mediastinal bulky disease
None
Chemotherapy with or without additional radiotherapy
Progression‐free survival
NPV and PPV of iPET
To evaluate the outcome of individuals with HL receiving ABVD alone or in combination with RT
Retrospective
India (Haemato Oncology Care Centre, Vadodara)
63
Not reported
Not reported
ABVD alone or in combination with RT
Overall survival
Progression‐free survival
To evaluate the contribution of PET combined with computed tomography (PET‐CT) and contrast enhanced computed tomography (ceCT) in the staging and in the prognostication of untreated advanced HL
Retrospective
Italy, Poland, Denmark (multicentre)
162
Not reported
Not reported
ABVD with or without RT
Overall survival
Event‐free survival
Single‐centre experience with using 18FDG‐PET as a prognostic factor for long term complete remission (CR)
Retrospective
Italy
65
Newly diagnosed with HL
Not reported
VBM or ChlVPP/ABVVP followed by IFRT
Complete remission
Freedom from treatment failure
To evaluate the prognostic value of interim 18 FDG‐PET in advanced HL patients treated with intensified ChlVPP/ABVVP
Not reported
Italy
70
Not reported
Not reported
6 cycles of ChlVPP/ABVVP
Overall survival
Freedom from treatment failure
To evaluate the use of interim PET for the identification of individuals with classic HL, who are at risk for relapse after first‐line therapy
Retrospective
USA (MD Anderson Cancer Center, Houston TX)
57
Newly diagnosed cHL
Not reported
ABVD
Event‐free survival
To explore the prognostic role of i‐PET in individuals with HL
Retrospective
Italy
83
Newly diagnosed HL, stage I‐IV disease
Not reported
Not reported
Overall survival
Disease‐free survival
To determine the value of 18F‐FDG PET‐CT after the second and sixth cycle of first line therapy with ABVD or BEACOPP in the outcome of individuals with HL
Retrospective, January 2007 to December 2012
Spain
79
HL diagnosis
Not reported
ABVD or BEACOPP
Progression‐free survival
To verify the prognostic role both of "early‐FDG PET" and of macrophagic infiltration, and to test if "early‐FDG PET" positivity could correlate with high macrophagic infiltration in diagnostic specimens
Retrospective, February 2007 to July 2010
Italy (Siena and Florence haematology departments)
52
Diagnosed HL
Not reported
4 to 6 cycles of ABVD with or without IFRT
Complete remission
CD68 expression
To examine the predictive role on treatment outcome of early interim FDG‐PET in individuals with HL, treated with BEACOPP (4 escalated + 4 baseline cycles)
Retrospective
Italy (8 haematological institutions)
44
Diagnosed HL, advanced stage (IIB to IVB, or IIA with adverse prognostic factors)
Not reported
BEACOPP
Complete remission
Failure‐free survival
To assess the importance of PET‐CT as method of evaluation of early response to treatment in HL
Retrospective, 2002 to 2011
Spain (The Miguel Servet's Hospital, Zaragoza)
61
Diagnosed HL, first‐line therapy
Not reported
ABVD or BEACOPP
Progression‐free survival
To evaluate if integration of the response evaluation with iPET with parameters available at diagnosis could add prognostic information, allowing a better risk‐stratification of individuals with HL
Retrospective, 2007 to 2014
Italy (Università Cattolica del Sacro Cuore, Rome)
102
Diagnosed classic HL
Not reported
ABVD
Progression‐free survival
To study the correlation of PET results after one cycle and after two cycles of chemotherapy, and to investigate if the high predictive value of PET after two cycles is obtainable already after one cycle of chemotherapy
Prospective trial
Denmark (Copenhagen), USA (New York)
36
Diagnosed HL
Not reported
ABVD or BEACOPPesc
Negative predictive value
To determine the best predictor of PFS among various variables of tumour metabolic measurements at baseline and at interim PET‐CT compared to conventional methods in individuals with classic HL
Retrospective
Not reported
58
Diagnosed classic HL, ABVD therapy, minimal follow‐up of 2 years
Not reported
ABVD
Progression‐free survival
To evaluate the use of interim PET to guide treatment in advanced stage individuals with classic HL
Not reported, December 2009 to December 2013
Russia
36
Newly diagnosed classic HL (stages IIB to IV, or IIA with bulk), adults
Not reported
EACOPP‐14 with or without RT
Progression‐free survival
To assess how FDG‐PET is currently used in individuals with HL
Unclear, 2006 to 2008
Italy (Cancer Registries in Modena, Ferrara, Parma and Reggio Emilia)
136
Diagnosed HL, adults (18 to 75 years), HIV negative
None
Not reported
Overall survival
Relapse‐free survival
Failure‐free survival
To investigate how PET is currently used in daily practice and whether results obtained in clinical trials and retrospective series can be generalised to all individuals with HL.
Retrospective
Italy
Participants were identified from archives of four population‐based Italian cancer registries
Total: 136
Registration in one of the four population‐based Italian cancer registries (Modena, Reggio Emilia, Parma, Ferrara)
Histologicallyconfirmed diagnosis of HL between 1 January 2006 and 31 December 2008, age between 18 and 75 years, and human immunodeficiency virus (HIV) negativity
Missing data
N = 116 (85%) participants received ABVD chemotherapy, 11 participants (8%) received intensified regimens such as BEACOPP or COPP/EBV/CAD, six participants (4%) received chemotherapy without anthracycline such as VBM or MOPP, and three participants (3%) received other therapies such as radiotherapy alone
Failure‐free survival
Overall survival
To assess the role of interim PET‐CT and compare it with PET‐CT results after the end of treatment in individuals with advanced stage classic HL
Not reported
Russia
114
Newly diagnosed classic HL
None
6 cycles of EACOPP‐14 with or without RT
Complete metabolic response
To summarise our experience with 18F‐FDG PET‐CT in HL
Retrospective, November 2006 to January 2010
Hungary (National Institute of Oncology, Budapest)
60
Not reported
Not reported
ABVD or BEACOPPesc, with or without RT
Prognostic value
To summarise our experience with 18F‐FDG PET‐CT in interim staging
Retrospective, November 2007 to January 2010
Hungary (National Institute of Oncology, Budapest)
19
Not reported
Not reported
ABVD or BEACOPPesc, with or without RT
Prognostic value
To evaluate the prognostic value of interim PET scan (PET2) and end‐of‐treatment PET (PET6) in the outcome of individuals with HL
Retrospective, January 2004 to December 2011
Portugal (Porto, single‐centre)
261
Diagnosed HL
PET‐guided treatment adaptation
ABVD, BEACOPPesc or CVP/CEB
Complete remission
Overall survival
Progression‐free survival
To investigate the value of PET performed after 2 (PET2) and 4 (PET4) cycles of therapy for the management of patients with HL
Not reported, September 2006 to September 2008
Italy (University of Bari)
26
Newly diagnosed HL
None
ABVD
Complete remission
Partial remission
Progression‐free survival
To assess the impact of disease bulk, interim PET and treatment modality on outcomes
Retrospective, 1995 to 2011
USA (Cleveland Clinic, Cleveland OH)
121
Previously untreated HL, early stages (I and II)
Missing clinical data
ABVD or other chemotherapy (not specified)
Overall survival
Progression‐free survival
To identify characteristics, both at baseline and during therapy, predictive for survival outcomes in HL
Retrospective
Italy (Niguarda Hospital, Milan)
105
Diagnosed HL
None
3 to 8 cycles of ABVD with or without IFRT
Overall survival
Event‐free survival
Relapse‐free survival
To explore the prognostic role of iPET in individuals with HL by correlating Deauville criteria and qPET to DFS and OS
Retrospective
Italy
131
Newly diagnosed HL, disease stages I to IV
None
Not reported
Overall survival
Disease‐free survival
To assess the prognostic value of anytime negative PET scan in the course of first line treatment in individuals with HL receiving ABVD
Retrospective
USA (Henry Ford Health System, Detroit MI)
32
Newly diagnosed HL
Not reported
ABVD
Overall survival
To compare iPET with a series of histological and immunohistochemical parameters obtained on tissue‐micro‐arrays as possible predictive factors
Retrospective
Italy (Bologna)
209
Biopsy‐proven HL, complete clinical and iPET data
None
Not reported
Overall survival
Progression‐free survival
Randomised clinical trial (phase III) including 1150 participants with HL
1st Department of Medicine, Cologne University Hospital, Cologne, Germany
Total: 1150
Hodgkin lymphoma
Adults (18 to 75 years)
CS I and II without risk factors (large mediastinal mass (> 1/3 of maximum transverse thorax diameter), extranodal involvement, elevated ESR, three or more involved nodal areas)
Written informed consent
Leukocytes < 3000/µl
Platelets < 100000/µl
Hodgkin lymphoma as composite lymphoma
Activity index (WHO) > 2
Active comparator (A): two cycles ABVD followed by 20 Gy IF‐RT, irrespective of FDG‐PET results after chemotherapy
Expertimental (B): two cycles ABVD followed by 20 Gy IF‐RT if FDG‐PET is positive after chemotherapy; 2 cycles ABVD and treatment stop if FDG‐PET is negative after chemotherapy
Progression‐free survival
Time frame: five years
Overall survival
Acute toxicity
Late toxicity
Complete response rate
Time frame: five years
May 2020
We included studies that evaluated both adult and adolescent participants (the youngest being 13 years old), as opposed to including adult participants ( ≥ 18 years of age) only as stated in the protocol of this review. Hodgkin lymphoma is a disease with a typical onset in adolescence to mid‐adulthood, with little physiological differences between adolescents and adults. In the studies included in this review, participants under the age of 18 were treated in the same clinic and received the same treatment as participants over ≥ 18 years of age. We believe that the results regarding interim PET are equally relevant to adolescents as they are to adult participants and, therefore, did not see reasons against the inclusion of studies including both younger and older adults. Nevertheless, we did not include studies that evaluated solely paediatric participants. In studies where only paediatric participants are included, it is more likely they will be treated in paediatric clinics and receive a different treatment regimen than adult participants.
We used an amended version of the Quality in Prognostic Factor Studies (QUIPS) tool to assess the risk of bias of the included studies. In consultation with Hayden and colleagues (Hayden 2013), we adapted the QUIPS tool by adding 'unclear (no information)' as a fourth judgement in the tool. In addition, we renamed the fifth domain of the tool, originally named 'study confounding', into 'other prognostic factors (covariates)', to highlight the importance of adjusting for other prognostic factors and distinguish it from confounding. Lastly, we assessed all six domains (study participation, study attrition, prognostic factor measurement, outcome measurement, other prognostic factors (covariates), statistical analysis and reporting) per outcome (OS and PFS) in each study. The first three domains ended up always receiving the same judgement as they are indeed to be considered at study level. With regard to the outcomes, however, we identified differences in analysis and reporting within studies.
With regard to data extraction, we developed our own data extraction form specific to prognostic factor studies (particularly those that are included in this review), which includes more items than stipulated in the protocol of this review.
Lastly, we searched Embase as an additional database, as well as one trial registry (ClinicalTrials.gov).
Contributions of authorsAngela Aldin: screening and selection of studies, development of data extraction form, data extraction, 'Risk of bias' assessment, GRADE assessment, data analysis interpretation, 'Summary of findings' tables, writing and drafting of the review, communication with and between authors.
Lisa Umlauff: 'Risk of bias' assessment, characteristics of included and excluded studies (texts and tables), abstract and Plain language summary, proofread and commented on the draft.
Karel Moons: methodological input on reviews of prognosis studies.
Lise J Estcourt: screening and selection of studies, data extraction, risk of bias assessment, clinical and methodological input.
Andreas Engert: medical and content input, particularly on the clinical comparability of studies and subgroup analysis.
Carsten Kobe: nuclear medical input on PET‐CT.
Bastian von Tresckow: clinical input, particularly on the clinical comparability of studies and subgroup analysis.
Gary Collins: methodological input on reviews of prognostic studies.
Madhuri Haque: screening and selection of studies.
Farid Foroutan: input on risk of bias and GRADE assessment of prognostic factor studies.
Nina Kreuzberger: 'Risk of bias' assessment, proofread and commented on the review draft.
Marialena Trivella: screening and selection of studies, data extraction, risk of bias assessment, statistical analysis, proofread and commented on the review draft.
Nicole Skoetz: protocol development, screening and selection of studies, data extraction, risk of bias assessment, GRADE assessment, proofread and commented on the review draft.
Sources of support Internal sourcesUniversity Hospital of Cologne, Germany
Cochrane Haematological Malignancies, Department 1 of Internal Medicine
NHS Blood and Transplant, UK
Federal Ministry of Education and Research, Germany
Funding Number 01KG1709
Angela Aldin: award of the grant by Federal Ministry of Education and Research for the University Hospital of Cologne to perform this systematic review does not lead to a conflict of interest.
Lisa Umlauff: award of the grant by Federal Ministry of Education and Research for the University Hospital of Cologne to perform this systematic review does not lead to a conflict of interest.
Karel Moons: none known.
Lise J Estcourt: award of the grant by Federal Ministry of Education and Research to the University of Oxford to perform this systematic review does not lead to a conflict of interest.
Andreas Engert: award of the grant by Federal Ministry of Education and Research for the University Hospital of Cologne to perform this systematic review does not lead to a conflict of interest. Principal investigator of the HD18 trial, does not lead to a conflict of interest. Received funds from Takeda Pharma GmbH, BMS and MSD for consultancy and educational presentations, but these were not related to the intervention in this review. No competing interests.
Carsten Kobe: award of the grant by Federal Ministry of Education and Research for the University Hospital of Cologne to perform this systematic review does not lead to a conflict of interest.
Bastian von Tresckow: award of the grant by Federal Ministry of Education and Research for the University Hospital of Cologne to perform this systematic review does not lead to a conflict of interest. Received funds from Novartis Pharma GmbH, Takeda Pharma GmbH and MSD for consultancy and educational presentations, but these were not related to the intervention in this review. No competing interests.
Gary Collins: supported by the NIHR Biomedical Research Centre, Oxford, and Cancer Research UK (programme grant: C49297/A27294). No conflict of interest.
Madhuri Haque: award of the grant by Federal Ministry of Education and Research for the University Hospital of Cologne to perform this systematic review does not lead to a conflict of interest.
Farid Foroutan: none known.
Nina Kreuzberger: award of the grant by Federal Ministry of Education and Research for the University Hospital of Cologne to perform this systematic review does not lead to a conflict of interest.
Marialena Trivella: part of the grant from the Federal Ministry of Education and Research to the University Hospital of Cologne, was paid to the University of Oxford for author time spent working on this review. However, the funder played no part in the design and execution of the project and it does not constitute a conflict of interest.
Nicole Skoetz: award of the grant by Federal Ministry of Education and Research for the University Hospital of Cologne to perform this systematic review does not lead to a conflict of interest.
contributed equally
contributed equally
Edited (no change to conclusions)
References References to studies included in this review Andre 2017 {published data only}RetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4