Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
MSc Physics and Astronomy
Physics of Life and Health
Master Thesis
Knowledge-based radiotherapy treatment planning for
stage III lung cancer patients
by
Evgenia Tourou
11128879
June 2018
60EC
Supervisor/Examiner: Examiner:
Wilko F.A.R. Verbakel, PhD Geert J. Streekstra, PhD
Daily Supervisor:
Alexander R. Delaney, MSc
Radiotherapy Department, VUmc medical center
ABSTRACT
Treatment of large volume lung cancer is carried out mostly using two techniques, the full-RapidArc
(f-RA) or the hybrid-RapidArc (h-RA). The choice between the two methods depends on the
individual characteristics of the patient, while the treatment planners often have to make both plans in
order to choose for the optimal treatment technique for the patient. However, manual treatment
planning is a labor-intensive and time consuming process which, in many cases, does not
yield consistent or optimal plans. RapidPlan (Varian Medical Systems, Palo Alto, USA), a
knowledge-based-planning solution, uses the dosimetry and geometry of previous treatment
plans to model and predict organ-at-risk (OAR) dose-volume-histograms (DVHs) for future
patients based solely on their geometry. The present study investigates the possibility of utilizing
RapidPlan, as a tool for selecting f-RA or h-RA technique for individual lung cancer patients, without
the requirement of creating actual treatment plans. A f-RA and a f-RA model were created,
consisting of 50 clinical plans each, and were used to generate dose predictions and
subsequently to optimize model-based plans (MBPs) for a group of 10 patients. MBPs quality
was analyzed by benchmarking MBPs against the manual plans (MPs) made by experienced
radiotherapy treatment planners. DVH prediction accuracy was analyzed by comparing predicted
vs achieved OAR dose metrics. Finally, the number of patients that would have been selected
for f-RA or h-RA based solely on OAR predictions was compared to the corresponding
number of patients that would have been selected based on the achieved OAR doses in
MBPs. MBPs improved contralateral lung (CL) and total lung (TL-PTV) mean dose compared to
the manual plans in both techniques. However, CL V5 in the f-RA MBPs increased compared
to the MPs. The target coverage was inferior in the MBPs compared to the MPs. RapidPlan
was able to accurately predict the mean dose of CL, but it consistently underestimated the
amount of sparing that could be achieved for TL-PTV. Based only on comparing single OAR
dose volumes, RapidPlan can accurately predict which technique gives the lower dose in 7-9
/10 cases. The results showed that RapidPlan is able to generate MBPs of comparable quality
to the MPs for f-RA and h-RA techniques, nevertheless, it requires further validation with a
more wise selection of priorities and using generated point-objectives instead of line
objectives.
CONTENTS
1. INTRODUCTION ................................................................................................................. 1
2. METHODS AND MATERIALS ........................................................................................... 5
2.1 Treatment planning of large volume lung cancer patients .......................................... 5
2.1.1 Full-RapidArc ...................................................................................................... 5
2.1.2 Hybrid-RapidArc ................................................................................................. 6
2.2 RapidPlan .................................................................................................................... 8
2.2.1 Data extraction ..................................................................................................... 9
2.2.2 Model Training .................................................................................................. 11
2.2.3 Generation of DVH Estimations ........................................................................ 13
2.2.4 Placement of Optimization Objectives .............................................................. 13
2.3 Model Libraries ......................................................................................................... 14
2.3.1 Patient geometries .............................................................................................. 15
2.3.2 Dosimetry ........................................................................................................... 16
2.3.3 Field set-up......................................................................................................... 16
2.4 Evaluation of Model Training ................................................................................... 17
2.4.1 RapidPlan-provided statistical metrics .............................................................. 17
2.4.2 Outlier analysis .................................................................................................. 18
2.4.3 Field Geometry .................................................................................................. 19
2.5 Evaluation of model-based plans .............................................................................. 19
2.5.1 Evaluation group geometries and field-set up ................................................... 20
2.5.2 Assigning Optimization objectives .................................................................... 21
2.5.3 Evaluation of prediction accuracy ..................................................................... 22
2.5.4 Using predictions to select treatment technique ................................................ 23
3. RESULTS ............................................................................................................................ 24
3.1 Evaluation of Model Training .................................................................................... 24
3.1.2 Outlier analysis .................................................................................................. 25
3.1.2 Field Geometry .................................................................................................. 26
3.2 Evaluation of Model-Based Plans .............................................................................. 27
3.3 Evaluation of prediction accuracy .............................................................................. 32
3.3.1 Contralateral Lung ............................................................................................. 32
3.3.2 Total Lung – PTV .............................................................................................. 36
3.3.3 Esophagus .......................................................................................................... 37
3.4 Individualized analysis .............................................................................................. 38
3.5 Using predictions to select treatment technique ........................................................ 43
4. DISCUSSION AND CONCLUSION.................................................................................. 47
References ................................................................................................................................ 50
APPENTIX A .......................................................................................................................... 54
APPENTIX B........................................................................................................................... 58
1
1. INTRODUCTION
Lung cancer is the most often diagnosed cancer and the first cause of death amongst
cancer patients, leading to 1.6 million deaths worldwide every year 1 . The major treatments
for lung cancer are surgery, radiotherapy, and chemotherapy. In treatment of large volume
lung cancer, radiotherapy, usually in combination with chemotherapy plays an important
role2. Radiotherapy treatment of large volume lung cancer is challenging because it requires
delivery of high dose levels to large tumor volumes, while sparing the proximal critical
organs-at-risk (OARs) such as the esophagus, spinal cord, and the heart. High dose levels to
the healthy tissues increases toxicity3 and can lead to side effects such as symptomatic
pneumonitis4–6 and esophagitis7.
Radiation therapy makes use of ionizing radiation to kill cancer cells by absorbed
energy. Thus, the aim is to deliver maximum dose to the tumor and as low dose as possible to
the surrounding normal tissue. The radiation therapy process starts with a computed
tomography (CT) scan of the patient to locate the tumor. Then, the physician delineates the
relevant targets/tumor volumes and the surrounding OARs that need to be spared. The
delivery of the treatment is done by a linear accelerator or a cobalt machine. In order to
minimize the dose to normal tissue while ensuring sufficiently high dose to the target,
multiple field directions are used. Additionally, a multileaf collimator (MLC) is utilized to
shape the radiation beams. The MLC consists of multiple metal leaves that move
independently. The leaves are placed such as the aperture of the MLC forms the shape of the
tumor, and thus shields the surrounding tissue from radiation.
Three-dimensional conformal radiotherapy (3D-CRT)8 treatment technique involves
the use of flattened radiation beams with fixed MLC leave configuration, to deliver uniform
radiation dose, while the contribution of each feild to the final dose can vary. Intensity
modulated radiotherapy (IMRT)9 delivers non-uniform radiation beam intensities per field by
computer-controlled movement of the MLC leaves. The summation of all fields leads to
relatively homogenous dose in the target, while the dose in the surrounding healthy structures
is minimized. It was proven that IMRT can reduce the dose to the healthy lung, esophagus,
and heart in lung cancer patients compared to 3D-CRT 10,11. Volumetric modulated arc
therapy (VMAT)12 is an advanced form of IMRT, where the gantry is rotated with
simultaneous movement of the MLC leaves and dose rate modulation. VMAT decreases the
2
treatment time and, with the use of full gantry rotation, generates highly conformal treatment
plans13.
The planning of 3D-CRT plans is done manually by the planner, who defines the
gantry angles, collimator angles, MLC configurations and relative weights of individual
fields. In IMRT and VMAT treatment planning, optimization algorithms are needed to
determine the MLC leaf movements. During a process called inverse treatment planning, the
planner specifies a desirable dose distribution to the target and the normal tissue by a set of
dose-volume objectives and priority factors for each delineated structure. These dose-volume
objectives include the minimum required dose to the target and the maximum required dose
to the OARs. The optimization algorithm tries to find the MLC configurations and dose rates
that will approximate the desired dose distribution by minimizing a cost function which
weights all dose-volume objectives 14.
Defining the appropriate dose-volume objectives and priorities is an essential part of
the process since these will define the final dose distribution and help to achieve the clinical
goal. Optimal selection of dose-volume objectives depends on the geometrical characteristics
of the patient such as the location and volume of the tumor and its proximity to OARs15,
therefore each patent requires special attention. The current practice is that the treatment
planner has to evaluate the plan and interactively maneuver the optimization objectives
during the optimization process until the best possible set of optimization objectives, and
consequently dose distribution- is achieved. This process is time-consuming, and leads to
inconsistencies and inter-planner and inter-institutional variability in plan quality16–18.
In recent years, there has been wide interest in the development and application of
automated treatment planning solutions, aiming to improve the consistency and quality of
radiotherapy treatment plans. Knowledge-based planning utilizes a large number of prior
treatment plans to create a model based on the dose distribution and the geometrical
characteristics of the patients19–22. This model is used to predict achievable OAR dose-
volume histograms (DVH) for prospective patients, based on its individual anatomical
characteristics. Then, for the optimization process, a line objective is generated for each OAR
below the range of the predicted OAR DVHs. Knowledge-based planning offers patient-
specific optimization objectives and thereby semi-automates the optimization process. It is
not a fully automated process because the user needs to manually define the OAR structures
and target, the prescription dose and the field set-up.
RapidPlan (Varian Medical Systems, Palo Alto, USA) is a knowledge-based treatment
solution which was developed based on the work of the groups of the Duke University15,23,24
3
and Washington University19,25. Pre-clinical evaluation of RapidPlan suggested that it is
capable of generating clinically acceptable plans for lung, head and neck, esophageal, breast,
hepatocellular and prostate cancer 20,21,26–31. Particularly for large volume lung cancer, only
Fogliata et al.26 have evaluated the use of RapidPlan on VMAT technique with promising
results.
At the VUmc radiotherapy department, treatment of large volume lung cancer is
carried out mostly using two techniques, the full-RapidArc (f-RA) or the hybrid-RapidArc (h-
RA). RapidArc is the trademark used by Varian for VMAT optimization. In f-RA plans, the
radiation fields are composed of two VMAT arcs, while h-RA is a combination of multiple
conventional 3D-CRT fields and a VMAT field32. H-RA technique usually provides better
planning target volume (PTV) coverage and reduced dose to the healthy contralateral
lung32,33, but it delivers high dose levels outside the PTV within the 3D-CRT fields. On the
other hand, f-RA can spare better the spinal cord and the heart but increases the volume of
contralateral lung receiving low dose32–34.The choice between the two methods is critical and
depends on the individual characteristics of the patient. It often happens that the treatment
planners have to make both plans in order to choose for the optimal treatment technique for
the patient. This is apparently a time-consuming process.
RapidPlan has been proven not only to generate good quality plans but also to provide
accurate achievable OAR dose predictions. Tol et al.35 showed that RapidPlan predictions
only could be used as for quality assurance of head and neck plan. Furthermore, Delaney et
al.36 suggested that RapidPlan can provide accurate predictions to be used for selecting
patients for proton therapy for head and neck cancer patients. The present study investigated
the possibility of utilizing RapidPlan as a tool for selecting f-RA or h-RA technique for
individual lung cancer patients, without the requirement of creating actual treatment plans. It
must be noted the RapidPlan is designed only for IMRT and VMAT plans, therefore, there
have been no studies which investigated the application of RapidPlan on a h-RA method.
To conduct the research, two RapidPlan models were created, one for h-RA and one
for f-RA, consisting of clinical plans of patients treated at VUmc. Both models were
validated on an initial set of patients, and then dosimetry and geometric outliers were
removed. Next, the two models were used to generate dose predictions and subsequently
optimize model-based plans (MBPs) for a group of 10 patients. To evaluate the quality of the
MBPs, they were benchmarked against the manual plans (MPs) made by experienced
radiotherapy treatment planners. Then, to evaluate the accuracy of the predictions, the
generated MBPs where compared to the predicted DVHs. Finally, the number of patients that
4
would have been selected for f-RA or h-RA based solely on OAR predictions was compared
to the corresponding number of patients that would have been selected based on the achieved
OAR doses in MBPs.
5
2. METHODS AND MATERIALS
2.1 Treatment planning of large volume lung cancer patients
Treatment planning of large volume lung cancer at our department has been detailed
previously in the studies of Verbakel 32 and Blom 37. Treatment plans are optimized using the
Progressive Resolution Optimizer (PRO) algorithm version 10.0.28 in the Eclipse treatment
planning system and dose calculation is carried out using either ACUROS 11.0.31 or the
Anisotropic Analytical Algorithm (AAA) 10.0.28 using a 2.5mm grid resolution.
The targets are the internal target volume (ITV) and the planning target volume
(PTV). ITV consists of the primary tumor and regional lymph nodes with metastatic disease
and PTV includes the ITV and a margin of 10mm to compensate for any geometric
inaccuracies. Prescription dose (PD) to the ITV and PTV is typically 50-66 Gy and is
delivered in 23-33 fractions. Treatment plans aim to deliver 97% of the PD to at least 95% of
the PTV, while V107%(the volume receiving at least 107% of the PD) should be lower than
5%.
The spared OARs typically include the contralateral lung (CL), the total lung minus
PTV (TL-PTV) (the summation of the two lungs from which the PTV volume is subtracted),
the esophagus (ESO), the spinal cord (SC), and the SC plus a 3mm margin (SC+3mm).
Generally, the CL is constrained such that the total volume receiving 5Gy (V5) is lower than
40%6. Meanwhile, objectives for the TL-PTV are V20<35%, and V5<60%. For SC and
SC+3mm, a maximum point dose objective is applied: <50Gy and <54Gy respectively. The
maximum dose for ESO should, in general, be less than 100% of the PD, but in cases where
the ESO overlaps with the PTV, a higher dose is acceptable. In order to avoid hotspots
outside the PTV, a control region (OAR-control) is created which surrounds the PTV and
contains most of the body in the planes of the PTV. Additionally, a maximum dose objective
is applied for the OAR-control at 100% of the PD.
2.1.1 Full-RapidArc
For f-RA plans, typically two full-arcs (gantry rotates from 179º to 181º) VMAT
fields with 6 MV beams are optimized simultaneously, using avoidance sectors (control
6
points where the beam is off) to avoid direct irradiation to the contralateral lung. Typically
the avoidance sectors are 90-100° long. However, there is a limitation in their use: each
beam-on and beam-off sector must be at least 15 degrees. Thus, if the beam needs to be off
from the starting angle of the rotation, partial-arcs have to be used instead. Collimator angles
are typically 10° and 15°.
The optimizer uses a simple dose calculation algorithm that does not model well
lateral electron transport, and overestimates the dose in low-density PTV regions. In the final
dose calculation, which takes into account this lateral electron transport, the dose in that
region is lower. To overcome this problem, the PTV is divided into the part that overlaps with
the lungs (PTVinLung) and the part that is out of the lung (PTVoutLung). Then, we typically
apply a PTVinLung lower objective which is placed a few Gy higher than the PTVoutLung
lower objective. Subsequently, a ‘continue previous optimization’ (CPO) is performed with
increased PTV priorities to improve PTV homogeneity38.
Typically two optimization objectives are used for each of the following treatment
planning aims : CL V5 , TL-PTV V5 and TL-PTV V20. These objectives are interactively
placed below the DVH line displayed during optimization and adapted until the lowest
possible dose is achieved, whilst maintaining good PTV dose coverage/homogeneity. For the
ESO three optimization objectives are placed around V40, V50,V60 and one maximum dose
objective of 66Gy.
2.1.2 Hybrid-RapidArc
H-RA plans consist of a conventional and a VMAT component. The conventional
component consist of typically three 3D-CRT fields of 15MV, and delivers 90% of the PD.
Field orientation generally consists of one anterior-posterior (AP) field, one posterior-anterior
(PA) field and one oblique-posterior field, with field-weights roughly set to 0.5, 0.25 and
0.25, respectively. Thus, the AP and PA fields spare the contralateral lung, while the oblique
field decreases direct irradiation through the spinal cord. The dose distribution achieved using
the conventional fields is calculated and used as a “base dose plan” for the optimizer, which
is subsequently configured to optimize the RapidArc component.
The RapidArc component delivers the remaining 10% of the dose, using 6MV beams.
Since the base dose plan delivers an inhomogeneous dose to the PTV, the RapidArc
component is meant to homogenize the dose to the PTV. A single partial-arc is typically used
7
from 181° to 30° for left-sided tumors or from 330° to 179° for right-sided tumors. The
partial-arc length can vary depending on the size and location of the tumor and, alternatively,
in some cases a full-arc with avoidance sector is used. The RapidArc component contributes
most to the outer part of the PTV, improving the dose homogeneity32.
Similar optimization objectives to the f-RA plans are used for CL and TL-PTV, while
the maximum dose objectives for the ESO, SC and SC+3mm are usually 1-2Gy lower. The
ESO is not always actively spared. The division of the PTV between in- and out-of-lung and
a CPO is not needed in h-RA plans as the majority of the dose is delivered using the
conformal-fields.
Figure 2.1 shows the field set-up and the resulting dose distribution of a left-sided
cancer patient, treated with f-RA (a,c) and h-RA (b,d). In the h-RA plan, the dose is
distributed to the anterior-posterior direction, while f-RA results in a more conformal dose
distribution. However, f-RA produces a larger low-dose volume in the surrounding normal
tissue39, which also covers a larger part of the CL.
8
Figure 2.1: Field geometry of f-RA and h-RA plans of the same patient and the
corresponding dose distributions. The PTV is depicted with red color. a) Field set-up f-RA
plan, b) Field set-up of h-RA plan, c) Dose distribution of f-RA plan, d)Dose distribution of
h-RA plan.
2.2 RapidPlan
RapidPlan is a commercial knowledge-based treatment planning system that utilizes
previous treatment plans to create DVH-estimation models which can predict a range of
DVHs for the organs-at-risk of prospective patients. It is integrated into the Eclipse treatment
planning system (Varian Medical Systems, Palo Alto, CA) and can be used to generate dose-
volume objectives which subsquently guide the optimization process of a new treatment plan.
RapidPlan is comprised of a model configuration component and a DVH estimation
component. In the model configuration component, the information of the planned patients is
9
extracted and used to train the DVH estimation model. The DVH estimation component uses
the trained model to make predictions for the OARs of a prospective patient and
automatically place dose-volume objectives on the lower boundary of these predictions. This
semi-automates the treatment planning process and these steps are further explained below.
2.2.1 Data extraction
First, a number of treatment plans (minimum 20) are selected to populate the model
library. Then, a model structure set is created by the user containing the targets and the
relevant OARs. Each structure of the treatment plans is then matched to the corresponding
structure of the model structure set. During the data extraction phase, the geometry and dose
of each structure, along with the field geometry and the prescription dose of the plan, are
extracted and converted into some characteristic metrics. Each OAR structure is divided into
the following regions (Figure 2.2):
In-field region: The part of the structure that overlaps with target projection at least
from one field view. This is the most heavily modulated region, since it receives
direct irradiation and its dose is minimized by the movement of the MLC leaves.
Overlap region: The part that is anatomically overlapping the target. This part has
dose level comparable to the target dose.
Leaf-transmission region: The part that is visible from the jaw aperture of at least one
field but is not overlapping with the target projection from any field. It receives some
dose through the closed leaves of the MLC but it does not strongly affect the
optimization.
Out-of-field region: The volume of the structure that is not visible from jaw aperture
of any field direction. This part does not receive direct irradiation.
10
Figure 2.2: Schematic represention of volume partition in transversal view40.
The relative volume of each OAR region is calculated, as well as the cumulative DVH
based on the extracted dose, sampled into 2.5mm resolution. Additionally, RapidPlan
calculates the cumulative DVH of the Geometry Expected Dose (GED) for each OAR region.
The GED is a metric that calculates the dose that would be expected in a voxel at a
certain distance from the target given the patient anatomy, prescribed dose to the target, and
position and orientation of the concerned fields. The GED does not take into account the
different levels of sparing of the OARs; it only considers general sparing of the tissue outside
the target while delivering the desired dose to the target. The dose expected in an OAR voxel
depends on the distance from the target and the fields, the orientation of the fields, the
nominal field energy, and the physical characteristics of photons15. The GED calculation also
includes heuristics about the optimal inter-field and intra-field modulation that lead to sparing
of the normal tissue. Meaning that each field can be weighted differently and each beamlet
within a field can deliver different dose by the MLC modulation, based on the shape and
orientation of the target. Therefore, if two or more fields have the same geometry, the
duplicates are not taken into account. Furthermore, the jaw position is not considered in the
calculation41.
For the whole OAR structures, the following geometric features are calculated: OAR
volume in cm3, overlap volume percentage with the joint targets, out-of-field volume
percentage and joint target volume in cm3.
11
2.2.2 Model Training
For each of the above-mentioned OAR regions, a separate DVH estimation model is
constructed during the model training phase. The in-field-region uses a combination of
Principal Component Analysis (PCA) and regression analysis, while the other three regions
use a simpler method that calculates the mean DVH and the standard deviation. A schematic
representation of training phase for the in-field region model is shown in Figure 2.3.
The PCA is applied to the DVHs of all OARs in the training set, to describe the
variance of DVH shapes in the population and to select the most significant parameters [Sohn
2007]. The methodology is based on the assumption that each DVH can be reconstructed
from the sum of the mean DVH and a few weighted Principal Components (PC). First, the
mean DVH over all the DVHs in the training set is calculated, and then subtracted from each
DVH in the training set. Then, the first PC (PC1) curve is calculated such that it explains the
most amount of variation in the training set. The projection of this PC is subtracted from each
DVH of the set and then the next PC is calculated by maximizing the variance of the
remaining curves. This process continues until at least 95% of the variance is explained by
the PCs 41. Each particular DVH of the training set is parametrized by subtracting the mean
DVH and then projecting the residual curve to each of the PCs to find the corresponding PC
coefficients or PC scores (PCS). The same principal component analysis is applied to the
GEDs of the training set and the respective GED-PCs are calculated.
The correlation between the PC scores of the DVHs (DVH-PCS) and the geometrical
features (absolute OAR volume, absolute target volume, overlap volume percentage, out-of-
field volume percentage, and GED-PCSs) in the training set is determined by stepwise
regression analysis. For each PC a separate model is used, which also includes the second
order terms of the parameters to account for the non-linear effect between two features15. The
stepwise regression procedure follows an iterative forward and backward method. First, the
most significant geometrical parameter is added in the model, and more parameters are added
in each step only if they have a significance level higher than 5%. Then, the parameters that
have a level of significance less than 5% are removed41. This results in a regression model
including the most significant geometrical parameters, whose coefficients are stored in a
matrix that can be used to estimate the DVH PCS from the geometric features. Additionally,
the standard error of each DVH PCS is calculated in order to determine the upper and lower
boundary of the estimation DVH range.
12
Figure 2.3: Schematic representation of the training phase and PCSs estimation. First, the
PCs are extracted from the DVHs. Then, each individual DVH is parameterized based on the
PCs. The regression model finds the relation between the PCSs and the geometrical
parameters. Finally, the PCSs of a new plan are estimated from the regression model and the
geomtrical parameters of patient.
13
2.2.3 Generation of DVH Estimations
The trained DVH estimation model is used to generate predictions for a new patient.
The patient's anatomy (targets and OARs), the field geometry and the prescription dose are
used as the input for the DVH estimation model. The algorithm calculates the same features
as in the data extraction phase (volume partition, geometric features, GED histograms),
except for the DVHs. For the in-field region, the GED histograms are parameterized using the
GED principal components calculated in the training phase. Then, the regression model is
applied to estimate the DVH PCSs from the geometric features. The estimated PCSs and the
stored principal components are then combined to calculate the most probable DVH curve.
The DVH estimation range is calculated from the standard error related to the regression
model. For other regions, the stored mean DVH is obtained and the estimation range is
calculated by adding and subtracting on standard deviation. Finally, the estimated DVHs
from the different OAR regions are weighted based on the relative volume of each region and
are summed together to construct the final DVH estimate for the OAR (Figure 2.4).
2.2.4 Placement of Optimization Objectives
When the DVH estimation range is generated, RapidPlan converts it into optimization
objectives to semi-automate the optimization process. A line of optimization objectives is
placed just below the lower boundary of the prediction range as seen in Figure 2.4. The
objectives pull all parts of the DVH curve with the same strength, aiming to produce a
resultant DVH which is representative of the DVH-prediction. However, if an OAR structure
overlaps with the target, the objective line corresponding to the volume that overlaps with the
target is placed horizontally so that it does not conflict with the targets lower objectives
(Figure 2.4). RapidPlan users may also specify dose-volume objectives with a certain volume
value for which the corresponding dose will be generated by the DVH-prediction range. This
can be utilized, for example, to lower specific parameters such as the V5 of an OAR, rather
than the dose to the entire OAR, which the line objective caters to.
14
Figure 2.4: Predicted DVH ranges of various organs-at-risk with the
generated optimization objective lines.
2.3 Model Libraries
Fifty-five patients previously treated with f-RA and 55 patients previously treated
with h-RA (planned according to the protocol described in section 2.1) were selected to
populate two models, the f-RA model and the h-RA model. The clinical plans were added to
the models without any modification.
In the f-RA model-library the PD for the ITV and PTV varies from 50Gy to 66Gy: 19
plans with 66Gy, 17 plans with 65Gy, 15 plans with 60Gy, 1 with 57Gy, 1 with 56Gy, 1 with
52Gy and 1 with 50Gy. Dose was delivered with fractions of 2.0-2.6Gy. The variation in PD
was not expected to affect the performance of the model as this has been previously reported
without any apparent degradation in resulting plan quality26.
In the h-RA model, the PD was 66Gy for 27 plans, 65Gy for 13 plans, and 60Gy for 15
plans. One difference between the two model-libraries was that only 16 of the 55 plans in the
h-RA model used optimization objectives for V40,V50,V60, while all plans in the f-RA model
included esophageal sparing for dose lower than 60Gy.
The following structure set was created for each model: ITV, PTV (PTVinLung and
PTVoutLung for the f-RA model), CL, TL-PTV, ESO, SC, SC+3mm, OAR-control, and
ipsilateral lung (IL). The corresponding structures were matched for each patient.
15
2.3.1 Patient geometries
A heterogeneous patient population in terms of tumor size and location was selected
in order to make the model applicable to a large variety of patient geometries. Furthermore, it
was intended that both models have patients with a similar range of volumes. Of the 55 plans
of the f-RA model, 35 had right-sided tumors while the remainder were left-sided cases, and
the PTV volume ranged from 205-1361 cm3 with an average of 589 cm3. In the h-RA model,
36/55 patients had right-sided tumors and the rest were left-sided cases, while the PTV
volume ranged from 201-1236 cm3, with a mean of 596 cm3. The geometric characteristics
of the PTV and OARs in the f-RA and h-RA models are listed in Table 2.1 and 2.2
respectively.
Table 2.1: Geometric features of structures in the f-RA model
Structure
Mean
volume
(𝐜𝐦𝟑)
Minimum
volume
(𝐜𝐦𝟑)
Maximum
volume
(𝐜𝐦𝟑)
Overlap with
the target
%
In-field
volume
%
PTV 589 ± 314 205 1361 - -
CL 2023 ± 604 769 3503 0.51 ± 1.23 73.1 ± 17.2
TL-PTV 36701 ± 1114 1564 7254 0.03 ± 0.05 70.9 ± 17.6
ESO 29.1 ± 15.7 6.8 91.1 21.0 ± 17.4 64.8 ± 21.0
SC 37.0 ± 17.6 11.4 82.7 0.22 ± 1.66 77.3 ± 23.1
OAR-control 5653 ± 2686 527.9 13967.4 0.08 ± 0.11 98.1 ± 5.7
IL 1869 ± 627 679.1 3868.6 11.52 ± 6.53 60.1 ± 15.4
Table 2.2: Geometric features of structures in h-RA model library
Structure
Mean
volume
(𝐜𝐦𝟑)
Minimum
volume
(𝐜𝐦𝟑)
Maximum
volume
(𝐜𝐦𝟑)
Overlap with
the target
%
In-field
volume
%
PTV 596 ± 262 201 1236 - -
CL 2044 ± 656 1016 3413 0.14 ± 0.27 75.0 ± 14.2
TL-PTV 3672 ± 1230 1473 6461 0.03 ± 0.06 71.9 ± 15.1
ESO 27.9 ± 12.5 5.9 78.9 27.3 ± 22.3 62.8 ± 22.3
SC 34.6 ± 15.4 14.1 74.9 0.01 ± 0.10 79.6 ± 22.9
OAR-control 4942 ± 2158 1365 11673 0.07 ± 0.11 97.3 ± 4.9
IL 1860 ± 672 657 3434 13.10 ±6.62 59.3 ± 14.5
16
2.3.2 Dosimetry
Table 2.3 contains the average dosimetry of plans in both the f-RA model and h-RA
model.
Table 2.3: Dosimetric features in the model libraries
Structure
f-RA model h-RA model
Mean
dose
(%)
Minimum
dose
(%)
Maximum
dose
(% )
Mean
dose
(%)
Minimum
dose
(%)
Maximum
dose
(% )
PTV 101.2 ± 1.3 99.0 103.9 100.7±1.2 98.6 104.0
CL 9.0 ± 6.2 1.1 27.5 6.06 ± 4.17 0.96 15.6
TL-PTV 19.4 ± 6.5 2.9 31.9 20.9 ±5.4 6.5 33.09
ESO 48.6 ± 19.8 8.6 84.7 55.0 ± 23.6 5.02 96.9
SC 34.1 ± 13.8 5.0 63.1 34.2 ± 16.2 5.2 71.5
OAR-control 39.6 ± 9.7 27.8 64.9 46.9 ± 15.2 27.5 74.5
IL 40.8 ± 13.0 7.5 68.9 48.0 ± 12.6 15.2 75.0
2.3.3 Field set-up
Field set-ups varied largely between plans of both the f-RA model and h-RA model,
as seen in Table 2.4 and Table 2.5, respectively. In the f-RA model, 48/55 plans used 2 full-
arcs, whilst using an avoidance sector to prevent irradiation through the healthy contralateral
lung (full-arc plans). 7/55 plans used 2 partial arcs for the same purpose (partial-arc plans).
The irradiation-arc length for the full-arc plans ranged from 240° to a maximum of 310° for a
centrally located tumor, with a mean of 270 ± 14°. Partial arcs were used in cases where the
tumors were relatively small (< 400 cm3) and the irradiation-arcs covered relatively shorter
angles, with their length ranging from 200° to 260° with mean 223 ± 23°.
Evaluation of plans in the h-RA model showed that the AP field was placed between
355°and 10° (at 0° in 51/55, at 10° in 1/55, at 8° in 1/55 and at 355 in 1/55 of plans), while
the PA field was placed between 170° and 195° (at 180° in 52/55, at 170° in 1/55 and at 195°
in 1/55). For right-sided tumors an oblique field was typically placed between 195 - 215°
with 210° and 200° being the most common field angles. In the left-sided cases, the oblique
field was placed at 150° in 9/19 cases, at 160° in 5/19 and at 155 in the remainder of cases.
Two plans used an additional oblique field: one at 210° for a left-sided tumor and one at 270°
for a right-sided tumor, while 2 other plans did not use any oblique field. In one particular
patient, with a centrally located tumor, 3 oblique fields were used at 340º, 160º, and 210º but
17
no AP or PA field was included. Regarding the RapidArc component, a full-arc was used in
12/55 plans with the irradiation-arc length ranging from 206° to 290°, with a mean of 251° ±
27°. The remainder 43 plans consisted of partial-arcs with the irradiation-arc length ranging
between 210-240°.
Table 2.4: Field geometry of plans in the f-RA model
Field set up # Plans Irradiation-arc length
Range Mean
Full-arc 48 240-310° 270 ± 14°
Partial-arc 7 200-260° 223 ± 23°
Table 2.5: Field geometry of plans in the h-RA model
Conventional component
Tumor
position # Plans
AP field PA field Oblique field
Angle range Angle range Angle range
Right 36 355-10° 170-195°
195-215°
Left 19 150-160°
RapidArc component
Field set up # plans Irradiation-arc length
Range Mean
Full-arc 12 206-290° 251 ± 27°
Partial-arc 43 210-240° 215 ± 8°
2.4 Evaluation of Model Training
2.4.1 RapidPlan-provided statistical metrics
Subsequent to model training, RapidPlan provides the user with certain quality
metrics regarding the goodness of fit and the goodness of estimation of the model. Amongst
others, RapidPlan provides the user with residual, regression, geometric and DVH-plots so
that the quality of the model can be visually assessed, as described by Delaney et al. 42.
According to the manufacturer, these metrics, as well as plots, should be evaluated before
model validation.
18
The goodness of fit statistics describe how well the DVH estimation model represents
the data in the training set. The coefficient of determination of the regression model
parameters, R2, describes how much of the variance is explained by the regression model. It
is scaled from 0 to 1, with a large value indicating a better fit. High values may also indicate
that the model is over-fitting the data, meaning that model overreacts to minor fluctuations in
the data. The regression model parameters such as the average chi square, 𝜒2, describes the
quality of the regression model. It is measured from the residual difference between the
original data and the estimated data. The closer the value is to 1, the better the quality of the
regression model. However, if it is very close to 1 it is possible that the model is being over-
fitted.
RapidPlan performs an internal cross-validation for the trained models. The plans are
divided into 10 groups. The plans of the 9 groups are used for model training, while the
remaining group is used for validating the model. This process is repeated 10 times until all
groups are used for validation. The average results of the validation processes represent the
ability of the model to estimate the DVH of a plan that is part of the training set. The mean
squared error between the original and the estimated data measures the distance between the
original DVH and the mean of the upper and lower bounds of the estimated DVH-range.
2.4.2 Outlier analysis
An outlier in the model can be a structure whose dosimetry or geometry differs from
the rest of the population, or it has a substantial effect on the model fit. RapidPlan provides
some statistical metrics and the reporting thresholds for the structures in the training set,
namely: cook’s distance (CD) which indicates influential data points, modified Z-score (mZ)
which indicates geometrical outliers and studentized residual (SR) which indicates dosimetric
outliers.
In order to examine if the model would be improved by removing the outliers, the
indicated outliers were removed from the model and the model was re-trained. The resulting
“cleaned model” and the initial “uncleaned model” were used to generate plans for 4 patients
not included in the models. The MBPs were then compared to decide which model provides
better plans.
19
2.4.3 Field Geometry
Since field set-ups in both models varied quite substantially, it was investigated
whether this variation was visible amongst the provided RapidPlan plots and statistical
metrics for these models. Furthermore, since both partial arcs and avoidance sectors were
used to limit irradiation of the contralateral lung, it was examined whether RapidPlan could
appropriately model both of these techniques. This was done by observation of the RapidPlan
provided CL regression plot: the principal component score 1 of the DVH (DVH-PCS1)
versus the principal component score 1 of the GED (GED-PCS1) (Figure 2.5); and the
residual plot: the DVH-PCS1 versus the estimated PCS1 for the DVH. In these plots, each
plan of the training set is represented by one data point. The lines represent one standard
deviation away from the regression line. Points falling outside of the lines are possible
outliers.
Figure 2.5 Regression plot: DVH-PCS1 versus the GED-PCS1 of a particular
OAR
2.5 Evaluation of model-based plans
A 10 patient evaluation group was selected arbitrarily, and already had respective f-RA
and h-RA MPs created according to the protocol mentioned in section 2.1. The PD for all the
plans was 66Gy delivered in 33 fractions. These MPs were optimized using PRO v10.0.28
and dose calculation was carried out with AAA v10.0.28. To investigate the performance of
both models, MBPs were created for this evaluation group and compared with respective
20
MPs. MBPs were optimized using the PO 13.6.23 optimization algorithm and the AAA
v10.0.28 for dose calculation, using a 2.5mm grid resolution. Field set-ups for MBPs were
the same as those for the respective MPs. For comparison reasons, the MBPs were
normalized such that the mean dose of the PTV is equal to the PTV mean dose of the MPs.
To assess the quality of f-RA and h-RA MBPs, MBPs were compared with respective MPs
on the basis of the following dosimetric parameters:
For the PTV: V95%(%) , V107%(%) , the maximum dose at the PTV in Gy Dmax(Gy),
homogeneity index (HI) defined as the minimum dose in 2% of the PTV (D2) minus
D98 divided by D50 , conformity index (CL) calculated as the absolute PTV volume
receiving 95% of the PD divided by absolute total body volume receiving 95% of the
PD.
For the OARs: V5(%) , Dmean(Gy) for the CL; V5(%) , V20(%), and Dmean(Gy) for
the TL-PTV; Dmax(Gy) for SC and SC+3mm; Dmax(Gy) and Dmean(Gy) for ESO;
and Dmax(Gy) for the OAR-control.
Paired, 2-sided student t-tests were performed to identify significant differences between the
MPs and MBPs with p<0.05.
2.5.1 Evaluation group geometries and field-set up
Table 2.6 shows the geometries of the 10 patient evaluation group.
Table 2.6 Geometric features of test patients
Patient
#
Tumor
side
PTV Contralateral lung Total lung-PTV Esophagus Volume
(𝐜𝐦𝟑)
Volume
(𝐜𝐦𝟑)
Overlap
%
Volume
(𝐜𝐦𝟑)
Overlap
%
Volume
(𝐜𝐦𝟑)
Overlap
%
1 L 1065 3186 0.04 4964 0.0 49.8 22.4
2 R 1017 3665 0.01 5893 0.0 63.9 11.3
3 L 936 1396 0.39 1913 0.0 75.1 27.9
4 L 987 1934 0.00 2971 0.0 60.5 48.1
5 R 861 2056 0.00 3830 0.17 33.9 10.6
6 R 1210 1920 0.54 3876 0.1 11.2 90.9
7 R 888 2510 0.27 4969 0.0 27.6 10.7
8 L 1250 1947 0.66 3090 0.0 43.2 37.6
9 R 789 1975 0.00 3858 0.18 46.8 0.4
10 L 1119 2581 0.01 3983 0.0 63.2 26.2
Average: 1012±150 2317±680 0.19±0.26 3935±1140 0.05±0.08 47.5±19.3 28.6±26.1
21
f-RA plans for all 10 evaluation patients utilized a full-arc with avoidance sector such
that the irradiation-arc length varied from 240º to 280 º. This length was within the range of
the f-RA model.
h-RA plans for 8/10 evaluation patients used partial-arcs with length between 210 º
and 220º. The remaining 2 plans (patient 6 and 8) utilized full-arc set-up with avoidance
sector, with a beam-on arc length of 250º. Regarding the conventional component, the
majority of the evaluation patients had a field set-up which was within the range of field-
geometries/characteristics of the h-RA model. However, 2 patients had a largely differing
field set-up: (1) patient 3 had a plan with an oblique field set to 140º, while in the model it is
between 150º and 160º ; (2) patient 6, although the PTV was mainly located in the right lung,
had 2 oblique fields, set on 200 º and 160 º with equal weight.
2.5.2 Assigning Optimization objectives
Both the f-RA and h-RA models were used to generate objectives for the following
OARs of the evaluation group: CL (line-objective), TL-PTV (line-objective) and ESO (line-
objective and a generated maximum point dose objective at 0% volume). The priorities for
the h-RA MBPs were reflective of those in the h-RA model. The f-RA model priorities were
adjusted after validation testing on 3 patients. It was found that the f-RA model did not
appropriately spare the contralateral lung, thus incrementally increasing the priority of this
structure, and re-testing, led to the resultant priorities in Table 2.7. Line-objectives for CL
and TL-PTV were chosen over point-objectives after validation testing on 3 patients: the line-
objectives found to reduce the mean dose of the lungs, while not affecting V5 and V20.
For certain OARs manual dose-volume objectives, stipulated during model creation,
were used for all plans: SC (maximum point dose-objective), SC+3mm (maximum point
dose-objective) and OAR-control (maximum point dose-objective). These manual dose-
volume objectives, and their respective priorities, were derived from averaging the same
values of the plans in the model libraries.
Additionally, for the f-RA and h-RA MBPs, the automatic normal tissue objective
(NTO) provided by RapidPlan which guides the dose fall-off outside the target was used with
a priority of 80 and 50 respectively. For CPOs of f-RA MBPs and MPs, the priorities of the
PTVinLung and PTVoutLung were increased to 200 and 180, respectively.
22
Table 2.7 Optimization objectives and priorities for the evaluation group MPs and MBPs
Structure
f-RA h-RA
MP MBP MP MBP
Objective
(Gy)
Priority Objective
(Gy)
Priority Objective
(Gy)
Priority Objective
(Gy)
Priority
ITV lower 66 110 66 110 66 100 66 110
PTV lower in lung: 70
out of lung: 65
120
120
in lung: 66
out of lung: 65
120
120 65 130 65 130
PTV upper in lung: 74
out of lung: 69
130
130
in lung: 69
out of lung: 68
130
130 67 120 67 120
CL a 130 Line 200 a 100 Line 110
TL-PTV b 130 Line 120 b 100 Line 110
SC 45-46 130-140 43.5 140 44 110-130 43.5 120
SC+3mm 47-48 130-150 45.7 150 46 120-140 45.7 120
OES c 90 Line 90 - - Line 90
OES max 66 120 Generated 150 65 130 Generated 130
OARcontrol 67 130 66 150 66 130 66 130
NTO - - Auto 80 - - Auto 50
a: Two objectives on V5, below the DVH line that was displayed during the optimization.
b: Two objectives on V5 and two on V20 , below the DVH line that was displayed during the optimization.
c: Objectives were placed on V40, V50, V60.
2.5.3 Evaluation of prediction accuracy
RapidPlan provides a prediction range of DVHs for each OAR. In order to evaluate
the accuracy of the predictions the mid-prediction DVH line was created running through the
middle of the prediction range as described by Tol et al.35 and can be seen in Figure 2.5.
Consequently the following parameters were calculated:
- Predicted Dmean: mean dose of the mid-prediction DVH line.
- Upper boundary Dmean: mean dose of the upper boundary of the prediction range.
- Lower boundary Dmean: mean dose of the lower boundary of the prediction range.
- Predicted Dmean range = Upper boundary Dmean - Lower prediction Dmean
Besides the mean dose of the OARs, the following point-volume predictions were extracted
from the mid-prediction DVH and upper and lower boundaries: Predicted V5, Upper
boundary V5, Lower prediction V5 for CL and Predicted V20, Upper boundary V20, Lower
prediction V20 for TL-PTV as well as the corresponding predicted ranges.
23
The prediction accuracy was investigated by comparing the achieved MBP dosimetry
metrics to the predicted aforementioned parameters. The difference between predicted and
achieved dose metrics ΔV5, ΔV20 and ΔDmean was calculated for example as Predicted V5
minus achieved MBP V5. Linear regression analysis using the least square method was
performed between the predicted and achieved dose parameters.
Figure 2.5: The DVH prediction range (shaded region) and the mid-
prediction DVH line running through the middle of it (dashed line).
2.5.4 Using predictions to select treatment technique
DVH-predictions were used to assess the possibility that RapidPlan can be used to
select which modality should be used for a prospective patient. It was examined if RapidPlan
can accurately predict which technique results in lower dose for the following parameters; CL
Dmean , TL-PTV V20 , TL-PTV Dmean and ESO Dmean. These parameters were found to be
mostly associated with high risk of pneumonitis and esophagitis after radiation treatment5,6.
CL V5 has been proven to be significantly lower in the h-RA plans compared to MPs33,34, and
was therefore excluded from this analysis. The number of patients for whom the predicted f-
RA parameter was lower than the predicted h-RA parameter (for ex. f-RA Predicted CL
Dmean < h-RA Predicted CL Dmean ), was compared to the numbers of patients for whom f-
RA MBP parameter was lower than h-RA MBP parameter (f-RA MBP CL Dmean < h-RA
MBP CL Dmean).
24
3. RESULTS
3.1 Evaluation of Model Training
3.1.1 RapidPlan-provided statistical metrics
Table 3.1 shows all the structures trained in the model, and the respective R2 and χ2,
the mean squared error between the original and the estimated data (MSE) and the number of
DVH-PCs and GED-PCs extracted during model training. Regression models for both f-RA
and h-RA showed good correlation between the geometric and dosimetric features of the CL,
TL-PTV, and ESO with R2>0.8. However, the TL-PTV of the h-RA model had a slightly
inferior R2 of 0.65. 𝜒2 values were close to 1 for all OAR structures, showing that residuals
from the regression model are independent43. MSE values for the CL, TL-PTV and ESO were
0.0013, 0.0011 and 0.0063 respectively for the f-RA model, and 0.0010, 0.0016 and 0.0045
for the h-RA model, respectively, showing good estimation capability of both models.
SC, SC+3mm, OARcontrol models were not used for generating optimization-
objectives because training results showed poor quality. However, the results are reported for
analytical purposes. SC and SC+3mm models resulted in very low R2 values. There is also
large variance in the DVH shapes, as indicated by the large number of DVH-PCs needed to
describe the dosimetric variability. The IL model training results are comparable to the CL
although it was not spared in the MPs. Nevertheless, the TL-PTV, which includes the IL, was
spared, and thus the IL DVHs shapes did not vary a lot.
Table 3.1: Summary of training results
Model
structure
f-RA model h-RA model
𝐑𝟐 𝐱𝟐 DVH-
PCs
GED-PCs
MSE 𝐑𝟐 𝐱𝟐 DVH-
PCs GED-PCs
MSE
CL 0.834 1.082 2 2 0.0013 0.807 1.099 2 2 0.0010
TL-PTV 0.848 1.116 3 2 0.0011 0.647 1.068 3 3 0.0016
IL 0.800 1.102 2 3 0.0015 0.706 1.099 2 3 0.0022
ESO 0.841 1.094 3 3 0.0063 0.886 1.102 2 3 0.0045
SC 0.586 1.024 4 4 0.0135 0.710 1.078 3 3 0.0187
SC+3mm 0.636 1.066 3 3 0.0120 0.712 1.059 3 3 0.0146
OARcontrol 0.932 1.056 2 2 0.0015 0.667 1.084 1 1 0.0071
R2: Coefficient of determination for the regression model parameters.
x2: Average chi square for the regression model parameters.
MSE: Mean square error between original and estimated data
25
3.1.2 Outlier analysis
In the f-RA model, 5 plans where indicated as outliers for the CL structure by the
model statistics: 3 plans had high CD value, indicating possible influential data points, while
2 plans had large mZ value, indicating possible geometrical outliers. Three TL-PTV
structures were indicated as outliers, one of them had high CD value and the rest high mZ
value. The same holds for the ESO model. After visual observation of the residual and
regression plots, only one plan from the suggested outliers could be noticed laying more than
2 sd away from the regression line for CL and TL-PTV, while it was close to identity line in
the residual plot. However, the geometry of this patient did not largely differ from the rest of
the patients. In the h-RA model, there was only one geometric outlier for the esophagus
structure. Since RapidPlan statistics did not suggest any outliers for CL and TL-PTV for the
h-RA model, the uncleaned h-RA model was used for the rest of the study.
After removing the suggested outliers from the f-RA model and retraining the model,
the resulting “cleaned model” had 3 new CL outliers and the R2 value was considerably
reduced from 0.834 to 0.580. There were also 3 new outliers for the TL-PTV model structure
and the R2 value was slightly decreased from 0.848 to 0.826.
The “cleaned” and “uncleaned” models were tested on 4 patients not included in the
model libraries. The dosimetric results, normalized to the mean PTV dose, are shown in
Table 3.2. Predicted Dmean range for CL were comparable for both models, while TL-PTV
predictions of the cleaned model were considerably narrower than those of the uncleaned
model. However, the uncleaned model provided lower CL and TL-PTV plan doses than that
of the cleaned model plans, while the PTV coverage was comparable between the two
models. Therefore, the uncleaned model including all the 55 plans was selected to guide the
optimization of the 10 test patients plans.
Table 3.2: Results of f-RA MBPs made using uncleaned model and cleaned model.
Patient
#
CL V5 (Gy) TL–PTV 𝐕𝟓 (Gy) TL–PTV 𝐕𝟐𝟎(Gy) PTV 𝐕𝟗𝟓%(%) Uncleaned
model
Cleaned
model
Uncleaned
model
Cleaned
model
Uncleaned
model
Cleaned
model
Uncleaned
model
Cleaned
model
1 31.9 32.9 38.9 39.5 16.2 17.5 97.3 98.7 2 33.2 44.4 50.0 56.9 26.1 27.9 97.3 97.7 3 60.6 60.5 71.3 71.2 29.4 29.3 92.8 93.0 4 39.0 43.5 60.3 63.2 27.2 27.3 99.2 98.9
Average 41.2 45.3 55.1 57.7 24.7 25.5 96.7 97.1
26
3.1.2 Field Geometry
Figure 3.1 shows the regression and residual plots for the CL and TL-PTV for the f-
RA model. The same plots of the h-RA model are shown in Figure 3.2. In Figure 3.1, the
partial-arc plans in the f-RA model are marked with red circles, while the rest of the plans
used full-arc incorporating an avoidance sector (full-arc plans). In Figure 3.2, for the h-RA
model, the full-arc plans are marked with red squares, while the rest is partial-arc plans.
The library of the f-RA model contains 7 partial-arc plans having typically low GED-
PCS1. However, in regression and residual plots, the partial-arc plans fall within 1 standard
deviation. In the h-RA model, the full-arc plans form a group on the right side of the
regression plot, having high GED-PCS1 value. Further analysis of the relationship between
the GED-PCS1 and the geometrical features showed that during the calculation of the GED
the field geometry is not modeled correctly when the avoidance sector is used (see details in
Appendix A). This problem caused the predictions of CL and TL-PTV to be higher.
However, the problem was partially solved by optimizing the plan twice (see Appendix A).
The h-RA model library contains two plans with 2 oblique fields instead of one like
the rest of the population. The first plan is also an avoidance sector plan and does not deviate
from the avoidance sector group. In the second plan, the direction of the extra oblique field is
at 270º, causing an unusually high exiting dose to the CL, for the relatively small PTV
volume of the patient (209 cm3). Thus, the plan lies above the regression and the residual
lines, meaning that the estimated CL dose is much lower than the actual dose, as expected.
Even though it can be considered as a negative outlier for the model, the CD value is 4.5,
while the threshold for outliers is 10, thus it does not have a significant effect on the
regression line. For the plan that had 3 oblique fields, the fields do not deviate more than 20º
from the AP and PA directions and thus the plan does not deviate from the rest of the training
set.
27
Figure 3.1: Regression and residual plots of the f-RA model. In red circles are the plans with partial-arc
field set-up. (a) Regression plot for CL. (b) Residual plot for CL.
Figure 3.2: Regression and residual plots of the h-RA model. In red squares are the plans with
avoidance sector field set-up. (a) Regression plot for CL. (b) Residual plot for CL.
3.2 Evaluation of Model-Based Plans
Dosimetry of the MPs and MBPs for both f-RA and h-RA method is summarized in
Table 3.3. The results are averaged over the 10-patient evaluation group. The dosimetric
results for individual patients are shown in Figure 3.3 and Figure 3.4.
In f-RA, 5/10 MBPs (#1, 2, 4, 9 and 10) and 6/10 MPs (# 1, 2, 5, 7, 9, 10) satisfied all
the clinical objectives for the OARs and the PTV. For h-RA, also 5/10 MBPs (#2, 4, 7, 9, 10)
were clinically acceptable and 6/10 MPs (# 1, 2, 4, 7, 9, 10). For the rest of them, the PTV
V95% was lower than 97%, while in some, one or more OAR requirements was also not
achieved. These plans were examined individually, and after adjusting some parameters and
repeating the optimization, it was possible to improve the plan quality (see Section 3.4).
The PTV coverage was sufficient in 5/10 MBPs and 9/10 MPs using the f-RA
method, and in 5/10 MBPs and 8/10 MPs using the h-RA method. Quantitatively, both f-RA
and h-RA MBPs resulted in lower PTV V95% on average when compared to the MPs: from
97.50±0.67 to 96.39±1.90 for the f-RA plans, and from 97.25±1.95 to 96.56±1.48 for the h-
28
RA plans. However, the difference was not statistically significant. The conformity index and
PTV V107% were slightly improved, while the PTV HI slightly increased in the MBPs.
F-RA plans were able to achieve the aim of CL V5<40% in 6/10 MBPs and 6/10 MPs.
TL-PTV V20 was clinically acceptable in 9/10 MBPs and 9/10 MPs, while TL-PTV V5 was
lower than 60% in 5/10 MBPs, and 6/10 MPs. In f-RA MBPs, CL V5 and TL–PTV V5 and
V20 increased by 4.1%, 2.5% and 1.1% on average respectively over MPs, with the increase
in TL–PTV V20 being statistically significant. However, 4/10 f-RA MBPs resulted in lower
CL V5 and TL-PTV V5 , while two of them also improved TL-PTV V20 over the respective
MPs.
Regarding the clinical aims in h-RA method, MPs and MBPs performed similarly
achieving CL V5 <40% in 7/10 patients, TL-PTV V20<35% in 9/10 patients and TL-PTV
V5<60% in 8/10 cases. In actual numbers, CL V5 and TL–PTV V5 were slightly increased -by
0.9% and 0.6% respectively- in MBPs compared to MPs, while TL–PTV V20 remained
almost the same.
MBPs improved the CL Dmean in the f-RA technique, as well as the CL Dmean
(p<0.05) and TL-PTV Dmean (p<0.05) of the h-RA technique, possibly due to the use of a
line-objective throughout the whole dose range instead of separate point objectives.
Furthermore, MBPs have a considerably lower ESO Dmax and Dmean, and SC, SC+3mm, and
OAR-control Dmax for both techniques. The MBP objectives for spinal cord and spinal cord +
3mm were stricter compared to the objectives used for the MPs and resulted in lower Dmax
for these structures. The lower ESO Dmean is attributed to the use of line objectives over the
entire dose range instead of only a few point objectives in large doses. The improved
conformity index is due to the use of the automatic NTO during the optimization of MPBs.
29
Table 3.3: Average values of dosimetric data for 10 evaluation patients
f-RA h-RA
MP MBP MP MBP
PTV
V95%(%) 97.5 ± 0.7 96.4 ± 1.9 97.3 ± 2.0 96.6 ± 1.5
V107%(%) 3.1 ± 1.5 2.8 ± 1.9 1.6 ± 2.4 0.8 ± 1.1
Dmax(Gy) 74.5 ± 0.6 74.7 ± 1.5 72.6 ± 1.6 72.0 ± 1.1
HI 0.13 ± 0.01 0.13 ± 0.02 0.11 ± 0.03 0.12 ± 0.02
CI 0.74 ± 0.06 0.78 ± 0.07ᵃ 0.50 ± 0.08 0.54 ± 0.11
CL
V5(%) 40.2 ± 13.2 44.3 ± 12.6 25.6 ± 15.0 26.5 ± 16.5
Dmean(Gy) 7.1 ± 2.4 6.8 ± 1.9 5.9 ± 2.6 5.7 ± 2.7ᵃ
TL – PTV
V5(%) 55.6 ± 11.9 58.1 ± 12.1 47.5 ± 11.6 48.1 ± 12.4
V20(%) 26.2 ± 6.3 27.3 ± 6.2ᵃ 26.3 ± 6.3 26.2 ± 6.3
Dmean(Gy) 15.6 ± 3.3 15.8 ± 3.0 16.6 ± 3.7 16.6 ± 3.8ᵃ
SC
Dmax(Gy) 49.2 ± 1.4 49.0 ± 1.7 48.4 ± 2.5 47.8 ± 2.4
SC + 3mm
Dmax(Gy) 55.6 ± 2.8 53.9 ± 3.3ᵃ 53.6 ± 3.9 53.1 ± 3.7
ESO
Dmax(Gy) 69.9 ± 0.8 68.6 ± 2.8 68.7 ± 1.3 67.8 ± 1.8
Dmean(Gy) 33.2 ± 10.4 31.8 ± 10.0 34.3 ± 11.8 33.1 ± 11.5ᵃ
OAR -control
Dmax(Gy) 74.2 ± 1.3 72.8 ± 1.1ᵃ 72.4 ± 2.6 71.6 ± 1.6
ᵃ Indicates a statistically significant difference (p<0.05) between MBPs and MPs
30
Figure 3.3: Dosimetric results of f-RA and h-RA MPs and MBPs per patient for PTV and
CL.
31
Figure 3.4: Dosimetric results of f-RA and h-RA MPs and MBPs per patient for TL-PTV,
ESO and SC.
32
3.3 Evaluation of prediction accuracy
The dose metrics predicted by the f-RA and h-RA models plotted against the achieved
MBP dose metrics for multiple organs-at-risk is shown in Figure 3.5, along with the linear fit
through all data points and the corresponding coefficient of determination R2 and the
standard error of the estimate σ. The difference between predicted and achieved dose
metrics ΔV5, ΔV20 and ΔDmean for each patient is shown in Table 3.4 and Table 3.5 for the f-
RA and the h-RA plans respectively. Furthermore, the achieved MBP metrics were visualized
in relation to the prediction ranges in Figure 3.6: the predicted Dmean, the upper boundary
Dmean and the lower boundary Dmean along with the achieved MBP Dmean for CL, TL-PTV,
and esophagus. The volume predictions for CL V5 and TL-PTV V20 with the corresponding
V5 and V20 of the upper and lower boundaries of the prediction range, and the achieved V5
and V20 are shown in Figure 3.7. The DVH plots along with the prediction ranges can be
found in the Appendix B.
3.3.1 Contralateral Lung
For f-RA plans, predicted and achieved CL Dmean shows good correlation with R2
value of 0.89 and σ of 0.70 Gy, while for the h-RA the correlation is slightly poorer with R2
of 0.84 and σ of 1.2 Gy. f-RA model was able to predict the mean dose with ΔDmean of less
than 1 Gy for all the cases (Table 3.4) and also achieved Dmean was between the predicted
range (Figure 3.6). The h-RA model was accurate in predicting the mean CL dose with
ΔDmean less than 1 Gy only for 6 of the patients (Table 3.5), while two of them were also out
of the predicted range (Figure 3.6). Although the predicted Dmean range of h-RA model
(2.4±1.2 Gy) is on average relatively narrow compared to that for the f-RA model (3.6±0.5
Gy), the predictions are less accurate.
When observing the V5 predictions, the f-RA model predicts generally lower V5 than
what is achieved by the MBPs and the standard error is 5.5 %. The predicted CL V5 is on
average 6.1% lower than the achieved (Table 3.4). On the other hand, h-RA model predicts
either higher or lower CL V5, yet with a higher standard error of 8%. The predicted V5 range
depicted in Figure 3.7 is on average 14.7±3.3% for the f-RA model and 11.9±2.1% for the h-
RA model. Nevertheless, 7/10 f-RA plans and 6/10 h-RA plans fall within the predicted
range.
33
Figure 3.5: The correlation between predicted and achieved DVH metrics of h-RA and f-RA
MBPs for multiple organs-at-risk. The solid lines represent linear regression fits (with R2 the
coefficient of determination), while the dashed lines represent one standard error of the
regression σ. The dashed black line represents the identity line (y=x).
34
Table 3.4: Difference between predicted and achieved MBP dose metrics for the f-RA model
Patient # Contralateral lung Total lung-PTV Esophagus
𝚫𝐕𝟓(%) 𝚫𝐃𝐦𝐞𝐚𝐧(𝐆𝐲) 𝚫𝐕𝟐𝟎(%) 𝚫𝐃𝐦𝐞𝐚𝐧(𝐆𝐲) 𝚫𝐃𝐦𝐞𝐚𝐧(𝐆𝐲)
1 -5.6 -0.1 8.2 2.8 0.3
2 -3.6 0.3 5.3 1.7 1.1
3 -8.2 0.5 1.8 0.4 0.9
4 1.2 0.7 1.5 1.4 -1.4
5 -11.2 -0.7 -2.7 -1.9 0.6
6 -15.4 -0.9 -0.2 -0.6 -1.1
7 -1.1 0.9 1.0 1.1 3.2
8 -14.5 -0.7 -0.6 -1.3 -1.4
9 -0.2 0.7 0.4 -0.4 4.1
10 -2.5 0.7 3.5 1.0 -1.0
Average -6.1±6.0 0.1±0.7 1.8±3.1 0.4±1.5 0.5±1.9
Negative values mean that the predicted value is lower than the achieved MBP value.
Table 3.5: Difference between predicted and achieved MBP dose metrics for the h-RA model
Patient # Contralateral lung Total lung-PTV Esophagus
𝚫𝐕𝟓(%) 𝚫𝐃𝐦𝐞𝐚𝐧(𝐆𝐲) 𝚫𝐕𝟐𝟎(%) 𝚫𝐃𝐦𝐞𝐚𝐧(𝐆𝐲) 𝚫𝐃𝐦𝐞𝐚𝐧(𝐆𝐲)
1 1.2 -0.46 2.9 0.9 -0.6
2 8.5 0.75 -0.8 -1.5 -2.5
3 -16.89 -2.07 0.6 0.1 -4
4 1.2 -0.8 5.6 2.4 -3.1
5 -3.7 -1.5 -1.2 -1.6 -0.5
6 -1.8 -0.26 -0.5 -0.8 2
7 8.4 1.16 -0.1 1.1 -7.1
8 -0.1 1.9 -3.6 -3.5 0
9 8 0.75 1.3 0.3 4.9
10 5.9 0.5 0.7 -0.6 -2.6
Average 1.1±7.7 0.0±1.2 0.5±2.5 -0.3±1.7 -1.4±3.3
Negative values mean that the predicted value is lower than the achieved MBP value.
35
Figure 3.6: Predicted and achieved Dmean for CL, TL-PTV, and ESO for f-RA and h-RA
techniques. Error bars represent the upper boundary Dmean and the lower boundary Dmean.
36
Figure 3.7: Predicted and achieved MBP CL V5, and TL-PTV V20 for both f-RA and h-RA
techniques. The error bars represent V5 and V20 of the upper boundary the lower boundary of
the prediction range.
3.3.2 Total Lung – PTV
The two models showed comparable quality of linear correlation between predicted
and achieved TL-PTV mean dose (Figure 3.5). The slope is higher than 1 for both methods,
while R2 and σ values are 0.80 and 1.4 Gy for the f-RA model respectively, and 0.85 and 1.5
37
Gy for the h-RA model. For the f-RA model, the prediction range was on average 3.1±0.4 Gy
and the achieved Dmean was within the predicted range in 7/10 cases. For the h-RA model,
the predicted range was slightly wider; 3.6±1.8 Gy on average, and it was representative for
8/10 patients.
Predicted and achieved V20 showed poor correlation for f-RA plans with R2 value of
0.77 and σ of 3.1 %. Although achieved MBP V20 was within the prediction range for 7/10
patients, achieved V20 was lower than the mid prediction most of the cases. The h-RA plans
showed stronger correlation and smaller standard error with R2 and σ values of 0.90 and
2.2% respectively, and achieved MBP V20 was within the prediction range for 7/10 patients.
The predicted V20 range was the same on average for both techniques; 5.0±1.2% for f-RA
and 5.0±0.9% for h-RA.
3.3.3 Esophagus
The linear regression analysis revealed a strong correlation between predicted and
achieved ESO Dmean , for both techniques, with R2 values of 0.97 and 0.93 for f-RA and h-
RA respectively, while the slope was close to 1. However, σ was 1.8Gy for the f-RA plans
and 3.2Gy for the h-RA. The average predicted Dmean range was 6.3±3.1 Gy for the f-RA
model and 5.0±1.7 for the h-RA model. Achieved esophagus Dmean was within the predicted
range in 9/10 f-RA plans, while only in 5/10 h-RA plans. Moreover, the h-RA model
predicted lower Dmean than the achieved for 7/10 patients.
38
3.4 Individualized analysis
PTV
PTV homogeneity and V95% of the MBPs were inferior compared to the MPs. In
particular, HI was increased in 8/10 of f-RA MBPs and in 6/10 h-RA MBPs. A more detailed
analysis revealed the reasons for that. The line objectives used for the CL and TL-PTV in the
MBPs, especially the objectives on doses higher than 66 Gy compete with the lower target
objective resulting in lower target coverage. An example of this can be seen in Figure 3.8:
when line objective is used for the lungs (a), the 95% isodose line covers less part of the PTV
at the side which is adjacent to the lung. When point objectives -generated by the model at
3Gy and 4Gy for CL and at 2Gy, 3Gy, 16Gy, 17Gy for TL-PTV -with the same priorities as
the line-objectives in the original MBPs- are used (b), PTV coverage is improved, while CL
V5 and TL-PTV V20 is not affected. The CL and TL-PTV line objective was found to affect
the PTV homogeneity also in h-RA plans, an example of which can be seen in Figure 3.9:
when using point objectives for the lungs the V95% was increased from 96.1% to 97.4%. The
effect of line objective for ESO was also examined and shown in Figure 3.9 for comparison.
If the line objective for ESO is not used, the PTV V95% is improved to 97.1%, however the
ESO V60Gy increases substantially.
On a patient-specific level, the lower PTV coverage is attributed to the sparing of the
esophagus (patients 3 and 8) and spinal cord (patient 3). After re-optimizing these plans using
lower priority for ESO, the PTV coverage considerably increased. Especially, for patient 3,
the spinal cord is adjacent to the PTV. In the MP, the SC+3mm dose exceeds the clinical
criterion of 54Gy while in the MBPs it is lower. In such case, the clinician can choose to
allow higher dose to the spinal cord for a more homogenous PTV dose, or the opposite.
39
Figure 3.8: f-RA MBPs of the same patient. (a) Plan optimized using line objective for CL
and TL-PTV. (b) Plan optimized using point objectives for CL and TL-PTV. Red line
represents the PTV outline. Green, pink and orange lines represent the 62,7Gy (95% of
prescribed dose), 20Gy and 5Gy isodose line respectively.
Figure 3.9: h-RA MBP DVHs of the same patient using: 1) Line objectives for CL, TL-PTV,
and ESO. 2) Point objectives for CL and TL-PTV, and line objective for ESO. 3) Line
objectives for CL and TL-PTV and only maximum objective for ESO.
Contralateral Lung
To understand the observed difference between MPs and MBPs, we have to also look
at the predictions, and consequently the model-generated objectives, in relation to the MPs.
In the f-RA MBPs of three patients (3, 5, 7) the CL V5 increased by more than 5%
compared to the MPs. This is attributed to the fact that, in the MPs the objectives for CL V5
40
are placed at very low volume in order to achieve the lowest possible V5. On the other hand,
in the MBPs, the objective line is placed automatically below the lower boundary of the
prediction range, and thus the objectives around 5Gy are placed always higher compared to
the manual objectives (Figure 3.10a). In particular, the manual objectives in the 10 evaluation
patients were set to dose 2-4 Gy and volume 7.9 – 23.5 % while the generated objective line
for the same dose was placed at volume 21.1% - 64.7%. To compensate for that, a priority of
200 was used for the objective-line in all f-RA MBPs, while the priority in MPs was 130.
Nevertheless, it was not possible to achieve the same V5 for the majority of the plans.
Regarding the f-RA MBP of patient 9, although V5 is clinically acceptable (33.7%),
there is a large deviation from the MP (20.6%). This happened because the model-generated
objective line was placed above the manual DVH-line making it impossible to achieve dose
as low as in the MP (Figure 3.10b).
Figure 3.10: CL DVHs of the f-RA MPs and MBPs along with the respective objectives.
(a)Patient 5: manual objectives are placed considerably lower than the model-generated
objective line. (b) Patient 9: the model-generated line objective is placed higher than the CL
DVH-line of the MP.
An alternative approach (MBP-alt) was investigated for the cases that the f-RA MBP
had increased CL V5. Instead of line-objective, point-objectives were generated per 1 Gy,
from 2Gy to 7Gy for the CL, and between 2-7 Gy and 16-22Gy for the TL-PTV as seen in
41
Figure 3.11a. Furthermore, the priority of CL and TL-PTV was increased to 300 and 150
respectively. The results for patients 3, 5 and 7 are shown in Table 3.6 along with the results
of the MPs and the original MPBs. The use of point objectives with higher priority for the
lungs improved the CL V5 and TL-PTV V5 compared to the original MBPs, while for patient
5 CL V5 and TL-PTV V5 were even lower than the MP. However, TL-PTV V20 was not
improved because the TL-PTV predictions were too high as explained below.
Figure 3.11: (a) f-RA generated point objectives for CL in the dose region 2-7 Gy and for
TL-PTV in dose 2-7Gy and 16-22Gy. Generated line objective for ESO. (b) f-RA plans of
patient 5.
42
Table 3.6: Results of f-RA plans using the alternative method for patient 3,5 and 7, and
comparison with MPs and MBPs. Patient 3 Patient 5 Patient 7
MP MBP MBP-alt MP MBP MBP-alt MP MBP MBP-alt
PTV
V95%(%) 97.4 92.8 96.3 97.5 96.3 97.1 97.1 95.5 95.6
CL
V5(%) 44.8 60.6 50.2 39.9 47.9 31.8 31.9 37.4 32.5
Dmean(Gy) 7.6 8.1 8.6 6.5 6.1 6.1 6.6 6.4 7.2
TL – PTV
V5(%) 59.8 71.3 63.8 63.0 66.7 58.3 49.8 54.0 51.4
V20(%) 28.4 29.4 31.1 29.5 30.8 30.2 24.2 27.6 28.4
Dmean(Gy) 17.1 17.7 18.0 17.4 17.7 17.5 13.7 14.8 15.1
When observing the h-RA predictions for CL, the DVH that deviates most from the
predicted DVH-range is that of patient 3 (Appendix B). That plan has an unusual field set up
of the conventional fields. The oblique field is set to 140 degrees, while the field direction in
the model library is between 150 and 160 degrees for left-sided tumors (Table 2.5).
Therefore, a larger part of the lung is located in the beam direction, resulting in an unusually
high exiting dose for the CL that the model is not able to predict. This part of the lung is
already in the in-field region of the RapidArc component, thus the in-field region is the same
whether the conventional fields are used or not. Furthermore, the model takes into account
the final field set-up including the RapidArc component and only takes into account the
direction of the fields, not how many beams are there. Consequently, the conventional fields
do not play any role in the estimation algorithm. After excluding this plan, the correlation
coefficient between predicted and achieved Dmean increases to 0.90 and σ drops to 0.9 and
are now comparable to the figures for the f-RA model. The same happens for the V5
predictions; R2increaces to 0.90 and σ is 5.0%, while also the slope decreases to 1.05.
The MBP CL mean dose of the h-RA plan of patient 8 deviated from the predicted by
1.9Gy. For the RapidArc component of this plan, a full-arc with avoidance sector was used
containing a 250° irradiation arc. That caused the GED PC1 to be higher as explained before.
Full-arc was used also for patient 6. However, an additional oblique field was used on 160°,
even though the tumor is located in the right lung, causing direct irradiation to the CL. The
extra dose delivered by this field, compensated for the higher GED PC1.
43
TL-PTV
The dose of TL-PTV in the f-RA MBPs was generally increased in comparison to the
MPs. When looking at the DVH predictions (Appendix B) the objective line is placed higher
than the MP DVH-line in 7/10 cases (patients 1,2,3,4,7,9,10) in the low dose region, showing
that the model underestimated the amount of sparing that could be achieved.
A possible cause for the high predictions, despite that model statistics indicate that the
model training has been successful, is the presence of an outlier in the model that it may
overfit the data. There is a plan with high CD value (CD=26.8, while the threshold is 10), that
is away from the regression line but close to identity line in the residual plot. To further
investigate that, the cleaned model, from which the suggested outliers were removed (Section
3.1.2), was used to generate predictions for the TL-PTV. The predictions were again
consistently higher than the MPs. More specifically, in 8/10 patients the objective line was
placed above the manual TL-PTV DVH-line.
The largest deviation between the predicted and achieved TL-PTV DVH was
observed for patients 1 and 2. The TL-PTV structure of patient 2 was indicated as an outlier
by the f-RA model because of large volume (Table 2.1 and Table 2.6). The TL-PTV structure
of patient 1, although not indicated as an outlier by the DVH estimation algorithm, has
volume higher than mean+1sd of the model library structures. Consequently, it is possible
that the model dose not perform well for large structure volumes.
It should be noted that RapidPlan was able to pull the DVH-line much lower than the
optimization objective line for patient 1, 2 and 10 as can be seen in Appendix B. This shows
that the TL-PTV sparing during the optimization process was mainly driven by the CL lung
objective, as the CL volume accounts for around 60% of the TL-PTV volume. To confirm
that, the plan of patient 1 was optimized again, without the use of dose-volume objectives for
the TL-PTV. The resultant plan had only minor differences in mean doses: CL Dmean was
0.09 Gy lower, TL-PTV Dmean was 0.15Gy higher and IL Dmean increased by 0.55Gy, while
TL-PTV V5 and V20 remained the same.
3.5 Using predictions to select treatment technique
Achieved MBP f-RA dose parameters plotted against the achieved h-RA dose
parameter for CL, TL-PTV and ESO are shown in Figure 3.12. The cases where the
44
predicted f-RA parameter was lower than the predicted h-RA parameter (for ex. f-RA
Predicted CL Dmean < h-RA Predicted CL Dmean ) are marked in red.
Data points on the left side of the identity line show that the f-RA MBP gave lower
dose than the h-RA MBP, while data points highlighted with red indicate that the f-RA
method would result in lower dose based on the predictions. There are 8/10 correct
predictions for CL Dmean, and 7/10 for TL-PTV Dmean, 9/10 for TL-PTV V20, and 8/10 for
ESO Dmean.
Based on the MBPs results, the h-RA technique is almost always better for CL Dmean
and TL-PTV V20, and f-RA method is better for TL-PTV Dmean and ESO Dmean. However,
considering that the f-RA TL-PTV V20 predictions are consistenlty high as described above,
we need to take a closer look at the TL-PTV V20 resaults. When examining the results of the
MPs in Figure 3.13, f-RA MPs resaulted in lower TL-PTV V20 in 5/10 patients. In that case,
RapidPlan gave correct prediction for 7/10 cases.
45
Figure 3.12: MBP f-RA dose parameters plotted against the achieved h-RA dose parameter
for CL, TL-PTV and ESO. Marked in red are the cases where the predicted f-RA parameter
was lower than the predicted h-RA parameter. Solid line represents the identity line.
46
Figure 3.13: Manual f-RA TL-PTV V20 plotted against the manual h-RA TL-PTV V20.
Marked in red are the cases where the predicted f-RA TL-PTV V20 was lower than the
predicted h-RA TL-PTV V20.
47
4. DISCUSSION AND CONCLUSION
The purpose of this thesis was to evaluate the performance of the RapidPlan
knowledge-based planning solution in providing f-RA and h-RA plans for large volume lung
cancer patients. Additionally, it was examined whether RapidPlan predictions can serve as a
tool for selection of the optimal treatment technique for a new patient.
The results showed that RapidPlan can generate MBPs of comparable quality to the
MPs made by experienced treatment planners. The use of line-objectives throughout the
whole dose range improved CL and TL-PTV Dmean, while in some cases MBPs also reduced
the CL V5 and TL-PTV V20 for specific patients. The most notable difference between MBPs
and MPs for both techniques was the decreased PTV V95%. The use of line-objective for CL
and TL-PTV negatively affected the PTV coverage. On top of that, the use of somewhat
higher priorities for ESO and SC also prevented the dose to be equally distributed in the PTV,
reducing its homogeneity. Another possible reason for the lower homogeneity it might be the
use of NTO in the MBPs, which on the other hand led to increased conformity.
Another remarkable result is the increased CL V5 in the f-RA MBPs, caused by the
fact that manual optimization objectives are placed at very low dose in order to pull down the
DVH as much as possible. However, with a quite accurate model, the predictions will never
be low enough to generate objectives in a similar position to the manual ones. Since
increasing the priority of the whole objective-line would affect the PTV coverage, the only
solution seems to be the use of generated point-objectives with increased priorities, at the low
dose region. This could also solve the issue with low PTV V95%. This approach was tested in
three patients and showed a reduction in CL V5. RapidPlan does have a feature to
automatically generate the priorities. Ideally, the suitable priorities that ensure sufficient
OAR sparing while maintaining good homogeneity of the PTV should be generated by
RapidPlan itself. However, generated priorities still do not seem to work in this way.
Fogliata et al.26 have shown that RapidPlan VMAT plans improved IL Dmean, CL
Dmean, and CL V20. The last is contrary to the findings of the present study. However,
Fogliata et al. have used line-objective on IL instead of TL-PTV, while no objectives were
used around the CL V5 in the manual plans. On the other hand, at the VUmc reducing the CL
V5 as low as possible is of high clinical importance, and manual plans have very low V5.
Therefore is difficult to further reduce V5 in the MBPs. Furthermore, in Fogliata et al. paper,
48
RapidPlan improved PTV homogeneity. However, their study involved smaller PTV
volumes.
A limitation of this study was the problem in the GED calculation algorithm which is
incorrectly modeling the avoidance sector, and this could be a contributing reason to
inaccuracies and poor results. Because of this, the optimization of f-RA plans had to be done
twice, consuming more time as explained in the Appendix A. However, the problem is not
completely solved as it also exists in the model training algorithm. The avoidance sector
plans are considered as having beamlets in the avoidance sector directions, and therefore
some radiation is expected from these directions, while the partial-arc plans are correctly
modeled. This leads to inconsistencies in a model containing both avoidance sector plans and
partial-arc plans. The problem was communicated to a Varian representative who stated that
the bug will be corrected in the next version of RapidPlan.
RapidPlan was able to accurately predict the mean dose of CL, but it consistently
underestimated the amount of sparing that could be achieved for TL-PTV. After excluding
the possibility that this is caused by any outliers in the model, the reasoning may be found in
the modeling of TL-PTV structure. TL-PTV structure has a large variation in dose
distribution because CL is spared as much as possible whereas IL receives much more dose.
RapidPlan solves the problem of varying dose within a structure by using volume partitioning
and constructing distinct models for the parts of the organ that is out-of-field or overlapping
the target. However, in case there is large dose variation within one model structure, it is
possible that this is more difficult to present with the PC’s. This hypothesis is further
supported by the observation that predictions for the IL, although not used for the
optimization, are of good quality (Figure B5 in Appendix B). If this is the cause of the
problem, a possible solution could be to utilize the predictions of the individual lungs to be
sync to the total lung predictions. However, there is no such mechanism in RapidPlan.
Furthermore, it is possible that the problem with not modeling correctly the avoidance
sector causes the inaccuracy in TL-PTV predictions. The GED-PCS1 seems to have a more
important role in the TL-PTV predictions since its partial coefficient of determination is
0.837 for the TL-PTV, while for the CL and IL is considerably lower: 0.737 and 0.663
respectively.
The h-RA MBPs, apart from the decreased PTV V95%, were of very good quality, and
resembled well the MPs since 90% of the dose is delivered by the conventional fields, which
were the same as in the MPs. RapidPlan is not designed for hybrid techniques and thus does
not take into account the conventional component of the field. However, because the model
49
consisted mostly of fields having a similar arrangement, the model incorporated correctly the
dosimetry of the training set, and thus generated quite accurate predictions for CL and TL-
PTV Dmean. On the contrary, the model was not able to predict correct dose in plans with
different field set-up than the standard. Nevertheless, the h-RA predictions for ESO were
lower than the achieved MBPs in 7/10 patients. That is because the esophagus is a small
structure, compared to the PTV, and the MLC movement and dose modulation in VMAT, can
sufficiently reduce its dose.
RapidPlan can better predict CL and TL-PTV mean doses, instead of dose-specific
volumes such as V5 and V20. When it comes to comparison between f-RA and h-RA, based
on the simple method used in this study, by only comparing single OAR dose volumes,
RapidPlan can accurately predict which technique gives the lower dose in 7-9 /10 cases.
To conclude, this study demonstrated that RapidPlan is capable of generating
clinically acceptable f-RA and h-RA plans for lung cancer patients. However, the plans could
be improved after a more wise selection of priorities and using generated point-objectives
instead line objectives. Furthermore, the bug in the algorithm which is not incorporating the
avoidance sector should be corrected for more accurate results. This study could serve a
starting point for further validation of the use of RapidPlan for large volume lung cancer
patients with the aim to be implemented in the clinical practice.
50
References
1. Ferlay J, Soerjomataram I, Dikshit R, et al. Cancer incidence and mortality worldwide: Sources, methods and major patterns in GLOBOCAN 2012. doi:10.1002/ijc.29210.
2. Eberhardt WEE, De Ruysscher D, Weder W, et al. 2nd ESMO Consensus Conference in Lung Cancer: locally advanced stage III non-small-cell lung cancer. Ann Oncol. 2015;26(8):1573-1588. doi:10.1093/annonc/mdv187.
3. Wijsman R, Dankers F, Troost EGC, et al. Comparison of toxicity and outcome in advanced stage non-small cell lung cancer patients treated with intensity-modulated (chemo-)radiotherapy using IMRT or VMAT. Radiother Oncol. 2017;122(2):295-299. doi:10.1016/j.radonc.2016.11.015.
4. Palma DA, Senan S, Tsujino K, et al. Predicting Radiation Pneumonitis After Chemoradiation Therapy for Lung Cancer: An International Individual Patient Data Meta-analysis. Int J Radiat Oncol. 2013;85(2):444-450. doi:10.1016/j.ijrobp.2012.04.043.
5. Khalil AA, Hoffmann L, Moeller DS, Farr KP, Knap MM. New dose constraint reduces radiation-induced fatal pneumonitis in locally advanced non-small cell lung cancer patients treated with intensity-modulated radiotherapy. Acta Oncol (Madr). 2015;54(9):1343-1349. doi:10.3109/0284186X.2015.1061216.
6. Pinnix CC, Smith GL, Milgrom S, et al. Predictors of radiation pneumonitis in patients receiving intensity modulated radiation therapy for Hodgkin and non-hodgkin lymphoma. Int J Radiat Oncol Biol Phys. 2015. doi:10.1016/j.ijrobp.2015.02.010.
7. Palma DA, Senan S, Oberije C, et al. Predicting Esophagitis After Chemoradiation Therapy for Non-Small Cell Lung Cancer: An Individual Patient Data Meta-Analysis. Int J Radiat Oncol. 2013;87(4):690-696. doi:10.1016/j.ijrobp.2013.07.029.
8. Mittal BB, Purdy JA, Ang KK. Advances in Radiation Therapy. 1st ed. (Mittal BB, Purdy JA, Ang KK, eds.). Springer US; 1998. doi:10.1007/978-1-4615-5769-2.
9. Bentzen SM. Radiation therapy: intensity modulated, image guided, biologically optimized and evidence based. Radiother Oncol. 2005;77(3):227-230. doi:10.1016/j.radonc.2005.11.001.
10. Murshed H, Liu HH, Liao Z, et al. Dose and volume reduction for normal lung using intensity-modulated radiotherapy for advanced-stage non–small-cell lung cancer. Int J Radiat Oncol. 2004;58(4):1258-1267. doi:10.1016/j.ijrobp.2003.09.086.
11. Liu HH, Wang X, Dong L, et al. Feasibility of sparing lung and other thoracic structures with intensity-modulated radiotherapy for non-small-cell lung cancer. Int J Radiat Oncol Biol Phys. 2004;58(4):1268-1279. doi:10.1016/j.ijrobp.2003.09.085.
12. Otto K. Volumetric modulated arc therapy: IMRT in a single gantry arc. Med Phys. 2007;35(1):310-317. doi:10.1118/1.2818738.
13. Li Y, Wang J, Tan L, et al. Dosimetric comparison between IMRT and VMAT in irradiation for peripheral and central lung cancer. Oncol Lett. 2018;15(3):3735-3745. doi:10.3892/ol.2018.7732.
14. Webb S. Optimisation of conformal radiotherapy dose distribution by simulated annealing. Phys Med Biol. 1989;34(10):1349-1370. doi:10.1088/0031-9155/34/10/002.
51
15. Yuan L, Ge Y, Lee WR, Yin FF, Kirkpatrick JP, Wu QJ. Quantitative analysis of the factors which affect the interpatient organ-at-risk dose sparing variation in IMRT plans. Med Phys. 2012;39(11):6868. doi:10.1118/1.4757927.
16. Das IJ, Cheng C-W, Chopra KL, Mitra RK, Srivastava SP, Glatstein E. Intensity-Modulated Radiation Therapy Dose Prescription, Recording, and Delivery: Patterns of Variability Among Institutions and Treatment Planning Systems. JNCI J Natl Cancer Inst. 2008;100(5):300-307. doi:10.1093/jnci/djn020.
17. Everitt S, Kron T, Fimmell N, et al. Interplanner variability in carrying out three-dimensional conformal radiation therapy for non-small-cell lung cancer. J Med Imaging Radiat Oncol. 2008;52(3):293-296. doi:10.1111/j.1440-1673.2008.01957.x.
18. Nelms BE, Robinson G, Markham J, et al. Variation in external beam treatment plan quality: An inter-institutional study of planners and planning systems. Pract Radiat Oncol. 2012;2(4):296-305. doi:10.1016/j.prro.2011.11.012.
19. Appenzoller LM, Michalski JM, Thorstad WL, Mutic S, Moore KL. Predicting dose-volume histograms for organs-at-risk in IMRT planning. Med Phys. 2012;39(12):7446-7461. doi:10.1118/1.4761864.
20. Fogliata A, Wang P-M, Belosi F, et al. Assessment of a model based optimization engine for volumetric modulated arc therapy for patients with advanced hepatocellular cancer. Radiat Oncol. 2014;9(1):236. doi:10.1186/s13014-014-0236-0.
21. Tol JP, Delaney AR, Dahele M, Slotman BJ, Verbakel WFAR. Evaluation of a knowledge-based planning solution for head and neck cancer. Int J Radiat Oncol Biol Phys. 2015;91(3):612-620. doi:10.1016/j.ijrobp.2014.11.014.
22. Delaney AR, Dahele M, Tol JP, Slotman BJ, Verbakel WFAR. Knowledge-based planning for stereotactic radiotherapy of peripheral early-stage lung cancer. Acta Oncol. 2017;56(3):490-495. doi:10.1080/0284186X.2016.1273544.
23. Chanyavanich V, Das SK, Lee WR, Lo JY. Knowledge-based IMRT treatment planning for prostate cancer. Med Phys. 2011;38(5):2515-2522. doi:10.1118/1.3574874.
24. Good D, Lo J, Lee WR, Wu QJ, Yin F-F, Das SK. A Knowledge-Based Approach to Improving and Homogenizing Intensity Modulated Radiation Therapy Planning Quality Among Treatment Centers: An Example Application to Prostate Cancer Planning. Int J Radiat Oncol. 2013;87(1):176-181. doi:10.1016/j.ijrobp.2013.03.015.
25. Moore KL, Brame RS, Low DA, Mutic S. Experience-Based Quality Control of Clinical Intensity-Modulated Radiotherapy Planning. Int J Radiat Oncol. 2011;81(2):545-551. doi:10.1016/j.ijrobp.2010.11.030.
26. Fogliata A, Belosi F, Clivio A, et al. On the pre-clinical validation of a commercial model-based optimisation engine: Application to volumetric modulated arc therapy for patients with lung or prostate cancer. Radiother Oncol. 2014;113(3):385-391. doi:10.1016/j.radonc.2014.11.009.
27. Hussein M, South CP, Barry MA, et al. Clinical validation and benchmarking of knowledge-based IMRT and VMAT treatment planning in pelvic anatomy. Radiother Oncol. 2016;120(3):473-479. doi:10.1016/j.radonc.2016.06.022.
28. Nwankwo O, Mekdash H, Sihono DSK, Wenz F, Glatting G. Knowledge-based radiation therapy (KBRT) treatment planning versus planning by experts: validation of a KBRT algorithm
52
for prostate cancer treatment planning. Radiat Oncol. 2015;10(1):111. doi:10.1186/s13014-015-0416-6.
29. Chin Snyder K, Kim J, Reding A, et al. Development and evaluation of a clinical model for lung cancer patients using stereotactic body radiotherapy (SBRT) within a knowledge-based algorithm for treatment planning. J Appl Clin Med Phys. 2016;17(6):263-275. doi:10.1120/jacmp.v17i6.6429.
30. Fogliata A, Nicolini G, Clivio A, et al. A broad scope knowledge based model for optimization of VMAT in esophageal cancer: validation and assessment of plan quality among different treatment centers. Radiat Oncol. 2015;10:220. doi:10.1186/s13014-015-0530-5.
31. Fogliata A, Nicolini G, Bourgier C, et al. Performance of a Knowledge-Based Model for Optimization of Volumetric Modulated Arc Therapy Plans for Single and Bilateral Breast Irradiation. Gilhuijs KGA, ed. PLoS One. 2015;10(12):e0145137. doi:10.1371/journal.pone.0145137.
32. Verbakel WFAR, Van Reij E, Ladenius-Lischer I, Cuijpers JP, Slotman BJ, Senan S. Clinical application of a novel hybrid intensity-modulated radiotherapy technique for stage III lung cancer and dosimetric comparison with four other techniques. Int J Radiat Oncol Biol Phys. 2012;83(2):e297-e303. doi:10.1016/j.ijrobp.2011.12.059.
33. Chan OSH, Lee MCH, Hung AWM, Chang ATY, Yeung RMW, Lee AWM. The superiority of hybrid-volumetric arc therapy (VMAT) technique over double arcs VMAT and 3D-conformal technique in the treatment of locally advanced non-small cell lung cancer - A planning study. Radiother Oncol. 2011;101(2):298-302. doi:10.1016/j.radonc.2011.08.015.
34. Silva SR, Surucu M, Steber J, Harkenrider MM, Choi M. Clinical Application of a Hybrid RapidArc Radiotherapy Technique for Locally Advanced Lung Cancer. Technol Cancer Res Treat. 2016. doi:10.1177/1533034616670273.
35. Tol JP, Dahele M, Delaney AR, Slotman BJ, Verbakel WFAR. Can knowledge-based DVH predictions be used for automated, individualized quality assurance of radiotherapy treatment plans? Radiat Oncol. 2015;10(1):234. doi:10.1186/s13014-015-0542-1.
36. Delaney AR, Dahele M, Tol JP, Kuijper IT, Slotman BJ, Verbakel WFAR. Using a knowledge-based planning solution to select patients for proton therapy. Radiother Oncol. 2017;124(2):263-270. doi:10.1016/j.radonc.2017.03.020.
37. Blom GJ, Verbakel WF a R, Dahele M, Hoffmans D, Slotman BJ, Senan S. Improving radiotherapy planning for large volume lung cancer: a dosimetric comparison between hybrid-IMRT and RapidArc. Acta Oncol. 2015;54(3):427-432. doi:10.3109/0284186X.2014.963888.
38. Tol JP, Dahele M, Doornaert P, Slotman BJ, Verbakel WFAR. Toward optimal organ at risk sparing in complex volumetric modulated arc therapy: An exponential trade-off with target volume dose homogeneity. Med Phys. 2014;41(2):021722. doi:10.1118/1.4862521.
39. Wolff D, Stieler F, Welzel G, et al. Volumetric modulated arc therapy (VMAT) vs. serial tomotherapy, step-and-shoot IMRT and 3D-conformal RT for treatment of prostate cancer. Radiother Oncol. 2009;93(2):226-233. doi:10.1016/j.radonc.2009.08.011.
40. No Title. https://varian.force.com/CpWebSummary?Id=a0OE000000NrOvaMAF.
41. Varian Medical Systems. Eclipse Photon and Electron Algorithms Reference Guide Version
53
15.1. Palo Alto, CA 94304-1038 United States of America; 2016. doi:P1015026-002-B.
42. Delaney AR, Tol JP, Dahele M, Cuijpers J, Slotman BJ, Verbakel WFAR. Effect of Dosimetric Outliers on the Performance of a Commercial Knowledge-Based Planning Solution. Int J Radiat Oncol Biol Phys. 2016;94(3):469-477. doi:10.1016/j.ijrobp.2015.11.011.
43. Varian Medical Systems. Eclipse Photon and Electron Reference Guide Version 13.5. Palo Alto, CA 94304-1038 United States of America; 2014. doi:P1005653-002.
54
APPENDIX A
Geometry Expected Dose calculation algorithm not incorporating the avoidance sector.
To investigate the reason of full-arc plans in the h-RA model have higher GED-PC1, the
GED-PC1 for CL was ploted against the following geometrical features: overlap with target,
in-field volume, target volume and mean dose (Figure A1). Full-arc with avoidance sector
plans (red squares) deviated from the rest of the plans, despite not having different
geometrical features.
Figure A1: Overlap with target, in-field volume, target volume and mean dose of CL plotted
against GED-PC1 for h-RA model. Red squares include the full-arc with avoidance sector
plans.
To further examine the handling of avoidance sectors by the model, for two patients
three treatment plans were created and interactively optimized. The first treatment plan
incorporated a full-arc and avoidance sector (a), the second a partial-arc covering the same
irradiative angles of the first plan (b) and the third using full irradiating arc without any
avoidance sector. (c).These plans were subsequently added to the f-RA model, and the model
was re-trained to visualize where these plans lie in the generated regression plot for the CL
(Figure A2). The full-arc without avoidance sector has higher GED-PCS compared to the
other two plans, as expected. One would expect that when the irradiating field directions are
55
the same for both plans, the GED and consequently the GED-PCS1 would be the same for the
two plans. However, after model training, the avoidance sector plan had a considerably
higher GED-PCS1, suggesting that the GED is not accurately calculated. The RapidPlan
provided geometric plots (Figure A2 (B) and (C)) show that the geometrical features (CL
volume, overlap with target, out-of-field volume and target volume) are the same for the three
plans. The only other factor that could cause the GED to be different is the field set-up.
Figure A2: (A) Regression plot of CL of the f-RA model. (B) Geomteric plots for patient 1,
for the (a) partial-arc plan, (b) full-arc with avoidance sector plan (c) full-arc plan. (C)
Geometric plots for patient 2 for the corresponding plans.
56
This bug in the algorithm, had an effect on the generation of predictions. The DVH
estimation algorithm does not take into account the avoidance sector in the optimization
window, and thus the predictions for the CL and TL-PTV are high (1st prediction in Figure
A3). After optimizing the plan incorporating the avoidance sector, and then using the
optimized plan to generate new predictions, the predictions appear to be lower (2nd prediction
in Figure A3). The 1st prediction corresponds to the case c/full-arc without avoidance sector
in Figure A2, while the 2nd prediction corresponds to the case b/full-arc with avoidance
sector. If we use partial-arcs, generate the predictions and optimize the plan, the predictions
seem to be far lower than the achieved DVH.All the f-RA MBPs in the study were optimized
two times to incorporate the avoidance sector. However, since when using partial-arc the
predictions are even lower, the problem is not completely solved.
The problem was communicated to a Varian representative who gave the explanation
to our observations. The GED calculation algorithm does not recognize the avoidance sectors
and ‘discretize’ the arc fields. If the difference in gantry angle is less than 5 degree between
two control points -beam directions-, the control points are used for the discretization, but if
the difference is more than 5 degree, additional control points are interpolated between the
two control points. The new control points are equally spaced and the amount is deduced so
that the new spacing is less or equal than 5 degree. When we start with new fields to generate
predictions for a plan (1st prediction), the control points are spaced in 5 degree along the full
arc, so the predictions are generated as if there is no avoidance sector. When optimizing the
plan, we manually set the control points in around 2 degree interval in the irradiation-arc and
no control points in the avoidance sector. When using the optimized plan -whose fields have
already control points- to generate predictions (2nd prediction), any avoidance sector is
considered as a long jump between two control points and additional control points are
generated with 5 degree spacing. Therefore, the 2nd prediction is generated as if in the
avoidance sector the ‘density’ of control points has been dropped from one control point in
every 2 degree into one control point in every 5 deg. This is actually affecting the GED since
it assumes less dose from those directions. If we use partial arcs, there are no control
points/beamlets at all in the avoidance sector directions. The correct functionality would be
that the partial-arc fields and the full-arc fields with avoidance sector should give the same
results.
However, the problem is not encountered only in the predictions, it is also in the
models since the same GED calculation algorithm is used for the model training. Using
57
partial-arc instead of avoidance sector in the MBPs does not solve the problem because our
models is mainly consisted of avoidance sector plans.
Figure A3: DVH predictions for a new plan (1st prediction) and for an already optimized
plan (2nd prediction) of the same patient, with full-arc with avoidance sector, for CL and TL-
PTV.
58
APPENDIX B
Supplementary material
Figure B1: Predicted DVH-ranges (shaded areas), achieved MBP (black) and manual plan
DVH-lines (red) of Contralateral Lung with the f-RA method for the 10 test patients.
59
Figure B2: Predicted DVH-ranges (shaded areas), achieved MBP (black) and manual plan
DVH-lines (red) of TL-PTV with the f-RA method for the 10 test patients.
60
Figure B3: Predicted DVH-ranges (shaded areas), mid-prediction DVH-lines (dashed lines),
and achieved MBPs (solid line) of CL with the h-RA method for the 10 test patients.
61
Figure B4: Predicted DVH-ranges (shaded areas), mid-prediction DVH-lines (dashed lines),
and achieved MBPs (solid line) of TL-PTV with the h-RA method for the 10 test patients.
62
Figure B5: Achieved MBP Dmean plotted against predicted Dmean of IL, for f-RA and h-RA
MBPs.