Machine learning models to identify the simplest way to screen for lung cancer have been developed by researchers from UCL and the University of Cambridge, bringing personalised screening one step closer.
The model was found to be as good or better at predicting an individual’s risk of getting lung cancer within five years compared to the best risk models available, and was able to do so using just a quarter of the information needed.
Lung cancer is the most common cause of cancer death worldwide, with poor survival in the absence of early detection. It is estimated that there were 1.8 million lung cancer deaths globally in 20201.
Screening for lung cancer amongst those at high-risk could reduce lung cancer-specific mortality by 20-24% amongst those screened, but the ideal way to determine if someone is high-risk remains uncertain and existing approaches are resource intensive.
The UK is currently planning a national screening programme for lung cancer, which will include people aged 55-74 who have ever smoked, using a risk model based on 17 questions. This information is complex and time-consuming to gather and will require a 50-100 person-strong call centre to collect the data from one million people.
In this study, researchers from UCL and the University of Cambridge used data from the UK Biobank and US National Lung Screening Trial to develop models to simplify the prediction of a person getting lung cancer within the next five years.
The team used the datasets to experiment with over 60 different machine learning pipelines to see which were the most effective at predicting lung cancer risk using just three variables – age, how many years the individual smoked for, and the average number of cigarettes per day.
From these, they selected four model pipelines and combined them into an ‘ensemble’ that was able to predict lung cancer risk with the same or improved accuracy, compared to the best available models currently is use2. Importantly, they were able to achieve this accuracy using only a third of the variables, greatly simplifying the process of gathering the data required.
Dr Tom Callender (UCL Medicine), first author of the study, said: “Screening for cancer and other diseases saves lives and we are increasingly able to personalise this process. But such personalised screening and disease prevention programmes present important logistical challenges at scale. Our study shows that artificial intelligence can be used to accurately predict lung cancer risk using just three pieces of information that would be easy to gather during routine GP appointments, online or via apps. This approach has the potential to greatly simplify population level screening for lung cancer and help to make it a reality.”
The models used in the study were externally validated in the US Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial and benchmarked against models that are either in use or have performed strongly in previous analyses. The authors believe the same approach could be viable for simplifying screening process for other diseases, such as type-2 diabetes and cardiovascular disease.
Professor Mihaela van der Schaar, an author of the study from the University of Cambridge, said: “This research is a prime example of how machine learning tools such as AutoPrognosis, combined with innovative clinical researchers, can make a real impact in healthcare at a population level. While AutoPrognosis has already been applied for risk prediction and prognosis in numerous diseases, this is the first time it has been used to determine the minimal information needed to screen patients. I think this is the future of preventive medicine and I’m optimistic that the same approach could be applied to screening for other diseases.”
The authors hope the findings will be used to make any national lung cancer screening programme quicker, easier and cheaper to implement, while still achieving the primary aim of reducing lung cancer mortality.
Professor Sam Janes (UCL Medicine), senior author of the study, said: “It’s great news that the UK is working towards a national screening programme for lung cancer, which remains the leading cause of cancer-related deaths in this country as it does across the world. But as we’ve seen in the US, whose screening programme uptake is just eight per cent, there are hurdles to overcome. For any national screening programme to work, it will need to be feasible to run and succeed in getting people to participate. Our findings are good news on both counts.”
This work was supported by Wellcome, the National Science Foundation, the Medical Research Council and Cancer Research UK.
Notes to Editors:
1 For more information on lung cancer, see the WHO website.
2 The current NHS pilot combines two of the best-performing models available, the US Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial 2012 (PLCOm2012) and Liverpool Lung Project version 2 (LLPv2).
Publication:
Thomas Callender et al. ‘Assessing eligibility for lung cancer screening using parsimonious ensemble machine learning models: A development and validation study’ will be published in PLOS Medicine and is strictly embargoed until 3 October 2023 19:00 BST / 14:00 ET.
DOI: https://doi.org/10.1371/
About UCL – London’s Global University
UCL is a diverse global community of world-class academics, students, industry links, external partners, and alumni. Our powerful collective of individuals and institutions work together to explore new possibilities.
Since 1826, we have championed independent thought by attracting and nurturing the world’s best minds. Our community of more than 50,000 students from 150 countries and over 16,000 staff pursues academic excellence, breaks boundaries and makes a positive impact on real world problems.
The Times and Sunday Times University of the Year 2024, we are consistently ranked among the top 10 universities in the world and are one of only a handful of institutions rated as having the strongest academic reputation and the broadest research impact.
We have a progressive and integrated approach to our teaching and research – championing innovation, creativity and cross-disciplinary working. We teach our students how to think, not what to think, and see them as partners, collaborators and contributors.
For almost 200 years, we are proud to have opened higher education to students from a wide range of backgrounds and to change the way we create and share knowledge.
We were the first in England to welcome women to university education and that courageous attitude and disruptive spirit is still alive today. We are UCL.
www.ucl.ac.uk | Follow @uclnews on Twitter | Read news at www.ucl.ac.uk/news/ | Listen to UCL podcasts on SoundCloud | View images on Flickr | Find out what’s on at UCL Minds