HPC and AI makes genomic research feasible at population level

Given Kao Data’s first of three data centre campuses is located in the heart of the UK Innovation Corridor – a hotbed for bioinformatics and life sciences, we are regularly involved in discussions revolving around the latest advancements in this field of work.

Thanks to advances in next-generation sequencing technologies (NGS), genomic research is thriving since its origins back in the 1990s. Evolution in AI and machine learning, combined with the advent of high-performance computing (HPC) have accelerated progress further, reducing analysis times from days to hours, minutes even, if cutting-edge supercomputers like NVIDIA Cambridge-1, hosted at our Harlow campus, are being used.

Associated costs have also come down to the point where individual genes can be sequenced routinely. Indeed, some R&D facilities are now able to routinely sequence millions of cases annually and an entire genome can be sequenced for a fraction of the cost that it took just 24 months ago. Genome sequencing is a computational discipline that continues to surprise and amaze such is the exponential rate of year-on-year progress.

The ability to carry out genomic research at population level brings with it a myriad of health and wellbeing benefits because gaining insight into our DNA makeup allows medical scientists to accurately predict our susception to future illnesses and/or chronic conditions before they affect us. Not only does this buy time for research labs and pharmaceutical companies to discover potential cures and/or and rollout counteracting medicines, it empowers individuals to take measures to prevent/ delay the impact of said conditions, thus increasing their life expectancy rates and/or improving their quality of life.

Buying time for the researchers and pharmaceuticals

Since Covid-19 hit the headlines, genomic sequencing has emerged as a key tool in efforts to contain the disease, as health experts the world over have grappled with the inevitability of continued transmission until a) we reach “herd immunity” or b) entire populations are vaccinated.

The UK are world leaders in this space. Indeed, the Department of Health and Social Care published a press release at the beginning of October announcing that over one million SARS-Cov-2 genome sequences have been uploaded to the GISAID database (an international initiative set up to share info pertaining to avian influenza), accounting for nearly a quarter of all sequences published globally to date.

The GISAID database stores sequences from across the globe – and believe me, that’s a heck of a lot of data. The discipline has become so integral to advances in medical science in this country that the Government has established an entire department dedicated to genomic sequencing, Genomics England.

Too much knowledge can be counterproductive

On the flip side you could argue that genomic sequencing is accelerating to the point of becoming intrusive. According to an article in the Guardian, genetic detective work in Victoria, Australia, traced the second Delta variant outbreak down to one (very unfortunate) family, and the sequencing evidence significantly influenced the state’s subsequent quarantine measures. I can only imagine that family’s grief realising they were the eye of the Covid storm in their neighbourhood…

Given that example, there is understandable concern that too much insight is dangerous, leading to unnecessary anxiety which impacts mental health and aspirations for cures for conditions that are not currently treatable.

Genomic sequencing is data intensive

Regardless of our opinions on medical advances, social ethics etc, one thing we know for sure is that genomic sequencing facilities around the globe are generating data in unfathomable quantities. And this begs the question as to how and where we supposed to analyse and store this highly sensitive information? Capacity was initially managed by subdividing DNA samples into bitesize chunks, (exomes etc). However, as technologies and processes have evolved and storage has become more sophisticated, it’s now possible to analyse DNA in its entirety but it comes with obvious compute and bandwidth consequences.

As with all computer-generated research, genomic sequencing is data-intensive because millions of samples must be analysed to obtain meaningful patterns and trends. But how does this high-volume data impact data centres? Not only is the keyed-in data loaded with information, when you combine this with AI and machine learning (also data heavy) to speed up processes and outcomes, the IT load needed is huge. And the more we utilise genomic sequencing for medical research, the more data will be produced.

This is where future-ready campuses and colocation facilities like ours come into play. Equipped with all-fibre networks for swift processing and interconnectivity via secure connections in line with personal data protection laws, we are able to provide the robust, high-density colocation infrastructure needed to drive research in this field. We are seeing more and more demand from life sciences organisations to host power-hungry hardware in the data centre and given this we’ve been fast to move from a one-campus solution to a multi-site offering across three London-perimeter locations that together offer more than 55MW of capacity.

With NVIDIA Cambridge-1 already at our Harlow headquarters, along with the European Bioinformatics Laboratory (EMBL-EBI) and smaller startups using AI and machine learning to explore niche bio-computational workloads including genetics, we’re well placed to support the UK’s continuing push to be a world leader in genetic sequencing.



Share

Other articles

March 31, 2026

Hype Cycle to Power Cycle: The Industrial Era of AI kicked off in San Jose

March 18, 2026

The UK’s AI Ambition Gap

January 22, 2026

The Quiet Revolution: Slough’s story shows how data centres can benefit the UK’s economy and communities for good

Test Text

test job title

Details

If your application is successful, Harlow Council will transfer the grant by BACS.  Bank details (account name, number and sort code) will need to be supplied with a summary of accounts. 

Funding conditions:  If your application is successful, your project must be delivered by 31 December 2026. You will have to return any grant funds if the project is not delivered or the organisation receiving the funding stops operating.

A contract agreement will need to be signed between your organisation and Harlow District Council before any funding is granted.

Monitoring and Evaluation: Grant recipients will be required to provide an end of project report to establish whether the project has met its aims and objectives, as well as to assess the overall impact on participants. Funding for the project is provided on the basis that the Project Evaluation form is returned within the agreed project timescale (no later than 31 January 2027).

Risks and Liabilities: In giving grants the ‘Harlow Council’ will require the supported project organisation or groups to accept all risks and liabilities associated with the activity being supported. This will be a condition of the grant.  Copies of relevant documents may need to be provided if the application has been approved.

Data protection statement – how we will use your information

The Council is committed to handling your personal information in line with the data processing principles.  The Data Protection Legislation and the General Data Protection Regulation 2016/679 (GDPR) sets the legal framework for how we collect, handle and process personal data and for your rights as a ‘data subject.’

General Data Protection Regulation:  Personal data provided by you will be processed in accordance with this protocol. For more details, please see https://www.harlow.gov.uk/privacy-notice

Thank you for taking the time to read these guidance notes. If you have any questions, please get in touch with [email protected].

FAQs

  1. Who can apply?
    The fund is open to not-for-profit community groups and grassroots initiatives based in Stockport.
     
  2. What types of projects are eligible?
    We encourage projects focused on environmental sustainability, community cohesion, and local economic development.
     
  3. How much funding is available?
    Grants range from £500 to £2,500 for pilot projects or to enhance existing initiatives.
     
  4. Is this the sole funding for this project?
    If not, please expand on the additional match funding that you currently have or are in the process of applying for.
     
  5. Are there any restrictions on grants under £500?
    Yes, please note that grants under £500 may be subject to different guidelines or restrictions, which will be communicated upon application.
     
  6. When will we know if our project was selected?
    Notifications will be sent by March 31, 2025.
     
  7. How can I apply for a grant?
    Applications can be submitted through our online portal, where you will find detailed guidelines and forms.
     
  8. What happens if my application is unsuccessful?
    If your application is not successful, we encourage you to seek feedback and consider reapplying in future funding rounds.
     
  9. When is the application deadline?
    Please check our website for the latest application deadlines and any upcoming funding rounds.