Support 110 years of independent journalism.

  1. Science & Tech
23 April 2015updated 07 Sep 2021 10:17am

Diamonds in the data: the LHC is back (and better than ever!)

Cern is about to become very good at mining data for insight - vital for breaking through the information fog of the 21st century.

By Michael Brooks

The Large Hadron Collider (LHC) is back. If you foresee “been there, done that” ennui, don’t let it ruin the moment. After all, we didn’t know that we craved insights into the Higgs boson until researchers found it.

It’s likely that the newly upgraded collider will give us something else we didn’t know we wanted. This is a long-established pattern for innovation. We have recently learned, for instance, that Google Images was conceived in order to find pictures of Jennifer Lopez. It is somehow disappointing but not surprising: we already knew that the creators of YouTube were searching for a way to view Janet Jackson’s accidentally exposed nipple.

So, it’s a good time to be reminded that our higher purposes can also bear fruit. The European Organisation for Nuclear Research (Cern) has discovered a glittering array of subatomic particles but it also gave us the internet. And that’s not all: other Cern spin-offs include cancer therapies, new medical imaging technologies, touch screens and major improvements to solar panels.

The next unsought spin-off is likely to be in data processing. As boring as it sounds, this will almost certainly be the key to every innovation over the next 50 years. Learning to sift data – whether presented in pictures, strings of numbers, or words – will be vital for breaking through the information fog of the 21st century.

Cern is about to become very good at mining data for insight. In the new runs of the LHC, there will be a billion collisions per second. The subsequent events – involving particle characteristics, trajectories, lifetimes, and so on – are what enable Cern’s scientists to find the physics answers they want, but the volume of information is overwhelming. Most of the collisions will produce nothing of interest, with only a few hundred per second worth tracking. Those are selected using complex and subtle algorithms; data from the rest is discarded. It’s not easy to make those selections, which is why searching and sifting algorithms are likely to be one major spin-off from the collider’s development.

Select and enter your email address Your weekly guide to the best writing on ideas, politics, books and culture every Saturday. The best way to sign up for The Saturday Read is via The New Statesman's quick and essential guide to the news and politics of the day. The best way to sign up for Morning Call is via
  • Administration / Office
  • Arts and Culture
  • Board Member
  • Business / Corporate Services
  • Client / Customer Services
  • Communications
  • Construction, Works, Engineering
  • Education, Curriculum and Teaching
  • Environment, Conservation and NRM
  • Facility / Grounds Management and Maintenance
  • Finance Management
  • Health - Medical and Nursing Management
  • HR, Training and Organisational Development
  • Information and Communications Technology
  • Information Services, Statistics, Records, Archives
  • Infrastructure Management - Transport, Utilities
  • Legal Officers and Practitioners
  • Librarians and Library Management
  • Management
  • Marketing
  • OH&S, Risk Management
  • Operations Management
  • Planning, Policy, Strategy
  • Printing, Design, Publishing, Web
  • Projects, Programs and Advisors
  • Property, Assets and Fleet Management
  • Public Relations and Media
  • Purchasing and Procurement
  • Quality Management
  • Science and Technical Research and Development
  • Security and Law Enforcement
  • Service Delivery
  • Sport and Recreation
  • Travel, Accommodation, Tourism
  • Wellbeing, Community / Social Services
Visit our privacy Policy for more information about our services, how Progressive Media Investments may use, process and share your personal data, including information on your rights in respect of your personal data and how you can unsubscribe from future marketing communications.

Data compression and storage will also be pushed to new heights by the physics going on in Geneva. Every year, the collider creates the equivalent of over 250 years of HD video in data, much of which has to be archived. US researchers illustrated the value of long-term archiving recently when they released new results that used four-year-old data to pin down the characteristics
of the Higgs boson.

Cern researchers have already identified a new species of particle using archived data and the ongoing analysis will help direct new experiments. However, the truly exciting breakthrough is probably not in the data but in the way the data is handled.

Thanks to Cern, the world now has a new network: the Grid. It connects over 200,000 computers across the world, making their processing power available to researchers looking to carry out particular searches or computations in LHC data. The software and networking innovations needed to create the Grid are certain to fuel developments outside of physics. Medicine will benefit: new possibilities in distributed computing are already allowing people to volunteer the spare capacity of their Android phone or tablet to help in the hunt for new pharmaceutical drugs.

Medical innovations are low-hanging fruit, however. As with the internet, techniques developed for esoteric physics will trickle down to enable us all to perform tasks that we haven’t yet realised are essential to modern life. If we have learned how to discard boring experimental results, for instance, perhaps we could speed up internet data transfer by stripping out all images of cats. Because of physics, we can dare to dream.

Content from our partners
What you need to know about private markets
Work isn't working: how to boost the nation's health and happiness
The dementia crisis: a call for action