LCG Project Overview  
 

 

The Large Hadron Collider (LHC), currently being built at CERN near Geneva, is the largest scientific instrument on the planet. When it begins operations in 2008, it will produce roughly 15 Petabytes (15 million Gigabytes) of data annually, which thousands of scientists around the world will access and analyse. The mission of the Worldwide LHC Computing Grid (LCG) project is to build and maintain a data storage and analysis infrastructure for the entire high energy physics community that will use the LHC.

The data from the LHC experiments will be distributed around the globe, according to a four-tiered model. A primary backup will be recorded on tape at CERN, the “Tier-0” centre of LCG. After initial processing, this data will be distributed to a series of Tier-1 centres, large computer centres with sufficient storage capacity and with round-the-clock support for the Grid.

The Tier-1 centres will make data available to Tier-2 centres, each consisting of one or several collaborating computing facilities, which can store sufficient data and provide adequate computing power for specific analysis tasks. Individual scientists will access these facilities through Tier-3 computing resources, which can consist of local clusters in a University Department or even individual PCs, and which may be allocated to LCG on a regular basis.

The Goals of the LCG project

Discovering new fundamental particles and fields and analysing their properties with the LHC accelerator is possible only through statistical analysis of the massive amounts of data gathered by the LHC detectors ALICE, ATLAS, CMS and LHCb, and detailed comparison with compute-intensive theoretical simulations. LCG provides tools and computing services to help the research physicists in their work. The goals of the LCG project include:

  • Developing components to support the physics application software, such as general-purpose scientific libraries, tools for storing, managing and accessing physics data, adaptation of general scientific simulation frameworks and toolkits to the needs of LHC, interfacing batch and interactive analysis systems to a Grid environment.
  • Developing and deploying computing services based on a distributed Grid model using resources from more than a hundred computing centres around the globe.
  • Managing users and their rights in an international, heterogeneous and non-centralized Grid environment.
  • Collaborating with the regional and national education and research network organisations (NRENs) to ensure high bandwidth data paths between the major LCG centres.
  • Coordinating the programme of tests and pilot services for commissioning of the LCG service in 2007.

Rationale for using a Grid to process LHC data

When the LHC accelerator is running optimally, access to experimental data needs to be provided for the 5000 scientists in some 500 research institutes and universities worldwide that are participating in the LHC experiments. In addition, all data need to be available over the 15 year estimated lifetime of the LHC. The analysis of the data, including comparison with theoretical simulations, requires of the order of 100 000 CPUs at 2006 measures of processing power. A traditional approach would be to centralize all of this capacity at one location near the experiments. In the case of the LHC, however, a novel globally distributed model for data storage and analysis – a computing and data Grid – was chosen because it provides several key benefits. In particular:

  • The significant costs of maintaining and upgrading the necessary resources for such a computing challenge are more easily handled in a distributed environment, where individual institutes and participating national organisations can fund local computing resources and retain responsibility for these, while still contributing to the global goal. As the use of Grid technology grows these resources can be provided from Grid infrastructures shared with other sciences.
  • Also, in a distributed system there are fewer single points of failure. Multiple copies of data and automatic reassigning of computational tasks to available resources ensures load balancing of resources and facilitates access to the data for all the scientists involved, independent of geographical location. Spanning all time zones also eases round-the-clock monitoring and support.

Of course, a distributed system also presents a number of significant challenges. These include ensuring adequate levels of network bandwidth between the contributing resources, maintaining coherence of software versions installed in various locations, coping with heterogeneous hardware, managing and protecting the data so that it is not lost or corrupted over the lifetime of the LHC, and providing accounting mechanisms so that different groups have fair access, based on their needs and contributions to the infrastructure. These are some of the challenges that the LCG project is addressing.

LCG in context

The major computing resources for LHC data analysis are provided by the Worldwide LHC Computing Grid Collaboration – comprising the LHC experiments, the accelerator laboratory and the Tier-1 and Tier-2 computer centres. The LCG project manages the deployment and operation of the distributed computing services for LHC on behalf of the collaboration.

The computing centres providing resources for LCG are embedded in different operational Grid organisations, in particular EGEE (Enabling Grids for E-SciencE) and OSG (the Open Science Grid), but also several national and regional Grid structures such as GridPP in the UK, INFN Grid in Italy and NorduGrid in the Nordic region.

The Enabling Grids for E-sciencE (EGEE ) project is funded by the European Commission and aims to integrate current national, regional and thematic Grid efforts, in order to create a seamless Grid infrastructure available to scientists 24 hours-a-day, for the support of scientific research. LCG and
EGEE are tightly coupled and provide complementary functions.

OSG (Open Science Grid) is a U.S. distributed computing infrastructure for large-scale scientific research, built and operated by a consortium of universities, national laboratories, scientific collaborations and software developers. The OSG integrates computing and storage resources from more than 50 sites in the United States, Asia and South America. Researchers from many fields of science use the OSG infrastructure, including the U.S. participants in the ATLAS and CMS experiments at the LHC. The OSG is supported by the U.S. National Science Foundation and Department of Energy's Office of Science.

The Globus Alliance involves several universities and research laboratories conducting research and development to create fundamental Grid technologies and produce open-source software. The LCG project is actively involved in the support of Globus and uses the Globus-based Virtual Data Toolkit (VDT) as part of the project middleware.

The LCG project is also following developments in industry, in particular through CERN openlab, where leading IT companies are testing and validating cutting-edge Grid technologies using the LCG environment.