SN0518 : Petaminer: Using ROOT for efficient data storage in MySQL database

Author(s):J Cranshaw, D Malon, A Vaniachine, V Fine, J Lauret and P Hamill
Date:May. 13, 2009
File(s): 1742-6596_219_4_042036.pdf
Abstract:

High Energy and Nuclear Physics (HENP) experiments store Petabytes of event data and Terabytes of calibration data in ROOT files. The Petaminer project is developing a custom MySQL storage engine to enable the MySQL query processor to directly access experimental data stored in ROOT files. Our project is addressing the problem of efficient navigation to PetaBytes of HENP experimental data described with event-level TAG metadata, which is required by data intensive physics communities such as the LHC and RHIC experiments. Physicists need to be able to compose a metadata query and rapidly retrieve the set of matching events, where improved efficiency will facilitate the discovery process by permitting rapid iterations of data evaluation and retrieval. Our custom MySQL storage engine enables the MySQL query processor to directly access TAG data stored in ROOT TTrees. As ROOT TTrees are column-oriented, reading them directly provides improved performance over traditional row-oriented TAG databases. Leveraging the flexible and powerful SQL query language to access data stored in ROOT TTrees, the Petaminer approach enables rich MySQL index-building capabilities for further performance optimization.

Submitted: CHEP2009
Status: Published
Ref: J Cranshaw, D Malon, A Vaniachine, V Fine, J Lauret  and P Hamill, 2010 J. Phys.: Conf. Ser. 219 042036

Keywords:MySQL, ROOT trees, TAG, RHIC, LHC
Category:Computing