Replicating Data from the Large Electron Positron (LEP) collider at CERN ( Aleph Experiment)

Replicating Data from the Large Electron Positron (LEP) collider at CERN

(Aleph Experiment)

Under the DPHEP umbrella

Marcello Maggi/INFN –BariTommaso Boccali/INFN-Pisa

International Collaboration for Data Preservation and Long Term Analysis in High Energy Physics

The HEP Scientist

e μ τ

ντνμ

Quarks

Leptons

Force carriers

hThe Just Discovered Piece

Fermions Bosons

The Standard Model Of Elementary Particles

Dark Matter Matter/Anti-matter Asymmetry

Super Symmetric ParticlesFROM

MICROCOSMTO

MACROCOSM

The use case• The ALEPH Experiment

took data from the CERN e+e− collider LEP from 1989 to 2000

• Collaboration of ~500 scientists from all over the world

• More than 300 papers were published by the ALEPH Collaboration

ALEPH data – still valuable• While the Collaboration practically stopped

a couple of years later, some late papers were published, and we still get request by Theoretical Physicists for additional checks / studies on ALEPH Data

• Current policy: any paper can use ALEPH data if among the author there is at least one former ALEPH Member (moving to CC0?)

Data Sharing & Data ManagementFundamental Issue

Birth of Web @ CERN

Some FactsLooking at Reconstructed samples (the format closest to physics utilization) ALEPH data consists in:• 150k files, avg size 200 MB• 30 TB• Split between real collected events and Monte

Carlo simulated events• Processing times (analysis) on recent machines

is such that 1 week is enough to process all the sample Splitting between machines helps!

The Computing Environment

Last blessed environment (Blessed = blessed for physics) is Linux SL4• GCC 3.4• G77 3.4• LIBC6• All the sw ALEPH uses will have a CC

licenceWe can recompile everything on our own

Data workflowsRAW DATA Reconstructed

Fortran + HBOOK

User Code

Kinematic Simulation

Paeticle/Matter Simulation

Reconstructed Simulated

Events

Real DATA

Simulated Events

Various tools Geant3 based “Julia”

Current StrategyComputing Environment via VM approach• Currently using uCERN-VM• Provides batch ready VMs, interactive ready

VMs, development ready VMsData to be served via POSIX to the executables• Current approach (pre Eudat) was– Via WebDAV (Apache, Storm, …)– Seen by the VM as FUSE/DavFS2 mounted POSIX

Filesystem

What is missing?• Metadata!– What is a file containing?– Where is a file available?– “Give me all the data taken at 203 GeV in Spring

1999”– “Give me all the simulations @ 203 GeV for

hadronic W decays”

• We had a tool, SQL based– We still have the SQL dump– Tool only reachable via low level commands

LOCALBATCH

HPCGRID

The Picture

Network Infrastruture

Storage Storage

DisasterRecovery

Data Curation

BitPreservation

KnowledgePreservation

Data Infrastructure (Ingestion, PID, Trust, Integrity, Replica, Sharing,…)

ExperimentVM

Fetch data &environment

DataDiscovery

(OPEN?)Access

DigitalLibrary

(oa reps)

B2SAFE

B2SHAREB2FIND

B2STAGE

MetaData IngestionComplexity is the norm…

1999 (Real Data)

Eudat?Plan is “simple”

• Move all the files WebDAV -> Eudat node– Already started, to CINECA-Eudat node

• Use B2FIND to re-implement a working metadata catalog (SQL to B2FIND??)

• Using other B2* tools to (we need to study here)– Prestage files on the VM before execution? (B2STAGE)– Access via streaming? (B2STREAM?)

• Using B2* to save outputs in a safe place, with correct metadata (B2SHARE)

Few Starting Feedbacks

How it is now:1) Provide identity 2) gridftp 3) copyback pids 4) build metadata 5) feed OA reps 6) give oai-pmh link

DataMetaData(template needed)

Eudata NodePIDB2Find(to be

integrated)

Locationcksumsizepid

OpenAccessRepositoryOAI-PMH

150,000 files… Missing bulk “B2Ingestion”B2SHARE, B2SAFE,B2STAGE do “just” part of it

OAI-PMH OAI-PMH

Harvester(running on grid/cloud)

Linked-data search engine

Semantic-web enrichment

End-points

Harvester(running on grid/cloud)

Data & Knowledge InfrastructureRoberto Barbera/INFN e Università, Catania

Knowledge BaseNot only data• ALEPH Virtual Environment

Link to Digital Library (Open Access Reps)• File Catalogues• Documentation• Instructions

HEP status• ALEPH is now a small experiment• KLOE has 1 PB• CDF has 10 PB • LHC has > 100 PBData Nodes can federate, specially useful for sites where computing is available

NETWORK

Low level

High levelEudatNode1

EudatNode2

EudatNode3

The (Big) DATA today107 “sensors” produce 5 PByte/secComplexity reduced by a Data ModelAnalytics in real time filters to 0.1−1 Gbyte/sec (Trigger)

Data + Replica move with a Data Management PolicyAnalytics produce “Publication Data” that are SharedFinally the Publications

Re-use of Data relies on Data Archive Management Plans

Replicating Data from the Large Electron Positron (LEP) collider at CERN ( Aleph Experiment)

Documents

Transcript of Replicating Data from the Large Electron Positron (LEP) collider at CERN ( Aleph Experiment)

ΑΝΑΖΗΤΗΣΗ ΤΩΝ ΣΩΜΑΤΙΔΙΩΝ HIGGS ΣΤΟ LEP

Introduction to Positron Emission Tomography - …brainimaging.waisman.wisc.edu/~oakes/pet/TRO_PET_basics.pdfIntroduction to Positron Emission Tomography with your host, Terry Oakes

Medical Imaging – Positron Emission Tomography · Cyclotron-produced (-) 82Rb 2 min. half-life (-) Chemically boring (-) Generator-produced (+) Individual Detector Element Photomultiplier

Trapping of - Positron lifetime.pdf · to the local electron density at the site of positron annihilation. Since the birth of the positron is accompanied by the immediate emission

Progress on “Online” studies - Agenda (Indico) · 2015. 2. 5. · Introduction 2/5/15 2 E.Perez LEP or planned experiments for the ILC : triggering is not an issue. FCC ee, Z

Positron Emission Tomography Imaging of Fibrillar ... · Positron Emission Tomography Imaging of Fibrillar Parenchymal and Vascular Amyloid‑β in TgCRND8 Mice Daniel McLean,†,‡

Medical Imaging – Positron Emission Tomography · Medical Imaging – Positron Emission Tomography (thanks to Bill Moses, Life Sciences Div. LBNL) What is Positron Emission Tomography

Fisica gamma-gamma a LEP - INFN Lecce web · 2003. 5. 7. · S. Braccini Fisica gamma-gamma a LEP 28 Conclusioni (1) • Processi esclusivi • Formazione di risonanze : candidati

Clinical and positron emission tomography responses to long … · 2019. 1. 10. · RESEARCH Open Access Clinical and positron emission tomography responses to long-term high-dose

Dissertation Defense, 05/18/20051 Measurement of the hadronic photon structure function F 2 γ with L3 detector at LEP Gyöngyi Baksay Florida Institute.

Positron Emission Tomography (PET) and PET/CT...Positron Emission Tomography (PET) and PET/CT Positron (β+) Emission Z N 20 40 60 80 100 120 140 160 Z 20 40 60 80 100 Neutron-poor

Positron Emission Mammography - AAPM: The … · Positron • Uses positron (β+) emitting radio-isotopes to label physiologic tracers (e.g. radiopharmaceuticals) • Positrons are

Pet presentation, positron emission tomography

B physics at LEP Andrea Sciabà INFN-CNAF XXXIII International Symposium on Multiparticle Dynamics September 5-11, 2003 Krakow.

204677Orig1s000 - Food and Drug Administration · Positron Emission Tomography” for its Established Pharmacologic Class is ... of ZK 6016468 after intravenous bolus administration

Wales Research and Diagnostic Positron Emission Tomography ...meditech.cardiff.ac.uk/pages/individula meetings/2... · IBA Synthera® Chemistry Synthesis Units Integrated Fluidic

Standard Model Results from LEP

The MEG detector for e decay search · 2017. 4. 6. · muon decay vertex and the positron momentum, a tim-ing counter for measuring the positron time, and a liq-uid xenon detector

Antibody‐based in vivo PET imaging detects amyloid‐β ...€¦ · 31/5/2018 · Antibody‐based in vivo PET imaging detects amyloid‐β reduction in ... Positron emission tomography

14 Sept 2004 D.Dedovich Tau041 Measurement of Tau hadronic branching ratios in DELPHI experiment at LEP Dima Dedovich (Dubna) DELPHI Collaboration E.Phys.J.