ATLAS ONLINE MONITORING. FINISHED! Now what? How to check quality of the data?!! DATA FLOWS!

Preview:

Citation preview

ATLAS ONLINE MONITORINGATLAS ONLINE MONITORING

FINISHED!FINISHED! Now what? How to check quality of the data?!!

DATA FLOWS!DATA FLOWS!

Online Monitoring:Online Monitoring:Surveillance of the data and its Surveillance of the data and its qualityquality

Why monitor?Why monitor?

To be able to discover in a relatively easy way that the detector is not behaving as it should and take action

Examples: detector modules that do not work, synchronization problems btw SCT and TRT, efficiencies sink ....

............

............

......

.........

......

......

HISTOGRAM REPOSITORY

(HISTOGRAM DISPLAY)

PUBLISH

OHP Monitoringdisplay SUBSCRIBE

UPDATE HISTODQM Monitoring display

I see tracks in the SCT!

OHP

Tools for retrievingdata from dataflowbooking, filling and publishing histograms

Readout System

ROB,ROS,SFI,SFO,

LVL2/EF

Tier 0

Calibration FARM

Gatherer

Mon Mon Mon

Mon Mon Mon

IntelligentMonitoring

DisplayExperts

Archiver Archiver

Data Quelity Assessment

Alarms notifications

DQMF

Slow Ctrl.DBS

Var. Ref.DBS

MonitoringDBS

Data Qual.DBS

Monitoring Scheme

Mon Mon Mon

LVL1

Shifterdisplays

OHP

Gatherer

Where and what is monitoredWhere and what is monitored

Detector monitoring:• DCS: detector hardware status and conditions

Online Monitoring ROD Crate, ROS: data quality and integrity Event Builder: correlation between sub-detectors, consistency of

LVL1 information Trigger Monitoring

• LVL1, LVL2: sample rejected events to check the trigger decision• Event Filter: information attached to a sub-set of accepted and

rejected events• DAQ monitoring:

• ROS, EB: operational monitoring (buffer occupancies, throughput, s/w and h/w status, errors, etc.)

Online Histogram Presenter (OHP) Atlas Data Quality Framework (DQMF) Online Histogram Display (OH) Trigger Monitoring

Online Monitoring applicationsOnline Monitoring applications

Trigger OHP DQM OH

Monitoring Displays:Monitoring Displays:Online Histogram Presenter (OHP)Online Histogram Presenter (OHP)

OHP displays already existing histograms Displayed when histos are updated, i.e. when

the detector is running and histos are published

OHP subscribes to one or several histograms (or even all) and displays them online

If the detector is not running a message will be displayed saying that no histogram is available

Histograms - online monitoringHistograms - online monitoring Each detector or detector-subsystem expert

has written tools for retrieving data from detector and filling histograms

Global Inner Detector uses 7 tools for Athena PT Responsible: Arshak Tonoyan& Heidi Sandaker

monitoring of LVL1, BCIDs matching of ID segments at TRT/SCT and SCT/Pixel boundary surface ID noise occupancies monitoring of no. of hits, residuals on combined tracks matching parameters of top and bottom tracks, SCT&TRT extrapolation of SCT segments to TRT: TRT straw efficiencies, residuals, no. of TRT hits

on extrapolated track TRT straw efficiencies

https://twiki.cern.ch/twiki/bin/view/Atlas/InDetGlobalMonitoring?topic=IDGlobalMonitoring

Produces many histograms available in Online Histogram Display – only 10 allowed for each sub-detector in Online Histogram Presenter

OHPOHP

OHP example:OHP example:SCT Hits for all tracks in the eventSCT Hits for all tracks in the event

THE NUMBER OF HITS PERALL TRACKS SHOULD BECOMPARED WITH A REFERENCE HISTO

SCT + SCT + TRTTRT example: example:The BCIDs of the SCT and TRTThe BCIDs of the SCT and TRT

VALUE IS 1IF EVERYTHINGIS OKTHEN SCT &TRT AREREADINGSAME EVENT!

ROD ID

DIFFERENT BCIDBETWEEN SCT ANDTRT -SOMETHING IS WRONG

Online display: Data Quality Online display: Data Quality Monitoring (DQM)Monitoring (DQM) Automatic checks on data quality using

predefined algorithms on histograms or counters etc.

Input from Online Histogram Service Can also be done offline using ROOT files

Puts flags or alarms if values are off expected limits or e.g. histograms are empty

States are green, yellow, red States per subsystem but also overall state

Milestone 6 (M6)Milestone 6 (M6)Cosmic RunCosmic Run

Global Inner Detector monitoring included for the first time

Basic functionalities fully working Looked at the online histograms Did offline work – reported results Work will continue to improve histograms

what they display, how and which to choose

CONCLUSIONSCONCLUSIONS

Monitoring is crucial for good data-taking Several monitoring applications exist and are

working M6 was first time the Global Inner Detector

was fully tested Doing shift-work really lets you learn a lot

about ATLAS and the jungle of software that is out there....

https://twiki.cern.ch/twiki/bin/view/Atlas/InDetGlobalMonitoring?topic=IDGlobalMonitoringhttps://twiki.cern.ch/twiki/bin/view/Atlas/EventFilterMonitoringhttps://twiki.cern.ch/twiki/bin/view/Atlas/AthenaPThttp://atlas-tdaq-monitoring.web.cern.ch/atlas-tdaq-monitoring/

MORE INFO, not included in talk

Atlas Online Monitoring System

Highly distributed, different processes run on different machines at once

Low level: hardware states, noisy modules, dead channels, electronic mappings, data quality e.g. Synchronization btw different sub-detectors

High-level (Athena): run on full events, check physics quantities momentum, spin etc.

Both produce histograms presented in ROOT TDAQ (Trigger and data quality ) group

provide useful services e.g.: Online Histogram Service OHS provides input to OHP

(Online Histogram Presenter)

TDAQ: Trigger and Data Acquisition System HLT: High Level Trigger EF: Event Filter DF: Data Flow software EDF: Event Data Flow PT: Processing Task ROD: Read Out Driver ROS: Read Out Subsystem RODC:Read Out Driver Crate ROC: Read Out Crate EMON: Atlas Event Monitoring – sampler

system

ACRONYMSACRONYMS

Atlantis: Atlas Standalone Event Display Athena: Atlas Offline software framework DQM: Data Quality Monitoring DQMF: Atlas Data Quality Framework OH: Online Histogramming service IS: Information Service DCS: Detector Control System SFI: SubFarm Input SFO: SubFarm Output DSP: Digital Signal Processor OKS: Object Kernel Support

Library to support simple in-memoryobject manager. Suitable for real-timeobject manager for e.g. Data Acquisition

ACRONYMSACRONYMS

Recommended