DEVELOPMENT OF DEVELOPMENT OF ONLINE EVENT SELECTION IN ONLINE EVENT SELECTION IN
CBMCBM I. Kisel (for CBM Collaboration) I. Kisel (for CBM Collaboration)
GSI Helmholtzzentrum für Schwerionenforschung GmbH, Darmstadt, Germany;GSI Helmholtzzentrum für Schwerionenforschung GmbH, Darmstadt, Germany; E-mail: [email protected]@gsi.de Open Charm Event Open Charm Event
SelectionSelectionD (c = 312 m): D+ K-++ (9.5%)D0 (c = 123 m): D0 K-+ (3.8%) D0 K- + + - (7.5%) D
s (c = 150 m): D+
s K+ K- + (5.3%) +
c (c = 60 m): +
c pK-+ (5.0%)
No simple trigger primitive, like high pt, available to tag events of interest. The only selective signature is the detection of the decay vertex.
Very efficient tracking algorithms are essential for the feasibility of the open charm event selection
Up to 109 tracks/sec in the Silicon Tracker
Develop algorithms which exploit the full potential
of modern processors. First step:- use SIMD instructions
Best results were obtained with aCellular Automaton based track finderwith integrated Kalman filter track fit
allows usage of double-side strip detectors even at high track densities
highly optimized code- field approximated by polynomials- compact, cache-efficient data- most calculations SIMDized- fast on standard PC's- well adapted to next generation
many-core and wide-SIMD processors
- already ported to IBM Cell processor and NVIDIA graphics cards
very fast when only hard quasi-primary
tracks are reconstructed, as needed
in the online first level event selection
of open charm candidates
supports reconstruction of soft tracks
down to 100 MeV/c, as needed in the
offline analysis
High Speed Tracking AlgorithmsHigh Speed Tracking Algorithms
Source: CBM Progress Report, 2008.
Cell: Heterogeneous multi-coreCell: Heterogeneous multi-core
Inte
l P4
Inte
l P4
Cell
Cell
lxg1411
eh102blade11bc
4
Optimization steps for the track fit routine
Performance on different platforms
CPU time for track reconstruction and fitTypical reconstructed Au+Au collision
Concept of SIMD
R&D RoadmapR&D Roadmap
Detailed simulation and co-optimization of the
tracking system and the analysis algorithms
- alternate sensor types (single-sided sensors)
- alternate module layouts
Detailed studies of event selection algorithms
- open charm selector covering all relevant channels (D0,D±,Ds,Λc)
- design of multi-level event selection
Mathematical and computational optimization
of all algorithms
Determine best platform for:- Hit/Cluster finding- Tracklet finding- Tracking/Vertexting
Go beyond SIMDization (from scalars to vectors)
Address MIMDization (multi-threads, multi-coresand many-core systems)
Exploit the numerical throughputof dedicated purpose processorslike GPU's (Graphics Processors)
Be ready for the emerging heterogeneousmany-core systems
Re-design algorithms to run efficiently onall CPU/GPU architectures
Investigate new languages for the performance
critical core of algorithms, like OpenCL, Ct or CUDA
CPU/GPUCPU/GPU AMD: AMD: FusionFusion
CPU/GPUCPU/GPU AMD: AMD: FusionFusion
OpenCL?OpenCL?OpenCL?OpenCL?
GamingGaming STI: STI: CellCell
GamingGaming STI: STI: CellCell
GP CPUGP CPU Intel: Intel: LarrabeeLarrabee
GP CPUGP CPU Intel: Intel: LarrabeeLarrabee
GP GPUGP GPU Nvidia: Nvidia: TeslaTesla
GP GPUGP GPU Nvidia: Nvidia: TeslaTesla
CPUCPU Intel: Intel: XXX-coresXXX-cores
CPUCPU Intel: Intel: XXX-coresXXX-cores
FPGAFPGA XilinxXilinx
FPGAFPGA XilinxXilinx
CPU: SIMD, multi-coreCPU: SIMD, multi-core GPU: Controller plus many ALUGPU: Controller plus many ALU
Deutsche Physikalische Gesellschaft e.V.
Bochum 09
Tracking ChallengeTracking Challenge
Fixed-target heavy-ion experiment 107 Au+Au collisions/sec ~ 1000 charged particles/collision Non-homogeneous magnetic field Double-sided strip detectors Track reconstruction in STS/MVD and displaced vertex search required in the first trigger level
Scalability on Intel multi-core CPUs
Porting to NVIDIA CUDA
CoresCores
HW ThreadsHW ThreadsSIMD widthSIMD width
NNspeed-upspeed-up = N = Ncorescores*(N*(Nthreadsthreads/2)*W/2)*WSIMDSIMD
KK--
+
First level event selection is done in a processor farm fed with data from the event building network
FP
GA
FP
GA
FP
GA
FP
GA
PCPC PCPCPCPCPCPC PCPC
Sub-FarmSub-Farm
Winner of the DPG Poster Session 2009 Winner of the DPG Poster Session 2009