24
10/12/2002 WP4 Workshop - CERN fMon WP4 - Monitoring Progress report [email protected]

WP4 - Monitoring

  • Upload
    salma

  • View
    27

  • Download
    0

Embed Size (px)

DESCRIPTION

WP4 - Monitoring. Progress report. [email protected]. Work plan 3Q02 – 1Q03. Repository server TCP transport C-Implementation of the repository API Alarm display Packaging, documentation and unit testing for release 2 Integration and deployment on release 2 Testbed - PowerPoint PPT Presentation

Citation preview

Page 1: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

WP4 - Monitoring

Progress report

[email protected]

Page 2: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Work plan 3Q02 – 1Q03 • Repository server• TCP transport• C-Implementation of the repository API• Alarm display• Packaging, documentation and unit testing for release 2• Integration and deployment on release 2 Testbed• Dynamic proxy logic in TCP transport• Perl implementation of the repository API• Java implementation of the repository API• Extended EDG sensor set• ORACLE interface• Database survey• Interface open source Database• Investigation on metric schemas• Revisit the metric configuration• Security• Web based report GUI

Page 3: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Repository server

• Released

• Multithreaded

• Uses MR API for insertion of samples

• Much better performance noticed for UDP transport (packet loss)

• SOAP connectivity for metric subscription

Page 4: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

TCP transport

• Released

• End to end reliable push transport of blobs with cache and recovery if server down

• Stress tests to be completed with a large number of clients

• Proxy configuration currently manual

Page 5: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

C-Implementation of the repository API

• Released

• Implements the flat file system

• Includes subscription mechanism (SOAP)

Page 6: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Alarm display

• On going

• Java implementation

• SOAP connectivity causing delays

Page 7: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Packaging, documentation and unit testing for release 2

• Completed

• Integrated successfully

• LCFGng object

Page 8: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Integration and deployment on release 2 Testbed

• Waiting for the testbed

Page 9: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Dynamic proxy logic in TCP transport

• Not started

• Planned beginning 2003

• Logic to be defined

• Need large scale deployment to test behavior

Page 10: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Perl implementation of the repository API

• On going

• Requires a simplified MR API

• String oriented interface

• First tests successful with SWIG– Tcl prototype works fine

Page 11: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Java implementation of the repository API

• To be done

Page 12: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Extended sensor set

• On going (forever?)

• Some new metrics:– spaceUsed– daemonCheck– executeScript– file.dump– file.tail (including pipe from lcfg logs)– processAccounting– snmp

• Activity will go on with deployment, following demand

Page 13: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

ORACLE interface

• On going

• But an advanced prototype is ready

• More details in following presentation

Page 14: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Database survey

• Completed

• Prototyping necessary to confirm results in real-life environment

Page 15: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Interface open source Database

• Not started

• … but soon

• Should be faster because table structures now defined

Page 16: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Investigation on metric schemas

• A few thoughts already

• Planned for next year

• Deals with sample types

• Complex: can have arrays, etc.

Page 17: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Revisit the metric configuration

• Not started

• Schema defined already

• But need some updates

Page 18: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Security

• Not started

• Concerns mainly transport

• Also some sensible configuration items

Page 19: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Web based report GUI

• On going

• First prototype released

• CGI interface for DB query available

Page 20: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Out of plan items

• Various MSA improvements– Daemon facilities– Now using EDG environment variables– Syslog redirection of logs– Multiple transports at the same time

• Tutorial … to be tested next Thursday

Page 21: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Users

• MSA used extensively at CERN : CCS project– use UDP transport, with central PVSS DB/alarm

– local correlation connected to SURE

– 19 clusters with different configurations (CCS system)

– 900 nodes (half computer center)

– CCS sensors

– production quality

– mainly exception monitoring

Page 22: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Users

• Good feedback from Rutherford RAL

• Connectivity to fault tolerance

Page 23: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Plans after 1Q2003

• Complete previous list

• Then…– Final picture has not many gaps left– Test and consolidate newer components– Deploy on larger scale

Page 24: WP4 - Monitoring

10/12/2002 WP4 Workshop - CERN

fMon

Conclusion

• Software package starts to be complete and usable

• Architecture and software stable

• No major problem foreseen

• It’s going on well