44
Toward High-Confidence and Full Automation of Seismic Data Processing for CTBT Monitoring Jin Ping (Northwest Institute of Nuclear Technology, China)

Toward High-Confidence and Full Automation of Seismic Data

  • Upload
    vanphuc

  • View
    218

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Toward High-Confidence and Full Automation of Seismic Data

Toward High-Confidence and Full Automation of Seismic Data Processing for CTBT Monitoring

Jin Ping (Northwest Institute of Nuclear Technology, China)

Page 2: Toward High-Confidence and Full Automation of Seismic Data

Contents

•  Introduction; •  Review of existed technologies; •  Progresses achieved; •  Summary;

Page 3: Toward High-Confidence and Full Automation of Seismic Data

1. Introduction

•  Seismic monitoring activities, either for CTBT or earthquake-induced disaster control, are facing a new era of big data as the number of seismic stations with data open available or to be dealt with increasing rapidly.

Page 4: Toward High-Confidence and Full Automation of Seismic Data

•  For the CTBT monitoring, the 170 IMS seismic stations would have the capability to detect events of mb>3.2 for a very large part of the globe, equivalent to signals of more than 100,000 events may be yearly registered and to be processed.

Fig.1 Network detection capability map of the 41 International Monitoring System (IMS) primary seismic stations operational by 1 January 2012. Source: Kvaerna & Ringdal (2013), Bull. Seism. Soc. Am,103, 759-772.

Page 5: Toward High-Confidence and Full Automation of Seismic Data

•  For earthquake monitoring, as the result of the application of modern digital technologies in the field of seismic observation, many parts of the world are covered by densely-distributed seismic stations. These non-IMS seismic stations can also be used for CTBT verification.

Fig.2 Seismic stations affiliated to the National Seismological Bureau of China.

Page 6: Toward High-Confidence and Full Automation of Seismic Data

•  To process the huge amount of seismic waveform data, automatic technologies which can reliably and efficiently detect, localize and categorize seismic events is crucial for both CTBT and natural seismicity monitoring;

Page 7: Toward High-Confidence and Full Automation of Seismic Data

•  However, existing automatic technology for seismic data processing still facing great technical obstacles in improving its reliability and efficiency;

Page 8: Toward High-Confidence and Full Automation of Seismic Data

•  Summary of accuracy of IDC automatic processing ü Almost 30% of events which appear in SEL3 have been

discarded by analysts as invalid; ü Analysts have substantially modified a further 50% of SEL3

events; ü  Events added by analysts constitute on average about 20% of

events in the reviewed event list (LEB); -Source: Pearce et al (2009), Exploiting the skills of waveform

data and analysts in the quest for improved automatic processing, poster presented at the ISS09 Conference, 10-12 June, 2009.

Page 9: Toward High-Confidence and Full Automation of Seismic Data

•  Therefore up to date, routine seismic monitoring still heavily relied on manual interactive analyses, and this reality can hardly fit the situation of the era of rapidly increasing amount of seismic data.

Page 10: Toward High-Confidence and Full Automation of Seismic Data

2. Review of existing technologies for automatic seismic data processing.

•  Conventional procedure of seismic monitoring for CTBT can be split into four stages;

Automa'c  Data  Processing    (Event  Detec'on  and  Loca'on)

Analyst  Review  (Review  and  correct  automa'cally  produced  events  one  by  one)

Automa'c  Event  Characteriza'on  &  Screening

Special  Event  Analyses

Fig.3 Principal stages of CTBT seismic monitoring

Page 11: Toward High-Confidence and Full Automation of Seismic Data

•  To improve the automation degree of the monitoring, it is crucial to develop more reliable and effective technologies to detect, localize and categorize seismic events automatically;

Page 12: Toward High-Confidence and Full Automation of Seismic Data

•  For automatic event detection and localization, the conventional method is by the “DIAL” approach;

Signal Detection

Phase type Identification

Signal Association

Event Location

Fig.4 Typical approach for automatic seismic data processing.

Page 13: Toward High-Confidence and Full Automation of Seismic Data

•  This approach usually is characterized by features including:

ü  Signals are detected individually, though in many cases (especially under the situation of regional seismograms) they may arrive collectively at a station as wavetrains;

Fig. 5 A diagram to illustrate wavetrains and detections.

wavetrain  1

wavetrain  2 wavetrain  3

Page 14: Toward High-Confidence and Full Automation of Seismic Data

ü Through the whole processing, seismic signals are principally characterized by features that are local in time domain, features characterizing the whole shape of respective wavetrains are ignored;

Page 15: Toward High-Confidence and Full Automation of Seismic Data

Sta$on xxxxxxx

Arrival  Time xxxxxx

SNR xxxxxx

Amplitude xxxxxx

Period xxxxxx

Azimuth xxxxxx

Slowness xxxxxx

…… …….

Sta$on xxxxxxx

Arrival  Time xxxxxx

SNR xxxxxx

Amplitude xxxxxx

Period xxxxxx

Azimuth xxxxxx

Slowness xxxxxx

…… …….

The whole seismogram

Segmented waveform for individual detections

Parameterized features;

Fig.6 A diagram to illustrate the measurement of local features that are used to identify and associate seismic signals by conventional DIAL approach. During the process, the information about the shape of the whole seismograms is lost and links between different detections is basically cut off.

Page 16: Toward High-Confidence and Full Automation of Seismic Data

ü Decision methods used for signal detection, identification and association basically are based on theories for simple scenarios. Their performances usually are significantly reduced in the situation of real, complex data;

Fig. 7 Illustration of the possible gap between theoretically assumed (left) and practical (right) distribution of feature parameters.

theoretically assumed practical

Page 17: Toward High-Confidence and Full Automation of Seismic Data

ü And the processing is basically sequential, other viable explanations are ignored in many cases by this way;

Fig. 8 Illustration of potential mistakes due to sequential processing.

associatable  correct

associatable,  false

……

sequen'al  in  space

sequen'al  in  'me

Sta 1

Sta 2

Sta N

unassociatable  

location uncertainty of seed origin

false location due to false association

correct location

Page 18: Toward High-Confidence and Full Automation of Seismic Data

ü Visit to waveform data is limited to the detection & feature extraction stage, following processing stages are only based on derived, information-incomplete signal parameters;

Detection & Feature Extraction

Parameterized Tables

Post-Detection Processing 1

Post-Detection Processing 2

Post-Detection Processing N

Raw waveforms

Fig. 9 The model of waveform data accessing of automatic processing systems.

Page 19: Toward High-Confidence and Full Automation of Seismic Data

•  The low performance of seismic data processing systems is intrinsically related to these features. It can be illustrated by comparing computer processing methods with that used by human being analysts;

Automa$c Interac$ve

Associa'on  mode from  part  to  whole;   from  whole  to  part;

Signal  feature  used local  only; both  local  and  integral;

Decision  criteria sequen'al,  rigorous; parallel,  flexible;

Visit  to  waveform  data limited; through  the  analyses;

Table 1 Comparison of reasoning approaches between automatic processing and interactive analyses

Page 20: Toward High-Confidence and Full Automation of Seismic Data

Detec'on  of    individual  signals;

Characterize   and   iden'fy   signals  types  by  features  local  in  'me;

Associate   signals   to   form   signal  groups;  

Detect   wavetrains   with   associated  signals;

Iden'fy   wavetrain   type   principally  by   features   characterizing   its  envelope    shape;

Iden'fy   major   phases   by   their  rela've   posi'ons   in   the   wavetrain  and      their  local  features;

Seismogram  Level

Individual  Signal  Level

Fig.9 Illustration of differences in approaches for signal detection, phase type identification and association between computer programs and human analysts. Contrary to the former, the processing procedure used by analysts normally is from whole wavetrains to specific signals, applying both local and integral features.

Page 21: Toward High-Confidence and Full Automation of Seismic Data

•  To develop more reliable and accurate automatic seismic data processing systems, approaches, including signal features and reasoning rules, methods used by human being analysts should be generalized and applied;

Page 22: Toward High-Confidence and Full Automation of Seismic Data

3. Progresses Achieved;

•  Automatic seismic data processing using both integral and local features of seismograms;

•  Progressive signal association algorithm for detecting teleseismic/network-outside events using regional seismic networks;

•  Prototype Computer Analyst System for seismic event review;

Page 23: Toward High-Confidence and Full Automation of Seismic Data

3.1Data processing using integral features.

•  A novel technique has been developed to incorporate integral features into conventional DIAL approach for automatic seismic data processing; (Jin et al, 2014, A novel technique for automatic seismic data processing using both integral and local feature of seismograms, Earthq. Sci., 27, 337-349.)

Page 24: Toward High-Confidence and Full Automation of Seismic Data

•  Specialties of the technique. ü Detect individual signals together with associated

wavetrains; ü Detections are categorized into the beginning or the

following type;

the  beginning  detec'on following  detec'ons

Fig.10 Illustration of the beginning detection and following detections of a wavetrain. Normally direct P waves can not be following detections and they can be associated to respective beginning detection straightforwardly.

Page 25: Toward High-Confidence and Full Automation of Seismic Data

ü Integral features are introduced to characterize the full shape of wavetrains;

Fig.11 Illustration of the concept of integral features of wavetrains (left). The rough envelope shape of seismograms can be inferred from these features (right).

Page 26: Toward High-Confidence and Full Automation of Seismic Data

ü Use the integral features along with conventional local features for seismic phase type identification and association;

Fig.12 Illustration of the method to search for regional P/S pairs by both integral and local features.

Page 27: Toward High-Confidence and Full Automation of Seismic Data

•  Very promising results has been achieved by applying this technique with fairly low false and missed event ratio simultaneously obtained.

—Summary of test results using 16 days’ data of Xinjiang Seismic Network (XJSN).

ü  False events ratio: ~8%; ü  Missed events ratio: <2% for ML>1.0; ü  Average Location error: 15.6km ; (in the case of the sparse regional

network XJSN)

Page 28: Toward High-Confidence and Full Automation of Seismic Data

Fig.14 Left: Statistics on detected Xinjiang Event Bulletin (XJEB) events; Right: Comparison of automatically determined epicenter locations with that listed in XJEB for events of ML>2.0;                      

automatic

XJEB

Page 29: Toward High-Confidence and Full Automation of Seismic Data

Fig. 15 An example of automatic processing results for a far regional event.

Page 30: Toward High-Confidence and Full Automation of Seismic Data

Fig. 16 Examples of automatically detected local/regional events which have been missed by XJEB.

Page 31: Toward High-Confidence and Full Automation of Seismic Data

3.2 A novel association algorithm for detecting teleseismic/network-outside events using regional seismic networks

•  Monitoring underground nuclear explosions using regional seismic networks is a common practice;

•  For various reasons, little attention has been paid to this problem;

•  To deal with this issue, we have developed a special progressive association algorithm which can reliably detect teleseismic/network-outside events using regional seismic networks;(Jin et al, 2015, A Novel Progressive Signal Association Algorithm for Detecting Teleseismic/Network-outside Events Using Regional Seismic Networks, Geophys. J. Int., 201,1950-1960.)

Page 32: Toward High-Confidence and Full Automation of Seismic Data

•  This association algorithm exploits the relationship between the arrival times and the slowness of teleseismic signals as well as the unique slowness range of teleseismic P waves;

Fig.17 Left: Diagram illustrates a teleseismic signal propagating across a regional seismic network; Right: The relationship between slowness and epicentral distances for frequently observable seismic phases.

Page 33: Toward High-Confidence and Full Automation of Seismic Data

•  It takes the so-called primary triangle station arrays (PTAs) as the starting point to search for P waves of teleseismic events progressively;

Fig. 18 Predefined PTAs for XJSN to detect teleseismic events

Page 34: Toward High-Confidence and Full Automation of Seismic Data

•  This algorithm can effectively and reliably detect network-outside seismic events with very low ratio of false events(<3%);

Fig. 19 Magnitude and distance distributions of (a) detected and (b) undetected IDC REB events using data of XJSN from 2010/06/01 to 2010/06/16. REB events undetected basically have no signal recorded at XJSN stations.

Page 35: Toward High-Confidence and Full Automation of Seismic Data

Fig.20 Comparison of automatically determined epicenters using XJSN with those in IDC REB.

Page 36: Toward High-Confidence and Full Automation of Seismic Data

3.3 prototype Computer Analyst System

•  To realize the full automation of seismic data processing ultimately, we are making effort to develop a prototype Computer Analyst System (pCAS);

High Confidence Automatic Data Processing

Computer Analyst Review

Enhanced Event Characterization & Screening

Special Event Analyses

Fig. 21 Proposed approach for full automation of seismic data processing for CTBT monitoring.

Page 37: Toward High-Confidence and Full Automation of Seismic Data

•  Its function is to substitute human being analysts for seismic event review, including:

ü remove false origins produced by APS and search for missed events;

ü remove falsely associated phases and add previously undetected/unassociated ones;

ü revise mistakes in phase timing and labeling; ü reconstruct seriously mislocated events produced by

APS;

Page 38: Toward High-Confidence and Full Automation of Seismic Data

•  A primitive version of pCAS has been designed for test.

OriginsCheck EventsBuildup EventsScan

• E v a l u a t e   o r i g i n s  reliability;  • Iden'fy   and   merge   split  origins;  • Iden'fy  and   remove   false  origins;  • Iden'fy  and   remove   false  associated   seismic   phases  for  real  events;  

•    Revise   mistakes   in   phase  'ming  and    labeling;  • Add   previously   undetected  or   unassociated   signals,  inc luding   a l l   secondary  phases;    • Iden'fy   and   reconstruct  events    with  serious  mistakes  in   signal   associa'on   and  event  loca'on;  

•    Search   for   and   add  p r e v i ou s l y   unde t e c t ed  events  ;

Fig.22 Principal modules of pCAS and related functions.

Page 39: Toward High-Confidence and Full Automation of Seismic Data

•  Preliminary test results are promising, though there are still a lot of technical challenges to overcome;

Page 40: Toward High-Confidence and Full Automation of Seismic Data

•  Summary of preliminary test results of pCAS; ü  Split and false events were removed nearly without the loss of

meaningful events; ü  Falsely associated phases were reduced by more than 60%

while the total number of associated signals averagely increased by 30% for accepted events;

ü  Severely mislocated events were successfully reconstructed, significantly reducing the upper limit of location errors relative to results of interactive analyses;

Page 41: Toward High-Confidence and Full Automation of Seismic Data

Right down: Revised signal pickups and associations by pCAS. The modified epicenter is very close to that determined by interactive analyses.

Left up: Example of severely mislocated event detected by APS. Pn waves at many stations are not correctly picked up and associated probably due to its emergent property at these stations, leading to a location error of about 250km.

Fig.23 Illustration of improvement in signal pickups and associations completed by pCAS for severely mislocated events.

Page 42: Toward High-Confidence and Full Automation of Seismic Data

Summary

•  High-confidence and full automation of seismic data processing is crucial for effective CTBT monitoring;  

•  Conventional local-feature based approaches for automatic seismic data processing hardly can satisfy this requirement;

•  Novel processing technologies in better patterns and incorporating signals integral features as well as new methods for reasoning are needed;

Page 43: Toward High-Confidence and Full Automation of Seismic Data

•  With the application of these techniques along with state-of-art theories or technologies in the filed of signal and information processing, especially those about artificial intelligence, the full automation of seismic data processing can be expected.

Page 44: Toward High-Confidence and Full Automation of Seismic Data

Thank  you  for  your  aVen'on!