29
U. S. DEPARTMENT OF INTERIOR GEOLOGICAL SURVEY x/? A computer program for creating keyword indexes to textual data files by < D. V7. Moody Open-File Report VJashingtcn, D. C, June J972 2426*1 > 108

2426*1 - pubs.usgs.gov · PDF fileABSTRACT A keyword-in-context (KWIC) or out-of-context (KWOC) index is a convenient means of organizing information. This keyword index program can

Embed Size (px)

Citation preview

U. S. DEPARTMENT OF INTERIOR

GEOLOGICAL SURVEY

x/?

A computer program for creating keyword

indexes to textual data files

by

< D. V7. Moody

Open-File Report

VJashingtcn, D. C,

June J972

2426*1

>108

CONTENTS

Page

Abstract . . . . .'. , . . . . . . . . . v . . . . .. . . 1

Introduction ............. ..... . . . . . 2

Program Organization ... . . » . . . . . .* . . . . ... 4

Program Options . . . . . . . . . .. . . ... . . . . . 5* , '. ' i

Card Input . . . . . . . . . . . . . . .......10

Optical Scanner Input .. . . ...... .... . .11

Gipsy Input .;. . ...... . . . . . . . ... . . 12

Applications . . . . . ..... . . . . . . . . . . . . . 18

References Cited . . . . . .... . . . . . . . . -. . . . * 23

ILLUSTRATIONS

Page

Figure 1. Job control language cards needed to make aGIPSY search and create a data set for processing by the keyword program. . . . , 15

Figure 2* GIPSY control cards needed to search andcopy selected records- from Selected Water Resources Abstracts. . . . ...... 16

Figure 3. Job control language cards and program!;control cards needed to execute the keyword program. . . » ..... » . * . . . 17

Figure 4. Example of the bibliographic section of_..". -, , .; l > the index., , - " -- . . »,<.. »,r »,,», *-.. _; ».,; »- . ..,;;«:..-. .;.--*--'. :w - 19.

Figure 5. Example of the KWIC index. ^..I. « » * . . 20

Figure 6. < Example of the KWOC index. ..>... . ^ ^ 21

Figure 7. Example of the Index term index. .. » . . 22

Table 1, Summary of keyword program control cards . . 24

ABSTRACT

A keyword-in-context (KWIC) or out-of-context (KWOC)

index is a convenient means of organizing information.

This keyword index program can be used to create either

KWIC or KWOC indexes of bibliographic references or other

types of information punched on cards, typed on optical

scanner sheets, or retrieved from various Department of

Interior's data bases using the Generalized Information

Processing System (GIPSY). The index consists of a

"bibliographic" section and a keyword section based on

the permutation of. document titles, project titles,

environmental impact statement titles, maps, etc. or-.-' ; ' --.- - - - :". - . ' .. ' - . ' - «, -...'>:

lists of descriptors. The program can also create a

back-of-the-book index to documents from a list of

descriptors. By providing the user with a wide range

of input and output options, the program provides the

researcher, manager, or librarian with a means of main­

taining a list and index to documents in a small library,

reprint collection, or office file.

INTRODUCTION

Machine produced indexes have been used for more than

a decade to inexpensively organize information. The most

common types of indexes prepared by the computer are the

keywprd-in-context (KWIC) index and the keyword-out-of-

context (KWOC) index. Both indexes are produced in much

the same way. A string of characters, such as the title

of a bibliographic citation, is fragmented into substrings

(e.g., words) using break characters such as blanks, commas,

and periods. The words are then used as index terms.

Because many words occur so frequently as to have little

information value, provision is usually made to eliminate

them by providing the computer with a list of stop words,

i.e., words which are not to be used as index terms. The

remaining terms are referred to as keywords.

The KWIC index format lists the keywords alphabet­

icallyini the center of the index page. The remaining

;words in the title are printed to the left and right of

the keyword in their original order. If there is insuffic-

ient space on the line for the words tb the right of the

|Jteyjwordt in the title, the title is wrapped around and

Sprinted beginning at the far left of the same line. Thus,

the keyword appears in the cbntext in which it is used in

the title* Each line is identif ied^Jby" a number which

references the full citation of the title in the biblio­

graphic section of the index.

The KWOC index format presents each keyword alpha­

betically in .the left margin of the index followed by a

list of all the titles which contain that keyword. In

this case the keyword is out of context with the title.. > -. *

This report describes the use of a keyword index

program to produce KWIC and KWOC indexes of document

titles punched on cards, typed on optical scanner sheets,

or retrieved from various Department of the Interior's

data bases using the Generalized Information Processing

System (GIPSY). While bibliographic records are used

as illustrations, the program can also be used to produce

indexes of projects, maps, computer programs, geologic

names, index terms, or any other information contained

in a GIPSY data base or punched on cards in the keyword

index program format. Instructions are given for pre­

paring and maintaining a personal bibliographic file.

PROGRAM ORGANIZATION

This keyword index program is written in PL/1 for

the IBM 360/65 computer and is a modification of the Uni­

versity of California's Quick Index Compiler (IBM SHARE

Program Library 360D 03.3.002). The program is organized

into 3 parts. The first part of the program reads a set

of control cards, a list of stop words, and the biblio­

graphic data. The program then breaks the title or other

information into words, removes stop words, and prepares

two output files: the file named OUTPUT contains the

bibliographic section of the index; the file named INPUT

contains the keyword index. Note that indexing will occur

on words followed by a pound sign (§), blank, period,

comma, double apostrophe, left and right parentheses,

hyphen, serai-colon, colon, question mark, and quote mark.

-The second part of the program uses the cataloged

procedure DISKSORT to alphabetize the keywords in file

INPUT and add the sorted index to file OUTPUT. The third

part of the program prints the 'bibliographic and keyword

sections of the index contained in file OUTPUT.

PROGRAM OPTIONS

Control Cards

The following control words are punched on separate

cards beginning in column 1. Numbers are separated from

the control words by a blank. The order of the cards is

unimportant with the exception that 'STOP WORDS 1 must

precede the control card 'TITLES 1 . The 'TItLES' card is

the only card required. All others are optional.

PRINT BIBLIOGRAPHY - This card causes all citations

to be printed. Each citation is

separated by a double space. If this

option is used, the input data should be

sorted by reference number prior to

using the program.

LENGTH n - The length parameter sets the index line

length to n characters. The maximum

line length is 132 characters. If no

length is specified, n defaults to 80.

PAGESIZE n - This card specifies the number of lines

to be printed on each page of the index.

If the page size is not specified, n

defaults to 60.

COPIES n - This card specifies the number of copies

.of the index to be printed. If the

number of copies is not specified, n

defaults to ond.

STORAGE n - The storage card specifies the

maximum length of a character string

that can be processed by the program.

The default value of n, the number of

characters in the string, is 180*

If using punch cards (or optical

scanning sheets) as input, 180 char­

acters will be sufficient to handle »

document titles which fill 3 cards

(60 characters per card). Any excess

characters will be ignored unless the

STORAGE parameter is increased in

multiples of 60. If using input from

a GIPSY retrieval, the default value

will accommodate the first 180 char­

acters of the document title record.

STORAGE may be set to any value?

however, it must equal the length of

the "title" string put out by the

COPY command in the GIPSY retrieval

(see p. 13), In this case the record

length and the block size parameters

on the DATA DD card (job control

language card) will also have to be

adjusted (see p. 14Ki

COLUMN - The column command is used with the

INDEX TERMS to format a double column

list of terms. It should only be used

with this option.

MARGIN - The margin command disables the auto­

matic margin adjustments made for

line lengths less than 132 characters.

In general this card will not be used.

SCANNER INPUT - The program will expect an input data

set on a tape or disk pack^whose file

name is defined as DATA by the Job

Control Language. The input data

must be in the optical scanner format.

GIPSY INPUT - The program will expect an input data

set on a» tape or disk pack whose file

name is defined as DATA by the Job

Control Langau^e. The input data

must be in the GIPSY copy format.

(Default Input)- If the type of input is not specified,

the program assumes that the data is

punched on cards in the format required

by the program. Data cards immediately

follow the TITLES cards.

KWIC INDEX - The program will produce a keyword-in-

context index.

KWOC INDEX - The program will produce a keyword-out-r

of-context index using terms from the

citation titles.

KWOC TERMS - The program will produce an index in

KWOC format using the index terms.

Only the first 10 characters of each

2/ term .word is used.

INDEX TERMS c - The program will produce a back-of-

the-book type of index fr&m the list

of index terms associated with a cita­

tion. If "c u is a blank character

(col. 13), then every term will bet

indexed. If terra sets are separated

by a comma or other punctuation, then

place this character in column 13 and

the term set will be treated as a single

unit. A maximum of 25 characters from

the multiple-term descriptor is printed.

I/ If bibliography is stored on disk in card-imate format, use DISK INPUT on control card. . The program will expect an input^data set on a disk- pack whose file name is defined as DATA by the Job Control Language.

j2/ To use this option a list of descriptors (index terms) must be associated with each bibliographic citation.

8

STOP WORDS - Up to 500 words may be punched on sep­

arate cards following this one. Each

word begins in column 1. Only the

first 10 characters (including blanks)

of each word are used for comparison.

Columns 19-80 should be blank.

TITLES - The 'TITLES 1 card is the only required

control card. Any information punched

in columns 11-60 will be used as a

heading for each page of the bibliog-

. v raphy. section. . -_;-

INDEX AUTHORS - The authortsl names will be indexed

with the keywords in the i-ndex.

Card Input:

Card input is assumed by the program, if no other

input option is specified. Sets of one or more punched

cards must immediately follox* the 'TITLES 1 card. The

format of the punched cards is as follows:

Cols.

1-60 Text. The text on each card may be continued on

up to nine additional cards as indicated by the

card's sequence number. Because the text is

printed exactly as it appears on the card,

do not break text in the middle of a word at

th'e end of a line. The punctuation us^d in the

text depends upon the type of information con­

tained in the card (see types of cards listed

below).

61 Type of card,

'I 1 - Author card. Place author's last name

first followed by a comma, first name

or initials, and a semi-colon. Repeat

for each author to be listed. For

examples 'JOE, JOHN? SMITH, A. A.;'

2 1 - Title card. Place colon (or comma) at

end of title.

'3' - Citation. Use standard OSGS biblio­

graphic format.

«4» - Descriptor or index term card. Separate

10

terms by commas. For example: 'WATER,

CHEMISTRY, ATLANTIC OCEAN, WATER RESOURCES. 1

End the term list with a period.

'5 1 - Identifiers terms (or other information)

related to card 4. If card type 5 appears,

it will be combined with the information

on card type 4. Otherwise, do not include

card type 5.

Other types of cards may be added as needed

(card type 6 - 9),

62-63 Sequence number. Number successive cards of the

same type from 01-10.

64-73 * Reference code of document. This is - a 10 char­

acter alphanumeric code which appears next to

each reference in the bibliography and to the

right of each line in the index.

Optical Scanner Input

Bibliographic data can also be prepared on optical

scanning sheets using a 10-pitch IBM Selectric typewriter

equipped with an OCS typing element. The following format

is used:

Cols. '

1 Type of card:

f l f Author card

«2 f - Title card

3 1 - Citation card

11

4 1 - Descriptor term card

"5 1 - identifier terra card

Other types of cards may be added as needed

(card type 6-9). .

2-3 Sequence number. Number successive cards of the

same type from 01-10.

4-13 Reference code of document.'

14-73 Text. The text may be continued on up to nine

continuation cards.

Since the optical scanner reads up to 75 characters

on a line, there is space to make two corrections of one

letter each on any given line of input text. Otherwise,

the entire line must be deleted. After the citations are

typed they are read by the optical scanner and placed as

card images on magnetic tape. The tape can then be pro­

cessed by the keyword program.

GIPSY Input ' -

Records can also be used as input to the keyword

program. It is assumed that the user of this version of

the program has read the GIPSY application manual (Addison

and others, 1969) and is familiar with the GIPSY cataloged

procedures on the Geological Survey's IBM 360/65.

\I/ Copies of the GIPSY cataloged procedures may be Obtained

from Chief, Office of Systems Research and Development, Computer Center Division, Geological Survey^ Washington,D. C. . *

12

The GIPSY program is an integral part of the Geo­

logical Survey's Bibliographic System. Using GIPSY the

user can search the title, index terms, or abstracts of

bibliographic references for specific combinations of

words, word stems, or phrases. The selected records can

then be sorted by author or accession number and printed.

A key-word index of these records provides a convenient

means of scanning the retrieved documents for items of

interest.

The user prepares the key-word program input data

set by (1) making a GIPSY search of one of the GIPSY data

files, (2) sorting the retrieved records by reference

number, and (3) placing the records into a temporary disk

storage area on the computer system, e,g., TEMP01. The

temporary data set is then ready for processing by the

keyword program. The use of the TEMP01 disk pack is

described in the U.S.G.S. Computer User's Manual (p. 5.3),

and the necessary job control language instructions and

GIPSY commands are shown in figures 1 and 2.

The principal limitations to this use of the keyword

program are the length of the data elements copied, e.g.,

AUTHOR, TITLE, CIT, and DESCR from the bibliographic file,

and the number of records selected. Only the first 180l

characters of the author, title, reference, and inde^x

\terms fields are copied. If any of these fields are more\

than 180 characters, the remaining characters will be

dropped. The total length of each record is 193 characters.

13

This includes the "card type," sequence number, and the

10-character reference code. Sixty tracks of temporary

disk space is sufficient to store 740 references. If

larger character strings are copied, the STORAGE parameter

will need to be set accordingly. The length of record

parameter on the DATA DD card will also need to be changed

to reflect the increase in the length"of the record copied

on to the temporary disk. The new length will be equal to

STORAGE plus 13 characters, e.g., LRECL-STORAGE+13, Simi­

larly, the BLKSIZE parameter will need to be adjusted.

The overall effect of these changes will be to reduce the

number of records which can be stored on a disk track.

The-required job control language cards to execute

the keyword program are shown in figure 3.

14

//,... JOB (...)/*SETUP CCDnnn/DlSK -//JOBL1B DD DSM=SYS1.GIPL1B,DSN=SHR//STEP1 EXEC QUESTRAN,DNAME=dictionary,DVOL=CCDnnn,// RNAME=recordf i 1 e, RVOL=CCDnnn//QUESTRAN.SYSVIRKO DD UN I T=SYSDA, D I SP=( NEW, KEEP/ DELETE) ,

// SPACE=(TRK, (10 / 10) / RLSE) / DSM=Annnnnn.aaaB200.nafr.e //QUESTRAN.SYSRDR DD *

(Gipsy control cards go here.)

/*

Figure 1. ---Job Control Language cards needed to rr.ake a GIPSY search and create a data set for Input to the keyword prograr.

15

Col. 1

FORM WRSICSELECT A. DESCR B. § <..

LOGIC A AND B AND .. SORTNUMBER 10 COPYAUTHOR 180'101 1

NUMBER 10NEW RECORDTITLE 180'201'

NUMBER 10NEW RECORDCIT 180'301 1

NUMBER 10NEW RECORDDESCR 180'401'

NUMBER 10

Selected Wate'r Resources Abstracts

Figure 2. GIPSY control cards needed to search andcopy selected records from Selected Water Resources Abstracts.

16

//:... JOB (...)//STEPl EXEC PL1LFCLG,PA3M.PL1L='SIZE='007782U,ST,A,NT,X I , // T! N'E.PL1L = ( 0^30) ,71 ME. LKED-C 0,5), TIME. G0=(2, 10), // REGION. GO=193K //PLIL.SYSIf! DO * .

(Source deck KWIC prograir- part 1)

/*//GO. OUTPUT DD UH1 T=SYSDK, D I SP=( NEW, PASS, DELETE) ,II SPACE=(TRK,(60,20)),DCB=(RECFM=FB,LRECL=133,3LKSIZE=7182),

//GO. INPUT DD UNlT=SYSpK,DISP = (NEV<f , PASS, DELETE),

// DSN-&&INPT//GO. DATA DD UN 1 T=SYSDA, D I SP=SHR,// VOL=SER=TEMP01,DSN=Annnnnn'.aaaB 200* name//GO.SYSIN DD *

TITLES (j^ust be last control card

(If CARD Input option is used place cards here and "durcrcy 11 the "GO. DATA!1 DD card.)

/* . ',

//STEP2 EXEC PROC=niSCSORT,REGlON=100K,TIME=(0,30) //SORTIN DD DISP^OU^DELETE, DELETE), DSN=&&1NPT //SORTOUT DD D 1 SP = (MOD / PASS, DELETE) , DSN=&&OUTPT //SYS IN DD *

SORT FIELDS=(62 / 26 / CH,A),SIZE=E30000 (Use for KWIC Index.)

SORT FIELDS=(6,10,CH,A),SIZE=E30000 (Use for KWOC Index.)

SORT FIELDS=(21/ 25 / CH / A) / SIZE-^30000 (Use for INDEX TERMS.)

/* .

//STEP3 EXEC PL1LFCLG, TIME. PL1L = ( 0,10), TIME. Lked = ( / 5) , // TIME.GO=(,30) / REGION.GO=IOOK //PL1L.SYSIM DD *

(Source deck KWIC program- part 2)

/*//GO. PRINTER DD SYSOUT=A, DC3=( RECFM=UA/ CLKS I ZE=133)

nn ^^DD DUMMY/DCB-CRECFM.F.aLKSiZE-eO)""

Flcure 3. -- Job Control Language cards needed to execute the keyword procrarr.

17

APPLICATIONS

The card version of the keyword program 'can be- used

to develop an index to a personal reprint collection or a

small office library. Each item is given a reference code

as it is received which identifies the item's owner (or

location) and perhaps the date it was received. Thus, if

more than one collection were to be merged for indexing

purposes, the reference code will identify the report's

location. The following reference code (columns 64-73 on

card input) might be used to identify and locate documents

Cols.

64-66 Document owner's initials (3 characters)

67 Hyphen

68-69 Year reprint received or year of publication

70 Hyphen

71-73 Accession number of reprint within year

For example, JDD-70 001 represents the first reprint

received by John D. Doe in 1970. When submitting cards

for processing, the citations must be in accession number

order. By placing each private collection in alphabetical

order by the owner's" initials, more than one collection

can be merged into one index^. Thus, as research personnel*

move from office to office they can take their index with

them and.merge with existing collections at their next- ' \

assignment. Figures 4 to 7 are examples of indexes from

a private reprint collection. ' *

' 18

l/n

n f

v~

wv*

w..

»..

..._

...

__

UN

Ufc

RG

RU

UN

U

WA

ST

E

DIS

PO

SA

L -

CO

NC

EP

TS

A

NU

E

NV

IRO

NM

EN

TA

L

SC

IEN

CE

A

NU

T

EC

HN

ULU

GY

, V

. 4

, N

O.

P.

64

2-

*fl

. .

<

,/,?

f 1970'.

. »

' '

!

£"

"

GK

OU

NU

WA

TE

R,

PO

LL

UT

ION

* W

AS

TE

D

ISP

OS

AL

* O

EE

P>J

ELL

D

ISP

OS

AL.

G

'*

OW

M-7

0-0

02

O

UO

IUN

S,

0.

A.{

R

AG

LA

NO

, P

. C

.;

JOH

NS

ON

, J.

D.;

ro

WA

ltK

-CLA

Y

INT

ER

AC

TIO

NS

IN

N

OR

TH

C

AR

OLIN

A'S

P

AM

LIC

U

ES

TU

AR

Y:

1*

, tN

VlK

UN

Mb

NT

AL

S

CIt

NC

U

AN

O1

TG

CM

NO

LOG

Y,

V.

/*,

NO

. 9

t P

. 7

43

-'*

. I

7'i8

t 1V

70.

tSTUAKYi SGOIMENT, WATCR QUALITY, SALINITY, SEDIMENTATION,

w

IUN

bXCHANGE, SUSPENDED SEDIMENT, NORTH CAROLINA.

x '

g

OWH-70-003

KLEIN, 0.

H.

; GOLUHCRG, U.

D.;

,2

...

MtRC

UftY

IN

THE MARINE ENVIRONMENT:

M

ENVIRONMENTAL SCIENCE ANU TECHNOLOGY, V. 4, NO

. 9, P. 765-

»

. 76«J,

l»)70.

MEis

CUKY

, MARINE ORGANISMS, SEDIMENTS, CALIFORNIA, POLLUTION.

HiOWM-70-004

KISIHL, C.

C.

; DURUM, W. H.

; LANGOE1N, W. 0,

;J

DATA COLLECTION SYSTEMS FUR WATER QUALITY SURVEILLANCE*

g

U.S.

Gb

'OL.

SURVEY, UNPUOLISHED MANUSCRIPT, 21 P.

, 1968.

WATER UUALITY, UATA COLLECTION, DATA PROGRAMS, SAMPLING

V

TIICURY, NE

TV/O

RK DESIGN.

Q^ H

OWM-70-005

K1SICL, C. C.

; H.

MATHEMATICAL METHODOLOGY IN HYDROLOGY:

O IN

TbKN

AT.

SEMI

NAR

FOR

HYDROLOGY, PROC., JU

LY 19

69,

URBA

NA,

^

' ILLINOIS,

P. 36^-390.

P EDUCATION; MO

DELS

, MATHEMATICAL M

ODELS, HYOKOLOGIC M

O'OE

LS.

tj J

OWM-70-006

MSIDL, C.

C.

; %

O

THR US

E OF WATER QUALITY PREDICTIONS AS

AN AID TO

(A

' (D n rt H- o 9 o H» ft ar A H«

3 a (D

X

C

O in W.

fi) M

O O H O a (D X

C SIMULATION LANGUAGES WITH

CRUSAOK ON

EAM SE

OlMf-N/

VARIATION OF CATION EXCHANGE

WATE

R-CL

AY INTERACTIONS IN N

ORTH

ICLF SUE

IN STRFAM S

F.OI

MEN/

VA

RIAT

ION

OF

N A

SHA/

DIGITAL-COMPUTER APPLICATIONS

IN

CO ESTUARY:

WATE

R-

LLAN

CF:

"^

DATA

TUDIES 0

« SU

RFAC

E WATER

IN A

SMA/

DIGITAL"

UNDE

RGRO

UND

WASTF. D

ISPOSAL

-

URVE

ILLA

NCE:

UALI

TY S

TUDIES O

F SU

RFAC

E WA

TER

IN A

SMA

/ E

TO A

BIO

- SI

MULA

TION

- A

USE

R'S

IMPRRS/

SCIE

NCES

TO

POLI

CY S

CIENCES- M

ATERIALS F

ORUNDERGROUND WA

STE

MERCURY- IN

THE MARINE

Y INTERACTIONS IN NORTH CAROLINA'S PAMLICO

SAND RIBBONS OF

2E

IN STREAM SEOIMEN/

VARIATION OF

CATION

MATHEMATICAL METHODOLOGY IN

ING:

SOME NORMATIVE

REFERENCE TO A

BIO-.SIMULATION -

A USER'S

TUARY:

WATER-CLAY

tuu-

>

irir

r\u

J.?

* w

y

v ri

*

ON - A USER'S IMPRES/

DISCRETE SIMULATION'

ERIALS FO

R DISCUSSIONS

FROM

OF WATER QUALITY PREDICTIONS AS

AN

AID TO

MERCURY

IN' TH

E DM

MANAGEMENT SCIENCES TO

POLICY SCIENCES-

MATHEMATICAL

UNDERGROUND WASTE DISPOSAL - CONCEPTS AND

POLICYMAKING:

SOME

WATER-CLAY INTERACTIONS IN

ATER-CLAY INTERACTIONS IN NORTH CAROLINA'S

OF CATION F.

XCHA

NGE

CAPACITY AND RATE WITH

FRUM MANAGEMENT SCIENCES TO

CREATIVE IMPLICATIONS OF A

SYSTEMS VIEW O?

THt

IJSF: OF WATER UUALITY

BOTTLES:

CAPACITY AND RATE WITH PARTICLE SUE IN STR

CAROLINA'S PAMLICO ESTUARY J

CASWCLLt C.

A.

CATION EXCHANGE CAPACITY AND RATE WITH PA

RTCHEMICAL-QUALITY STUDIES OF

SURFACE WATER

ICLAY INTERACTIONS IN NORTH CAROLINA'S PAMLI

COLLECTION SYSTEMS FOR WATER QUALITY SURVEI

CUMPUTFR APPLICATIONS IN CH

F.MIC AL-QUAL I.TY

SCONCEPTS AND MISCONCEPTIONS:

CRUSADE ON

ROTTLES:

DATA

COLLECTION SYSTEMS FO

R WATER QUALITY S

DIGITAL-COMPUTER APPLICATIONS IN CHEMICAL-Q

DISCRETE SIMULATION LANGUAGES WITH REFERENC

DISCUSSION:

FROM MANAGEMENT

DISPOSAL -

CONCEPTS AND MISCONCEPTIONS:

DUrWINS't

D. A.

OKU

ft r YEHEZKEL

ORO*

t YEHCZKEL

DURUM t

W. H.

ENVIRONMENT:

ESTUARY:

* WA

TER-

CLA

EUROPEAN TIDAL SE

AS«

EXCHANGE CAPACITY AND RATE WI

TH' PARTICLE SI

GHLDTERG,

E. 0-

'HY

DROL

OGY:

IMPLICATIONS OF

A

SYSTEMS VIEW OF

POLICYMAK

IMPRESS IONS:

/TE

SIMULATION LANGUAGES WITH

INTFRACTIONS W NORTH CAROLINA'S PAMLICO ES

JOHNS ON t

J. D.

KENNEDY, V.

C.

KENYUN

t'N.H.

KISIELr C. C.

KISIELt C. C.

KISICL, C. Ca

KLEIN» 0. H.

LANGQEINt W. 0.

LANGUAGES WITH REFERENCE T3

A

010- SIMULATI

MALCOM, R. L.

MANAGEMENT SCIENCES TO

POLICY SCIENCES- MAT

MANG

EMEN

T:

THE

USE

MARINE ENVIRONMENT:

MATERIALS FOR DISCUSSION:

FRMATHEMATICAL METHODOLOGY IN. HYDROLOGY:

MERCURY IN THE MARINE ENVIRONMENT:

METHODOLOGY IN HYDROLOGY:

MISCONCEPTIONS:

NIFOnRUOA, A. W.

NHRMATIVC IMPLICATIONS OF A

SYSTEMS VIEW OF

NflRTH CAROLINA'S PAMLICO ESTUARY:

PAMLICO

ESTUARY:

. w

PARTICLE SIZE IN STREAM SEDIMENT:

/RIATION

POLICY SCIENCES- MATERIALS FO

R DISCUSSION:

PULICYMAKING:

, '

SOME N

PREDICTIONS AS AN AID TO

MANGEMENT:

DWM-70-010

OWM-70-014

OWX- 70-002

OWM-70-OCI

DWM- 70-01 4

OWM-70-OC7

DWM-70-002

DWM-70-OC4

DWM- 70-007

DWM- 70-00?.

DWM-7C-010

DWM-70-OC^

DWM- 70-007

OWX-70-013

DWM-70-C12

DWM- 7 0-0 01

DWM-70-002

DWM- 70-0 12'

OWM-7C-011

DWM-7C-OC4

DWK-7C-OC3

DWM-70-002

DWM- 70-003

DWM-7 0-0 I 4

OWM-70-003

OWM-70-005

DWM- 7 0-0 1

1DWM-70-013

DWM-70-002

DWM-70-002

' OWM-70-014

DWM-7 0-008

DWX-70-CC6

DWM- 70-004

DWM-70-005

DWX-70-003

DWM-70-OC<V

OWM-70-013

DWM-7 0-0 14

DWM-7 0-0 12

DWV.-70-OC6

DWM- 7 0-0 03

DWM-70-012

OWM- 70-005

DWM- 70-003

DWM-70-005

DWM-70-OOl

DWX-70-P-C9

DWM-7 0-0 11

OWM-70-002

DWM-7C-002

DWM-7 0-0 14

DWM- 7 0-0 12

DWV-70-Oll

owv-

7C-iro(>

CAPA

CITY

CHP.

V.IC

ALCL

AYCC

'LL'T

.TIO

N

CO

';C

li»

TS

OA

TAD

IGIT

AL

DIV

.KF

T5

fJV

!'->n

N1.'

r:N

£ST

!iA

"-Y

£

]:'.

) :

; \?

i E

XC

'«/.f

.%C

H

Y>

"?L

MG

Y

I ' '

PL I

CA

T 1

0

«"r»

^!:

S5I

'J?

J ir

.'Tt'J

-'.'.r

.T 1

0

M.V.

-GI XP.NT

VAKI

NF

"ATI

: RIA

LSJ-

'AT

H'-'

-'AT

1C

*v

c'..r

-;:>

.Y

N!3

UX

AT

IVE

N

tJK

TH

'- A

y.T

;CL

«:

PK

fe'J

ICT

ION

« O

'JA

LIT

Y^

t>

AT

fi

P r,

^

»L

Fr.

SJ.r

\'C

(

SA

M)

sc

ienc

esS

CIS

'CC

'iSS:

-AS

.S

r/J

lV5

M

VARI

ATIO

N or C

ATION

EXCHANGE:

CAPACITY A

ND K

ATE

WITH

PARTICLE

size IN s

WATf-H-CLAV

INTr

RACT

IONS

IN

NO«

»TM

CARO

LINA

'S PAXLICO

ESTU

ARY:

VACATION O

F CATION F

XCHA

NGC

CAPACITY A

ND M

ATT: W

ITH

PARTICLE

DIGITAL-COMPUTER APPLICATIONS

IN C

HEMI

CAL -DUALITY STUDIES

OFXA

TI R-C

LAY

INTERACTIONS IN

NORTH C

AROL

INA'

S PA

^LIC

O FSTUARY:

HATA

COL

LECT

ION

SYST

EMS

FOR

WATFR

QUALITY

SURVf

I LLANCT

:DI

GITA

L-CO

MPUT

ER AP

PLIC

ATIO

NS IN

CHF. MICAL-OUAL

IT Y

STUD

IES

OFUN!)

rr*C,KU'JNO

WAST

E DISPOSAL.- C

ONCE

PTS

AND

MISCONCEPTIONS:

CRUSAJJF ON

BO

TTLE

S:DA

TA C

OLLC

CTIO

N SYSTEMS

FOR

WATER

QUALITY

SURV

C I LL AN

CF :

DIGI

TAL-

COMP

UTER

APPLICATIONS

IN C

HFMI

CAL-

QUAL

I TY STUDIES

OFDI

SCRE

TE SI

VULATION LA

NGUA

GES

WITH R

EFER

ENCE

TO

A

UIQ-

SI

MUL

F»OM

MANAGEMENT

SCIt

NCFS

TO

POLI

CY SCIPNCES-

MATERIALS

FOR

DUN

MfTR

GHUU

NO W

ASTE

DIS

POSA

L -

CONCEPTS A

ND M

I SCCNCEPTI ONS

:vf

JCU^

Y IN TH

E MA

KING

ENVIRONMENT:

WATT

R-CL

AY

I NTrRAC

TI ONS

IN N

ORTH C

AROLINA'S

PAVL

ICO

FSTUARY:

SAND

RI

IWU

NS OF

EUROPEAN TIDAL

SFAS

: '

VA".!ATIf]N

OF CATION EXCHANGE CA

PACI

TY AND

RATE

WIT

H PARTICLE. SUE

MATH

EMAT

ICAL

ME

THOO

OLOG

Y IN

HYOP.OLOGY:

Sr^f

: NORMATIVE. I

MPLICATIONS

OF A

SYSTEMS

VIEW OF

PO

L IC

YXAK

I NG

:OISC^ETr SI

MULA

TION

LAN

GUAG

ES WITH R

EFER

ENCE

TO A

?.in-

SIMU

LWA

T«:R-f.LAY INTERACTIONS IN

NORTH C

AROL

INA'

S PA

MLIC

O ESTUARY:

OISC

y.ET

C SIMULATION LANGUAGES

WITH R

EFERENCE T

O A

fUC-

SI

MUL

FOU

V,

v.A

NA

GE

MF

NT

S

CIE

NC

ES

TO

P

OL

ICY

S

CIE

NC

ES

- M

AT

ER

IAL

S

FOR

D

rue

USE

OF WATER

QUALITY

PRED

ICTI

ONS

AS A

N AI

D TO

MANGEMENT:

vfri

<cu*

Y ijj

THE

MARINE ENVIRONMENT:

FROM MANAGEMENT SCIENCES TO POLICY SCIENCES- MATERIALS FOR D

MATHEMATICAL METHOHQLOGY IN HYDROLOGY:

w^HCU'.Y IN THE MARINE EN

VATH

EVAT

ICAL

M

5THO

')OLO

GY

IN

UNOC

KCIO

UNO

WAS

TE

DISP

OSA

L Sn

.XC

NUH.

MAT

IVE

IMPL

ICAT

ION

SWATUR-CLAY INTERACTIONS IN

HYDR

OLOG

Y:* CO

NCEP

TS AN

D MISCONCEPTIONS:

OF A

SYST

EMS

VIEW O

F POL

I CYM

AK ING:

NOKTH CAROLINA'S PA^LICO ESTUARY:

WATCR-CLAY INTFRACTIONS IN NORTH CAROLINA'S PAVLICC ESTUARY:

VA!>IATIUN OF CATION EXCHANGE CAPACITY AND RATE WITH PARTICLE

F«(1M

MANAGEMENT SCIENCES TO POLICY SCIENCES- MATERIALS FOR D

S'Jv

l: MJRXATIVE IMPLICATIONS OF

A SYSTEMS VIEW OF POL I

CYMAK ING:

THE U3E OF WATER QUALITY PREDICTIONS AS AN AID TO MANGEMENT:

PRELlKINArtY STUDY OF TRANSVEHSE

L\A".S:

DATA COLLFCTION SYSTEMS FO

P. WATE5.

OUALITY SURVEILLANCE:

THF USE

'JF WATFR OUALITY PREDICTIONS AS AN AIO TO MANGFMCNT:

DIGITAL-COMPUTER APPLICATIONS

IN CHEMICAL-QUALITY ST'JOIF.S

OF

VARIATION OF CATION EXCHANGE CAPACITY AND RATE WITH PARTICLE

DISCRETE SIMULATION LANGUAGHS WITH REFERENCE TO A

IUD- SIMUL

SANO Kl-UJONS OF EURDPFAN TIDAL SEAS:

TIDAL SbAS:

TO POLICY SCIENCES- MATERIALS FOR D

TO POLICY SCIENCES- MATERIALS FOR D

TIDAL SEAS:

SA-V

O F»UM

SAMf

)

RlilBONS OF

MANAGEMENT

MANAGEMENT

RIinONS OF

EUROPEAN

SCIENCES

SCIENCES

EUR'

JPF.

Ar.'

VARIATION OF CATION EXCHANGE CAPACITY AND RATE WITH

SU

E

IN

SSU

RFA

CE

W

SUR

FAC

F W

SUR

FAC

E W

TIO

N -

ASC

USS

ION

:

SU

E

IN

S

* TIO

N -

A

TIO

N -

ASC

USS

ION

:

SCU

SSIO

N:

;: SIZ

E

IN

SSC

USS

ION

: .

SUR

FAC

E W

SIZ

E

IN

S,T

ION

-

A

SCU

SSIO

N:

SCU

SSIO

N:

SIZ

E

IN

S

DW

M-7

0-00

2D

WM

-70-

014

DW

M-7

0-00

7OW

M- 7

0-0

02

DW

M-7

0-O

O't

OW

V-7

0-0

07

OW

M- 7

0 -0

01

nW

M-7

0-0

10

nwM

- 70-0

04

DW

M-7

0-00

7O

WM

-70-

013

DW

M- 7

0-0

12

DW

M- 7

0-0

0 1

OW

M- 70-0

03

DW

M-7

0-00

2

DW

M-7

0-01

4O

WM

-70-

005

DW

M- 7

0-0

11

OW

M- 7

'J-0

13

DW

M-7

0-00

2 D

WM

- 70

-0 1

3D

WM

- 7 0

-6 1

2D

WM

-70-

006

OW

.X- 70-0

03

DW

M-7

0-01

2O

W.M

-70-

005

DW

M- 70-0

0 3

DW

M- 7

0-0

09

OW

.V-7

0-O

Ol

DW

M-7

0-01

1D

WM

-70-

002

DW

V- 70-0

02

DW

V-7

0-01

AO

WV

-70-0

12

DW

M-7

0-01

1D

WM

-70-

006

DW

M- 70-0

09

UW

M- 7

0-0

04

OW

M- 7

0-0

06

DW

M-7

0-00

7D

WM

- 70-0

1'*

OW

M-7

0-01

3D

WM

- 70-0

08

DW

M-7

0-O

OS

OW

M- 7

0-0

i?

DW

V.-

70-0

12D

WM

-70-

OO

BO

WM

- 70

-0 1

4

CA

LIF

OR

NIA

D

Wti-

70-0

03

CA

TIO

N

tXC

HA

NG

G

' D

WM

-70-

01A

CO

AS

TAL

GbO

MQ

KP

HO

LOG

V

OV

?M-7

07p0

9'C

OA

ST

AL

S

tOlM

tNT

AU

ON

D

WMr

70-0

09 ,

CO

MP

UT

ER

S

' '

DW

M-7

0-«

00

7C

UM

f>U

TF.f<

$ -O

WM

-70-0

13

CO

NT

INE

NT

AL

S

HE

LF

D

WM

-70

-00

8D

AT

A

CO

LLE

CT

ION

O

WM

-fO

-00

'»D

AT

A

PR

OG

RA

MS

.

OW

M-7

0-0

04

hj

OttP

WcLL

DIS

PO

SA

L

OW

M-7

0-0

01

H-

ED

UC

AT

ION

O

WM

-70

-00

5*2

E

NG

LIS

H

CH

AN

NtL

O

UM

-70

-00

8h<

'

EN

VIR

ON

ME

NT

O

WM

-70

-01

0

ft

LN

VU

UN

ME

NT

AL

MO

VE

ME

NT

'

OW

M-7

0-0

10

. rS

T'J

AK

Y

' .

OW

M-7

0-0

02

.

F1

NG

EK

R

AK

S

'

DW

M-7

0i

009

FL

OR

IDA

'

' '

OW

M-7

0i

OO

?F

OR

EC

AS

TIN

G

' D

wM

-70

-00

6'

??

GK

CU

NO

WA

TE

R

'

' D

WM

-7C

J-O

Ol

0,

'HY

J^O

LO

GIC

M

OD

ELS

O

WM

-70

-00

53

I0\

EX

CH

AN

GE

O

WM

-70

-C0

2^

MA

NA

GE

ME

NT

'

DW

M-7

Q-0

06

O

' M

AN

AG

EM

EN

T

SC

IEN

CE

O

WM

-70

-01

2M

AK

ING

O

RG

AN

ISM

S

OW

M-7

0-0

03

O

' M

AK

ING

S

ED

IME

NT

AT

ION

'

DW

M-7

0-C

Ott

MA

RY

LAN

D

DW

M-7

C-G

10

ft

MA

TH

EM

AT

ICA

L

MO

DE

LS

DW

M-7

0-0

13

V

MA

TH

EM

AT

ICA

L

MO

DE

LS

OW

M-7

0-0

06

0

MA

TH

EM

AT

ICA

L

MO

DE

LS

OW

M-7

0-0

05

H

.' M

AT

HE

MA

TIC

AL

M

OD

ELS

.

DW

M-7

0-0

07

3

\

ME

KC

UA

Y

OW

M-7

0-0

03

5*

»\

MO

DE

LS

. O

WM

-70-0

0 b

NE

TW

UK

K

DE

SIG

N

DW

M-7

0-0

0<

»

NO

HM

AT

IVG

G

EN

ER

AL

SY

ST

EM

S

OW

M-7

0-O

ll£

\N

OR

TH

C

AR

OLIN

A

DW

M-7

0-0

02

2

\ \P

OL

ICY

A

NA

LY

SIS

O

WM

-70

-Oll

g

PO

LIC

Y

AN

AL

YS

IS

OW

M-7

0-0

12

PO

LIC

Y

SC

IEN

CE

O

WM

-70

-01

2g"

PO

LIC

Y

SC

IEN

CE

D

WM

-70-0

11

0,

PO

LIC

YM

AK

ING

D

WM

r70

-01

1fl>

P

CL

LU

TIU

N

DW

M-7

0-0

03

X

PO

LLU

TIO

N

' O

WM

-70-C

01

RE

CY

CL

ING

W

AS

Tt

' D

W.X

-70

-01

0K

CG

KE

SS

ION

A

NA

LY

SIS

O

V/M

-70

-00

7K

ET

UK

MA

bLE

B

OT

TL5S

D

WM

-7C

-010

SA

LIN

ITY

O

WM

-70

-00

2S

AyP

LIJ

JG

TH

EO

RY

.

OV

.'M-7

0-p

O*

SA

'-i'J

D

EP

OS

ITS

O

WM

-70

-CO

3S

AN

i) K

lOG

kS

' D

WM

-70-C

C9

SA

M)

WA

VE

S

' D

WM

-70-C

08

StO

1M

bN

T

OW

M-7

0-0

02

Although the bibliographic example just given will

probably be the- most frequent application of the keyword

index program, other potential applications should not be

overlooked. Recall that up to 9 types of "cards" (records)

can be identified with one reference code. These cards in

the bibliographic example were called:

Card type

1 Author card

2 Title card

3 Citation card

4 Descriptor card

5 - Identifier card

6-7 Additional card types

The information on all card types is printed in the

bibliographic section of the index. Information on card

type 1 (INDEX AUTHORS) and on card types 4 and 5 (INDEX

TERMS) may be extracted in a back-of-the-book type of

index. Information on card type 2 may be permuted in a

keyword-in-context index (KWIC INDEX) or a keyword-out-of-

context index (KWOC INDEX); in both cases information on

card type 1 may be merged into the index (INDEX AUTHORS).

Similarly f- information on card types 4 and 5 may also be

permuted in KWIC (KWIC TERMS) or KWOC (KWOC TERMS) indexes;

again information on card type 1 may be merged into the

index (INDEX AUTHORS). These commands are summarized in

table 1. .

23

Tabl

e 1. S

umma

ry o

f ke

ywor

d pr

ogra

m co

ntro

l ca

rds.

X

- re

quir

ed c

ard;

0 -

optional

card

.

Keyw

ord pr

ogra

m control

cards

PRINT

BIBLIOGRAPHY X

. __ 0 0 0 0 0 0

XtfDEX

AUTHORS

..7.. X 0 0 0 0 0

ti H

O 8»

..,__... X - - - -

1 :

8 M 8 .:_ - X - - -

o CO

&H Q H - - * - X - -

i

o -B

, rr - -_ X -

p o -j - - - - X

COLUMN X - X - -

.

Sort pro

gram

control

cards

p fc o u> W

0) p s s H X - X - -

t o to O X - X -

t o (0 o

o - X

' - X

ft

<u -p (0 p h

O Ul ft

H & X - - - -

\Description

of o

utpu

t

1 '

Bibliographic

sect

ion

of rep

ort

only

- a

ll car

d ty

pes

printed

Doub

le c

olumn

inde

x of i

nfor

mati

on o

n ca

rd typ

e 1

KWIC

ind

ex of

card

typ

e 2

KWOC

index of

card

typ

e 2

Double c

olumn

inde

x of i

nfor

mati

on o

n card t

ypes

U a

nd 5

KWIC i

ndex o

f ca

rd typ

es U

and

5

KWOC

ind

ex -

inf

orma

tion

On

card t

ypes U

and 5

app

ears

in

left-

hand

margin; i

nformation o

n card t

ype

2 appears

on l

ine.

TITLES c

ontr

ol c

ard

required,

for

all

runs

and

must

appear l

ast.

STOP

WORDS c

ontr

ol car

d an

d Stop w

ords l

ist, if use

d mu

st i

mmediately pre

cede

TIT

LES

card

.

Having reviewed the effects of the commands, now

consider a research project file stored on GIPSY. Using

the COPY command, an index might be created to the file

making the following card type assignments:

Card type

1 Project leader

2 Project title

3 Project location (city and state)

4 Descriptors

6 Project summary

The possible types of indexes were discussed above.

Other potential applications that come to mind are map

catalogs, lists of geologic and geographic names, mineral

and fossil inventories, certain types of personnel rosters,

and indexes to publications (see the cumulative index to

the U. S. Geological Survey's Sysnotes),

25

STORAGE REQUIREMENTS AND RUN TIME

The core storage requirements of the keyword program

depend upon the line length of the index to be produced.

The REGION parameter on the KWIC program-part 1 execute

card should be specified as follows:

LENGTH n REGION (or REGION,GO) parameter

LENGTH 80 (default) . REGION-114K

LENGTH 120 REGION-178K

LENGTH 132 REGION=198K

Run times depend upon the number of documents to be

indexed and the length of the card type 2, card types 4,

and cards type 5 records as specified by the STORAGE control

card (STORAGE 180 is the default value). If STORAGE 180, a

KWIC index to 120 bibliographic references requires the

following run times:

" 1 / Program step Run time (seconds)

Step 1 KWIC program-part 1 37Step 2 Sort program - Balsort 3Step 3 KWIC program - part 2 7

Run times for larger collection of documents or larger

storage values must be adjusted upwards*

I/ Run times do not include times required to compile and link edit program source decks. '

26

REFERENCES CITED

, C. H.; Coney, M. D., Jones, M. A., Shields, R. W. r

and Sweeney, J. W., 1969, General information processing

system application description: Univ. Oklahoma Infor­

mation Science Series, mono, no, 4, 126 p.

U. S. Geological Survey, 1970, U.S.G.S. Computer user's

manual: Washington, D. C., U. S. Dept. of the Interior,

Geological Survey, Computer Center Division.

27