Upload
dangngoc
View
238
Download
0
Embed Size (px)
Citation preview
U. S. DEPARTMENT OF INTERIOR
GEOLOGICAL SURVEY
x/?
A computer program for creating keyword
indexes to textual data files
by
< D. V7. Moody
Open-File Report
VJashingtcn, D. C,
June J972
2426*1
>108
CONTENTS
Page
Abstract . . . . .'. , . . . . . . . . . v . . . . .. . . 1
Introduction ............. ..... . . . . . 2
Program Organization ... . . » . . . . . .* . . . . ... 4
Program Options . . . . . . . . . .. . . ... . . . . . 5* , '. ' i
Card Input . . . . . . . . . . . . . . .......10
Optical Scanner Input .. . . ...... .... . .11
Gipsy Input .;. . ...... . . . . . . . ... . . 12
Applications . . . . . ..... . . . . . . . . . . . . . 18
References Cited . . . . . .... . . . . . . . . -. . . . * 23
ILLUSTRATIONS
Page
Figure 1. Job control language cards needed to make aGIPSY search and create a data set for processing by the keyword program. . . . , 15
Figure 2* GIPSY control cards needed to search andcopy selected records- from Selected Water Resources Abstracts. . . . ...... 16
Figure 3. Job control language cards and program!;control cards needed to execute the keyword program. . . » ..... » . * . . . 17
Figure 4. Example of the bibliographic section of_..". -, , .; l > the index., , - " -- . . »,<.. »,r »,,», *-.. _; ».,; »- . ..,;;«:..-. .;.--*--'. :w - 19.
Figure 5. Example of the KWIC index. ^..I. « » * . . 20
Figure 6. < Example of the KWOC index. ..>... . ^ ^ 21
Figure 7. Example of the Index term index. .. » . . 22
Table 1, Summary of keyword program control cards . . 24
ABSTRACT
A keyword-in-context (KWIC) or out-of-context (KWOC)
index is a convenient means of organizing information.
This keyword index program can be used to create either
KWIC or KWOC indexes of bibliographic references or other
types of information punched on cards, typed on optical
scanner sheets, or retrieved from various Department of
Interior's data bases using the Generalized Information
Processing System (GIPSY). The index consists of a
"bibliographic" section and a keyword section based on
the permutation of. document titles, project titles,
environmental impact statement titles, maps, etc. or-.-' ; ' --.- - - - :". - . ' .. ' - . ' - «, -...'>:
lists of descriptors. The program can also create a
back-of-the-book index to documents from a list of
descriptors. By providing the user with a wide range
of input and output options, the program provides the
researcher, manager, or librarian with a means of main
taining a list and index to documents in a small library,
reprint collection, or office file.
INTRODUCTION
Machine produced indexes have been used for more than
a decade to inexpensively organize information. The most
common types of indexes prepared by the computer are the
keywprd-in-context (KWIC) index and the keyword-out-of-
context (KWOC) index. Both indexes are produced in much
the same way. A string of characters, such as the title
of a bibliographic citation, is fragmented into substrings
(e.g., words) using break characters such as blanks, commas,
and periods. The words are then used as index terms.
Because many words occur so frequently as to have little
information value, provision is usually made to eliminate
them by providing the computer with a list of stop words,
i.e., words which are not to be used as index terms. The
remaining terms are referred to as keywords.
The KWIC index format lists the keywords alphabet
icallyini the center of the index page. The remaining
;words in the title are printed to the left and right of
the keyword in their original order. If there is insuffic-
ient space on the line for the words tb the right of the
|Jteyjwordt in the title, the title is wrapped around and
Sprinted beginning at the far left of the same line. Thus,
the keyword appears in the cbntext in which it is used in
the title* Each line is identif ied^Jby" a number which
references the full citation of the title in the biblio
graphic section of the index.
The KWOC index format presents each keyword alpha
betically in .the left margin of the index followed by a
list of all the titles which contain that keyword. In
this case the keyword is out of context with the title.. > -. *
This report describes the use of a keyword index
program to produce KWIC and KWOC indexes of document
titles punched on cards, typed on optical scanner sheets,
or retrieved from various Department of the Interior's
data bases using the Generalized Information Processing
System (GIPSY). While bibliographic records are used
as illustrations, the program can also be used to produce
indexes of projects, maps, computer programs, geologic
names, index terms, or any other information contained
in a GIPSY data base or punched on cards in the keyword
index program format. Instructions are given for pre
paring and maintaining a personal bibliographic file.
PROGRAM ORGANIZATION
This keyword index program is written in PL/1 for
the IBM 360/65 computer and is a modification of the Uni
versity of California's Quick Index Compiler (IBM SHARE
Program Library 360D 03.3.002). The program is organized
into 3 parts. The first part of the program reads a set
of control cards, a list of stop words, and the biblio
graphic data. The program then breaks the title or other
information into words, removes stop words, and prepares
two output files: the file named OUTPUT contains the
bibliographic section of the index; the file named INPUT
contains the keyword index. Note that indexing will occur
on words followed by a pound sign (§), blank, period,
comma, double apostrophe, left and right parentheses,
hyphen, serai-colon, colon, question mark, and quote mark.
-The second part of the program uses the cataloged
procedure DISKSORT to alphabetize the keywords in file
INPUT and add the sorted index to file OUTPUT. The third
part of the program prints the 'bibliographic and keyword
sections of the index contained in file OUTPUT.
PROGRAM OPTIONS
Control Cards
The following control words are punched on separate
cards beginning in column 1. Numbers are separated from
the control words by a blank. The order of the cards is
unimportant with the exception that 'STOP WORDS 1 must
precede the control card 'TITLES 1 . The 'TItLES' card is
the only card required. All others are optional.
PRINT BIBLIOGRAPHY - This card causes all citations
to be printed. Each citation is
separated by a double space. If this
option is used, the input data should be
sorted by reference number prior to
using the program.
LENGTH n - The length parameter sets the index line
length to n characters. The maximum
line length is 132 characters. If no
length is specified, n defaults to 80.
PAGESIZE n - This card specifies the number of lines
to be printed on each page of the index.
If the page size is not specified, n
defaults to 60.
COPIES n - This card specifies the number of copies
.of the index to be printed. If the
number of copies is not specified, n
defaults to ond.
STORAGE n - The storage card specifies the
maximum length of a character string
that can be processed by the program.
The default value of n, the number of
characters in the string, is 180*
If using punch cards (or optical
scanning sheets) as input, 180 char
acters will be sufficient to handle »
document titles which fill 3 cards
(60 characters per card). Any excess
characters will be ignored unless the
STORAGE parameter is increased in
multiples of 60. If using input from
a GIPSY retrieval, the default value
will accommodate the first 180 char
acters of the document title record.
STORAGE may be set to any value?
however, it must equal the length of
the "title" string put out by the
COPY command in the GIPSY retrieval
(see p. 13), In this case the record
length and the block size parameters
on the DATA DD card (job control
language card) will also have to be
adjusted (see p. 14Ki
COLUMN - The column command is used with the
INDEX TERMS to format a double column
list of terms. It should only be used
with this option.
MARGIN - The margin command disables the auto
matic margin adjustments made for
line lengths less than 132 characters.
In general this card will not be used.
SCANNER INPUT - The program will expect an input data
set on a tape or disk pack^whose file
name is defined as DATA by the Job
Control Language. The input data
must be in the optical scanner format.
GIPSY INPUT - The program will expect an input data
set on a» tape or disk pack whose file
name is defined as DATA by the Job
Control Langau^e. The input data
must be in the GIPSY copy format.
(Default Input)- If the type of input is not specified,
the program assumes that the data is
punched on cards in the format required
by the program. Data cards immediately
follow the TITLES cards.
KWIC INDEX - The program will produce a keyword-in-
context index.
KWOC INDEX - The program will produce a keyword-out-r
of-context index using terms from the
citation titles.
KWOC TERMS - The program will produce an index in
KWOC format using the index terms.
Only the first 10 characters of each
2/ term .word is used.
INDEX TERMS c - The program will produce a back-of-
the-book type of index fr&m the list
of index terms associated with a cita
tion. If "c u is a blank character
(col. 13), then every term will bet
indexed. If terra sets are separated
by a comma or other punctuation, then
place this character in column 13 and
the term set will be treated as a single
unit. A maximum of 25 characters from
the multiple-term descriptor is printed.
I/ If bibliography is stored on disk in card-imate format, use DISK INPUT on control card. . The program will expect an input^data set on a disk- pack whose file name is defined as DATA by the Job Control Language.
j2/ To use this option a list of descriptors (index terms) must be associated with each bibliographic citation.
8
STOP WORDS - Up to 500 words may be punched on sep
arate cards following this one. Each
word begins in column 1. Only the
first 10 characters (including blanks)
of each word are used for comparison.
Columns 19-80 should be blank.
TITLES - The 'TITLES 1 card is the only required
control card. Any information punched
in columns 11-60 will be used as a
heading for each page of the bibliog-
. v raphy. section. . -_;-
INDEX AUTHORS - The authortsl names will be indexed
with the keywords in the i-ndex.
Card Input:
Card input is assumed by the program, if no other
input option is specified. Sets of one or more punched
cards must immediately follox* the 'TITLES 1 card. The
format of the punched cards is as follows:
Cols.
1-60 Text. The text on each card may be continued on
up to nine additional cards as indicated by the
card's sequence number. Because the text is
printed exactly as it appears on the card,
do not break text in the middle of a word at
th'e end of a line. The punctuation us^d in the
text depends upon the type of information con
tained in the card (see types of cards listed
below).
61 Type of card,
'I 1 - Author card. Place author's last name
first followed by a comma, first name
or initials, and a semi-colon. Repeat
for each author to be listed. For
examples 'JOE, JOHN? SMITH, A. A.;'
2 1 - Title card. Place colon (or comma) at
end of title.
'3' - Citation. Use standard OSGS biblio
graphic format.
«4» - Descriptor or index term card. Separate
10
terms by commas. For example: 'WATER,
CHEMISTRY, ATLANTIC OCEAN, WATER RESOURCES. 1
End the term list with a period.
'5 1 - Identifiers terms (or other information)
related to card 4. If card type 5 appears,
it will be combined with the information
on card type 4. Otherwise, do not include
card type 5.
Other types of cards may be added as needed
(card type 6 - 9),
62-63 Sequence number. Number successive cards of the
same type from 01-10.
64-73 * Reference code of document. This is - a 10 char
acter alphanumeric code which appears next to
each reference in the bibliography and to the
right of each line in the index.
Optical Scanner Input
Bibliographic data can also be prepared on optical
scanning sheets using a 10-pitch IBM Selectric typewriter
equipped with an OCS typing element. The following format
is used:
Cols. '
1 Type of card:
f l f Author card
«2 f - Title card
3 1 - Citation card
11
4 1 - Descriptor term card
"5 1 - identifier terra card
Other types of cards may be added as needed
(card type 6-9). .
2-3 Sequence number. Number successive cards of the
same type from 01-10.
4-13 Reference code of document.'
14-73 Text. The text may be continued on up to nine
continuation cards.
Since the optical scanner reads up to 75 characters
on a line, there is space to make two corrections of one
letter each on any given line of input text. Otherwise,
the entire line must be deleted. After the citations are
typed they are read by the optical scanner and placed as
card images on magnetic tape. The tape can then be pro
cessed by the keyword program.
GIPSY Input ' -
Records can also be used as input to the keyword
program. It is assumed that the user of this version of
the program has read the GIPSY application manual (Addison
and others, 1969) and is familiar with the GIPSY cataloged
procedures on the Geological Survey's IBM 360/65.
\I/ Copies of the GIPSY cataloged procedures may be Obtained
from Chief, Office of Systems Research and Development, Computer Center Division, Geological Survey^ Washington,D. C. . *
\«
12
The GIPSY program is an integral part of the Geo
logical Survey's Bibliographic System. Using GIPSY the
user can search the title, index terms, or abstracts of
bibliographic references for specific combinations of
words, word stems, or phrases. The selected records can
then be sorted by author or accession number and printed.
A key-word index of these records provides a convenient
means of scanning the retrieved documents for items of
interest.
The user prepares the key-word program input data
set by (1) making a GIPSY search of one of the GIPSY data
files, (2) sorting the retrieved records by reference
number, and (3) placing the records into a temporary disk
storage area on the computer system, e,g., TEMP01. The
temporary data set is then ready for processing by the
keyword program. The use of the TEMP01 disk pack is
described in the U.S.G.S. Computer User's Manual (p. 5.3),
and the necessary job control language instructions and
GIPSY commands are shown in figures 1 and 2.
The principal limitations to this use of the keyword
program are the length of the data elements copied, e.g.,
AUTHOR, TITLE, CIT, and DESCR from the bibliographic file,
and the number of records selected. Only the first 180l
characters of the author, title, reference, and inde^x
\terms fields are copied. If any of these fields are more\
than 180 characters, the remaining characters will be
dropped. The total length of each record is 193 characters.
13
This includes the "card type," sequence number, and the
10-character reference code. Sixty tracks of temporary
disk space is sufficient to store 740 references. If
larger character strings are copied, the STORAGE parameter
will need to be set accordingly. The length of record
parameter on the DATA DD card will also need to be changed
to reflect the increase in the length"of the record copied
on to the temporary disk. The new length will be equal to
STORAGE plus 13 characters, e.g., LRECL-STORAGE+13, Simi
larly, the BLKSIZE parameter will need to be adjusted.
The overall effect of these changes will be to reduce the
number of records which can be stored on a disk track.
The-required job control language cards to execute
the keyword program are shown in figure 3.
14
//,... JOB (...)/*SETUP CCDnnn/DlSK -//JOBL1B DD DSM=SYS1.GIPL1B,DSN=SHR//STEP1 EXEC QUESTRAN,DNAME=dictionary,DVOL=CCDnnn,// RNAME=recordf i 1 e, RVOL=CCDnnn//QUESTRAN.SYSVIRKO DD UN I T=SYSDA, D I SP=( NEW, KEEP/ DELETE) ,
// SPACE=(TRK, (10 / 10) / RLSE) / DSM=Annnnnn.aaaB200.nafr.e //QUESTRAN.SYSRDR DD *
(Gipsy control cards go here.)
/*
Figure 1. ---Job Control Language cards needed to rr.ake a GIPSY search and create a data set for Input to the keyword prograr.
15
Col. 1
FORM WRSICSELECT A. DESCR B. § <..
LOGIC A AND B AND .. SORTNUMBER 10 COPYAUTHOR 180'101 1
NUMBER 10NEW RECORDTITLE 180'201'
NUMBER 10NEW RECORDCIT 180'301 1
NUMBER 10NEW RECORDDESCR 180'401'
NUMBER 10
Selected Wate'r Resources Abstracts
Figure 2. GIPSY control cards needed to search andcopy selected records from Selected Water Resources Abstracts.
16
//:... JOB (...)//STEPl EXEC PL1LFCLG,PA3M.PL1L='SIZE='007782U,ST,A,NT,X I , // T! N'E.PL1L = ( 0^30) ,71 ME. LKED-C 0,5), TIME. G0=(2, 10), // REGION. GO=193K //PLIL.SYSIf! DO * .
(Source deck KWIC prograir- part 1)
/*//GO. OUTPUT DD UH1 T=SYSDK, D I SP=( NEW, PASS, DELETE) ,II SPACE=(TRK,(60,20)),DCB=(RECFM=FB,LRECL=133,3LKSIZE=7182),
//GO. INPUT DD UNlT=SYSpK,DISP = (NEV<f , PASS, DELETE),
// DSN-&&INPT//GO. DATA DD UN 1 T=SYSDA, D I SP=SHR,// VOL=SER=TEMP01,DSN=Annnnnn'.aaaB 200* name//GO.SYSIN DD *
TITLES (j^ust be last control card
(If CARD Input option is used place cards here and "durcrcy 11 the "GO. DATA!1 DD card.)
/* . ',
//STEP2 EXEC PROC=niSCSORT,REGlON=100K,TIME=(0,30) //SORTIN DD DISP^OU^DELETE, DELETE), DSN=&&1NPT //SORTOUT DD D 1 SP = (MOD / PASS, DELETE) , DSN=&&OUTPT //SYS IN DD *
SORT FIELDS=(62 / 26 / CH,A),SIZE=E30000 (Use for KWIC Index.)
SORT FIELDS=(6,10,CH,A),SIZE=E30000 (Use for KWOC Index.)
SORT FIELDS=(21/ 25 / CH / A) / SIZE-^30000 (Use for INDEX TERMS.)
/* .
//STEP3 EXEC PL1LFCLG, TIME. PL1L = ( 0,10), TIME. Lked = ( / 5) , // TIME.GO=(,30) / REGION.GO=IOOK //PL1L.SYSIM DD *
(Source deck KWIC program- part 2)
/*//GO. PRINTER DD SYSOUT=A, DC3=( RECFM=UA/ CLKS I ZE=133)
nn ^^DD DUMMY/DCB-CRECFM.F.aLKSiZE-eO)""
Flcure 3. -- Job Control Language cards needed to execute the keyword procrarr.
17
APPLICATIONS
The card version of the keyword program 'can be- used
to develop an index to a personal reprint collection or a
small office library. Each item is given a reference code
as it is received which identifies the item's owner (or
location) and perhaps the date it was received. Thus, if
more than one collection were to be merged for indexing
purposes, the reference code will identify the report's
location. The following reference code (columns 64-73 on
card input) might be used to identify and locate documents
Cols.
64-66 Document owner's initials (3 characters)
67 Hyphen
68-69 Year reprint received or year of publication
70 Hyphen
71-73 Accession number of reprint within year
For example, JDD-70 001 represents the first reprint
received by John D. Doe in 1970. When submitting cards
for processing, the citations must be in accession number
order. By placing each private collection in alphabetical
order by the owner's" initials, more than one collection
can be merged into one index^. Thus, as research personnel*
move from office to office they can take their index with
them and.merge with existing collections at their next- ' \
assignment. Figures 4 to 7 are examples of indexes from
a private reprint collection. ' *
' 18
l/n
n f
v~
wv*
w..
»..
..._
...
__
UN
Ufc
RG
RU
UN
U
WA
ST
E
DIS
PO
SA
L -
CO
NC
EP
TS
A
NU
E
NV
IRO
NM
EN
TA
L
SC
IEN
CE
A
NU
T
EC
HN
ULU
GY
, V
. 4
, N
O.
8»
P.
64
2-
*fl
. .
<
,/,?
f 1970'.
. »
' '
!
£"
"
GK
OU
NU
WA
TE
R,
PO
LL
UT
ION
* W
AS
TE
D
ISP
OS
AL
* O
EE
P>J
ELL
D
ISP
OS
AL.
G
'*
OW
M-7
0-0
02
O
UO
IUN
S,
0.
A.{
R
AG
LA
NO
, P
. C
.;
JOH
NS
ON
, J.
D.;
ro
WA
ltK
-CLA
Y
INT
ER
AC
TIO
NS
IN
N
OR
TH
C
AR
OLIN
A'S
P
AM
LIC
U
ES
TU
AR
Y:
1*
, tN
VlK
UN
Mb
NT
AL
S
CIt
NC
U
AN
O1
TG
CM
NO
LOG
Y,
V.
/*,
NO
. 9
t P
. 7
43
-'*
. I
7'i8
t 1V
70.
tSTUAKYi SGOIMENT, WATCR QUALITY, SALINITY, SEDIMENTATION,
w
IUN
bXCHANGE, SUSPENDED SEDIMENT, NORTH CAROLINA.
x '
g
OWH-70-003
KLEIN, 0.
H.
; GOLUHCRG, U.
D.;
,2
...
MtRC
UftY
IN
THE MARINE ENVIRONMENT:
M
ENVIRONMENTAL SCIENCE ANU TECHNOLOGY, V. 4, NO
. 9, P. 765-
»
. 76«J,
l»)70.
MEis
CUKY
, MARINE ORGANISMS, SEDIMENTS, CALIFORNIA, POLLUTION.
HiOWM-70-004
KISIHL, C.
C.
; DURUM, W. H.
; LANGOE1N, W. 0,
;J
DATA COLLECTION SYSTEMS FUR WATER QUALITY SURVEILLANCE*
g
U.S.
Gb
'OL.
SURVEY, UNPUOLISHED MANUSCRIPT, 21 P.
, 1968.
WATER UUALITY, UATA COLLECTION, DATA PROGRAMS, SAMPLING
V
TIICURY, NE
TV/O
RK DESIGN.
Q^ H
OWM-70-005
K1SICL, C. C.
; H.
MATHEMATICAL METHODOLOGY IN HYDROLOGY:
O IN
TbKN
AT.
SEMI
NAR
FOR
HYDROLOGY, PROC., JU
LY 19
69,
URBA
NA,
^
' ILLINOIS,
P. 36^-390.
P EDUCATION; MO
DELS
, MATHEMATICAL M
ODELS, HYOKOLOGIC M
O'OE
LS.
tj J
OWM-70-006
MSIDL, C.
C.
; %
O
THR US
E OF WATER QUALITY PREDICTIONS AS
AN AID TO
(A
' (D n rt H- o 9 o H» ft ar A H«
3 a (D
X
C
O in W.
fi) M
O O H O a (D X
C SIMULATION LANGUAGES WITH
CRUSAOK ON
EAM SE
OlMf-N/
VARIATION OF CATION EXCHANGE
WATE
R-CL
AY INTERACTIONS IN N
ORTH
ICLF SUE
IN STRFAM S
F.OI
MEN/
VA
RIAT
ION
OF
N A
SHA/
DIGITAL-COMPUTER APPLICATIONS
IN
CO ESTUARY:
WATE
R-
LLAN
CF:
"^
DATA
TUDIES 0
« SU
RFAC
E WATER
IN A
SMA/
DIGITAL"
UNDE
RGRO
UND
WASTF. D
ISPOSAL
-
URVE
ILLA
NCE:
UALI
TY S
TUDIES O
F SU
RFAC
E WA
TER
IN A
SMA
/ E
TO A
BIO
- SI
MULA
TION
- A
USE
R'S
IMPRRS/
SCIE
NCES
TO
POLI
CY S
CIENCES- M
ATERIALS F
ORUNDERGROUND WA
STE
MERCURY- IN
THE MARINE
Y INTERACTIONS IN NORTH CAROLINA'S PAMLICO
SAND RIBBONS OF
2E
IN STREAM SEOIMEN/
VARIATION OF
CATION
MATHEMATICAL METHODOLOGY IN
ING:
SOME NORMATIVE
REFERENCE TO A
BIO-.SIMULATION -
A USER'S
TUARY:
WATER-CLAY
tuu-
>
irir
r\u
J.?
* w
y
v ri
*
ON - A USER'S IMPRES/
DISCRETE SIMULATION'
ERIALS FO
R DISCUSSIONS
FROM
OF WATER QUALITY PREDICTIONS AS
AN
AID TO
MERCURY
IN' TH
E DM
MANAGEMENT SCIENCES TO
POLICY SCIENCES-
MATHEMATICAL
UNDERGROUND WASTE DISPOSAL - CONCEPTS AND
POLICYMAKING:
SOME
WATER-CLAY INTERACTIONS IN
ATER-CLAY INTERACTIONS IN NORTH CAROLINA'S
OF CATION F.
XCHA
NGE
CAPACITY AND RATE WITH
FRUM MANAGEMENT SCIENCES TO
CREATIVE IMPLICATIONS OF A
SYSTEMS VIEW O?
THt
IJSF: OF WATER UUALITY
BOTTLES:
CAPACITY AND RATE WITH PARTICLE SUE IN STR
CAROLINA'S PAMLICO ESTUARY J
CASWCLLt C.
A.
CATION EXCHANGE CAPACITY AND RATE WITH PA
RTCHEMICAL-QUALITY STUDIES OF
SURFACE WATER
ICLAY INTERACTIONS IN NORTH CAROLINA'S PAMLI
COLLECTION SYSTEMS FOR WATER QUALITY SURVEI
CUMPUTFR APPLICATIONS IN CH
F.MIC AL-QUAL I.TY
SCONCEPTS AND MISCONCEPTIONS:
CRUSADE ON
ROTTLES:
DATA
COLLECTION SYSTEMS FO
R WATER QUALITY S
DIGITAL-COMPUTER APPLICATIONS IN CHEMICAL-Q
DISCRETE SIMULATION LANGUAGES WITH REFERENC
DISCUSSION:
FROM MANAGEMENT
DISPOSAL -
CONCEPTS AND MISCONCEPTIONS:
DUrWINS't
D. A.
OKU
ft r YEHEZKEL
ORO*
t YEHCZKEL
DURUM t
W. H.
ENVIRONMENT:
ESTUARY:
* WA
TER-
CLA
EUROPEAN TIDAL SE
AS«
EXCHANGE CAPACITY AND RATE WI
TH' PARTICLE SI
GHLDTERG,
E. 0-
'HY
DROL
OGY:
IMPLICATIONS OF
A
SYSTEMS VIEW OF
POLICYMAK
IMPRESS IONS:
/TE
SIMULATION LANGUAGES WITH
INTFRACTIONS W NORTH CAROLINA'S PAMLICO ES
JOHNS ON t
J. D.
KENNEDY, V.
C.
KENYUN
t'N.H.
KISIELr C. C.
KISIELt C. C.
KISICL, C. Ca
KLEIN» 0. H.
LANGQEINt W. 0.
LANGUAGES WITH REFERENCE T3
A
010- SIMULATI
MALCOM, R. L.
MANAGEMENT SCIENCES TO
POLICY SCIENCES- MAT
MANG
EMEN
T:
THE
USE
MARINE ENVIRONMENT:
MATERIALS FOR DISCUSSION:
FRMATHEMATICAL METHODOLOGY IN. HYDROLOGY:
MERCURY IN THE MARINE ENVIRONMENT:
METHODOLOGY IN HYDROLOGY:
MISCONCEPTIONS:
NIFOnRUOA, A. W.
NHRMATIVC IMPLICATIONS OF A
SYSTEMS VIEW OF
NflRTH CAROLINA'S PAMLICO ESTUARY:
PAMLICO
ESTUARY:
. w
PARTICLE SIZE IN STREAM SEDIMENT:
/RIATION
POLICY SCIENCES- MATERIALS FO
R DISCUSSION:
PULICYMAKING:
, '
SOME N
PREDICTIONS AS AN AID TO
MANGEMENT:
DWM-70-010
OWM-70-014
OWX- 70-002
OWM-70-OCI
DWM- 70-01 4
OWM-70-OC7
DWM-70-002
DWM-70-OC4
DWM- 70-007
DWM- 70-00?.
DWM-7C-010
DWM-70-OC^
DWM- 70-007
OWX-70-013
DWM-70-C12
DWM- 7 0-0 01
DWM-70-002
DWM- 70-0 12'
OWM-7C-011
DWM-7C-OC4
DWK-7C-OC3
DWM-70-002
DWM- 70-003
DWM-7 0-0 I 4
OWM-70-003
OWM-70-005
DWM- 7 0-0 1
1DWM-70-013
DWM-70-002
DWM-70-002
' OWM-70-014
DWM-7 0-008
DWX-70-CC6
DWM- 70-004
DWM-70-005
DWX-70-003
DWM-70-OC<V
OWM-70-013
DWM-7 0-0 14
DWM-7 0-0 12
DWV.-70-OC6
DWM- 7 0-0 03
DWM-70-012
OWM- 70-005
DWM- 70-003
DWM-70-005
DWM-70-OOl
DWX-70-P-C9
DWM-7 0-0 11
OWM-70-002
DWM-7C-002
DWM-7 0-0 14
DWM- 7 0-0 12
DWV-70-Oll
owv-
7C-iro(>
CAPA
CITY
CHP.
V.IC
ALCL
AYCC
'LL'T
.TIO
N
CO
';C
li»
TS
OA
TAD
IGIT
AL
DIV
.KF
T5
fJV
!'->n
N1.'
r:N
£ST
!iA
"-Y
£
]:'.
) :
; \?
i E
XC
'«/.f
.%C
H
Y>
"?L
MG
Y
I ' '
PL I
CA
T 1
0
«"r»
^!:
S5I
'J?
J ir
.'Tt'J
-'.'.r
.T 1
0
M.V.
-GI XP.NT
VAKI
NF
"ATI
: RIA
LSJ-
'AT
H'-'
-'AT
1C
*v
c'..r
-;:>
.Y
N!3
UX
AT
IVE
N
tJK
TH
'- A
y.T
;CL
«:
PK
fe'J
ICT
ION
« O
'JA
LIT
Y^
t>
AT
fi
P r,
^
»L
Fr.
SJ.r
\'C
(
SA
M)
sc
ienc
esS
CIS
'CC
'iSS:
-AS
.S
r/J
lV5
M
VARI
ATIO
N or C
ATION
EXCHANGE:
CAPACITY A
ND K
ATE
WITH
PARTICLE
size IN s
WATf-H-CLAV
INTr
RACT
IONS
IN
NO«
»TM
CARO
LINA
'S PAXLICO
ESTU
ARY:
VACATION O
F CATION F
XCHA
NGC
CAPACITY A
ND M
ATT: W
ITH
PARTICLE
DIGITAL-COMPUTER APPLICATIONS
IN C
HEMI
CAL -DUALITY STUDIES
OFXA
TI R-C
LAY
INTERACTIONS IN
NORTH C
AROL
INA'
S PA
^LIC
O FSTUARY:
HATA
COL
LECT
ION
SYST
EMS
FOR
WATFR
QUALITY
SURVf
I LLANCT
:DI
GITA
L-CO
MPUT
ER AP
PLIC
ATIO
NS IN
CHF. MICAL-OUAL
IT Y
STUD
IES
OFUN!)
rr*C,KU'JNO
WAST
E DISPOSAL.- C
ONCE
PTS
AND
MISCONCEPTIONS:
CRUSAJJF ON
BO
TTLE
S:DA
TA C
OLLC
CTIO
N SYSTEMS
FOR
WATER
QUALITY
SURV
C I LL AN
CF :
DIGI
TAL-
COMP
UTER
APPLICATIONS
IN C
HFMI
CAL-
QUAL
I TY STUDIES
OFDI
SCRE
TE SI
VULATION LA
NGUA
GES
WITH R
EFER
ENCE
TO
A
UIQ-
SI
MUL
F»OM
MANAGEMENT
SCIt
NCFS
TO
POLI
CY SCIPNCES-
MATERIALS
FOR
DUN
MfTR
GHUU
NO W
ASTE
DIS
POSA
L -
CONCEPTS A
ND M
I SCCNCEPTI ONS
:vf
JCU^
Y IN TH
E MA
KING
ENVIRONMENT:
WATT
R-CL
AY
I NTrRAC
TI ONS
IN N
ORTH C
AROLINA'S
PAVL
ICO
FSTUARY:
SAND
RI
IWU
NS OF
EUROPEAN TIDAL
SFAS
: '
VA".!ATIf]N
OF CATION EXCHANGE CA
PACI
TY AND
RATE
WIT
H PARTICLE. SUE
MATH
EMAT
ICAL
ME
THOO
OLOG
Y IN
HYOP.OLOGY:
Sr^f
: NORMATIVE. I
MPLICATIONS
OF A
SYSTEMS
VIEW OF
PO
L IC
YXAK
I NG
:OISC^ETr SI
MULA
TION
LAN
GUAG
ES WITH R
EFER
ENCE
TO A
?.in-
SIMU
LWA
T«:R-f.LAY INTERACTIONS IN
NORTH C
AROL
INA'
S PA
MLIC
O ESTUARY:
OISC
y.ET
C SIMULATION LANGUAGES
WITH R
EFERENCE T
O A
fUC-
SI
MUL
FOU
V,
v.A
NA
GE
MF
NT
S
CIE
NC
ES
TO
P
OL
ICY
S
CIE
NC
ES
- M
AT
ER
IAL
S
FOR
D
rue
USE
OF WATER
QUALITY
PRED
ICTI
ONS
AS A
N AI
D TO
MANGEMENT:
vfri
<cu*
Y ijj
THE
MARINE ENVIRONMENT:
FROM MANAGEMENT SCIENCES TO POLICY SCIENCES- MATERIALS FOR D
MATHEMATICAL METHOHQLOGY IN HYDROLOGY:
w^HCU'.Y IN THE MARINE EN
VATH
EVAT
ICAL
M
5THO
')OLO
GY
IN
UNOC
KCIO
UNO
WAS
TE
DISP
OSA
L Sn
.XC
NUH.
MAT
IVE
IMPL
ICAT
ION
SWATUR-CLAY INTERACTIONS IN
HYDR
OLOG
Y:* CO
NCEP
TS AN
D MISCONCEPTIONS:
OF A
SYST
EMS
VIEW O
F POL
I CYM
AK ING:
NOKTH CAROLINA'S PA^LICO ESTUARY:
WATCR-CLAY INTFRACTIONS IN NORTH CAROLINA'S PAVLICC ESTUARY:
VA!>IATIUN OF CATION EXCHANGE CAPACITY AND RATE WITH PARTICLE
F«(1M
MANAGEMENT SCIENCES TO POLICY SCIENCES- MATERIALS FOR D
S'Jv
l: MJRXATIVE IMPLICATIONS OF
A SYSTEMS VIEW OF POL I
CYMAK ING:
THE U3E OF WATER QUALITY PREDICTIONS AS AN AID TO MANGEMENT:
PRELlKINArtY STUDY OF TRANSVEHSE
L\A".S:
DATA COLLFCTION SYSTEMS FO
P. WATE5.
OUALITY SURVEILLANCE:
THF USE
'JF WATFR OUALITY PREDICTIONS AS AN AIO TO MANGFMCNT:
DIGITAL-COMPUTER APPLICATIONS
IN CHEMICAL-QUALITY ST'JOIF.S
OF
VARIATION OF CATION EXCHANGE CAPACITY AND RATE WITH PARTICLE
DISCRETE SIMULATION LANGUAGHS WITH REFERENCE TO A
IUD- SIMUL
SANO Kl-UJONS OF EURDPFAN TIDAL SEAS:
TIDAL SbAS:
TO POLICY SCIENCES- MATERIALS FOR D
TO POLICY SCIENCES- MATERIALS FOR D
TIDAL SEAS:
SA-V
O F»UM
SAMf
)
RlilBONS OF
MANAGEMENT
MANAGEMENT
RIinONS OF
EUROPEAN
SCIENCES
SCIENCES
EUR'
JPF.
Ar.'
VARIATION OF CATION EXCHANGE CAPACITY AND RATE WITH
SU
E
IN
SSU
RFA
CE
W
SUR
FAC
F W
SUR
FAC
E W
TIO
N -
ASC
USS
ION
:
SU
E
IN
S
* TIO
N -
A
TIO
N -
ASC
USS
ION
:
SCU
SSIO
N:
;: SIZ
E
IN
SSC
USS
ION
: .
SUR
FAC
E W
SIZ
E
IN
S,T
ION
-
A
SCU
SSIO
N:
SCU
SSIO
N:
SIZ
E
IN
S
DW
M-7
0-00
2D
WM
-70-
014
DW
M-7
0-00
7OW
M- 7
0-0
02
DW
M-7
0-O
O't
OW
V-7
0-0
07
OW
M- 7
0 -0
01
nW
M-7
0-0
10
nwM
- 70-0
04
DW
M-7
0-00
7O
WM
-70-
013
DW
M- 7
0-0
12
DW
M- 7
0-0
0 1
OW
M- 70-0
03
DW
M-7
0-00
2
DW
M-7
0-01
4O
WM
-70-
005
DW
M- 7
0-0
11
OW
M- 7
'J-0
13
DW
M-7
0-00
2 D
WM
- 70
-0 1
3D
WM
- 7 0
-6 1
2D
WM
-70-
006
OW
.X- 70-0
03
DW
M-7
0-01
2O
W.M
-70-
005
DW
M- 70-0
0 3
DW
M- 7
0-0
09
OW
.V-7
0-O
Ol
DW
M-7
0-01
1D
WM
-70-
002
DW
V- 70-0
02
DW
V-7
0-01
AO
WV
-70-0
12
DW
M-7
0-01
1D
WM
-70-
006
DW
M- 70-0
09
UW
M- 7
0-0
04
OW
M- 7
0-0
06
DW
M-7
0-00
7D
WM
- 70-0
1'*
OW
M-7
0-01
3D
WM
- 70-0
08
DW
M-7
0-O
OS
OW
M- 7
0-0
i?
DW
V.-
70-0
12D
WM
-70-
OO
BO
WM
- 70
-0 1
4
CA
LIF
OR
NIA
D
Wti-
70-0
03
CA
TIO
N
tXC
HA
NG
G
' D
WM
-70-
01A
CO
AS
TAL
GbO
MQ
KP
HO
LOG
V
OV
?M-7
07p0
9'C
OA
ST
AL
S
tOlM
tNT
AU
ON
D
WMr
70-0
09 ,
CO
MP
UT
ER
S
' '
DW
M-7
0-«
00
7C
UM
f>U
TF.f<
$ -O
WM
-70-0
13
CO
NT
INE
NT
AL
S
HE
LF
D
WM
-70
-00
8D
AT
A
CO
LLE
CT
ION
O
WM
-fO
-00
'»D
AT
A
PR
OG
RA
MS
.
OW
M-7
0-0
04
hj
OttP
WcLL
DIS
PO
SA
L
OW
M-7
0-0
01
H-
ED
UC
AT
ION
O
WM
-70
-00
5*2
E
NG
LIS
H
CH
AN
NtL
O
UM
-70
-00
8h<
'
EN
VIR
ON
ME
NT
O
WM
-70
-01
0
ft
LN
VU
UN
ME
NT
AL
MO
VE
ME
NT
'
OW
M-7
0-0
10
. rS
T'J
AK
Y
' .
OW
M-7
0-0
02
.
F1
NG
EK
R
AK
S
'
DW
M-7
0i
009
FL
OR
IDA
'
' '
OW
M-7
0i
OO
?F
OR
EC
AS
TIN
G
' D
wM
-70
-00
6'
??
GK
CU
NO
WA
TE
R
'
' D
WM
-7C
J-O
Ol
0,
'HY
J^O
LO
GIC
M
OD
ELS
O
WM
-70
-00
53
I0\
EX
CH
AN
GE
O
WM
-70
-C0
2^
MA
NA
GE
ME
NT
'
DW
M-7
Q-0
06
O
' M
AN
AG
EM
EN
T
SC
IEN
CE
O
WM
-70
-01
2M
AK
ING
O
RG
AN
ISM
S
OW
M-7
0-0
03
O
' M
AK
ING
S
ED
IME
NT
AT
ION
'
DW
M-7
0-C
Ott
MA
RY
LAN
D
DW
M-7
C-G
10
ft
MA
TH
EM
AT
ICA
L
MO
DE
LS
DW
M-7
0-0
13
V
MA
TH
EM
AT
ICA
L
MO
DE
LS
OW
M-7
0-0
06
0
MA
TH
EM
AT
ICA
L
MO
DE
LS
OW
M-7
0-0
05
H
.' M
AT
HE
MA
TIC
AL
M
OD
ELS
.
DW
M-7
0-0
07
3
\
ME
KC
UA
Y
OW
M-7
0-0
03
5*
»\
MO
DE
LS
. O
WM
-70-0
0 b
NE
TW
UK
K
DE
SIG
N
DW
M-7
0-0
0<
»
NO
HM
AT
IVG
G
EN
ER
AL
SY
ST
EM
S
OW
M-7
0-O
ll£
\N
OR
TH
C
AR
OLIN
A
DW
M-7
0-0
02
2
\ \P
OL
ICY
A
NA
LY
SIS
O
WM
-70
-Oll
g
PO
LIC
Y
AN
AL
YS
IS
OW
M-7
0-0
12
PO
LIC
Y
SC
IEN
CE
O
WM
-70
-01
2g"
PO
LIC
Y
SC
IEN
CE
D
WM
-70-0
11
0,
PO
LIC
YM
AK
ING
D
WM
r70
-01
1fl>
P
CL
LU
TIU
N
DW
M-7
0-0
03
X
PO
LLU
TIO
N
' O
WM
-70-C
01
RE
CY
CL
ING
W
AS
Tt
' D
W.X
-70
-01
0K
CG
KE
SS
ION
A
NA
LY
SIS
O
V/M
-70
-00
7K
ET
UK
MA
bLE
B
OT
TL5S
D
WM
-7C
-010
SA
LIN
ITY
O
WM
-70
-00
2S
AyP
LIJ
JG
TH
EO
RY
.
OV
.'M-7
0-p
O*
SA
'-i'J
D
EP
OS
ITS
O
WM
-70
-CO
3S
AN
i) K
lOG
kS
' D
WM
-70-C
C9
SA
M)
WA
VE
S
' D
WM
-70-C
08
StO
1M
bN
T
OW
M-7
0-0
02
Although the bibliographic example just given will
probably be the- most frequent application of the keyword
index program, other potential applications should not be
overlooked. Recall that up to 9 types of "cards" (records)
can be identified with one reference code. These cards in
the bibliographic example were called:
Card type
1 Author card
2 Title card
3 Citation card
4 Descriptor card
5 - Identifier card
6-7 Additional card types
The information on all card types is printed in the
bibliographic section of the index. Information on card
type 1 (INDEX AUTHORS) and on card types 4 and 5 (INDEX
TERMS) may be extracted in a back-of-the-book type of
index. Information on card type 2 may be permuted in a
keyword-in-context index (KWIC INDEX) or a keyword-out-of-
context index (KWOC INDEX); in both cases information on
card type 1 may be merged into the index (INDEX AUTHORS).
Similarly f- information on card types 4 and 5 may also be
permuted in KWIC (KWIC TERMS) or KWOC (KWOC TERMS) indexes;
again information on card type 1 may be merged into the
index (INDEX AUTHORS). These commands are summarized in
table 1. .
23
Tabl
e 1. S
umma
ry o
f ke
ywor
d pr
ogra
m co
ntro
l ca
rds.
X
- re
quir
ed c
ard;
0 -
optional
card
.
Keyw
ord pr
ogra
m control
cards
BIBLIOGRAPHY X
. __ 0 0 0 0 0 0
XtfDEX
AUTHORS
..7.. X 0 0 0 0 0
ti H
O 8»
..,__... X - - - -
1 :
8 M 8 .:_ - X - - -
o CO
&H Q H - - * - X - -
i
o -B
, rr - -_ X -
p o -j - - - - X
COLUMN X - X - -
.
Sort pro
gram
control
cards
p fc o u> W
0) p s s H X - X - -
t o to O X - X -
t o (0 o
o - X
' - X
ft
<u -p (0 p h
O Ul ft
H & X - - - -
\Description
of o
utpu
t
1 '
Bibliographic
sect
ion
of rep
ort
only
- a
ll car
d ty
pes
printed
Doub
le c
olumn
inde
x of i
nfor
mati
on o
n ca
rd typ
e 1
KWIC
ind
ex of
card
typ
e 2
KWOC
index of
card
typ
e 2
Double c
olumn
inde
x of i
nfor
mati
on o
n card t
ypes
U a
nd 5
KWIC i
ndex o
f ca
rd typ
es U
and
5
KWOC
ind
ex -
inf
orma
tion
On
card t
ypes U
and 5
app
ears
in
left-
hand
margin; i
nformation o
n card t
ype
2 appears
on l
ine.
TITLES c
ontr
ol c
ard
required,
for
all
runs
and
must
appear l
ast.
STOP
WORDS c
ontr
ol car
d an
d Stop w
ords l
ist, if use
d mu
st i
mmediately pre
cede
TIT
LES
card
.
Having reviewed the effects of the commands, now
consider a research project file stored on GIPSY. Using
the COPY command, an index might be created to the file
making the following card type assignments:
Card type
1 Project leader
2 Project title
3 Project location (city and state)
4 Descriptors
6 Project summary
The possible types of indexes were discussed above.
Other potential applications that come to mind are map
catalogs, lists of geologic and geographic names, mineral
and fossil inventories, certain types of personnel rosters,
and indexes to publications (see the cumulative index to
the U. S. Geological Survey's Sysnotes),
25
STORAGE REQUIREMENTS AND RUN TIME
The core storage requirements of the keyword program
depend upon the line length of the index to be produced.
The REGION parameter on the KWIC program-part 1 execute
card should be specified as follows:
LENGTH n REGION (or REGION,GO) parameter
LENGTH 80 (default) . REGION-114K
LENGTH 120 REGION-178K
LENGTH 132 REGION=198K
Run times depend upon the number of documents to be
indexed and the length of the card type 2, card types 4,
and cards type 5 records as specified by the STORAGE control
card (STORAGE 180 is the default value). If STORAGE 180, a
KWIC index to 120 bibliographic references requires the
following run times:
" 1 / Program step Run time (seconds)
Step 1 KWIC program-part 1 37Step 2 Sort program - Balsort 3Step 3 KWIC program - part 2 7
Run times for larger collection of documents or larger
storage values must be adjusted upwards*
I/ Run times do not include times required to compile and link edit program source decks. '
26
REFERENCES CITED
, C. H.; Coney, M. D., Jones, M. A., Shields, R. W. r
and Sweeney, J. W., 1969, General information processing
system application description: Univ. Oklahoma Infor
mation Science Series, mono, no, 4, 126 p.
U. S. Geological Survey, 1970, U.S.G.S. Computer user's
manual: Washington, D. C., U. S. Dept. of the Interior,
Geological Survey, Computer Center Division.
27