Upload
others
View
2
Download
0
Embed Size (px)
Citation preview
©2016EDRMLLC
EDRMGlossary
http://www.edrm.net/resources/glossaries/glossary
Version1.002,April22,2016
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC i
TheEDRMGlossaryisEDRM'smostcomprehensivelistingofelectronicdiscoveryterms.Itincludestermsfromthespecializedglossarieslistedbelowaswellastermsnotinthoseglossaries.
Thetermsarelistedinalphabeticalorderwithdefinitionsandattributionswhereavailable.
EDRMCollectionStandardsGlossary
TheEDRMCollectionStandardsGlossaryisaglossaryoftermsdefinedaspartoftheEDRMCollectionStandards.
EDRMMetricsGlossary
TheEDRMMetricsGlossarycontainsdefinitionsfortermsusedinconnectionwiththeupdatedEDRMMetricsModelpublishedinJune2013.
EDRMSearchGlossary
TheEDRMSearchGlossaryisalistoftermsrelatedtosearchingESI.
EDRMSearchGuideGlossary
TheEDRMSearchGuideGlossaryispartoftheEDRMSearchGuide.TheEDRMSearchGuidefocusesonthesearch,retrievalandproductionofESIwithinthelargere-discoveryprocessdescribedintheEDRMModel.
IGRMGlossary
TheIGRMGlossaryconsistsofcommonlyusedInformationGovernanceterms.
TheGrossman-CormackGlossaryofTechnology-AssistedReview
DevelopedbyMauraGrossmanofWachtell,Lipton,Rosen&KatzandGordonCormackoftheUniversityofWaterloo,theGrossman-CormackGlossaryofTechnology-AssistedReviewcontainsdefinitionsfortermsusedinconnectwiththediscoveryprocessesreferredtobyvarioustermsincludingComputerAssistedReview,TechnologyAssistedReview,andPredictiveCoding.
Ifyouwouldliketosubmitanewtermwithdefinitionoranewdefinitionforanexistingterm,pleasegotoourSubmitaDefinitionpage,http://www.edrm.net/23482,andcompleteandsubmittheform.Weappreciateyourcontributions!
Exceptwhereotherwisenoted,contentincludedinthisdocumentislicensedunderaCreativeCommonsAttribution3.0UnportedLicense.Thatmeansyouarefreetoshare,remixormakecommercialuseofthecontentsolongasyouprovideattribution.Toprovideattribution,pleaseciteto"EDRM(edrm.net)."Ifyouhavequestions,[email protected].
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC ii
1.....................................................................................................................................................1
A.....................................................................................................................................................1
B...................................................................................................................................................19
C...................................................................................................................................................38
D...................................................................................................................................................70
E....................................................................................................................................................99
F..................................................................................................................................................115
G.................................................................................................................................................134
H.................................................................................................................................................139
I...................................................................................................................................................146
J..................................................................................................................................................162
K.................................................................................................................................................166
L..................................................................................................................................................172
M................................................................................................................................................180
N.................................................................................................................................................205
O.................................................................................................................................................216
P.................................................................................................................................................223
Q.................................................................................................................................................246
R.................................................................................................................................................249
S..................................................................................................................................................266
T..................................................................................................................................................296
U.................................................................................................................................................310
V.................................................................................................................................................322
W................................................................................................................................................327
X.................................................................................................................................................334
Y..................................................................................................................................................334
Z..................................................................................................................................................334
....................................................................................................................................................336
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 1
1
10b(5)
SecuritiesandExchangeCommissionregulationgoverningtherightsofshareholders.ManylawsuitsbyshareholdersarefiledunderRule10b(5).
Source: IbisConsulting,Glossary.
17a4
SecuritiesandExchangeCommissionregulationrelatingtodataretentionforfinancialservicesfirms.
Source: IbisConsulting,Glossary.
A
Ablate
Toremove.Usedtodescribethelaser-readable"pits"intherecordedlayerofopticaldisks.
Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
AcceptonZero
Astatisticalsamplingprocedure(anacceptancesamplingprocedure)thatdrawsarandomsampleofobjectsfromapopulationandcheckseachonetodeterminewhetheritisadefect.Ifnoneoftheobjectsinthesampleisfoundtobedefective,thenwecanconcludewithaspecifiablelevelofconfidencethattherewerenomorethanaspecifiableproportionofdefectsintheoriginalpopulation.Findingzerodefectsinthesampledoesnotmeanthattherewerezerodefectsinthepopulation,onlythattherewerenomorethanaspecifiablepercentage.OneapplicationofthisprocedureineDiscoveryistodrawarandomsamplefromthepopulationofdocumentsdeterminedbythereviewtobenonresponsive.Thesizeofthesampleisdeterminedbyyourspecifiedconfidencelevelandbythemaximumacceptablepercentageofresponsivedocumentsthatwerenotretrieved.Ifnoneofthedocumentsinthesampleisfoundtoberesponsive,thenwecansaywithconfidenceX%thattherewerenomorethanY%responsivedocumentsleftbehind.
Source: HerbRoitblat,Search2020:TheGlossary.
Source: HerbRoitblat,PredictiveCodingGlossary.
AcceptonZeroError
AtechniqueinwhichthetrainingofaMachineLearningmethodisgaugedbytakingaSampleaftereachtrainingstep,anddeemingthetrainingprocesscompletewhenthelearningmethodcodesaSamplewith0%Error(i.e.,100%Accuracy).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 2
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Accuracy
ThefractionofDocumentsthatarecorrectlycodedbyasearchorrevieweffort.NotethatAccuracy+Error=100%,andthatAccuracy=100%–Error.WhilehighAccuracyiscommonlyadvancedasevidenceofaneffectivesearchorrevieweffort,itsusecanbemisleadingbecauseitisheavilyinfluencedbyPrevalence.Consider,forexample,aDocumentPopulationcontainingonemillionDocuments,ofwhichtenthousand(or1%)areRelevant.Asearchorrevieweffortthatidentified100%oftheDocumentsasNotRelevantandtherefore,foundnoneoftheRelevantDocuments,wouldhave99%Accuracy,belyingthefailureofthatsearchorrevieweffort.
Source:MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Acetate-BaseFilm
Afilmsubstrateusedinmicrofilmproduction.Consideredasafetyfilm(ANSIStandard).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Acrobat
Adobe'selectronicdocumentformat.Documentscanbecreatedfromwithinawordprocessor,frompostscript,orfromscannedpages.Thedocumentsarehighlyportable,yetmaintainthelookoftheoriginal.AcrobatisespeciallyusefulinthisareabecauseAdobemakesthereaderavailableforfree.Version3.0alsomakesitintegratewellwithwebbrowsers.
Source: RSI,Glossary.
Seealso:
Externallink:
AdobeAcrobatFamily,http://www.adobe.com/products/acrobat/main.html
ActiveData
Datacurrentlydisplayedonacomputerscreen,and/orfilesonacomputerthatcanbeaccessedwithouthavingtousearestorationprocess.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingRichardA.Lazar,TheGuidetoElectronicDiscovery,at37(Fios,Inc.2002).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 3
Theinformationreadilyavailableandaccessibletousers,includingwordprocessingfiles,spreadsheets,databases'data,e-mailmessages,electroniccalendarsandcontactmanagers.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingFeldman,TheEssentialsofComputerDiscovery,ComputerForensicsInc.(1/1/2001),http://www.forensics.com/pdf/Essentials_of_Discovery.pdf#page=2.
Activedataisinformationresidingonthedirectaccessstoragemediaofcomputersystems,whichisreadilyvisibletotheoperatingsystemand/orapplicationsoftwarewithwhichitwascreatedandimmediatelyaccessibletouserswithoutundeletion,modificationorreconstruction(i.e.,wordprocessingandspreadsheetfiles,programsandfilesusedbythecomputer’soperatingsystem).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms.
Activedataisinformationresidingonthedirectaccessstoragemediaofcomputersystems,whichisreadilyvisibletotheoperatingsystemand/orapplicationsoftwarewithwhichitwascreatedandimmediatelyaccessibletouserswithoutundeletion,modificationorreconstruction.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Dataexistingonthedataandfilestoragemediaofcomputersystems.Activedataiseasilyviewedontheoperatingsystemand/orapplicationsoftwarethatwasusedtocreateitandisdirectlyavailabletouserswithoutun-deletion,alteration,orrestoration.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Datacurrentlydisplayedonacomputerscreen.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html.
Source: RSI,Glossary.
Informationresidingonthecomputerwhichisvisibleandfullyavailabletotheuser.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ActiveLearning
AnIterativeTrainingregimeninwhichtheTrainingSetisrepeatedlyaugmentedbyadditionalDocumentschosenbytheMachineLearningAlgorithm,andcodedbyoneormoreSubjectMatterExpert(s).
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 4
Aformofsupervisedmachinelearningthatpresentsforrevieworhumancategorizationthedocumentswiththehighestcurrentuncertainty,thosedocumentsthatwillbemostinformativeabouthowtoupdatethelearningprocess.
Source: HerbRoitblat,Search2020:TheGlossary.
Source: HerbRoitblat,PredictiveCodingGlossary.
ActiveRecord
Activerecordsarerecordsrelatedtocurrent,ongoingorinprocessactivitiesandarereferredtoonaregularbasistorespondtoday-to-dayoperationalrequirements.Anactiverecordresidesinnativeapplicationformatandisaccessibleforpurposesofbusinessprocessingwithnorestrictionsonalterationbeyondnormalbusinessrules.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms.
Activity
Singlelogicalqueryortheprogressionofsinglelogicalqueriesperformedinteractivelyinanefforttoaccumulateintelligence.
Source: EDRMSearchGlossary.
AdHocSearch
Singlelogicalqueryortheprogressionofsinglelogicalqueriesperformedinteractivelyinanefforttoaccumulateintelligence.
Source: EDRMSearchGlossary.
Seealso:
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 5
AdHocWorkflow
Asimplemanualprocessbywhichdocumentscanbemovedaroundamulti-userimagingsystemonan“as-needed”basis.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
AdHocWorkflow
Rule-BasedWorkflow
Workflow
AdaptivePatternRecognition
Thesystemindexeseveryletteroneverypage.Whentheuserconductsasearch,thesystemconductsasearchbasedondiscretepatternsinthetext.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
ADC(AnalogtoDigitalConverter)
Changesanalogsignalstodigitalrepresentations(numbers).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 6
AdditiveColor
Allthecolorsinthelightspectrumadduptomakewhitelight.Computermonitorsuseathreeadditivecolors,Red,Green&Blue(RGB).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ADF(AutomaticDocumentFeeder)
Adevicethatholdspagesandfeedsthemoneafteranotherintoascanner.
Source: RSI,Glossary.
Thisisthemeansbywhichascannerfeedsthepaperdocument.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Admissible
Evidencethatisacceptableorallowableincourt.
Source: RenewData,Glossary(10/5/2005).
Admissibleevidenceinacourtoflawisanytestimonial,documentary,ortangibleevidencethatmaybeintroducedtoafactfinder--usuallyajudgeorjury--inordertoestablishortobolsterapointputforthbyapartytotheproceeding.Inorderforevidencetobeadmissible,itmustberelevant,withoutbeingprejudicial,anditmusthavesomeindiciaofreliability.
Forevidencetoberelevant,itmusttendtoproveordisprovesomefactthatisatissueintheproceeding.However,suchevidencewillnotbeadmissibleiftheutilityoftheevidenceisoutweighedbyitstendencytocausethefactfindertodisapproveofthepartyitisintroducedagainstforsomeunrelatedreason.Furthermore,certainpublic-policyconsiderationsbartheadmissionofotherwiserelevantevidence.
Forevidencetobereliableenoughtobeadmitted,thepartyprofferingtheevidencemustbeabletoshowthatthesourceoftheevidencemakesitso.Iftheevidenceisintheformofwitnesstestimony,thepartyintroducingtheevidencemustlaythegroundworkforthecredibilityofthewitness,andhisknowledgeofthethingstowhichheattests.Hearsayisgenerallybarredforitslackofreliability.Iftheevidenceisdocumentary,thepartyprofferingtheevidencemustbeabletoshowthatitisauthenticandmustbeabletodemonstratethechainofcustodyfromtheoriginalauthortothepresentholder.
Thetrialjudgeperformsa"gatekeeping"roleinexcludingunreliabletestimony.TheUnitedStatesSupremeCourtfirstaddressedthereliabilityrequirementforexpertsinthelandmarkcaseDaubertv.MerrellDowPharmaceuticals,Inc.509U.S.579(1993).TheCourtlaidoutfournon-exclusivefactorsthattrialcourtsmayconsiderwhenevaluatingscientificexpertreliability:(1)whetherscientificevidencehasbeentestedandthemethodologywithwhichithasbeentested;(2)whethertheevidencehasbeensubjectedtopeerrevieworpublication;(3)whetherapotentialrateoferrorisknown;and(4)whethertheevidenceisgenerallyacceptedinthe
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 7
scientificcommunity.Id.at592-94.KumhoTireCo.,Ltd.v.CarmichaellaterextendedtheDaubertanalysistoincludeallexperttestimony.526U.S.137(1999).
Source: EDRMPresentationGuide.
Agreement
ThefractionofallDocumentsthattworeviewerscodethesameway.WhilehighAgreementiscommonlyadvancedasevidenceofaneffectiverevieweffort,itsusecanbemisleading,forthesamereasonthattheuseofAccuracycanbemisleading.WhenthevastmajorityofDocumentsinaPopulationareNotRelevant,ahighlevelofAgreementwillbeachievedwhenthereviewersagreethattheseDocumentsareNotRelevant,irrespectiveofwhetherornottheyagreethatanyoftheRelevantDocumentsareRelevant.
Source: TheGrossman-CormackGlossaryofTechnologyAssistedReview(Version1.02,Nov.2102).
AI(ArtificialIntelligence)
See: ArtificialIntelligence
AIIM(AssociationforInformationandImageManagement)
TheAssociationforInformationandImageManagement–focusedonelectronicimaging.
Externallinks:
AIIM,http://www.aiim.org/
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Algorithm
Amathematicalsetofstepsdesignedtosolveaproblemorruninstructionsinaprogram.Forexample,analgorithminacasemanagementprogramwouldperformthefunction,“Checkthefilingdateofthecomplaintinthismatter,determinethedatetofileananswer,determineiftheanswerhasbeensentoutand,ifnot,sendemailtotheattorneyinchargeofthecasewarninghimoftheimpendingdate.”
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Aformallyspecifiedseriesofcomputationsthat,whenexecuted,accomplishesaparticulargoal.TheAlgorithmsusedinE-Discoveryareimplementedascomputersoftware.
Source: TheGrossman-CormackGlossaryofTechnologyAssistedReview(Version1.02,Nov.2102).
Aspecificsetofstepsthatwhenaccuratelyexecutedleadstoaspecificoutcome.Algorithmscanbecreatedformanydifferentkindsofprocesses,includingcalculation,dataprocessing,automatedreasoning,andmathematicalcomputations.Algorithmsshouldbedistinguished
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 8
fromheuristics.Theword“algorithm”isoftenmisusedtorefertoanycomputer-implementedprocess.
Source: HerbRoitblat,Search2020:TheGlossary.
Aliasing
Whencomputergraphicsoutputhasjaggededgesorastairsteppedappearancewhenmagnified.Homonymis"anti-aliasing”.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Alpha
Thecomplementofconfidencelevel.1–confidencelevel.Statisticiansrefertothealphaleveltodecidewhetheravalueissignificant.A95%confidencelevelhasa5%alphalevel.A99%confidencelevelhasa1%alphalevel.
Source: HerbRoitblat,PredictiveCodingGlossary.
Alphanumeric
Setofcharacterscomposedoflettersandnumbers;mayincludepunctuationmarksorothersymbols;excludesprintercontrolcharacterssuchas"carriagereturn"andflowcontrolcharacterssuchasXONandXOFF.
Source: RSI,Glossary.
Characterscomposedofletters,numbers(andsometimespunctuationmarks).Excludesprinter/flowcontrolcharacters,(CarriageReturn/XON&XOFF).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ALS(AutomatedLitigationSupport)
Theprocessofusingcomputerstocontroldataduringlitigation.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AmbientData
Datastoredinnon-traditionalcomputerstorageareasandformats,suchasWindowsswapfiles,unallocatedspaceandfileslack.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Seealso:
Ambientdata
Fragmenteddata
Freespace
Residualdata
Slackspace
Swapfile
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 9
Unallocatedspace
AmericanNationalStandardsInstitute(ANSI)
TheAmericanNationalStandardsInstitute,orANSI,"isaprivate,non-profitorganization(501(c)3)thatadministersandcoordinatestheU.S.voluntarystandardizationandconformityassessmentsystem.TheInstitute'smissionistoenhanceboththeglobalcompetitivenessofU.S.businessandtheU.S.qualityoflifebypromotingandfacilitatingvoluntaryconsensusstandardsandconformityassessmentsystems,andsafeguardingtheirintegrity."
Source: http://www.ansi.org/about_ansi/overview/overview.aspx?menuid=1
Externallink:
AmericanNationalStandardsInstitute,http://www.ansi.org
AmericanStandardCodeforInformationInterchange(ASCII)
Allocatesanumbertoeachkeyonthekeyboardthatcanbetradedandreadbymostcomputersystems.Atextfile.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
ASCII:TheacronymfortheAmericanStandardCodeforInformationInterchange,whichhasassignedacodedsetofnumberstorepresentlettersandotherspecialcharacters.ASCIIdataconsistsonlyoftextwithnoformatting(e.g.boldoritalics).
Source: IbisConsulting,Glossary.
Astandardcodeusedfordataexchangebetweencomputers.AnASCII(pronounced“as-key“)textfilecontainsonlythelettersofthealphabet,numbers,punctuation,andcertaincommunicationssymbols,butnoembeddedword-processingcodes.AnASCIIdatafile(orASCIIdelimitedfile)hasthedatainfieldsthatareseparatedbyquotationmarksorcommasandthatallowseasytransferintoadatabaseorspreadsheet.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Pronouncedask-ee.AmericanStandardsCommitteeII.Aneightbitcomputercodingstructureforletters,numbersandcharactersinwhichsevenbitsareusedtoidentifyeachindividualentity(128maximum),withonebitforparity.Whennoparitybitisused,alleightbitscanbeusedtorepresentupto256characters;thischaractersetisextendedASCII.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ASCIIisacodethatassignsanumbertoeachkeyonthekeyboard.ASCIItextdoesnotincludespecialformattingfeaturesandthereforecanbeexchangedandreadbymostcomputersystems.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 10
ThecodebywhichEnglishlettersarerepresentedinsideacomputer.Mostcommonlyusedtodiscussadocumentfromwhichallformattinginformation(otherthanspacesandparagraphbreaks)hasbeenremoved.Thetextofthosedocuments.MSWorddocuments,forexample,includealotofinformationinadditiontothetextthatspecifieshowthedocumentshouldlook,revisions,andsoforth.Theso-calledASCIItextofthisdocumentjustcontainsthetextofthedocument,witheverythingelseremoved.
Analog
Theelectricalreplicaorwaveformofaphysicalprocesscausedbychangesinamplitudeorfrequency.Oppositeofdigital(Zeros&Ones).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
AnalogtoDigitalConverter
See: ADC(AnalogtoDigitalConverter)
AnalysisPhase
EvaluatingESIforcontentandcontext,includingkeypatterns,topics,peopleanddiscussion.
Source: EDRMStages
CorrespondstoUTBMSCodeL660.Activitiesandactionsrequiredbylitigationteamstobeabletomakeinformeddecisionsaboutstrategyandscopethroughreliablemethodsbasedonverifieddata.
Source: EDRMMetricsGlossary
Annotation
Anoteplacedinafull-textrecordtocommentonthetextualmaterial.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thechangesoradditionsmadetoadocumentusingstickynotes,ahighlighter,orotherelectronictools.Documentimagesortextcanbehighlightedindifferentcolors,redacted(blacked-outorwhited-out),stamped(e.g.“FAXED”or“CONFIDENTIAL”),orhaveelectronicstickynotesattached.Annotationsshouldbeoverlaidandnotchangetheoriginaldocument.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ANSI
See: AmericanNationalStandardsInstitute(ANSI)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 11
ApertureCard
AnIBMpunchcardwithawindowwhichholdsa35mmframeofmicrofilm.Indexinginformationispunchedinthecard.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Applet
Anapplicationprogramthatusestheclient'swebbrowsertoprovideauserinterface.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Application
Aprogram,thatinstructsacomputertoperformaspecificsetofinstructionsorexecuteaprocess.Somesoftwareapplicationsareuser-drivenlikeMicrosoftWordorNotepad,whileothersaresystem-drivenliketheWindowssystemclockorautomaticvirusscanningprograms.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Anapplicationisacollectionofoneormorerelatedsoftwareprogramsthatenablesausertoenter,store,view,modifyorextractinformationfromfilesordatabases.Thetermiscommonlyusedinplaceof“program,”or“software.”Applicationsmayincludewordprocessors,Internetbrowsingtoolsandspreadsheets.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Asetofelectronicinstructions,alsoknownasaprogram,whichinstructsacomputertoperformaspecificsetofprocesses.
Source: RSI,Glossary.
Aprogramthatperforms“people”functions,suchaswordprocessing,spreadsheets,orlitigationsupport.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Aprogramusedtogeneratedocuments(e.g.,MSWord,WordPerfect,Pagemaker,Visio).Eachapplicationistypicallyassociatedwithoneormorefileextensions(e.g.,.doc,.xls).)
ApplicationFile
Computerfilesthatrunsoftwareapplications(suchasMSOffice,LotusWordProandLotus1‑2‑3,AdobeAcrobat,TXTfiles,TIFFs,etc.)notassociatedwithmailcontainersortheirmessages,attachments,ornon-mailitems.
Source: IbisConsulting,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 12
ApplicationFileUser
Theuserdatasetcontainingapplicationfiles.
Source: IbisConsulting,Glossary.
ApplicationServiceProvider(ASP)
AnapplicationserviceproviderisacompanythatdeliverssoftwareapplicationstomultipleusersovertheInternetorothernetwork.Insteadofpurchasingsoftwarelicensesdirectlyfromvendorsorre-sellers,companiesrentthesoftwarefromanASP,whichhosts,maintainsandupgradessoftwareapplicationsandcomputerhardware.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Providingacomputerprogramorapplicationacrossabroadbandconnectionasathird-partyprovider.Allowsuserstolowerthecostofdeployinganapplication.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ApplicationSoftware
See: Application
Architecture
Thedesignorphysicalstructureofthecomputer’sinternalcomponentsandhowtheywork.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ArchivalData
Archivaldataisinformationthatisnotdirectlyaccessibletotheuserofacomputersystembutthattheorganizationmaintainsforlong-termstorageandrecordkeepingpurposes.ArchivaldatamaybewrittentoremovablemediasuchasaCD,magneto-opticalmedia,tapeorotherelectronicstoragedevice,ormaybemaintainedonsystemharddrivesincompressedformats(i.e.,datastoredonbackuptapesordisks,usuallyfordisasterrecoverypurposes).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Informationthatisnotdirectlyaccessibletotheuserofacomputersystembutthattheorganizationmaintainsforlong-termstorageandrecord-keepingpurposes.ArchivaldatamaybewrittentoremovablemediasuchasaCD,magneto-opticalmedia,tapeorotherelectronicstoragedevice,ormaybemaintainedonsystemharddrivesincompressedformats.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Datathatisnotimmediatelyavailabletothecomputeruserbutthattheorganizationpreservesforstorageandrecordkeepingpurposes,oftenstoredonCD-ROMs,tapes,orotherelectronicstoragedevices.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 13
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Informationthatisnotdirectlyavailabletotheuserofacomputerbuthasbeenstoredonthecomputersystemandcanberetrievedthroughaspecialprocess.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Archive
Along-termcomputerstoragearea.
Source: RenewData,Glossary(10/5/2005).
Archivesarelongtermrepositoriesforthestorageofrecords.Electronicarchivespreservethecontent,preventortrackalterationsandcontrolaccesstoelectronicrecords.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Acopyofdataonacomputerdrive,oronaportionofadrive,maintainedforhistoricalreference.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingFios'seDiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Acontainerthatholdsfiles,eithercompressedoruncompressed(ZIP,CAB,TAR,GZ,JAR,PST,NSF,orotherfiletypes).Therearetwotypesofarchives–mailcontainersandfilecontainers.
Source: IbisConsulting,Glossary.
Theprocedureoftransferringtextordatafromaharddisktooff-linestoragemediaforlateraccess.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AreaUndertheROCCurve(AUC)
FromSignalDetectionTheory,asummarymeasureusedtoassessthequalityofPrioritization.AUCistheProbabilitythatarandomlychosenRelevantDocumentisgivenahigherprioritythanarandomlychosenNon-RelevantDocument.AnAUCscoreof100%indicatesaperfectranking,inwhichallRelevantDocumentshavehigherprioritythanallNon-RelevantDocuments.AnAUCscoreof50%meansthePrioritizationisnobetterthanchance.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 14
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
ArtificialIntelligence(AI)
Acategoryofcomputersciencedealingwiththeabilityofmachinestoperforminamannerassociatedwithhumanbeings,suchasreasoning,learning,orunderstandinglanguage.Currentlyassociatedwithvoicerecognitiontechnologyand,toalesserdegree,opticalcharacterrecognition(OCR).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Anumbrellatermforcomputermethodsthatemulatehumanjudgment.TheseincludeMachineLearningandKnowledgeEngineering,aswellasPatternMatching(e.g.,voice,face,andhandwritingrecognition),robotics,andgameplaying.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
ASCII
See: AmericanStandardCodeforInformationInterchange(ASCII)
ASP
See: ApplicationServiceProvider(ASP)
AspectRatio
Therelationshipoftheheightandwidthofanyimage.Thismustalwaysbepreservedtopreventdistortion.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Aspects
Themajorelementscommontoeache-discoveryPhasearoundwhichidentifiablemetricsactivityaggregates:Custodians,Systems,Media,Status,Format,QA&Control,andActivities.
Source: EDRMMetricsGlossary
ASR
See: AutomatedSpeechRecognition(ASR)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 15
Asset
ThespecificcontainerofinformationthatITstoresandsecuresundertheirmanagement.Theprimarydriveristoincrease“Efficiency”andlowercostsassociatedwiththisfunction.
Source: IGRMWhitePaper
AssociationforInformationandImageManagement,The
See: AIIM(AssociationforInformationandImageManagement)
AssociativeRetrieval
Whencertaintermsappearfrequentlyinthevicinityofthetermsforwhichtheuserissearching,theseassociativewordsmayprovidecluesforfurthersearching.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
Attachment
Anyfiletypeassociatedwithorattachedtoane-mail.
Source: RenewData,Glossary(10/5/2005).
Amemorandum,letter,spreadsheet,oranyotherelectronicdocumentappendedtoanotherdocumentoremail.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 16
Filesattachedtomailmessage(orsometimesembeddedintomailmessage).
Source: IbisConsulting,Glossary.
Anenclosuretoatransmittalletteroranexhibittoaprimarydocument.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Anyelectronicdocumentappendedtoanotherdocument,typicallyemail.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Anattachmentisarecordorfileassociatedwithanotherrecordforthepurposeofstorageortransfer.Theremaybemultipleattachmentsassociatedwithasingle“parent”or“master”record.Theattachmentsandassociatedrecordmaybemanagedandprocessedasasingleunit.Incommonuse,thistermreferstoafile(orfiles)associatedwithane-mailfortransferandstorageasasinglemessageunit.Becauseincertaincircumstancesthecontextoftheattachment—forexample,theparente-mailanditsassociatedmetadata—canbeimportant,anorganizationshouldconsiderwhetheritspolicyshouldauthorizeorrestrictthedisassociationofattachmentsfromtheirparentrecords.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
AttachmentField
Adatafieldusedtorecordinformationaboutenclosuresand/orattachmentstoa“parent”document.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 17
AttorneyNotesField
Adatafieldusedforongoingattorneynotesandcomments.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
Attribute
Adataattributeisacharacteristicofdatathatsetsitapartfromotherdata,suchaslocation,length,ortype.Thetermattributeissometimesusedsynonymouslywith“dataelement”or“property.”
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Audio-VideoInterleave(AVI)
AMicrosoftstandardforWindowsanimationfiles.Theformatinterleavesaudioandanimationtoprovidemediumqualitymultimedia.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
AuditTrail
Incomputersecuritysystems,achronologicalrecordofwhenusersloggedin,howlongtheywereengagedinvariousactivities,whattheyweredoing,andwhetheranyactualorattemptedsecurityviolationsoccurred.Anaudittrailisanautomatedormanualsetofchronologicalrecordsofsystemactivitiesthatmayenablethereconstructionandexaminationofasequenceofeventsand/orchangesinanevent.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 18
Authentication
Authenticationistheactofestablishingorconfirmingsomething(orsomeone)asauthentic.Thismightinvolveconfirmingtheidentityofaperson,theoriginsofanartifact,orassuringthatacomputerprogramisatrustedone.
Source: EDRMPresentationGuide.
Author
Theauthorofadocumentistheperson,officeordesignatedpositionresponsibleforitscreationorissuance.Inthecaseofadocumentintheformofaletter,theauthorororiginatorisusuallyindicatedontheletterheadorbysignature.Insomecases,thesoftwareapplicationproducingthedocumentmaycapturetheauthor’sidentityandassociateitwiththedocument.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
AuthorField
Adatafieldusedforrecordingnamesofindividualsand/orbusinessentitieswhowrote,sent,ortransmittedadocument.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
Auto-Categorization
Theprocessofusingmachinelearningorotherrule-basedsystemsforcategorizingdocumentswithoutdirecthumanintervention.Forexample,emailsmaybeauto-categorizedastheyarriveatanarchiveastotheirretentionperiod.Thecategoriesmaybebasedonataxonomyorontology.
Source: HerbRoitblat,Search2020:TheGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 19
AutoFunction
ThisfunctiondynamicallyupdatesfieldcodevaluesenteredbyapplicationusersinMicrosoftOfficeandAutoCADdocuments.
Source: IbisConsulting,Glossary.
Autoexec.bat
Usuallypronouncedautoexecdotbat,thisisaspecialbatchfileusedonPCsthatrunswhenthecomputeristurnedonandtellsthecomputerwhatprogramstoexecutefirst.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AutomatedLitigationSupport(ALS)
See: ALS(AutomatedLitigationSupport)
AutomatedSpeechRecognition(ASR)
Alsocalledautomatedvoicerecognition(AVR).Aprogramthatwill“translate”wordsspokenintoamicrophoneconnectedtoacomputerintowrittentextinaword-processingprogramorperformafunctioninadatabaseprogram.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AutomatedVoiceRecognition(AVR)
See: AutomatedSpeechRecognition(ASR)
AutomaticDocumentFeeder(ADF)
See: ADF(AutomaticDocumentFeeder)
AVI
See: Audio-VideoInterleave(AVI)
AVR
See: AutomatedSpeechRecognition(ASR)
B
Back-End/Front-End
Expressionsthatdescribeprogramsrelativetotheuser.Afront-endprogramisonethatusersinteractwithdirectly,whileaback-endprogramsupportsthefront-endservices.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 20
Backfile
Anexistingpaperormicrofilmfile.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Backup
Tocreateacopyofdataasaprecautionagainstthelossordamageoftheoriginaldata.Mostusersbackupsomeoftheirfiles,andmanycomputernetworksutilizeautomaticbackupsoftwaretomakeregularcopiesofsomeorallofthedataonthenetwork.Somebackupsystemsusedigitalaudiotape(DAT)asastoragemedium.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Aduplicateofinformationasapreventativemeasureagainstthepotentiallossofdatathatisdoneregularlybymanycomputerusers.Manyorganizationsalsoutilizeautomaticbackupsoftwarethatregularlystoresdata.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Acopyofinactivedata,intendedforuseintherestorationofdatalosttocatastrophicfailureofsystemmemory.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Seealso:
Backup
Backuptape
DAT-digitalaudiotape
Dataextraction
Digitalaudiotape
Disasterrecoverytape
DLT-digitallineartape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
Tape
BackupData
Backupdataisinformationthatisnotpresentlyinusebyanorganizationandisroutinelystoredseparatelyuponportablemedia,tofreeupspaceandpermitdatarecoveryintheeventofdisaster.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Informationstoredseparatelyfromthecomputersystemtopermitdatarecoveryintheeventofdisaster.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 21
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
BackupTape
Tapedevicesthattransferactivedatatoinactivedata,intendedforuseindatarestoration.Backuptapestypicallyusedatacompression,whichmakesrestorationtime-consumingandexpensive,especiallygiventhelackofuniformstandardsgoverningdatacompression.
Source: IbisConsulting,Glossary.
Backupordisasterrecoverytapesareportablemediausedtostoredatathatisnotpresentlyinusebyanorganizationtofreeupspacebutstillallowfordisasterrecovery.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Tapemediausedtobackupdata.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Oneofvarioustypesofmagneticrecordingtapesthatareusedtosaveasnapshotofthecurrentstateofafilesystem.Backuptapesaredesignedtobeabletorestorethecontentofafilesystemifitshouldbecomecorrupted.
Seealso:
Backup
Backuptape
DAT-digitalaudiotape
Dataextraction
Digitalaudiotape
Disasterrecoverytape
DLT-digitallineartape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
Tape
BackupTapeRecycling
Theprocesswherebyanorganization'sbackuptapesareoverwrittenwithnewarchiveddatausuallyonafixedschedule(e.g.,theuseofnightlybackuptapesforeachdayoftheweekwiththedailybackuptapeforaparticulardaybeingoverwrittenonthesamedaythefollowingweek;weeklyandmonthlybackupsbeingstoredoffsiteforaspecifiedperiodoftimebeforebeingplacedbackintherotation).
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingAppliedDiscovery'sBoneuponBackup,http://www.lexisnexis.com/applieddiscovery/NewsEvents/PDFs/BoneUpOnBackup.pdf
Backuptaperecyclingistheprocesswherebyanorganization’sbackuptapesareoverwrittenwithnewbackupdata,usuallyonafixedschedule(i.e.,theuseofnightlybackuptapesforeachdayoftheweekwiththedailybackuptapeforaparticulardaybeingoverwrittenonthesame
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 22
daythefollowingweek;weeklyandmonthlybackupsbeingstoredoffsiteforaspecifiedperiodoftimebeforebeingplacedbackintherotation).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Theactbywhicholdbackuptapesareoverwrittenwithnewdata.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary
BagandTag
Theprocessofreceiving,recording,andsecuringclientsourcedataasevidence.Thefirstlinkinthechainofcustody.
Source: IbisConsulting,Glossary.
BagofWords
AFeatureEngineeringmethodinwhichtheFeaturesofeachDocumentcomprisethesetofwordscontainedinthatDocument.DocumentsaredeterminedtobeRelevantorNotRelevantdependingonwhatwordstheycontain.ElementaryKeywordSearchandBooleanSearchmethods,aswellassomeMachineLearningmethods,usetheBagofWordsmodel.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Bandwidth
Thequantityofinformationthatcanbesentoveranetworkinacertainamountoftime.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Theamountofinformationordatathatcanbesentoveranetworkconnectioninagivenperiodoftime.Bandwidthisusuallystatedinbitspersecond(bps),kilobitspersecond(kbps),ormegabitspersecond(mps).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
BarCode
Asmallpatternofverticallinesthatisreadbyalaseroranopticalscanner,andwhichcorrespondstoarecordinadatabase.Anadd-oncomponenttoimagingsoftware,thisfeatureisdesignedtoincreasethespeedwithwhichdocumentscanbearchived.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
BasicDe-Duplication
Performedonaselectandlimitedbasis,suchasforfilenamesandtypes,andisusuallybasedonthehashvalueoftheentireelectronicdocument.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 23
Source: RenewData,Glossary(10/5/2005).
Seealso:
Casede-duplication
Custodiande-duplication
De-duplication
Duplicate
Dynamicde-duplication
GlobalDeduplication
HorizontalDeduplication
Productionde-duplication
VerticalDeduplication
BasicInputOutputSystem(BIOS)
Aspecialprogramcontainedinthecomputer’sROMthatcontrolsthecomponentsofthecomputerandhowtheyinteractandworktogether.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ThespecificPCinput/output"rules"andtheprogramswhichexecutethesetoallowthetransferofinformationto/fromthe"centralprocessingunit"ofthePC.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Batch
Afilecontainingoneormorecommandsthatexecuteconsecutively,oneatatime.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Acollectionofmaterialforinputintothecomputer,suchasabatchofdocumentssegregatedforcodingorabatchofdatarecordstoberestoredfromabackuptape.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
BatchPrinting
Theprocessofprintingagroupofdocuments,usuallyfromtheTIFF’sorPDF’s.
BatchProcessing
Thenameofthetechniqueusedtoinputalargeamountofinformationinasinglestep,asopposedtoindividualprocesses.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Batchload
Aclient-specificdocumentloadfile,generatinganoutputdirectorystructurefordeliverables(usuallyTIFForPDFimages,metadataandtextrepresentationsoffiles,butsometimesfilesintheirnativeformat).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 24
Source: IbisConsulting,Glossary.
BatesNumber
Adocumentidentificationtechniqueinwhicheverypage(orimage)ofeverydocumentinadocumentcollectionisassignedaunique,sequentialidentificationnumber.Batesnumbersmaybethenprintedontothedocumentpagebeforethepageisdistributedtomultiplepartiestoensurethateachdistributedpagecanbeidentifiedandcomparedtotheoriginal.
Source: IbisConsulting,Glossary.
TheBate®numberisanumberthatuniquelyidentifieseachpageofadocument.
Source: RSI,Glossary.
Abatesproductionnumberisatrackingnumberassignedtoeachpageofeachdocumentintheproductionset.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Auniquenumberthatisattachedtoeachpageofadocument(inelectronicormanualform)toidentifyit.ThewordBatescomesfromtheBatesCompany,whichwasoneoftheoriginatorsofnumeric(andalpha)stampingmachines.
Seealso:
Batesnumber
Batesprefix
Batesstamp
Batesstamping
Documentnumber
BatesPrefix
Aproject-specific,clientspecificationintheformofanalphanumericprefixthatprecedesaproject’scontrolnumber(thedigitalequivalentofaBatesnumber).Alsocalled"controlnumberprefix."
Source: IbisConsulting,Glossary.
Seealso:
Batesnumber
Batesprefix
Batesstamp
Batesstamping
Documentnumber
BatesStamp
Seealso:
Batesnumber
Batesprefix
Batesstamping
Documentnumber
©2016EDRMLLC
BatesStamping
TheprocessofaddinganimagerepresentationofaBatesnumbertoapageorimage(thisincludesPDFsandTIFFs).
Source: IbisConsulting,Glossary.
Seealso:
Batesnumber
Batesprefix
Batesstamp
Batesstamping
Documentnumber
Baud
Aunitofdata-transmissionspeedusedindiscussingmodems.Onebaudequalsonebitpersecond(bps).Divideby10togetcharacterspersecond(e.g.,a9600baudmodemsendsdataat960characterspersecond).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Baud
Fileserver
Laptopcomputer
Microcomputer
Minicomputer
Notebookcomputer
Personalcomputer
Workstation
BaudRate
Seealso:
Baud
Fileserver
Laptopcomputer
Microcomputer
Minicomputer
Notebookcomputer
Personalcomputer
Workstation
Bayes/Bayesian/Bayes’Theorem
AgeneraltermusedtodescribeAlgorithmsandothermethodsthatestimatetheoverallProbabilityofsomeeventuality(e.g.,thataDocumentisRelevant),basedonthecombinationofevidencegleanedfromseparateobservations.InElectronicDiscovery,themostcommonevidencethatiscombinedistheoccurrenceofparticularwordsinaDocument.Forexample,aBayesianAlgorithmmightcombinetheevidencegleanedfromthefactthataDocumentcontainsthewords“credit,”“default,”and“swap”toindicatethatthereisa99%ProbabilitythattheDocumentconcernsfinancialderivatives,butonlya40%Probabilityifthewords“credit”and“default,”butnot“swap,”arepresent.ThemostelementaryBayesianAlgorithmisNaïveBayes;howevermostAlgorithmsdubbed“Bayesian”aremorecomplex.BayesianAlgorithmsarenamedafterBayes’Theorem,coinedbythe18thcenturymathematician,
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 26
ThomasBayes.Bayes’TheoremderivestheProbabilityofanoutcome,giventheevidence,from:(i)theprobabilityoftheoutcome,independentoftheevidence;(ii)theprobabilityoftheevidence,giventheoutcome;and(iii)theprobabilityoftheevidence,independentoftheoutcome.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
BayesianCategorizer
Aninformationretrievaltoolthatcomputestheprobabilitythatadocumentisamemberofacategoryfromtheprobabilitythateachwordisindicativeofeachcategory.Theseestimatesarederivedfromexampledocuments.Usestheprobabilityofeachwordgiveneachcategorytocomputetheprobabilityofeachcategorygiveneachword.AlsocalledanaïveBayesianCategorizer.
Source: HerbRoitblat,PredictiveCodingGlossary.
BayesianClassifier
Bayesianclassifierisaprocessofidentifyingconceptsusingacertainrepresentativedocumentsinaparticularcategory.Theclassifierhastheabilitytodiscernotherresponsivedocumentsinthelargercollectionandplacetheminacategory.Typically,acategoryisrepresentedbyacollectionofwordsandtheirfrequencyofoccurrencewithinthedocument.Theprobabilitythatadocumentbelongstoacategoryisbasedontheproductofeachwordofthedocumentappearinginthatcategoryacrossalldocuments.Thus,thelearningclassifierisabletoapplywordspresentinasamplecategoryandapplythatknowledgetoothernewdocuments.Inthee-discoverycontext,aBayesianclassifiercanquicklyplacedocumentsintoconfidential,privileged,responsivedocumentsandotherwell-knowncategories.
Source: EDRMSearchGlossary
BayesianClassifier/BayesianFilter/BayesianLearning
AcolloquialtermusedtodescribeaMachineLearningAlgorithmthatusesaBayesianAlgorithmNaïveBayes.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
BBS(BulletinBoardSystem)
Abulletinboardsystem(BBS)isacomputeroranapplicationdedicatedtothesharingorexchangeofmessagesorotherfilesonanetwork.Originallyanelectronicversionofthetypeofbulletinboardfoundonthewallinmanykitchensandworkplaces,theBBSwasusedtopostsimplemessagesbetweenusers.TheBBSbecametheprimarykindofonlinecommunitythroughthe1980sandearly1990s,beforetheWorldWideWebarrived.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 27
Source: WhatIs.comdefinition,bulletinboardsystem(BBS),http://whatis.techtarget.com/definition/bulletin-board-system-BBS
Abulletinboardsystem,orBBS,isacomputerserverrunningcustomsoftwarethatallowsuserstoconnecttothesystemusingaterminalprogram.Onceloggedin,theusercanperformfunctionssuchasuploadinganddownloadingsoftwareanddata,readingnewsandbulletins,andexchangingmessageswithotherusersthroughemail,publicmessageboards,andsometimesviadirectchatting.ManyBBSesalsoofferon-linegames,inwhichuserscancompetewitheachother,andBBSeswithmultiplephonelinesoftenprovidechatrooms,allowinguserstointeractwitheachother.BulletinboardsystemswereinmanywaysaprecursortothemodernformoftheWorldWideWeb,socialnetworksandotheraspectsoftheInternet.Low-cost,high-performancemodemsdrovetheuseofonlineservicesandBBSesthroughtheearly1990s.Infoworldestimatedtherewere60,000BBSesserving17millionusersintheUnitedStatesalonein1994,acollectivemarketmuchlargerthanmajoronlineserviceslikeCompuServe.
Source: Wikipedia,Bulletinboardsystem,https://en.wikipedia.org/wiki/Bulletin_board_system
BeginningDocumentNumber
Thefirstpageofadocumentorrecord.OftenBegDoc#.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
BeginningNumberField
Adatafieldforrecordingthenumberofthefirstpageofadocument.Alsousedasadocumentidentifiertofindhardcopiesortoretrieveimages.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 28
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
Bibliographic/ObjectiveCoding
Objectiveinformation,oftenmanuallyrecordedfromdocumentssuchasthedocumentdate,theauthorsorrecipientsofthedocuments,orthetitleofadocument.Bibliographiccodingusuallytakesplaceagainstdocumentsoriginatingaspaperwithnoelectronicallystoredinformation.
Source: EDRMSearchGlossary.
Theenteringofobjectiveinformationsuchasdate,documentnumber,anddocumenttypeintodatafields.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Extractinginformationfromelectronicdocumentssuchasdatecreated,authorrecipient,CCandlinkingeachimagetotheinformationinpre-definedobjectivefields.IndirectoppositiontoSubjectivecodingwherelegalinterpretationsofdatainadocumentarelinkedtoindividualdocuments.Alsocalledobjectivecoding.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Issuecoding
Levelcoding
Objectivecoding
Subjectivecoding
Tag
Taxonomiccoding
Verbatimcoding
©2016EDRMLLC
BigData
Anill-definedtermforlargecollectionsofdataofvarioussorts.Bigdatamaybebigbecauseitincludesalargenumberofrecords(e.g.,allofthetransactionsonAmazon),becauseitincludesalargenumberofvariables(allofthecharacteristicsorfeaturesthatabankknowsabouteachcustomer),orboth.Bigdataoftensuffersfromthe4-Vs:Velocity,Variety,Volume,andVeracity.Bigdataaccumulateveryrapidly,theyconsistofmanydifferentkindsofdata,athighvolume,andthequalityofthedataoftenpresentsachallenge.Bigdatacanalsorefertoverylargecollectionsofelectronicallystoredinformation.
Source: HerbRoitblat,Search2020:TheGlossary.
Bigram
AnN-GramwhereN=2(i.e.,a2-gram).
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Binary
Mathematicalbase2,ornumberscomposedofaseriesofzerosandones.Sincezero'sandone'scanbeeasilyrepresentedbytwovoltagelevelsonanelectronicdevice,thebinarynumbersystemiswidelyusedindigitalcomputing.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
BinomialCalculator/BinomialEstimation
AstatisticalmethodusedtocalculateConfidenceIntervals,basedontheBinomialDistribution,thatmodelstherandomselectionofDocumentsfromalargePopulation.BinomialEstimationisgenerallymoreaccurate,butlesswellknown,thanGaussianEstimate.ABinomialEstimateissubstantiallybetterthanaGaussianEstimate(which,incontrast,reliesontheGaussianorNormalDistribution)whentherearefew(orno)RelevantDocumentsintheSample.WhentherearemanyRelevantandmanyNon-RelevantdocumentsintheSample,BinomialandGaussianEstimatesarenearlyidentical.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
BinomialDistribution
TheProbabilitythataRandomSamplefromalargePopulationwillcontainanyparticularnumberofRelevantDocuments,giventhePrevalenceofRelevantDocumentsinthePopulation.UsedasthebasisforBinomialEstimation.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 30
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
BinomialEstimate
AStatisticalEstimateofaPopulationcharacteristicusingBinomialEstimation.ItisgenerallyexpressedasaPointEstimateaccompaniedbyaMarginofErrorandaConfidenceLevel,orasaConfidenceIntervalaccompaniedbyaConfidenceLevel.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
BIOS
See: BasicInputOutputSystem(BIOS)
Bit
Abitisthesmallestunitofinformationrecognizedbyacomputer;itcorrespondstoachoicebetweenoneandzero,thebasisforallinformationstorageinbinarylanguagecomputers.Eightbitsmakeupabyte.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Ameasurementofdata.Itisthesmallestunitofdata.Abitiseitherthe"1"or"0"componentofthebinarycode.Acollectionofbitsisputtogethertoformabyte.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterm
Singlepositioninbase2arithmetic(2n)–eitheron(1)oroff(0).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Byte
KB-kilobyte
MB-megabyte
GB-gigabyte
TB-terabyte
PB-petabyte
EB-exabyte
BitMap
Bitmapimages,alsocalledrasterorpaintimages,aremadeofindividualdotscalledpixels(pictureelements)thatarearrangedandcoloreddifferentlytoformapattern.Whenyouzoomin,youcanseetheindividualsquaresthatmakeupthetotalimage.Increasingthesizeofabitmaphastheeffectofincreasingindividualpixels,makinglinesandshapesappearjagged.Reducingthesizedistortstheoriginalimagebecausepixelsareremovedtoreducetheoverall
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 31
imagesize.Becauseabitmapiscreatedasacollectionofarrangedpixels,itspartscannotbemanipulated(e.g.,moved)individually.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Creatingcharactersorimagesbycreatinga"picture"(matrix)ofindividualbits(pixels).Theindividualbitsmayjustbebinary(blackandwhite)orhighdefinitioncolor.Incolorsystems,the"z-axis"ofeachpixelhasavaluewhichrepresentsthe"shadeofgray"orcolorofthebit.Thisvaluecanbeashighas32bitsforveryhighresolutioncolor.Thisresultsinalarge,uncompressedfile.Forinstance,a300dpi,E-Sizedrawingbitmapisapproximately16MB.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Bit-by-BitCopy
See: BitstreamCopy
Bitmap
Bitmapimages,alsocalledrasterorpaintimages,aremadeofindividualdotscalledpixels(pictureelements)thatarearrangedandcoloreddifferentlytoformapattern.Whenyouzoomin,youcanseetheindividualsquaresthatmakeupthetotalimage.Increasingthesizeofabitmaphastheeffectofincreasingindividualpixels,makinglinesandshapesappearjagged.Reducingthesizedistortstheoriginalimagebecausepixelsareremovedtoreducetheoverallimagesize.Becauseabitmapiscreatedasacollectionofarrangedpixels,itspartscannotbemanipulated(e.g.,moved)individually.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Creatingcharactersorimagesbycreatinga"picture"(matrix)ofindividualbits(pixels).Theindividualbitsmayjustbebinary(blackandwhite)orhighdefinitioncolor.Incolorsystems,the"z-axis"ofeachpixelhasavaluewhichrepresentsthe"shadeofgray"orcolorofthebit.Thisvaluecanbeashighas32bitsforveryhighresolutioncolor.Thisresultsinalarge,uncompressedfile.Forinstance,a300dpi,E-Sizedrawingbitmapisapproximately16MB.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Bitonal
Animageorfilecomprisedofpixelordotvaluesofeitherblackorwhite.
Source: RSI,Glossary.
Bi-tonal(blackandwhiteonly,onebitperpixel).ABi-tonalimageiscreatedbyathresholdingprocessfromagrayscaleinput,eitherduringthescanningprocessorsubsequently.Thresholdingisanirreversibleprocesswhichresultsinspeckledimageswithnoticeably"stair-stepped"diagonallines.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 32
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
BitsPerInch(BPI)
Thisdefinesdatadensitiesindiskandmagnetictapesystems.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
BitsPerSecond(BPS)
Indatacommunications,bitspersecond(abbreviatedbpsorbit/sec)isacommonmeasureofdataspeedforcomputermodemsandtransmissioncarriers.Asthetermimplies,thespeedinbpsisequaltothenumberofbitstransmittedorreceivedeachsecond.
Source: TechTargetdefinition,bitspersecond(bpsorbit/sec),http://searchnetworking.techtarget.com/definition/bits-per-second
BitstreamCopy
Bitstreambackup(alsoreferredtoasmirrorimagebackup)involvesthebackupofallareasofacomputerharddiskdriveoranothertypeofstoragemedia.Suchabackupexactlyreplicatesallsectorsonagivenstoragedevice.Thus,allfilesandambientdatastorageareasarecopied.Bitstreambackups-sometimesalsoreferredtoas"evidencegrade"backups-differsubstantiallyfromtraditionalcomputerfilebackupsandnetworkserverbackups.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingNTI'sComputerForensicsDefinitions,http://www.forensics-intl.com/def2.html
BitstreamImage
Asector-by-sector,bit-by-bitcopyofaphysicalharddriveoralogicaldrive.
Source: EDRMCollectionStandards
SeeBitstreamcopy:Bitstreambackup(alsoreferredtoasmirrorimagebackup)involvesthebackupofallareasofacomputerharddiskdriveoranothertypeofstoragemedia.Suchabackupexactlyreplicatesallsectorsonagivenstoragedevice.Thus,allfilesandambientdatastorageareasarecopied.Bitstreambackups-sometimesalsoreferredtoas"evidencegrade"backups-differsubstantiallyfromtraditionalcomputerfilebackupsandnetworkserverbackups.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingNTI'sComputerForensicsDefinitions,http://www.forensics-intl.com/def2.html
BlairandMaron
Authorsofaninfluential1985studystudy(DavidC.Blair&M.E.Maron,AnEvaluationofRetrievalEffectivenessforaFull-TextDocument-RetrievalSystem,28COMMC’NSACM289(1985)),showingthatattorneyssupervisingskilledparalegalsbelievedtheyhadfoundatleast
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 33
75%oftheRelevantDocumentsfromaDocumentCollection,usingSearchTermsanditerativesearch,whentheyhadinfactfoundonly20%.Thatis,thesearchersbelievedtheyhadachieved75%Recall,buthadachievedonly20%Recall.IntheBlairandMaronstudy,theattorneysandparalegalsusedaniterativeapproach,examiningtheretrievedDocumentsandrefiningtheirsearchtermsuntiltheybelievedtheyweredone.ManycurrentcommentatorsincorrectlydistinguishtheBlairandMaronstudyfromcurrentiterativeapproaches,failingtonotethattheBlairandMaronsearchersdidinfactrefinetheirsearchtermsbasedontheirreviewoftheDocumentsthatwerereturnedinresponsetotheirqueries.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Blog
Aweblog.Ajournalavailableonawebpage,typicallyonaspecificsubjectandupdateddaily.Legalblogsaresometimescalled“Blawgs.”
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Blogs,alsoreferredtoasWeblogs,arefrequent,chronologicalWebpublicationsconsistingoflinksandpostings.Themostrecentpostingappearsatthetopofthepage.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Blowback
Printingelectronicfilestopaperforrevieworproductioninhardcopyform.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingAlbertBarsocchini,DataCollectionStandards(LTN1/15/04),http://www.law.com/special/supplement/e_discovery/data_collection.shtml
Theto-be-printedelectronicfilesmayhavepreviouslybeenscannedfrompaperintoelectronicform(hencetheterm"blowback")and/ororiginatedinnativeelectronicform.Ineitherevent,somewherealongthewaythefilesmayhavebeenconvertedinto.pdfor.tifand/orendorsedwithBatesnumbers,privilegestamps,confidentialityredactionoverlays,etc.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).
Printingtifforpdfimagesonpaper.Itiscalledblowbackbecauseoriginallypaperdocumentsmaybescannedandthenreprintedonpaperfromthescannedimages.
BMP
AnativefileformatofWindowsforstoringimagescalledbitmaps.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 34
BooleanSearch
AsearchtechniquethatutilizesBooleanLogictoconnectindividualkeywordsorphraseswithinasinglequerysuchasAND,OR,andNOT,within(w/5),andNOTwithinN(notw/5).
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
AKeywordSearchinwhichtheKeywordsarecombinedusingoperatorssuchas“AND,”“OR,”and“[BUT]NOT.”TheresultofaBooleanSearchispreciselydeterminedbythewordscontainedintheDocuments.(SeealsoBagofWordsmethod.)
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Theterm"Boolean"referstoasystemoflogicdevelopedbyanearlycomputerpioneer,GeorgeBoole.InBooleansearching,an"and"operatorbetweentwowordsresultsinasearchfordocumentscontainingbothofthewords.An"or"operatorbetweentwowordscreatesasearchfordocumentscontainingeitherofthetargetwords.A"not"operatorbetweentwowordscreatesasearchresultcontainingthefirstwordbutexcludingthesecond.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingAppliedDiscovery'sGlossary,http://www.lexisnexis.com/applieddiscovery/clientResources/glossary_B.asp
Source: RSI,Glossary.
AsearchtypeusingBooleanlogicoperatorsbetweensearchtermsthatindicatearelationshipbetweenthem.An"AND"operatorbetweentwowordsorothervalues(forexample,"pearANDapple")meansoneissearchingfordocumentscontainingbothofthewordsorvalues,notjustoneofthem.An"OR"operatorbetweentwowordsorothervalues(forexample,"pearORapple")meansoneissearchingfordocumentscontainingeitherofthewords.
Source: IbisConsulting,Glossary.
MathematicalquerylanguagedevelopedbyEnglishmathematicianGeorgeBooleinthe19thcentury.Booleansearchingoftextisbasedontheunderlyinglogicfunctionsofvarioustrue/falsestatements.CommonBooleanoperatorsare“and,”“butnot,”and“within.”
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Asearchforinformationusing“AND,”“OR”and“NOT”commands,suchas“TombutnotJones”or“bankruptcyandtrustee.”
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 35
Theuseoftheterms“AND,”“OR”and“NOT”inconductingsearches.Usedtowidenornarrowthescopeofasearch.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
Boot
Theprocesswherebyacomputerautomaticallyloadsitsstartupsoftwarewhenithasbeenturnedon.Alsocalled“bootup,”thetermderivesfromthephrase“toliftoneselfupbyone’sownbootstraps.”
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Box
Asquaregraphicelementonaformusedtoenterasinglecharacter,usuallyusedinstringsforenteringconstraineddata.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
BPI
See: BitsPerInch
BPS(BitsPerSecond)
See: BitsPerSecond(BPS)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 36
Briefcase
Amethodtosimplifythetransportofagroupofdocumentsfromonecomputertoanother.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
BulkCoding
TheprocessofCodingallmembersofagroupofDocuments(identified,forexample,byDeduplication,Near-Deduplication,EmailThreading,orClustering)basedonthereviewofonlyoneorafewmembersofthegroup.AlsoreferredtoasBulkTagging.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
BulletinBoardSystem
See: BBS(BulletinBoardSystem)
Bum
Slangformaking(burning)aCD-ROMcopyofdata,whetheritismusic,software,orotherdata.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Burn CDburning
Burn
TheprocessofcreatingacopyofinformationontoaCD-RomorDVD.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
TorecordorwritedataonaCDorDVD.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Bum CDburning
Bus
Apathwaybetweenhardwaredevices,whichmaybeinternal,asisthecasewithcomponentsofacomputer,orexternal,asisthecasewithcomputersinanetwork.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
The"highway"whichconnectsthevariouscomponentsofacomputersystem.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 37
Source:FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
BusinessProcessOutsourcing
Businessprocessoutsourcingoccurswhenanorganizationturnsoverthemanagementandoptimizationofabusinessfunction,suchasaccountspayableorpurchasing,toathirdpartythatconductstheactivitybasedonasetofpredeterminedperformancemetrics.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Byte
Aunitofmeasureconsistingofeightbitsthatisthebasicmeasurementofmostcomputerdataasmultiplesofthebytevalue.Onemillionbytesareequivalenttoa"megabyte"whileonebillionbytesisa"gigabyte."
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Eightbits.
Source:LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Acomputerwordorasequenceofbitsusedasoneunit,usuallyeightbitslong.Inwordprocessing,asinglecharacter,suchasaletter,isusuallyonebyteinsize.
Source:LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Eightbits.TheASCIIstandardtodefineletters,numbersandcharacters–maximumof256.KB–Kilo-bytes,athousandbytes(actually210or1024bytes).MB–Megabytes,amillionbytes,(actually220or1,024KBor1,048,576bytes)GB–Gigabytes,abillionbytes(actually230or1024MBor1,073,741,824bytes).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Eightbits.Abyteisacollectionofbitsusedbycomputerstorepresentacharacter(i.e.,"a","1",or"&").A"megabyte"isonemillionbytesoreightmillionbitsora"gigabyte"isonebillionbytesoreightbillionbits.1gigabyte=1,000megabytes.1terabyte=1,000gigabytes.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Standardunitofmeasureforcomputerstorage.Abyteis8bits(binarydigits)andcorrespondstoabout1Englishcharacter.
Seealso:
Bit
KB-kilobyte
MB-megabyte
GB-gigabyte
TB-terabyte
PB-petabyte
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 38
EB-exabyte
ByteLevelDeletion
Deletionistheprocesswherebydataisremovedfromactivefilesandotherdatastoragestructuresoncomputersandrenderedinaccessibleexceptusingspecialdatarecoverytoolsdesignedtorecoverdeleteddata.Deletionoccursinseverallevelsonmoderncomputersystems:
1. Fileleveldeletion:Deletiononthefilelevelrendersthefileinaccessibletotheoperatingsystemandnormalapplicationprogramsandmarksthespaceoccupiedbythefile'sdirectoryentryandcontentsasfreespace,availabletoreusefordatastorage.
2. Recordleveldeletion:Deletionontherecordleveloccurswhenadatastructure,likeadatabasetable,containsmultiplerecords;deletionatthislevelrenderstherecordinaccessibletothedatabasemanagementsystem(DBMS)andusuallymarksthespaceoccupiedbytherecordasavailableforreusebytheDBMS,althoughinsomecasesthespaceisneverreuseduntilthedatabaseiscompacted.Recordleveldeletionisalsocharacteristicofmanye-mailsystems.
3. Byteleveldeletion:Deletionatthebyteleveloccurswhentextorotherinformationisdeletedfromthefilecontent(suchasthedeletionoftextfromawordprocessingfile);suchdeletionmayrenderthedeleteddatainaccessibletotheapplicationintendedtobeusedinprocessingthefile,butmaynotactuallyremovethedatafromthefile'scontentuntilaprocesssuchascompactionorrewritingofthefilecausesthedeleteddatatobeoverwritten.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Deletionistheprocesswherebydataisremovedfromactivefilesandotherdatastoragestructuresoncomputersandrenderedinaccessibleexceptusingspecialdatarecoverytoolsdesignedtorecoverdeleteddata.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Removingactivefilesmakingthemunavailable.Specialdatarecoverytoolscanstillretrievethesefiles.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
C
Cache
Aformofhigh-speedmemoryusedtotemporarilystorefrequentlyaccessedinformation;oncetheinformationisstored,itcanberetrievedquicklyfrommemoryratherthanfromtheharddrive.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 39
Source: RSI,Glossary.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Adedicated,highspeedportionofcomputermemorywhichcanbeusedforthetemporarystorageoffrequentlyuseddatatomaketheapplicationrunfaster(preventshavingtoconstantlyaccessthedatafromdisk/tapestorage).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Atypeacomputermemorythattemporarilystoresfrequentlyusedinformationforquickaccess.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Caching
Ofimages:Thetemporarystorageofimagefilesonaharddiskforlatermigrationtopermanentstorage,likeanopticalorCDjukebox.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CAR
See: ComputerAssistedReview(CAR)
CaseDe-Duplication
Retainsonlysinglecopiesofdocumentspercase.Forexample,ifanidenticaldocumentresideswithMr.A,Mr.BandMr.C,onlythefirstoccurrenceofthefilewillbesaved(Mr.A's).Contrastwithcustodiande-duplicationandproductionde-duplication.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Basicde-duplication
Custodiande-duplication
De-duplication
Duplicate
Dynamicde-duplication
GlobalDeduplication
HorizontalDeduplication
Productionde-duplication
VerticalDeduplication
CaseManagementSystem(CMS)
Softwaredesignedtoregulatealllawofficefunctionsperformedwithcomputersfromonecentralapplication.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 40
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
CaseSearch/SpecifyingCase
Specifyingthatthesearchmustbecasesensitivewillmatchtheexactcaseforalllettersinthekeywordandinthedocuments.Forexample,acase-sensitivesearchonRosewillmatchthename“RoseJones”butitwillnotmatchthephrase“rosegarden”.
Source: EDRMSearchGlossary.
CCD(ChargeCoupledDevice)
Acomputerchip(withsay2048cells)whoseoutputisproportionaltothelightorcolorpassedbyit.IndividualCCD'sorarraysoftheseareusedinscannersasahigh-resolution,“digitalcamera"to"read"documents.Thesedevicesaremicro-chipsizeandtheirresolutionsrunashighas1000pixelsperinch.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CCITT(ConsultativeCommitteeforInternationalTelephone&Telegraphy)
Setsstandardsforphones,faxes,modemsetc.Thestandardexistsprimarilyforfaxdocuments.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
CCITTGroup4
CCITTGroup4
Acompressiontechnique/formatthatreducesafilegenerally,about5:1overRLEand40:1overbitmap.Forexample,ata300bpiscanrate,theapproximatestoragerequirementsare:SizeRawRLEGroup4A1MB200K40KB2MB400K75KC4MB820K150KD8MB1.6MB300KE16MB3.2MB580K.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
CCITT
CD(CompactDiscorCompactDisk)
A43/4"diameterdevicewhichcanbereadbyalaserbeam.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 41
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Aremovableopticaldiskthatcanbeusedtostoredocumentsorotherdata.CDsareavailablethatcanbebothreadandwrittenusingwidelyavailableCD“burners.”ItiscommontotransferlargeamountsofdatafromonecomputertoanotherusingCDs.
Seealso:
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
CDBurning
TheprocessofwritingoutputtoCD-ROMsorDVDs.
Source: IbisConsulting,Glossary.
Seealso:
Bum Burn
CDPublishing
Analternativetophotocopyinglargevolumesofpaperdocuments.ThismethodinvolvescouplingimageandtextdocumentswithviewersoftwareonCDs.SometimessearchsoftwareisincludedontheCDstoenhancesearchcapabilities.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CD-R(CD-Recordable)
ThisisaCDthatcanbewritten(orrecorded)onlyonce.Itcanbecopiedtodistributealargeamountofdata.CD-RscanbereadonanyCD-ROMdrivewhetheronastandalonecomputerornetworksystem.Thismakesinterchangebetweensystemseasier.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
OftenalsousedasanacronymforCD-ROM'sthatcanbewrittenmorethanonce.Thesucceedingwritingsmustutilizeunusedsectionsoftheoriginal,withalibraryodirectoryofthetotaluse.OpticalstoragetechnologyusingformatscompatiblewithCD-ROM's.CD-ROMdiscs
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 42
mustbe"pre-mastered"toinsurethatthedataiscorrectlyformatted.Usinga"doublespeed"recorder,ittakesaboutahalfhourtoburnacomplete650MBdisc.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
CD
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
CD-Recordable
See: CD-R(CD-Recordable)
CD-ROM(ComputerDiskReadOnlyMemory)
OpticaldiskstorageusingthesametechnologyasaudioCDs.AcomputercanreadaCD-ROMdiscbutcannotwriteonit.Typicallyusedtodistributelargeamountsoftextualinformation,sinceoneCD-ROMholdsabout650MBofdata,orapproximately15,000pagesoftext.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Atypeofhighdensityopticaldiskwitha4"diameteranda650MBcapacity.Theinformation(1'sor0's)ispermanentlyetchedbyalaserintothesurfaceofthediskandreadbyalaserbeam.TheISO9660standarddefineshowaCD-ROMiswrittenforcomputerinterface.Itisnotrewritable.Itislegallyacceptedandwrittenonasingle-side.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
WrittenonalargescaleandnotonastandardcomputerCDburner(CDwriter),theyareanopticaldiskstoragemediapopularforstoringcomputerfilesaswellasdigitallyrecordedmusic.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Datastoragemediumthatusescompactdiscstostoreabout1,500floppydiscs’worthofdata.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
©2016EDRMLLC
CD
CD-R
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
CD-ROMDrive
Acomputerdrivethatreadscompactdiscs.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CD-RW(CompactDiscRe-Writable)
Seealso:
CD
CD-R
CD-ROM
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
CDMA(Code-DivisionMultipleAccess)
Anemergingwirelesscommunicationtechnologyforalldigitalvoiceanddatanetworks.
Source: RenewData,Glossary(10/5/2005).
CDPD(CellularDigitalPacketData)
Adatacommunicationstandardwhichusestheunusedcapacity(bandwidth)ofcellularvoiceproviders.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 44
CDR(ComputerDiskRecorder)
Themachinethatactually“burns”informationontoaCD.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
CellularDigitalPacketData
See: CDPD(CellularDigitalPacketData)
CentralProcessingUnit(CPU)
The“brain”ofthecomputer.InaPC,theCPUiscontainedonasinglemicroprocessorchipandperformsalllogicaloperationstorunprogramsandsolveproblems.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Theportionofacomputerwhichperformsmostofthelogicalandarithmeticfunctions.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CentronicsInterface
Aparallelinterfacestandardforconnectingprintersandotherdevicestocomputers.PioneeredbytheCentronicsInc.,aprintermanufacturerinNewHampshire.Usesa36pinconnector.SeeSPP.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CertifiedForensicExaminer
Apersonholdingoneofanumberofcommonlyrecognizedcertificationsinthefield.Duetoalackofindustrywidecertificationsitiscriticaltoresearchthecertificationsandanyrequirementswithinyourstateorjurisdiction.
Source: EDRMCollectionStandards
CGA(ColorGraphicsAdapter)
ShortforColorGraphicsAdapter,CGAwasanearlyIBMvideoadapterthatreplacedmonochromeandwasfirstintroducedin1981.CGAhasthehighestresolutionof640x200,colordepthof4-bit,andsupports16colors(24=16).
Source: ComputerHope,CGAdefinition,http://www.computerhope.com/jargon/c/cga.htm
TheColorGraphicsAdapter(CGA),originallyalsocalledtheColor/GraphicsAdapterorIBMColor/GraphicsMonitorAdapter,introducedin1981,wasIBM'sfirstgraphicscardandfirst
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 45
colordisplaycardfortheIBMPC.Forthisreason,italsobecamethatcomputer'sfirstcolorcomputerdisplaystandard.
Source: Wikipedia,ColorGraphicsAdapter,https://en.wikipedia.org/wiki/Color_Graphics_Adapter
Seealso:
VGA
ChainofCustody
Allinformationonafile'stravelsfromitsoriginalcreationversiontoitsfinalproductionversion.Adetailedaccountofthelocationofeachdocument/filefromthebeginningofaprojectuntiltheend.Asoundchainofcustodyverifiesthatyouhavenotalteredinformationeitherinthecopyingprocessorduringanalysis.Ifyoucannotshowthechainofcustody,youmayhaveadifficulttimedisprovingthatoutsideinfluencesmighthavetamperedwiththedata.Achainofcustodyfailure—i.e.,themishandlingofelectronicevidence(evenfullyrecoveredfiles)—cancausealitigationdefeat.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingFeldman,TheEssentialsofComputerDiscovery,ComputerForensicsInc.(1/1/2001),http://www.forensics.com/pdf/Essentials_of_Discovery.pdf#page=12
Aprocessusedtomaintainanddocumentthechronologicalhistoryofthehandlingofelectronicevidence.Achainofcustodyensuresthatthedatapresentedis"asoriginallyacquired"andhasnotbeenalteredpriortoadmissionintoevidence.Someprovidersmaintainanelectronicchain-of-custodylinkbetweenallelectronicdataanditsoriginalphysicalmediathroughouttheproductionprocess.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Anaccountingofthecontrol(custody)ofrealevidenceatalltimesuntilthemomentitisofferedinevidence.Chainofcustodyhelpstoshowthattheevidencebeingofferedhasnotbeentamperedwithandisauthentic.Chainofcustodyisimportantforelectronicevidencebecauseitcanbeeasilyaltered.
Source: IbisConsulting,Glossary.
Chainofcustodyreferstothechronologicaldocumentationand/orpapertrailshowingtheseizure,custody,control,transfer,analysis,anddispositionofevidence,physicalorelectronic.Becauseevidencecanbeusedincourttoconvictpersonsofcrimes,itmustbehandledinascrupulouslycarefulmannertoavoidlaterallegationsoftamperingormisconduct,whichcancompromisethecaseoftheprosecutiontowardacquittalorbecomegroundsforoverturningaguiltyverdictuponappeal.Theideabehindrecordingthechainofcustodyistoestablishthattheallegedevidenceisinfactrelatedtotheallegedcrime-ratherthan,forexample,havingbeenplantedfraudulentlytomakesomeoneappearguilty.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 46
Establishingchainofcustodyisespeciallyimportantwhentheevidenceconsistsoffungiblegoods.Inpractice,thismostoftenappliestoillegaldrugsthathavebeenseizedbylawenforcementpersonnel.Insuchcases,thedefendantmaydisclaimanyknowledgeofpossessionofthecontrolledsubstanceinquestion.Accordingly,thechainofcustodydocumentationandtestimonyispresentedbytheprosecutiontoestablishthatthesubstanceinevidencewasinfactinthepossessionofthedefendant.
Anidentifiablepersonmustalwayshavethephysicalcustodyofapieceofevidence.Inpractice,thismeansthatapoliceofficerordetectivewilltakechargeofapieceofevidence,documentitscollection,andhanditovertoanevidenceclerkforstorageinasecureplace.Thesetransactions,andeverysucceedingtransactionbetweenthecollectionoftheevidenceanditsappearanceincourt,shouldbecompletelydocumentedchronologicallyinordertowithstandlegalchallengestotheauthenticityoftheevidence.Documentationshouldincludetheconditionsunderwhichtheevidenceisgathered,theidentityofallevidencehandlers,durationofevidencecustody,securityconditionswhilehandlingorstoringtheevidence,andthemannerinwhichevidenceistransferredtosubsequentcustodianseachtimesuchatransferoccurs(alongwiththesignaturesofpersonsinvolvedateachstep).
Source: EDRMPresentationGuide.
Seealso:
Chainofevidence Forensicallysoundprocedures
ChainofEvidence
The"sequencing"ofthechainofevidencefollowsthisorder:identificationandcollection;analysis;storage;preservation;transportation;presentationincourt;returntoowner.Thechainofevidenceshows:whoobtainedtheevidence;whereandwhentheevidencewasobtained;whosecuredtheevidence;whohadcontrolorpossessionoftheevidence.1
Seealso:
Chainofcustody Forensicallysoundprocedures
CharacterEncoding
Electronicdataisrepresentedassequencesofbits,ornumbers.Eachalphabetorscriptusedinalanguageismappedtoauniquenumericvalue.Thisisreferredtoascharacterencoding.SeealsoUnicode.
Source: EDRMSearchGlossary.
CharacterTreatment
Theuseofallcapsoranotherstandardformoftreatinglettersinacodingproject.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 47
CharactersPerInch(CPI)
ChargeCoupledDevice
See: CCD(ChargeCoupledDevice)
Chip
Apieceofsiliconcontainingelectroniccircuitsthatperformcomputingfunctionsprocessedbythechip.Thechipismountedontoasocketthathasanumberofprojectingpinsandfitsintoareceptacleonthemotherboard.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
CIE(CommissionInternationaldel’Eclairage)
Theinternationalcommissiononcolormatchingandilluminationsystems.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Cine-Mode
Datarecordedonafilmstripsuchthatitcanbereadbyahumanwhenheldvertically.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Cinepak
Acompressionalgorithm,seeMPEG.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CITIS(ContractorIntegratedTechnicalInformationService)
TheDepartmentofDefensenowrequirescontractorstohaveanelectronicdocumentimageandmanagementsystem.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CivilProcedureRules(CPR)
TheprocessformerlyknownasdiscoverybywhichdocumentsareexchangedbetweenpartiesinlitigationinEnglandandWales.Technicallytheprocesshasthreephases-a)disclosure-makingitknownthatthedocumentsexistbyprovidingtheotherpartywithalist,b)inspection-allowingtheotherpartytolookatthedocumentsc)theprovisionofcopies.Typicallyallthree
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 48
phasesaredealtwithtogether.UnliketheUS,productionofdocumentsisinitiallydrivenby"push"i.e.thereisanobligationonapartytodisclosetheirdocumentswhicharematerialtothecase.ForfulldetailsseeCPR31andPD31.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
Classical,Gaussian,orNormalCalculator/Classical,Gaussian,orNormalEstimation
AmethodofcalculatingConfidenceIntervalsbasedontheassumptionthatthequantitiestobemeasuredfollowaGaussian(Normal)Distribution.Thismethodismostcommonlytaughtinintroductorystatisticscourses,butyieldsinaccurateConfidenceIntervalswhenthePrevalenceofitemswiththecharacteristicbeingmeasuredislaw.(C.f.BinomialCalculator/BinomialEstimation.)
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Classifier/Classification/Classified/Classify
Toarrangeordesignateaccordingtocategorizationsuchaspotentiallyresponsiveorprivilegedversusnon-responsiveornot-privileged.
Source: EDRMSearchGlossary.
AnAlgorithmthatLabelsitemsastowhetherornottheyhaveaparticularproperty;theactofLabelingitemsastowhetherornottheyhaveaparticularproperty.InTechnology-AssistedReview,ClassifiersarecommonlyusedtoLabelDocumentsasResponsiveorNon-Responsive.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Classify/Classification
Toarrangeordesignateaccordingtocategorizationsuchaspotentiallyresponsiveorprivilegedversusnon-responsiveornot-privileged.
Source: EDRMSearchGuideGlossary.
CleanInstall
Acleaninstallisasoftwareinstallationinwhichanypreviousversioniseradicated.Thealternativetoacleaninstallisanupgrade,inwhichelementsofapreviousversionremain.
Source: http://searchitchannel.techtarget.com/definition/clean-install.
Client/ServerNetwork
Acomputersystemfunctionallydistributedacrossseveralnodesonanetwork,sometimescalledadistributedapplication.Thebasictheoryisthatthevariouscomponentsofthesystem
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 49
canbetailoredtoperformspecificfunctions,hopefullyforthegoodoftheentirenetwork.Client/Serversystemsarealsotypifiedbyahighdegreeofparallelprocessingacrossdistributednodes.UsuallytheclientsareindividualPC'sconnectedtoserver(s)whichactascentralstorehousesand"trafficcops"forinformationandapplications.
Comparewithfile-sharingapplications,whereallsearchesoccurontheworkstation,whilethedocumentdatabaseresidesontheserver.Withclient-serverarchitecture,CPUintensiveprocesses(suchassearchingandindexing)arecompletedontheserver,whileimageviewingandOCRoccurontheclient.File-sharingapplicationsareeasiertodevelop,buttheytendtogeneratetremendousnetworkdatatrafficindocumentimagingapplications.Theyalsoexposethedatabasetocorruptionthroughworkstationinterruptions.Client-serverapplicationsarehardertodevelop,butdramaticallyreducenetworkdatatrafficandinsulatethedatabasefromworkstationinterruptions.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
LAN-localareanetwork
MAN-metropolitanareanetwork
Network
Peer-to-peernetwork
SAN-storageareanetwork
Standalonecomputer
WAN-wideareanetwork
ClockSpeed
Thespeedwithwhichthecomputerprocessesinformation.PCclockspeedismeasuredinmegahertz,e.g.,60megahertz.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
CloudComputing
Massiveandsharedcomputingresourceswhereeachuserobtainsthecomputingandstorageresourcestheyneedfromacommonpool,usuallyownedbyathird-partydataserviceandhousedintheirdatacenter.Cloudcomputingis,insomeways,reminiscentoftimesharingmain-framecomputing,whereafewbigcompaniescontrolledtocomputationalresourcesformostusers.
Source: HerbRoitblat,Search2020:TheGlossary.
Cluster
Inoperatingsystemsthatuseafileallocationtable(FAT)architecture,thesmallestunitofstoragespacerequiredfordatawrittentoadrive.Alsocalledanallocationunit.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 50
Source: RSI,Glossary.
Thesmallestunitofstoragespacerequiredforcomputerdatatobewrittentoadrive.Sometimescalledanallocationunit.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Cluster(File):Thesmallestunitofstoragespacethatcanbeallocatedtostoreafileonoperatingsystemsthatuseafileallocationtable(FAT)architecture.WindowsandDOSorganizeharddiscsbasedonclusters(alsoknownasallocationunits),whichconsistofoneormorecontiguoussectors.Discsusingsmallerclustersizeswastelessspaceandstoreinformationmoreefficiently.
Cluster(System):Acollectionofindividualcomputersthatappearasasinglelogicalunit.Alsoreferredtoasmatrixorgridsystems.
Clustering
AnUnsupervisedLearningmethodinwhichDocumentsaresegregatedintocategoriesorgroupssothattheDocumentsinanygrouparemoresimilartooneanotherthantothoseinothergroups.Clusteringinvolvesnohumanintervention,andtheresultingcategoriesmayormaynotreflectdistinctionsthatarevaluableforthepurposeofasearchorrevieweffort.1
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Groupingdocumentsorotherobjectsbysimilarity.Thesimilaritybetweentwodocumentsinaclusterisgreaterthanthesimilarityofdocumentsintwodifferentclusters.
Source: HerbRoitblat,Search2020:TheGlossary.
Source: HerbRoitblat,PredictiveCodingGlossary
CMS
See: CaseManagementSystem(CMS)
CMYK(Cyan,Magenta,YellowandBlack)
Asubtractivemethodusedinfourcolorprintinganddesktoppublishing.
Sourec: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 51
Co-Processor
Anadditionalprocessor,whichperformsspecifictaskswhilethemainprocessorrunstheprimaryfunctionsofthesystem.Amathco-processor,forexample,performsarithmeticoperationstotakethatburdenoffthemainprocessorresultinginfasteroperations.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Code/Coded/Coding
TheactionofLabelingaDocumentasRelevantorNon-Relevant,orthesetofLabelsresultingfromthataction.Sometimesinterpretednarrowlytoincludeonlytheresult(s)ofaManualRevieweffort;sometimesinterpretedmorebroadlytoincludeautomatedorsemi-automatedLabelingefforts.Codingisgenerallythetermusedinthelegalindustry;LabelingistheequivalentterminInformationRetrieval.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Code-DivisionMultipleAccess
See: CDMA(Code-DivisionMultipleAccess)
Coder
Anindividualassignedtoinputinformationfromdocumentsintoadocumentdatabase.
Sourec: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Coding
Anautomatedorhumanprocesswheretextcontentiscomparedtopre-determinedcodes,andtheresultsofthosecomparisonsarelogged.Codingusuallyidentifiesnames,dates,andrelevanttermsorphrases.Codingmaybestructured,i.e.,aYes/Nooptionastoanissueortheselectionofoneofthefinitenumberofchoices,orunstructured,i.e.,anarrativecommentaboutadocument.Codingmaybeobjective,i.e.,thenameofthesenderorthedate,orsubjective,i.e.,evaluationastothedocuments.
Source: IbisConsulting,Glossary.
Ameansofcapturingspecific,standardizeddatafromacollectionofdocumentsandcreatingadatabaselinkingthedatatotheimages.Theterm“coding”isgenerallyusedinthelegalandmedicalmarkets.Itissimilarto“indexing”inthecommercialmarketplace.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 52
Documentcodingistheprocessofcapturingcase-relevantinformation(i.e.author,dateauthored,datesent,recipient,dateopened,etc.)fromapaperdocument.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
BibliographicCoding
Indexing
IssueCode
Issuecoding
Levelcoding
Objectivecoding
Subjectivecoding
Tag
Taxonomiccoding
Verbatimcoding
CodingManual
Asetofinstructionsprovidedtocodersthatincludesadescriptionoftheproject,subjectcodes,andrulesfordataconformanceandconsistency.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
CognitiveComputing
Astyleofcomputingintendedtomimicthewaythehumanmindworks.Itisintendedtoaddressthekindsofproblemswherehumanjudgmenthaspreviouslybeenrequired,problemsinvolvinghighamountsofambiguityanduncertainty.Cognitivecomputingtypicallyinvolvessophisticatedformsofnaturallanguageprocessingandautomaticreasoning.
Source: HerbRoitblat,Search2020:TheGlossary.
COLD(ComputerOutputtoLaserDisk)
ThecomputersystemcontainsfilesofASCIIdata(frominputorapplicationprograms)orbit-mappedfilespreviouslyscannedfrommicrofilmdocumentsorpictures.Theseoutputfilesarecompressedbyafactorof5-20:1fromtheoriginaldocumentsandstoredonWORMoptical/laserdisks.Thestoreddataisthenavailabletoallonthenetwork.Generally,theformatofthesedatabasesarecompatiblewithSQLandimagingformats.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Collection
Agroupofdocuments.Thesecanbedocumentsgatheredforaparticularmatterorpurpose.Informationretrievalscientiststenduseseveralwell-knowndocumentcollections(e.g.,RCV1)fortestingandcomparisonpurposes.
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 53
CollectionPhase
GatheringESIforfurtheruseinthee-discoveryprocess(fromEDRMStageswebpage).
Source: EDRMStages
CorrespondstoUTBMSCodeL620-L629.Collection/Recovery,MediaCosts,Media/ESITransfer,Receipt,Inventory,QualityAssuranceandControl.
Source: EDRMMetricsGlossary
ColorGraphicsAdapter
See: CGA(ColorGraphicsAdapter)
COM(ComputerOutputtoMicrofilm)
Thecomputerconvertsandstoresdatadirectlyonmicrofilm/fichefromavarietyofavailableinputs.Thisoldertechnologyischeaperandmoreconvenientthanpaper,butoneofthemostdifficulttouseinactuallystoringandretrievingthedata.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Comb
Aseriesofboxeswiththeirtopsmissing.Tickmarksguidetextentry.Usedinformsprocessingratherthanboxes.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CombinedWordSearch
Awordsearchthatcombinessynonym,proximity,and/orBooleansearches.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 54
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
ComicMode
Human-readabledata,recordedonastripoffilmwhichcanbereadwhenthefilmismovedhorizontallytothereader.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CommissionInternationaldel’Eclairage
See: CIE(CommissionInternationaldel’Eclairage)
CommonUserInterface(CUI)
IBM’sanswertotheAppleMacintosh,itisastandardformenusandwindowsdevelopedbyIBM.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
CompactDisc
See: CD(CompactDiscorCompactDisk)
CompactDiscRe-Writable
See: CD-RW(CompactDiscRe-Writable)
CompactDiscRecordable
See: CD-R(CD-Recordable)
CompactDisk
See: CD(CompactDiscorCompactDisk)
Compatibility
Acharacteristicofacomputerorsoftwarebywhichdatapreparedinanothercomputerorsoftwarecanbeprocessed.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Theinterchangeabilityofcomputercomponents,eitherhardwareorsoftware.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 55
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ComplianceSearch
Searchingforthepurposesofidentificationofspecifiedrelevantinformationinresponsetoadiscoveryrequest.AcompliancesearchshouldbepairedwithamethodologysearchasAd-HocorIterativesearching.
Source: EDRMSearchGlossary.
Comply
Usedinthecontextofadiscoveryrequest.Whenonecomplieswithadiscoveryrequest,itisthroughaproduction(ofeitherawitnessordocument).
Source: IbisConsulting,Glossary.
CompositeVideo
Avideostreamthatcombinesred,green,blueandsynchronizationsignalsintoonesoitonlyrequiresoneconnector.CompositevideoisusedbymosttelevisionsandVCR's.Separateluminosityandcolorsignalsthatprovidethehighestpossiblesignalquality.DistinctfromvideostandardssuchasNTSCorPAL.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CompoundDocument
Adocumentthatcontainslinkedorembeddedobjectsaswellasitsowndata.
Source: IbisConsulting,Glossary.
Compression
Atechnologyforstoringdatainfewerbits,itmakesdatasmallersolessdiskspaceisneededtorepresentthesameinformation.CompressionprogramslikeWinZipandUNIXcompressarevaluabletonetworkusersbecausetheysavebothtimeandbandwidth.Datacompressionisalsowidelyusedinbackuputilities,spreadsheetapplications,anddatabasemanagementsystems.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Atechnologyforstoringdatainfewerbits,itmakesdatasmallersolessdiskspaceisneededtorepresentthesameinformation.Datacompressioniswidelyusedtobackuputilities,spreadsheetapplications,anddatabasemanagementsystems.Compressedfilesmustbedecompressedinordertobeuseable.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 56
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Atechnologythatreducesthesizeofafile.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Anymethodwhichreducestheamountofdatanecessarytotransmitinformationfromonepointtoanother.Compressiongenerallyeliminatesredundantinformationand/orpredictswherechangeswilloccur."Lossless"compressiontechniquestotallypreservetheintegrityoftheinput."Lossy"methodsdisregardsomeoftheoriginals.Theratioofthefilesizesofacompressedfiletoanuncompressedfile,e.g.,witha20:1compressionratio,anuncompressedfileof1MBiscompressedto50KB.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Atechnologythatreducesthesizeofafile.Compressionprogramsarevaluabletonetworkusersbecausetheyhelpsavebothtimeandbandwidth.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Computer
Includesbutisnotlimitedtonetworkservers,desktops,laptops,notebookcomputers,employees’homecomputers,mainframes,PDAs(personaldigitalassistants,suchasPalmPilot,Cassiopeia,HPJornadaandothersuchhandheldcomputingdevices),digitalcellphonesandpagers.
Source: RSI,Glossary.
Seealso:
Fileserver
Laptopcomputer
Microcomputer
Minicomputer
Notebookcomputer
Personalcomputer
Workstation
ComputerAssistedReview(CAR)
Anyofanumberoftechnologiesthatusecomputerstofacilitatethereviewofdocumentsfordiscovery.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
CAR PredictiveCoding TAR
ComputerDiskReadOnlyMemory
See: CD-ROM(ComputerDiskReadOnlyMemory)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 57
ComputerDiskRecorder
See: CDR(ComputerDiskRecorder)
ComputerEvidence
Computerevidenceisratheruniquewhencomparedtootherformsofmoretraditionaldocumentaryevidence.Unlikepaperdocumentation,computerevidenceisextremelyfragileanditoccursintheformofanidenticalcopyofaspecificdocumentthatisstoredinacomputerfile.Inaddition,thelegal"bestevidence"rulesdifferfortheprocessingofcomputerevidence.However,thereisthepotentialforunauthorizedcopiestobemadeofimportantcomputerfileswithoutleavingbehindatracethatacopywasmade.Computerevidenceisnotlimitedtodatastoredincomputerfiles,rathermostrelevantcomputerevidenceisuncoveredinuncommonlyknownlocations.Forexample,onMicrosoftWindowsandWindowsNT-basedcomputersystems,largequantifiesofevidencecanbefoundintheWindowsswapfilesorPageFiles.Inadditioncomputerevidencecanalsobeuncoveredinfileslackandunallocatedfilespace.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Seealso:
Computerforensics
Computerinvestigations
Discovery
Electronicdiscovery/e-discovery
Electronicevidence
Forensicanalysis
Forensics
Mirroring
ComputerForensics
Computerforensicsistheuseofspecializedtechniquesforrecovery,authenticationandanalysisofelectronicdatawhenacaseinvolvesissuesrelatingtoreconstructionofcomputerusage,examinationofresidualdata,andauthenticationofdatabytechnicalanalysisorexplanationoftechnicalfeaturesofdataandcomputerusage.Computerforensicsrequiresspecializedexpertisethatgoesbeyondnormaldatacollectionandpreservationtechniquesavailabletoend-usersorsystemsupportpersonnel.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Similartoallformsofforensicscience,computerforensicsiscomprisedoftheapplicationofthelawtocomputerscience.Computerforensicsdealswiththepreservation,identification,extraction,anddocumentationofcomputerevidence.Likemanyotherforensicsciences,computerforensicsinvolvestheuseofsophisticatedtechnologicaltoolsandproceduresthatmustbefollowedtoguaranteetheaccuracyofthepreservationofevidenceandtheaccuracyofresultsconcerningcomputerevidenceprocessing.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Theuseofspecializedtechniquesforrecovery,authentication,andanalysisofcomputerdata,typicallyofdatawhichmayhavebeendeletedordestroyed.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 58
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Computerevidence
Computerinvestigations
Discovery
Electronicdiscovery/e-discovery
Electronicevidence
Forensicanalysis
Forensics
Mirroring
ComputerInvestigations
Computercrimesarespecificallydefinedbyfederaland/orstatestatutesandanycomputerdocumentaryevidenceutilizedduringacomputerinvestigationmayincludecomputerdatastoredonfloppydiskettes,zipdisks,CDsandcomputerharddiskdrives.Theevidencenecessarytoprovecomputer-relatedcrimescanpotentiallybelocatedononeormorecomputerharddiskdrivesinvariousgeographiclocations.Thisevidencecanresideoncomputerstoragemediaasbytesofdataintheformofcomputerfilesandambientdata,however,ambientdataisusuallyunknowntomostcomputerusersandisthereforeoftenveryusefultocomputerforensicinvestigators.Computerinvestigationsrelyuponevidencestoredasdataandthetimelineofdatesandtimesthatfileswerecreated,modified,and/orlastaccessedbyacomputeruser.Timelinesofactivitiescanbeessentialwhenmultiplecomputersandindividualsareinvolvedinthecommissionofacrime.Inaddition,computerinvestigationsgenerallyinvolvethereviewofInternetlogfilestodetermineInternetaccountabuses.Usingcomputerforensicprocedures,processes,andtools,computerforensicsinvestigatorscanidentifypasswords,networklogons,Internetactivity,andfragmentsofemailmessagesthatweredumpedfromcomputermemoryduringpastWindowsworksessions.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Seealso:
Computerevidence
Computerforensics
Discovery
Electronicdiscovery/e-discovery
Electronicevidence
Forensicanalysis
Forensics
Mirroring
ComputerOutputtoLaserDisk
See: COLD(ComputerOutputtoLaserDisk)
ComputerOutputtoMicrofilm
See: COM(ComputerOutputtoMicrofilm)
ConceptSearch
Asearchtechniquethatprovideswordswhicharesimilarinconcepttoaqueryword.Aconceptsearchwillreturndocumentsthatrelatetothesameconceptasthequeryword,regardlessofwhetherthequerywordexistsinthesearchresultsdocuments.Conceptsearchescanbe
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 59
implementedasasimplethesaurusmatch,orbyusingsophisticatedstatisticalanalysismethods.Effectivenessofconceptsearchinane-discoveryprojectdependsgreatlyonthetypeofalgorithmusedanditsimplementation.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
Anindustry-specifictermgenerallyusedtodescribeKeywordExpansiontechniques,whichallowsearchmethodstoreturnDocumentsbeyondthosethatwouldbereturnedbyasimpleKeywordorBooleanSearch.MethodsrangefromsimpletechniquessuchasStemming,ThesaurusExpansion,andOntologysearch,throughstatisticalAlgorithmssuchasLatentSemanticIndexing.
Source: TheGrossman-CormackGlossaryofTechnologyAssistedReview(Version1.02,Nov.2102).
Mapsrelationshipsbetweeneachwordandeveryotherwordinlargesetsofdocumentsandthenassociateswordsbasedonthecontextinwhichtheyareused.Twotechniquescanbeusedtoperformconceptsearches:theuseofamanuallyconstructedthesauruswhichrelatescertainwordstoothersorsemanticindexing,afullyautomatedmethodtoshowassociationsamongwordsbased,inpart,onstatisticalanalysisoftheoccurrenceofproximityofcertainwordstoothers.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Alsocalled"thesaurus"or"related"searching;sometimescalled"synonymsearching."Searchesthatprovideotherwordssimilarorcloseinmeaningtotheprimaryword.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 60
ConfidenceInterval
AspartofaStatisticalEstimate,arangeofvaluesestimatedtocontainthetruevalue,withaparticularConfidenceLevel.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Theexpectedrangeofresults.Ifyoudrewrepeatedsamplesfromthesamepopulation,youwouldexpecttheresulttobewithintheconfidenceintervalabouttheproportionoftimesgivenbytheconfidencelevel.Forexample,inanelectionpoll,thedifferenceintheproportionofpeoplefavoringeachcandidateisdescribedasbeingwithinarangeof,say,plusorminus5%.Allotherthingsbeingequal,thesmallertheconfidenceinterval,thelargerthesamplesizeneedstobe.Saidanotherway,thelargerthesamplesize,thesmallertheconfidenceinterval.
Source: HerbRoitblat,Search2020:TheGlossary.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
MarginofError
ConfidenceLevel
AspartofaStatisticalEstimate,thechancethataConfidenceIntervalderivedfromaRandomSamplewillincludethetruevalue.Forexample,“95%Confidence”meansthatifoneweretodraw100independentRandomSamplesofthesamesize,andcomputetheConfidenceIntervalfromeachSample,about95ofthe100ConfidenceIntervalswouldcontainthetruevalue.ItisimportanttonotethattheConfidenceLevelisnottheProbabilitythatthetruevalueiscontainedinanyparticularConfidenceInterval;itistheProbabilitythatthemethodofestimationwillyieldaConfidenceIntervalthatcontainsthetruevalue.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Howoftenwewouldachieveasimilarresultifwerepeatedthesameprocessmanytimes.Ifwedidthesamekindoftestfromthesamepopulationmorethanonce,theconfidencelevelwouldtellushowoftenwewouldgetaresultthatiswithinacertainrange(theconfidenceinterval)ofthetruevalueforthepopulation.Mostscientificstudiesemployaminimumconfidencelevelof0.95,meaningthat95percentofthetimewhenyourepeatedtheexperimentyouwouldfindasimilarresult.Thehighertheconfidencelevelthelargerthesamplesizethatisrequired.Technically,itistheproportionoftimeswhenthetruepopulationvaluewouldbeincludedwithintheconfidenceinterval.
Source: HerbRoitblat,Search2020:TheGlossary.
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 61
Config.sys
ADOSconfigurationfile,whichisusedwhenthecomputerboots,toloadspecificdevicedriverstorunhardwareorsoftwarecomponents.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ConfusionMatrix
Atwo-by-twotablelistingvaluesforthenumberofTrueNegatives(TN),FalseNegatives(FN),TruePositives(TP),andFalsePositives(FP)resultingfromasearchorrevieweffort.Asshownbelow,allofthestandardevaluationmeasuresarealgebraiccombinationsofthefourvaluesintheConfusionMatrix.AlsoreferredtoasaContingencyTable.AnexampleofaConfusionMatrix(orContingencyTable)isprovidedimmediatelybelow.
CodedRelevant CodedNon-Relevant
TrulyRelevant TruePositives(TP) FalseNegatives(FN)
TrulyNon-Relevant FalsePositives(FP) TrueNegatives(TN)
Accuracy=100%–Error=(TP+TN)/(TP+TN+FP+FN)
Elusion=100%–NegativePredictiveValue=FN/(FN+TN)
Error=100%–Accuracy=(FP+FN)/(TP+TN+FP+FN)
Fallout=FalsePositiveRate=100%–TrueNegativeRate=FP/(FP+TN)
FalseNegativeRate=100%‒TruePositiveRate=FN/(FN+TP)
NegativePredictiveValue=100%–Elusion=TN/(TN+FN)
Precision=PositivePredictiveValue=TP/(TP+FP)
Prevalence=Yield=Richness=(TP+FN)/(TP+TN+FP+FN)
Recall=TruePositiveRate=Sensitivity=TP/(TP+FN)
TrueNegativeRate=Specificity=TN/(TN+FP)
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
ConsultativeCommitteeforInternationalTelephone&Telegraphy
See: CCITT(ConsultativeCommitteeforInternationalTelephone&Telegraphy)
Container
Anapplicationorobjectthatcontainsotherfilesorobjectswhichcanberepresentedasfiles.Acontainermightbeanarchiveoracompounddocumentwithanembeddedorlinkedobject.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 62
Source: IbisConsulting,Glossary.
Seealso:
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
OST
PST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
ContingencyTable
Atableofthefourresponsestatesinacategorizationtask.Therowsofthetablemaycorrespondtothecorrectortruecategoryvaluesandthecolumnsmaycorrespondtothechoicesmadebysystem.Forexample,thetoprowmaybethetrulypositivecategory(e.g.trulyresponsivedocuments)andthesecondrowmaybethetrulynegativecategory(e.g.,trulynon-responsivedocuments).Thecolumnsthenrepresentthepositivedecisionsmadebythesystem(e.g.,putativelyresponsive)andthenegativedecisionsmadebythesystem(e.g.,putativelynon-responsive).Theentriesinthesecellsarethecountsofdocumentscorrespondingtoeachresponsestate(e.g.,truepositives,falsenegatives,falsepositives,truenegatives).Contingencytablesareoftendisplayedalongwiththetotalsforeachrowandforeachcolumn.Sometimestherowsandcolumnsarereversed,sothecolumnsreflectthetruevaluesandtherowsreflectthechoices.
Source: HerbRoitblat,PredictiveCodingGlossary.
ContinuousTone
Animage(e.g.aphotograph)whichhasallthevaluesofgrayfromwhitetoblack.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ContractorIntegratedTechnicalInformationService
See: CITIS(ContractorIntegratedTechnicalInformationService)
ControlNumber
See:BatesNumber
ControlNumberPrefix
Aproject-specific,clientspecificationintheformofanalphanumericprefixthatprecedesaproject’scontrolnumber(thedigitalequivalentofaBatesnumber).Alsocalled"controlnumberprefix."
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 63
Source: IbisConsulting,Glossary.
Seealso:
Batesnumber
Batesprefix
Batesstamp
Batesstamping
Documentnumber
ControlSet
ARandomSampleofDocumentsCodedattheoutsetofasearchorreviewprocess,thatisseparatefromandindependentoftheTrainingSet.ControlSetsareusedinsomeTechnology-AssistedReviewprocesses.TheyaretypicallyusedtomeasuretheeffectivenessoftheMachineLearningAlgorithmatvariousstagesoftraining,andtodeterminewhentrainingmaycease.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Convergence
WheretheRGBsignals"converge"onasinglepixel.ThatpixelshouldbewhiteatfullbrightnessoftheRGBcomponents.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Cookie
Adatasetthatawebsiteservergivestoabrowserthefirsttimeauservisitsasite,updatedwitheachreturnvisit.TheremoteserversavescookiedataaboutauserastextfilesstoredinNetscapeorMSInternetExplorersystemfolders.Cookiesmaycontainuserorsessionspecificdatasuchasusername,dateofvisit,statisticandanythingthatserverknowsaboutremoteuser.Cookiesmaybeupdatedoneormoretimeseachvisit,oronlyonce.
Source: IbisConsulting,Glossary
Holdsinformationonthetimesanddatesauserhasvisitedwebsites.Otherinformationcanalsobesavedtoyourharddriveinthesetextfiles,includinginformationaboutonlinepurchases,validationinformationabouttheuserfor"MembersOnly"websites,etc.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Smalldatafilewrittentoauser’sharddrivebyawebserverwhichcontainsinformationthewebsiteusestoidentifytheuserinsubsequentvisits.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Smalldatafileswrittentoauser'sharddrivebyaWebserver.Thesefilescontainspecificinformationthatidentifiesusers(i.e.,passwordsandlistsofpagesvisited).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 64
Copy/Paste
Tocopyapieceofdatatoatemporarylocationandthenmakeanewcopyoftheobjectinanewlocation.Thisisusuallydonebyclickingtherightmousebuttonwhileholdingthemousecursorovertherelevantfileandthenclicking“copy”fromthemenuthatappears.Themousepointeristhenmovedtothedestinationlocation,arightmouseclickbringsupthesamefunctionmenuand“paste”isselectedtocopythefile(s)tothenewlocation.
Source: EDRMCollectionStandards
CopyeeField
Adatafieldusedtorecordthenamesofindividualsand/orbusinessentitieswhoreceivedacopyofadocument,whenthenameisnototherwiserecordedintheaddresseeorrecipientfield.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
Corpus
Acollectionofobjects,typicallydocumentsthatarethesubjectofanalysis,machinelearning,orcategorization.
Source: HerbRoitblat,Search2020:TheGlossary.
CorruptFile
Afilewithdeteriorateddataasaresultofsomeexternalagent.Hazardstodataintegrityincludenotonlycomputer-basedproblemssuchasviruses,hardwareorsoftwareincompatibilities,flaws,orfailures.Alsoenvironmentalthreatssuchaspoweroutages,dust,water,andextremetemperaturescancausehardwarefailurethatadverselyaffectfiledata.
Source: IbisConsulting,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 65
Cost
Costreferstothemeasurabledollarsassociatedwitheachidentifiabletask,activityoraction.CostisalsoavariableelementofTimeandVolume.CostmayalsoincludediscreteelementsthatmaybeindependentofTimeorVolume.
Source: EDRMMetricsGlossary
CoverageBias
CoverageBiascanoccurifthesamplesarenotrepresentativeofthepopulationduetothemethodologyused.Ine-discovery,suchcoveragebiasoccurswhenlargeportionsofESIgetexcludedfrombasedonmeta-dataortypeofESI.Asanexample,PatentLitigationmayrequiresamplingtechnicaldocumentsintheirsourceform,andcareshouldbetakentoincludethesedocumentsinthesampleselectionprocess.
Source: EDRMSearchGlossary.
CPI
See: CharactersPerInch(CPI)
CPR
See: CivilProcedureRules(CPR)
CPU
See: CentralProcessingUnit(CPU)
CRC(CyclicalRedundancyChecking)
Usedindatacommunicationstocreateachecksumcharacter(hexadecimal)attheendofadatablock.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
CrimeSceneReconstruction
Crimescenereconstructionistheuseofscientificmethods,physicalevidence,deductivereasoning,andtheirinterrelationshipstogainexplicitknowledgeoftheseriesofeventsthatmayhaveleduptothecrimeandwhatexactlyhappenedataspecificcrimescene.Itisadisciplinedandprincipledapproachtowardsobjectivelyunderstandingacrimescene.Crimereconstructionhelpsinterpretphysicalevidence.Itisanaidtohelpformulateahypothesisandarriveataconclusionaboutacertaincrime.Forensicspecialistsallcometogetherwiththeirdifferentformsofevidencesuchasphotos,sketches,andotherusefulthingsgatheredfromthecrimescenetopaintavividpicturewhichmakesitpossibletoretraceacrimethattookplace.Usingevidencefoundatapropercrimesceneyoucanreconstructwhathappenedandpossiblyfindmoreclues.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 66
Whenfocusingonothertypesofforensics,therearethreeareasofimportanceinfindingtheanswersanddeterminingthecomponentsofacrimescene:(1)specificincidentreconstruction,whichdealswithtrafficaccidents,bombings,homicides,andthingsofthatnature;(2)eventreconstruction,whichanalyzesconnections,sequence,andidentity;andthemostimportantcomponent,(3)physicalevidencereconstruction,whichfocusesonfirearms,blood,glass,andotherobjectsthatcanbestrippedforDNA.
Source: EDRMPresentationGuide.
Cross-ReferenceField
Adatafieldusedtorecordinformationthatiscross-referencedtothespecificdocumentrecord.Maybeusedtocross-reference:
1. Parentdocumentswithattachments;2. Separatetextpagestooneanother;and3. Documentswithdifferentidentifyingnumbers.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
CrossoverTrial
AnExperimentalDesignforcomparingtwosearchorreviewprocessesusingthesameDocumentCollectionandInformationNeed,inwhichoneprocessisappliedfirst,followedbythesecond,andthentheresultsofthetwoeffortsarecompared.(Cf.ParallelTrial.)
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
CUI
See: CommonUserInterface(CUI)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 67
Culling
Thepracticeofnarrowingalargerdatasettoasmallerdatasetforthepurposesofreview,basedonobjectivecriteria(suchasfiletypesordaterestrictors),orsubjectivecriteria(suchasKeywordSearchTerms).Documentsthatdonotmatchthecriteriaareexcludedfromthesearchandfromfurtherreview.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Cursor
AsymbolusedonthecomputerscreeninDOSsystemstoshowwheredataaretobeentered.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Custodian
Acommonelementwithineache-discoveryPhasewhichreferstotheindividual(s)responsiblefordatatypesorrepositoriesforagivenentity.Individualsinpossessionofdatathatispotentiallyrelevanttoacase.
Source: EDRMMetricsGlossary
Seealso:
Datacustodian
CustodianDe-Duplication
Cullsadocumentifmultiplecopiesofthatdocumentresidewithinthesamecustodian'sdataset.Forexample,ifMr.AandMr.Beachhaveacopyofaspecificdocument,andMr.Chastwocopies,thesystemwillmaintainonecopyeachforMr.A,Mr.B,andMr.C.Contrastwithcasede-duplicationandproductionde-duplication.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Basicde-duplication
Casede-duplication
De-duplication
Duplicate
Dynamicde-duplication
GlobalDeduplication
HorizontalDeduplication
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 68
Productionde-duplication VerticalDeduplication
CustodianSearch
Custodiansearchisacommonformofconstrainingsearchresults.Tosearchbasedonacustodian,themetadatasearchusingthemetadataname“Custodian”canbeused.CustodiansearchmayrelyonassigningcustodianstocollecteddataduringtheIdentificationPhasesothatsearchingdoesn’tmissoutoncustodians.Forexample,instantmessageswithbuddy-namesmaybemissedifthesearchtermisspecifiedaslast-name/first-nameorasemailaddresses.
Source: EDRMSearchGlossary.
Custodians
Acommonelementwithineache-discoveryPhasewhichreferstotheindividual(s)responsiblefordatatypesorrepositoriesforagivenentity.Individualsinpossessionofdatathatispotentiallyrelevanttoacase.
Source: EDRMMetricsGlossary
Seealso:
Datacustodian
Customer-AddedMetadata
Dataorworkproductcreatedbyauserwhilereviewingadocument.Forexample,annotationtextofadocumentorsubjectivecodinginformation.Contrastwithvendor-addedmetadata.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Documentmetadata
Emailmetadata
Extrinsicdata
Fileparameters
Filesystemmetadata
File-specificmetadata
Generalmetadata
Metadata
Vendor-addedmetadata
CustomizedDataField
Aspeciallynamedanddefineddatafieldinadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
©2016EDRMLLC
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
CustomizedFieldDefinition
Theprocessofdefiningthecharacteristicsofcustomizeddatafieldsinadatabase,includingfieldstructure(date,text,orintegerfield),fieldsize(numberofcharacters),multiplevalues(morethanonenameorcodeinafield),andfieldname.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
CutandPaste
Tohighlightablockoftextthenmoveorcopyit,eithertoanotherareaofthesamedocumentortoacompletelyseparatedocument.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 70
Cutoff
AgivenscoreorrankinaPrioritizedlist,resultingfromaRelevanceRankingsearchorMachineLearningAlgorithm,suchthattheDocumentsabovetheCutoffaredeemedtobeRelevantandDocumentsbelowtheCutoffaredeemedtobeNon-Relevant.Ingeneral,ahigherCutoffwillyieldhigherPrecisionandlowerRecall,whilealowerCutoffwillyieldlowerPrecisionandhigherRecall.AlsoreferredtoasaThreshold.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Cyan
Acoloredink.Reflectsblue&green&absorbsred.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Cyan,Magenta,YellowandBlack
See: CMYK(Cyan,Magenta,YellowandBlack)
CyclicalRedundancyChecking
See: CRC(CyclicalRedundancyChecking)
D
DaSilvaMoore
DaSilvaMoorev.PublicisGroupe,CaseNo.11Civ.1279(ALC)(AJP),2012WL607412(S.D.N.Y.Feb.24,2012),aff’d2012WL1446534(S.D.N.Y.Apr.26,2012).ThefirstfederalcasetorecognizeComputerAssistedReviewas“anacceptablewaytosearchforrelevantESIinappropriatecases.”TheopinionwaswrittenbyMagistrateJudgeAndrewJ.PeckandaffirmedbyDistrictJudgeAndrewL.Carter.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
DAC(DigitaltoAnalogConverter)
Changesdigitalnumberstoanelectricalwaveform.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 71
DAT(DigitalAudioTape)
Althoughgenerallyusedforaudio,aDAT(120meterslong)canholdupto10gigabytesifusedfordigitaldatastorage.Hasthedisadvantageofbeingaserial,ratherthanarandomaccessdevice.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Usedasastoragemediuminsomebackupsystems.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Recordsaudiosignalsontotapeinadigitalformat.Mayalsobeusedasabackuptapeinsomesystems.
Seealso:
Backup
Backuptape
Dataextraction
Digitalaudiotape
Disasterrecoverytape
DLT-digitallineartape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
Tape
Data
Numbers,characters,images,orothermethodofrecording,inaformwhichcanbeassessedbyahumanor(especially)inputintoacomputer,storedandprocessedthere,ortransmittedonsomedigitalchannel.
Source: IbisConsulting,Glossary.
Anyinformationstoredonacomputer.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Ageneralphraseforallinformation(facts,numbers,letters,graphics,etc.)thatcanbeprocessedbyacomputer.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Informationstoredonthecomputersystemandusedbyapplicationstoaccomplishtasks.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
DataAttribute
Adataattributeisacharacteristicofdatathatsetsitapartfromotherdata,suchaslocation,length,ortype.Thetermattributeissometimesusedsynonymouslywith“dataelement”or“property.”
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 72
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
DataCenter
Asecuresiteusedtohousecomputerapplicationsforusebyoneormoreclients.Usuallyincludesahigherlevelofsecurity,powersupply(generators,back-upswitches,etc.)andtelecommunicationsthanisfoundinastandardcomputerroom.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DataCustodian
Personhavingadministrativecontrolofadocumentorelectronicfile;forexample,thedatacustodianofanemailistheownerofthemailboxwhichcontainsthemessage.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Custodian
DarkData
Datathatarestored,perhapsinadatalake,inthehopethatsomedayitmightbeusefultotheorganization.Darkdataistypicallybigdatathatareunstructured,unanalyzed,uncategorized,and,mostimportantly,unusedforanyvaluablebusinessactivity.Hoardeddata,alsocalleddustydata.
Source: HerbRoitblat,Search2020:TheGlossary.
DataEntry
Theprocessofenteringinformationintoadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DataExtraction
Theprocessofremovingfilesandmeta-datafrombackuptapes.
Source: RenewData,Glossary(10/5/2005).
Theprocessofrestoringfilesandmeta-datafrombackuptapesinordertomakethemaccessible.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 73
Theprocessofpullinginformationoutofeitherhardcopyorelectronicdocuments.Theprocessmaybemanual(readandkey)orelectronicviaapatternrecognitionmethodology.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Backup
Backuptape
DAT-digitalaudiotape
Digitalaudiotape
Disasterrecoverytape
DLT-digitallineartape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
Tape
DataField
Anameforanindividualpieceofstandardizeddatatobeextractedfromanimagecollection.Fieldscanbetheauthorofadocument,arecipient,thedateofadocumentoranyotherpieceofdatacommontomostdocumentsinanimagecollection.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Aunitofinformationinadatabase.Databaserecords,forexample,consistofanorderedlistoffieldswhereaspecifickindofinformationisstoredineachfield.Fieldsareoftenprintedascolumnsindatabasereports.
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
DataFieldDefinition
Datafielddefinitionusuallyincludesfieldstructure(sizeofeachfieldandwhetheritisadate,aninteger,oratextfield)andfieldorganization(namesandlocationsofdatafieldswithinadocumentrecord).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 74
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
DataFormat
Theorganizationofinformationfordisplay,storage,orprinting.Dataismaintainedincertaincommonformatssothatitcanbeusedbyvariousprograms,whichmayonlyworkwithdatainaparticularformat.Thistermiscommonlyusedintheindustrywhenaskinganotherpersonaboutthestateinwhichparticularinformationexists.Forexample,"Whatformatisitin,PDForHTML?"
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html↩
Source: RSI,Glossary.
Dataintegrity
Referstothevalidityofdata.Dataintegritycanbecompromisedinanumberofways,including:humanerrorswhendataisentered,errorsthatoccurwhendataistransmittedfromonecomputertoanother,softwarebugsorviruses,hardwaremalfunctions,suchasdiskcrashesandnaturaldisasters,suchasfiresandfloods.Therearemanywaystominimizethesethreatstodataincluding:backingupdataonaregularbasis,controllingaccesstodataviasecuritymechanisms,designinguserinterfacesthatpreventtheinputofinvaliddata,andusingerrordetectionandcorrectionsoftwarewhentransmittingdata.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
DataLake
Astoragerepositoryholdinglargevolumesofrawdata.Awarehousefordataofanysizeortypethatisminimallyprocessedandlargelyunanalyzed.Datalakeseliminatetheup-frontcostofprocessingdatauntilthosedataareneeded.Datalakesareintendedtoremoveinformation
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 75
silosbycombiningdatafrommultiplesourcesintoasinglerepositoryandtoavoidfiltering,structuring,orotherwise“prejudging”thedatabeforetheycanbefullyanalyzed.
Source: HerbRoitblat,Search2020:TheGlossary.
DataMapping
Datamappingfindsorsuggestsassociationsbetweenfileswithinalargebodyofdata,whichmaynotbeapparentusingothertechniques.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
DataMining
“Datamining”generallyreferstotechniquesforextractingsummariesandreportsfromanorganization’sdatabasesanddatasets.Inthecontextofelectronicdiscovery,thistermoftenreferstotheprocessesusedtocullthroughacollectionofelectronicdatatoextractevidenceforproductionorpresentationinaninvestigationorinlitigation.Dataminingcanalsoplayanimportantroleincomplyingwithdataretentionobligationsunderanorganization’sformaldocumentmanagementpolicies.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Theprocessofextractingusefuldatafromavolumeofunstructuredinformation.Dataminingisusedtosearchforpatternsandsystematicrelationshipsinbigdatacollectionstoextractotherusefulpiecesofinformationfromthesecollections.
Source: HerbRoitblat,Search2020:TheGlossary.
DataProtectionAct(DPA)
ThisactimplementsaEuropeanDirectivewhichamongotherthingsprotectsprivacyandsetslimitsonwhatcanbedonewithanindividual'spersonaldata.Inparticular,itplaceslimitationsonthetransferofsuchdatabetweenjurisdictions.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
DataRate
Thespeedofadatacommunicationschannel,measuredinbitspersecond.
Source: RSI,Glossary.
DataSet
Anygroupoffilesprocessedasaunit,andgenerallycontainedinonedirectory.
Source: IbisConsulting,Glossary.
DataStream
Datastreamsallowmultipleformsofdatatobeassociatedwithafile,includinganynumberofgraphicfiles,databases,programs,spreadsheets,wordprocessingdocuments,orotherdata
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 76
typesassociatedwithagivenfiletoaltersomeoftherulesconcerningcomputersecurityissuesandcomputerforensicsinvestigations.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
DataValidation
Asystemforensuringaccuracyindataentryandconsistencyinformattingnamesanddates.Oftenaccomplishedbytheuseofvalidationtablestorestrictentryofinconsistentorinaccuratedata(e.g.,dateenteredas10/2/50,whenitshouldbe01/02/50).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Database
Asetofinterrelatedfilesstoredelectronicallyonacomputer.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Acollectionofrelateddataenteredintoindividualrecordsconsistingofanumberofdifferentfields.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Informationarrangedinthecomputerinarigorous,definedformattoalloweaseofrecordingandretrieval.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Acollectionofdataarrangedintablesalongwithreports,queries,andforms.Modernrelationaldatabasesemploycomplexlinkagesamongthedatainthetablessothatinformationcanbeenteredonlyonce,butstillusedtopresentcoherentreports.Atableislikeaspreadsheet,inwhichthecolumnscorrespondtofieldsandtherowstorecords,forexample,individuals.Eachfieldorcolumnoftherecordindicatesonepieceofinformationaboutthatindividual.
Seealso:
Flatfiledatabase
Fulltextdatabase
Relationaldatabase
SQL
WAIS-wideareainformationserver
DatabaseAdministrator
Adatabaseadministrator(shortformDBA)isapersonresponsiblefortheinstallation,configuration,upgrade,administration,monitoringandmaintenanceofdatabasesinanorganization.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 77
Source: http://en.wikipedia.org/wiki/Database_administrator.
DatabaseDesign
Theprocessofdecidingwhatdatabasestructuretouse.Typicallyinvolvestheconstructionofspecificdatafieldsandtheoveralldesignofbowthefieldsaretobeused.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DatabaseManagementSystem(DBMS)
Softwarethatcontrolstheorganizationofadatabaseandprocessesrequestsfordatabaseinformationfromotherapplications.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DateField
Adatafieldinadatabasethatcontainsthedateofthedocument.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
DateFilter
Afilteroptionthatallowsforincluding/excludingspecificdatesordaterangesforapplicationand/ormailusers.
Source: IbisConsulting,Glossary.
Seealso:
Extensions/sizesfilter
Filter
MD5-knownfilter
Sender/recipientfilter
©2016EDRMLLC
DateRangeSearch
Daterangesearchutilizesadocument’smetadatatofindsearchresultswherethecreationdates,accessdates,ormodificationdatesofdocumentsfallwithinaspecifiedrangeofdates.RefertospecifictechnologyutilizedtoprocessESItodeterminetheavailabledatesbasedonfiletypesandconsiderthehandlingoftimezonesduringESIprocessing.
Source: EDRMSearchGlossary.
DBMS
See: DatabaseManagementSystem(DBMS)
DBX
MicrosoftOutlookExpressstoresyourmessagesinafolderthatcontainsseveraldifferent.dbxfiles.Thesefiles(folders.dbx,inbox.dbx,outbox.dbx)containallyourmessages.
Source: ImportmessagesintoWindowsMailfromOutlookExpress,http://windows.microsoft.com/en-us/windows-vista/import-messages-into-windows-mail-from-outlook-express.
ddFile
A"dd"fileisarawimagefilecreatedusingtheddforensicimagingtool,acommandlineprogramthatusescommandlineargumentstocontroltheimagingprocess.
Source: http://www.forensicswiki.org/wiki/Dd
De-Duplication
De-duplication(“de-duping”)istheprocessofcomparingelectronicrecordsbasedontheircharacteristicsandremovingduplicaterecordsfromthedataset.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Theprocessofprovidingoneinstanceofanitemwhentherewasoncetwoormoreidenticalcopies.Thisprocessusuallyinvolveslandingallfilesintoadatabaseandthensearchingforduplicatefiles.
Source: RenewData,Glossary(10/5/2005).
Theprocessofidentifying(orsomevendorsincludesactuallyremoving)additionalcopiesofidenticaldocumentsinadocumentcollection.Therearethreetypesofde-duplication:case,custodian,andproduction.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 79
Theprocessofidentifying(and/orremoving)additionalcopiesofidenticaldocumentsinadocumentcollection.Therearethreetypesofde-duplication:case,custodian,andproduction.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Themethodofdatareductionthatexcludesduplicatemessages(withtheirattachments)andfilesfromfurtherprocessing.
Source: IbisConsulting,Glossary.
Theprocessofremovingduplicaterecordsfromacollectionofdata.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Theprocessofdeterminingwhichdocumentsareduplicates.Filesystemscancontainmanycopiesofthesamedocument,whichneedtobeidentifiedforefficiency’ssake.Everytimeanemailissentittypicallycreatestwoadditionalcopiesoftheemailanditsattachments,oneinthesender’ssent-itemsfolderandonceintherecipient’sinbox.Anemailmayalsobesenttomultiplerecipients,therebycreatingmorecopies.
Seealso:
Basicde-duplication
Casede-duplication
Custodiande-duplication
Duplicate
Dynamicde-duplication
GlobalDeduplication
HorizontalDeduplication
Productionde-duplication
VerticalDeduplication
De-Shade
RemoveshadedareastorenderimagesmoreeasilyrecognizablebyOCR.De-shadingsoftwaretypicallysearchesforareaswitharegularpatternoftinydots.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
De-Skew
Aprocesswherethecomputerdetectsandcorrectstheskewinanimagefile.
Source: RSI,Glossary.
Theprocessofstraighteningskewed(off-center)images.De-skewingisoneoftheimageenhancementsthatcanimproveOCRaccuracy.Documentsoftenbecomeskewedwhentheyarescannedorfaxed.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Skew
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 80
De-Speckle
Removeisolatedspecklesfromanimagefile.Specklesoftendevelopwhenadocumentisscannedorfaxed.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DecisionTree
Astep-by-stepmethodofdistinguishingbetweenRelevantandNon-RelevantDocuments,dependingonwhatcombinationofwords(orotherFeatures)theycontain.ADecisionTreetoidentifyDocumentspertainingtofinancialderivativesmightfirstdeterminewhetherornotaDocumentcontainedtheword“swap.”Ifitdid,theDecisionTreemightthendeterminewhetherornottheDocumentcontained“credit,”andsoon.ADecisionTreemaybecreatedeitherthroughKnowledgeEngineeringorMachineLearning.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Decryption
Decryptionistheprocessofconvertingencrypteddatabackintoitsoriginalform,soitcanbeunderstood.
Source: IbisConsulting,Glossary.
Seealso:
Encryption
Deduplication
AmethodofreplacingmultipleidenticalcopiesofaDocumentbyasingleinstanceofthatDocument.Deduplicationcanoccurwithinthedataofasinglecustodian(alsoreferredtoasVerticalDeduplication),oracrossallcustodians(alsoreferredtoasHorizontalDeduplication).
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
DeepLearning
Anapproachtobuildingandtrainingneuralnetworks.Deeplearningtypicallyinvolvesahierarchicalneuralnetworkwhereeachofthelevelsinthehierarchyistrainedseparately.
Source: HerbRoitblat,Search2020:TheGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 81
Default
Avalueoroptionassignedtodatabyasystemwhennospecificvaluehasbeenspecifiedbyanoperator.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DeletedData
Deleteddataisdatathat,inthepast,existedonthecomputeraslivedataandwhichhasbeendeletedbythecomputersystemorend-useractivity.Deleteddataremainsonstoragemediainwholeorinpartuntilitisoverwrittenbyongoingusageor“wiped”withasoftwareprogramspecificallydesignedtoremovedeleteddata.Evenafterthedataitselfhasbeenwiped,directoryentries,pointers,orothermetadatarelatingtothedeleteddatamayremainonthecomputer.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Deleteddataaredatathat,inthepast,existedonthecomputeraslivedataandwhichhavebeendeletedbythecomputersystemorend-useractivity.Deleteddataremainonstoragemediainwholeorinpartuntiltheyareoverwrittenor“wiped.”Evenafterthedataitselfhavebeenwiped,directoryentries,pointersorothermetadatarelatingtothedeleteddatamayremainonthecomputer.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Recoverableinformationfromdeletedfilesanddatamaybestoredinunallocatedorslackspaceonacomputerharddrive.
Source: RenewData,Glossary(10/5/2005).
Datathatatonetimeexistedonacomputersystemaslivedatabutthathasbeendeleted,however,suchdeleteddatainhabitsstoragemediainsomeformuntilitisoverwrittenor"wiped"withasoftwareprogramspecificallydesignedtoremovedeleteddata.Oncethestoragemediahasbeenwiped,directoryentries,pointers,orothermetadataoftenremain.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Datathatonceexistedonacomputerandhassubsequentlybeendeletedbytheuser.Deleteddataactuallyremainsonthecomputeruntilitisoverwrittenbynewdataor“wiped”withaspecificsoftwareprogram.(Evenafterwiping,metadatasuchasdirectoryentriesorpointersmaystillremain.)
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Deletion
Deletionistheprocesswherebydataisremovedfromactivefilesandotherdatastoragestructuresoncomputersandrenderedinaccessibleexceptusingspecialdatarecoverytools
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 82
designedtorecoverdeleteddata.Deletionoccursinseverallevelsonmoderncomputersystems:
1. Fileleveldeletion:Deletiononthefilelevelrendersthefileinaccessibletotheoperatingsystemandnormalapplicationprogramsandmarksthespaceoccupiedbythefile'sdirectoryentryandcontentsasfreespace,availabletoreusefordatastorage.
2. Recordleveldeletion:Deletionontherecordleveloccurswhenadatastructure,likeadatabasetable,containsmultiplerecords;deletionatthislevelrenderstherecordinaccessibletothedatabasemanagementsystem(DBMS)andusuallymarksthespaceoccupiedbytherecordasavailableforreusebytheDBMS,althoughinsomecasesthespaceisneverreuseduntilthedatabaseiscompacted.Recordleveldeletionisalsocharacteristicofmanye-mailsystems.
3. Byteleveldeletion:Deletionatthebyteleveloccurswhentextorotherinformationisdeletedfromthefilecontent(suchasthedeletionoftextfromawordprocessingfile);suchdeletionmayrenderthedeleteddatainaccessibletotheapplicationintendedtobeusedinprocessingthefile,butmaynotactuallyremovethedatafromthefile'scontentuntilaprocesssuchascompactionorrewritingofthefilecausesthedeleteddatatobeoverwritten.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Deletionistheprocesswherebydataisremovedfromactivefilesandotherdatastoragestructuresoncomputersandrenderedinaccessibleexceptusingspecialdatarecoverytoolsdesignedtorecoverdeleteddata.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Removingactivefilesmakingthemunavailable.Specialdatarecoverytoolscanstillretrievethesefiles.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Descender
Theportionofacharacterwhichfallsbelowthemainpartoftheletter(e.g.g,p,q).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Desktop
UsuallyreferstoanindividualPC--auser'sdesktopcomputer.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
DesktopPublishing
PCsystemsusedtopreparedirectprintoutputoroutputsuitableforprintingpresses.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 83
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DestinationFile
Thefilethatalinkedorembeddedobjectisinsertedinto,orthatdataissavedto.Thesourcefilecontainstheinformationthatisusedtocreatetheobject.Whenyouchangeinformationinadestinationfile,theinformationisnotupdatedinthesourcefile.
Source: Glosbe,destinationfile,https://en.glosbe.com/en/en/destination%20file
DIA/DCA(DocumentInterchangeArchitecture)
AnIBMstandardfortransmissionandstorageofvoice,textorvideoovernetworks.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DiacriticSpecification
Adiacriticspecificationisaphoneticmarkeraddedtoletter(aboveorbelow)indicatingachangeinthewayitistobepronouncedorstressed.Forlanguagesthatincludediacriticcharactersoncertaincharacters(suchasvowels),specifyingwhetherthediacriticsshouldmatchisasearchoption.
Source: EDRMSearchGlossary.
DigitalAudioTape
See: DAT(DigitalAudioTape)
DigitalLinearTape(DLT)
Digitallineartapeisaformofmagnetictapeanddrivesystemusedforcomputerdatastorageandarchiving.DLTisoneofseveraltechnologiesdevelopedinrecentyearstoincreasethedata-transferratesandstoragecapacitiesofcomputertapedrives.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingSearchStorage.com,http://searchstorage.techtarget.com/sDefinition/0,,sid5_gci759350,00.html.
Adocumentstoragemediuminacartridge.DLTtapesareoftenusedforbackuptapes.
Seealso:
Backup
Backuptape
DAT-digitalaudiotape
Dataextraction
Digitalaudiotape
Disasterrecoverytape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
Tape
©2016EDRMLLC
DigitalSignalProcessor(Processing)(DSP)
Aspecialpurposecomputer(ortechnique)whichdigitallyprocessessignalsandelectrical/analogwaveforms.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DigitaltoAnalogConverter
See: DAC(DigitaltoAnalogConverter)
DigitalVersatileDisc(DVD)
Aplasticdisc,likeaCD,onwhichdatacanbewrittenandread.DVDsarefaster,canholdmoreinformation,andcansupportmoredataformatsthanCDs.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
AnopticallyencodedremovablediscsimilartoaCD,butwithmuchhighercapacity.
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
DigitalVideoDisc(DVD)
See: DigitalVersatileDisc(DVD)
DimensionalityReduction
AFeatureEngineeringmethodusedtoreducethetotalnumberofFeaturesconsideredbyaMachineLearningAlgorithm.SimpleDimensionalityReductionmethodsincludeStemmingandStopWordelimination.MorecomplexDimensionalityReductionmethodsincludeLatentSemanticIndexingandHashing.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 85
Directory
Adirectoryis,ingeneral,anapproachtoorganizinginformation,themostfamiliarexamplebeingatelephonedirectory.
Source: TechTarget,directorydefinition,http://searchwindowsserver.techtarget.com/definition/directory
DirtyOCR
Electronicdocumentsresultingfrominaccurateopticalcharacterrecognitions.
Seealso:
ICR
OCR
OpticalCharacterRecognition
Patternrecognition
DisasterRecoveryPlan
Aplandevisedbyanorganizationtoavoiddatalossorbusinessdisruptionfollowingapowerloss,anaturaldisaster,oranactofterrorism.Agooddisasterrecoveryplancallsfordailybackupfunctionsandcontingencyplanstominimizedatalossandbusinessinterruption.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DisasterRecoveryTape
Portablemediausedtostoredatathatisnotpresentlyinusebyanorganizationtofreeupspacebutstillallowfordisasterrecovery.Mayalsobecalled"backuptapes."
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Seealso:
Backup
Backuptape
DAT-digitalaudiotape
Dataextraction
Digitalaudiotape
DLT-digitallineartape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
Tape
Disc
Anopticaldisc.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Round,flatstoragemediawithlayersofmaterialwhichenabletherecordingofdata.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 86
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
Disclosure
Disclosuremeansthegivingoutofinformation,eithervoluntarilyortobeincompliancewithlegalregulationsorworkplacerules.
Source: EDRMPresentationGuide.
Discovery
Apre-trialprocessinwhicheachpartytriestofindalltheinformationheldbytheotherpartyandbycertainthirdpartiesthatisrelevant,probativeandcanbeadmittedintoevidenceattrial.Eachpartyisrequiredtocooperatewiththeothertotheextentrequiredbytherelevantrulesofcivilprocedure.
Source: RenewData,Glossary(10/5/2005).
Thepre-trialprocedurebywhicheachpartygainsinformationheldbytheadversepartyconcerningacase.Discoveryisalsothedisclosureoffacts,documents,electronicallystoredinformationandtangibleobjectsbyanadverseparty.
Source: IbisConsulting,Glossary.
TheprocessofprovidingdocumentstoanopposingpartyinUSlitigation.UnliketheUK,theUSprocessisdrivenby"pull"i.e.apartyneedstospecifythedocumentstheywanteachopposingpartytoproduce.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
Seealso:
Computerevidence
Computerforensics
Computerinvestigations
Electronicdiscovery/e-discovery
Electronicevidence
Forensicanalysis
Forensics
Mirroring
©2016EDRMLLC
DiscoveryRequest
Anofficialrequest,bytheopposingattorney,todeliverdocumentsrelevanttoparticularcaseissues.
Seealso:
Documentrequest Interrogatory Requestforadmission
DiscoveryTracking
Theuseofadatabasetomonitortheprogressofdiscoveryaswellasthecontentandconsistencyofdiscoveryresponses.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Disk
Astoragemediumcapableofstoringlargeamountsofdata.Disktypesincludemagneticdisks(bothharddisksandfloppydisks)andopticaldisks.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Amagneticfloppyorharddisk.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Round,flatstoragemediawithlayersofmaterialwhichenabletherecordingofdata.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Itmaybeafloppydisk,oritmaybeaharddisk.Eitherway,itisamagneticstoragemediumonwhichdataisdigitallystored.AdiscmayalsorefertoaCD-ROM.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
©2016EDRMLLC
DiskDrive
Thedevicethathousesadiskandcontrolstheconnectionbetweenthecomputerandthemagneticdisk.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Floppydiskdrive
Jazdrive
Magneto-opticaldrive
Portabledrive
Storagedevice
Tapedrive
Zipdrive
DiskMirroring
Whenfilesarestoredonacomputersystem'sharddisk,a"mirror"copyismadeonanadditionalharddiskoraseparatepartofthesamedisktosafeguardinformationincaseofdisaster.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Amethodofdatabackupthatcopiesor“mirrors”eachsavedfileonaharddiskontoasecondharddisk.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DiskOperatingSystem(DOS)
Acronymfordiskoperatingsystem.ThetermDOScanrefertoanyoperatingsystem,butitismostoftenusedasashorthandforMS-DOS(Microsoftdiskoperatingsystem).OriginallydevelopedbyMicrosoftforIBM,MS-DOSwasthestandardoperatingsystemforIBM-compatiblepersonalcomputers.
Source: http://www.webopedia.com/TERM/D/DOS.html
Asetofprogramsthatcontrolsthecomputerandsupportssoftwareapplications.MS-DOS(MicrosoftDiskOperatingSystem)ispopularbecauseitwasthesystemusedintheoriginalIBMPCandsubsequentclones.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Linux
MicrosoftDOS
MicrosoftWindows
Networkoperatingsystem
NOS
Operatingsystem
OS
UNIX
Windows
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 89
Xenix
Diskette
Synonymfor“floppydisk.”
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
DistributedData
Datathatresidesonportablemediaandnon-localdevicessuchaslaptopcomputers,homecomputers,CD-ROMs,floppydisks,zipdrives,wirelesscommunicationdevices,personaldigitalassistants(PDAs),webpages,InternetrepositoriessuchasemailhostedbyInternetserviceprovidersorportals,andthelikethatbelongstotheorganizationandnottheuser.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Informationwhichresidesonnon-localdevicessuchashomecomputers,laptopcomputers,PDAs,orevenInternetrepositories.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Distributeddataisthatinformationbelongingtoanorganizationwhichresidesonportablemediaandnon-localdevicessuchashomecomputers,laptopcomputers,floppydisks,CD-ROMs,personaldigitalassistants(“PDAs”),wirelesscommunicationdevices(i.e.,Blackberry),zipdrives,Internetrepositoriessuchase-mailhostedbyInternetserviceprovidersorportals,Webpages,andthelike.Distributeddataalsoincludesdataheldbythirdpartiessuchasapplicationserviceprovidersandbusinesspartners.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Dithering
Creatingtheillusionofnewcolorsandshadesbyvaryingthepatternofdots.Newspaperphotographs,forexample,aredithered.Ifyoulookclosely(seeexamplebelow),youcansee
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 90
thatdifferentshadesofgrayareproducedbyvaryingthepatternsofblackandwhitedots.Therearenograydotsatall.Themoreditherpatternsthatadeviceorprogramsupports,themoreshadesofgrayitcanrepresent.Inprinting,ditheringisusuallycalledhalftoning,andshadesofgrayarecalledhalftones.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigation
Support(2005).
DLT
See: DigitalLinearTape(DLT)
DMS(DocumentManagementSystem)
Essentiallyadatabasetostoreandretrievefirmdocumentsbyclient/matternumber,title,author,dateand/orkeywords.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Document
Anyfileproducedbyasoftwareapplication.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Anyfileproducedbyasoftwareapplication.Includesbutisnotlimitedtoanyelectronicallystoreddataonmagneticoropticalstoragemediaasan"active"fileorfiles(readilyreadablebyoneormorecomputerapplicationsorforensicssoftware);any"deleted"butrecoverableelectronicfilesonsaidmedia;anyelectronicfilefragments(filesthathavebeendeletedandpartiallyoverwrittenwithnewdata);andslack(datafragmentsstoredrandomlyfromrandomaccessmemoryonaharddriveduringthenormaloperationofacomputer[RAMslack]orresidualdataleftontheharddriveafternewdatahasoverwrittensomebutnotallofpreviouslystoreddata).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 91
Source: RSI,Glossary.
SeeRule34oftheFederalRulesofCivilProcedure.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Oneorseveralsinglepagesofimagesthatmakealogicalsinglecommunicationofinformation.Examplesincludealetter,areport,amemooranairlineticket.A"document"maybeanymeansofcommunicating,informingoreducating,includinghard-copypaper,electronicdocuments,e-mail,voicemail,video,x-rays,drawings,etc.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Adocumentisapageorcollectionofpagesthatarephysicallyorlogically(orboth)linked.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Fed.R.Civ.P.34(a)definesadocumentas“includingwritings,drawings,graphs,charts,photographs,phonorecords,andotherdatacompilations.”Intheelectronicdiscoveryworld,adocumentalsoreferstoacollectionofpagesrepresentinganelectronicfile.E-mails,attachments,databases,worddocuments,spreadsheets,andgraphicfilesareallexamplesofelectronicdocuments.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
InthecontextofElectronicDiscovery,adiscreteitemofElectronicallyStoredInformationthatmaybethesubjectorresultofasearchorrevieweffort.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
DocumentAssembly
Softwarefunctionthatgathersfactsaboutaclient,thenmergesdataandtexttodraftauniquedocumentforthatclientthatvariesdependingonthefactsofeachcase.Typicallyisperformedbyansweringaseriesofquestionsorextractingdatafromadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DocumentBoundaries
Thebeginningandendingpagesofadocument.Afolderfullofpapersmayhavenoobviousindicationwhereonedocumentendsandanotherbegins(e.g.,ifthestaplesorpaperclipsthatheldthedocumentstogetherwereremoved).Whenpaperdocumentsarescanned,theboundariesbetweenthedocumentsusuallyhastobedeterminedandnoted.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 92
DocumentCollection
TheprocessofgatheringElectronicallyStoredInformationforsearch,review,andproduction;thesetofDocumentsresultingfromsuchaprocess.Inmanycases,theDocumentCollectionandDocumentPopulationarethesame;however,itisimportanttonotethatDocumentPopulationreferstothesetofDocumentsoverwhichaparticularStatisticalEstimateiscalculated,whichmaybetheentireDocumentCollection,asubsetoftheDocumentCollection(e.g.,thedocumentswithaparticularfiletypeormatchingparticularSearchTerms),asupersetoftheDocumentCollection(e.g.,theuniversefromwhichtheDocumentCollectionwasgathered),oranycombinationthereof.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
DocumentDate
Theoriginalcreationdateofadocumentusuallynotedonthedocumentitself.Inthecaseofaletter,whentheletterwaswrittenindicatedbythedateoftheletter.Onanemailindicatedbythedate-stampoftheemail.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DocumentDepository
Alibraryofhardcopiesofalldocumentsinaspecificcase,sometimestheoriginals,andoftenrununderguidelinesspecifiedbythecourt.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Acentrallibraryofalldocumentsinacase,eitherhardcopiesorimages,withsomeformofelectronicaccess.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DocumentEnhancement
Acontext-sensitiveannotationtoafull-textdocument.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DocumentInterchangeArchitecture
See: DIA/DCA(DocumentInterchangeArchitecture)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 93
DocumentManagementSystem
See: DMS(DocumentManagementSystem)
DocumentMetadata
Datastoredinadocumentaboutthedocument.Oftenthisdataisnotimmediatelyviewableinsoftwareapplicationusedtocreate/editthedocument,butoftencanbeaccessedviaa"properties"view.Contrastwithfilesystemmetadataandemailmetadata.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Datastoredinadocumentaboutthedocument.Oftenthisdataisnotimmediatelyviewableinsoftwareapplicationusedtocreate/editthedocument,butoftencanbeaccessedviaa"properties"view.Example:LastAccessedDate,LastEditedBy,Users,etc.
Seealso:
Customer-addedmetadata
Emailmetadata
Extrinsicdata
Fileparameters
Filesystemmetadata
File-specificmetadata
Generalmetadata
Metadata
Vendor-addedmetadata
DocumentNumber
SimilartoaBatesnumber,adocumentnumberisauniqueidentifierassignedtoadocumentorfile.Documentnumbersareusedtotrackdocumentsorfilesthroughoutoneormorelawsuitsorsimilarproceedings.
Seealso:
Batesnumber
Batesprefix
Batesstamp
Batesstamping
DocumentPopulation
ThesetofElectronicallyStoredInformationorDocumentsaboutwhichaStatisticalEstimationmaybemade.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 94
DocumentRequest
Inalawsuitorsimilarproceeding,awrittenrequestfromaparttoanotherpartyortoanon-partyaskingtherecipienttoproduceorpermittheinspectionofdocuments,electronicallystoredinformation,ortangibleobjects.
Externallinks:
Rule34.ProducingDocuments,ElectronicallyStoredInformation,andTangibleThings,orEnteringontoLand,forInspectionandOtherPurposes,https://www.law.cornell.edu/rules/frcp/rule_34
Seealso:
DiscoveryRequest Interrogatory Requestforadmission
DocumentRetentionPolicy
Asetofrulesfordetermininghowlongdocumentsofdifferenttypesneedtoberetainedforbusinessorlegalpurposes.Theserulesvarybybusinesstype,documenttype,andcounselstrategies.Adocumentretentionpolicyisonepartofarecordsmanagementscheme.
DocumentSegments
Documentsmaybesplitintomultiplesegments(suchasAbstract,Body,Title,References,Citation,etc.).TheBooleanoperatorsmaybelimitedtoaspecificdocumentsegment.Inthesesituations,youmayneedtospecifythesearchscopeofthedocument.
Source: EDRMSearchGlossary.
DocumentSizes
(U.S.):
• ASize8.5"by11"(A4)• BSize11"by17"(A3)• CSize17"by22"(A2)• DSize24"by36"(A1)• ESize36"by48"(A0)
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DocumentTemplate
Setsofindexfieldsfordocuments.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 95
DocumentType
Atypicalfieldusedinbibliographicalcoding.Typicaldocumenttypeexamplesincludeletter,memo,report,articleandothers.OftenreferredtoasDocType.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DOS
See: DiskOperatingSystem(DOS)
DOSPrompt
Usuallyadiskdriveletterfollowedbythegreaterthan(>)symbol.ItisthepositionfromwhichDOSfunctionsareexecutedmanuallyifthecomputerhasnocommonuserinterface(CUI)orGraphicalUserInterface(GUI).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DotPitch
DistanceofonepixelinaCRTtothenextpixelontheverticalplane.Thesmallerthenumber,thehigherqualitydisplay.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DotsPerInch(DPI)
Ameasurementofscannerresolution.Thenumberofpixelsascannercanphysicallydistinguishineachverticalandhorizontalinchofanoriginalimage.Documentsarenormallyscannedataresolutionofbetween200dpiand400dpi.
SOURCE:RSI,Glossary.
Double-SidedScanner
Double-sidedscanningusesasingle-sidedscannertoscandouble-sidedpages,scanningonecollatedstackofpaper,thenflippingitoverandscanningtheotherside.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Duplexscanner
Flatbedscanner
Scanner
Simplexscanner
©2016EDRMLLC
Download
Totransferdatatotheuser’scomputerfromanothercomputer.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DPA
See: DataProtectionAct(DPA)
DPI
See: DotsPerInch(DPI)
Drag-and-Drop
Acommonwaytomoveorcopyafileorfolderistohighlightitandliterally“drag”acopiedversionofittoanotherlocation.Firstthemousewouldbeusedtohighlightthefile.Thenwhileholdingdowntheleftmousebutton,thenameofthefilewouldbedraggedtoanewlocation.Inthebackground,theoperatingsystemcreatesanewcopyandplacesitinthenewlocation.Forexample,youcandragafiletotheRecycleBintodeletethefile,ortoafoldertocopyormoveittothatlocation.
Source: EDRMCollectionStandards
Themovementofon-screenobjectsbydraggingthemacrossthescreenwiththemouse.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DRAM(DynamicRandomAccessMemory)
Amemorytechnologywhichisperiodically"refreshed"orupdated–asopposedto"static"RAMchipswhichdonotrequirerefreshing.Thetermisoftenusedtorefertothememorychipsthemselves.Varietiesare:
• CDRAM:CacheDRAM(containsstaticcache)• EDODRAM:ExtendeddataoutDRAM• EDRAM:EnhancedDRAM(containsastaticmemorybufferandcachecontroller)• SDRAM:SynchronousDRAM(addedclockandburstaddressingcapability)• SGRAM:SynchronousGraphicsRAM(asingleportSDRAM)• WRAM:WindowRAM(dualportvideoRAM)• VRAM:VideoRAM(adualportedDRAM,goodforgraphics)
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Memory RAM ROM
©2016EDRMLLC
DroppedItems
AnotherformofvalidationutilizedtoensurethatResponsiveitemsarenotbeinginadvertentlyomittedthroughchangestothesearchcriteria.Asthesearchcriteriasetisbeingupdatedandmodifiedduringtheinitialinvestigationandanalysis,acomparisonwouldsampledocumentsthatwereoriginallyresultsofonesearchcriteriasetbutarenolongerresultsofthemodifiedsearchcriteriaset.IfResponsivedocumentsarefounduponreviewofdroppeditems,specialattentionshouldbepaidtodeterminewhetheradditionaltermsneedtobecreatedtocapturetheseitemsorifmodificationsmadetothecriteriashouldbechangedsotheseorsimilaritemswouldbeincludedintheresults.
Source: EDRMSearchGlossary.
DSP
See: DigitalSignalProcessor(Processing)(DSP)
DucesTecum
Latinfor"bringwiththee."Usuallymeanscometoadepositionorcourtappearancewithdocuments.
Source: IbisConsulting,Glossary.
DumbTerminal
Anetworkterminalwithkeyboard,monitor,andnetworkinterfacebutnoharddriveforindependentprocessingcapability.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Duplex
Theabilityofascannertoscanbothsidesofasheetsimultaneously.Requirestwoscannercamerasandoftentwoprocessingboards.
Source: RSI,Glossary.
Two-sidedpage(s).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
DuplexScanner
Duplexscannersautomaticallyscanbothsidesofadouble-sidedpage,producingtwoimagesatonce.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 98
Seealso:
Double-sidedscanner
Flatbedscanner
Scanner
Simplexscanner
Duplicate
Anexactduplicateofanotherdocumentinadatabase.Duplicatestypicallyarisewhenmultipledocumentproductionsfromseparatesourcesarecodedandcontaincopiesofthesamedocuments.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Basicde-duplication
Casede-duplication
Custodiande-duplication
De-duplication
Dynamicde-duplication
GlobalDeduplication
HorizontalDeduplication
Productionde-duplication
VerticalDeduplication
Duty
Legalobligationformanagingthe“Risk”associatedwithspecificinformation.LegalandRIMhavetheresponsibilityforlegaldutiesandobligations(i.e.,legalholdpreservationandregulatoryretentionobligations).
Source: IGRMWhitePaper
DVD
See: DigitalVersatileDisc(DVD)
DVD-ROM
Digitalversatiledisc-readonlymemory(DVD-ROM)isaread-onlydigitalversatiledisc(DVD)commonlyusedforstoringlargesoftwareapplications.Itissimilartoacompactdisk-readonlymemory(CD-ROM)buthasalargercapacity.ADVD-ROMstoresaround4.38GBofdata.ACD-ROMusuallystores650MBofdata.
Source: Technopedia,DigitalVersatileDisc-ReadOnlyMemory(DVD-ROM),https://www.techopedia.com/definition/24480/digital-versatile-disc-read-only-memory--dvd-rom
Seealso:
CD CD-R CD-ROM
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 99
CD-RW
Disc
Disk
Diskette
DVD
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
DynamicDe-Duplication
Aproprietarydynamicde-duplicationtechnologydesignedtohandlelargevolumesofinformationandensurethatuniquemeta-dataandoriginalcontentarestoredonlyonce.Thisdynamicde-duplicationprocessisexecutedasthedataflowsoffthetapes,inordertoavoidlargeandexpensiveprocessingandstoragerequirements.
Source: RenewData,Glossary(10/5/2005).
Seealso:
Basicde-duplication
Casede-duplication
Custodiande-duplication
De-duplication
Duplicate
Productionde-duplication
DynamicRandomAccessMemory
See: DRAM(DynamicRandomAccessMemory)
E
E-Mail(ElectronicMail)
Electronicmail,commonlyreferredtoas"e-mail"or"email,"isanelectronicmeansforcommunicatinginformationunderspecifiedconditions,generallyintheformoftextmessages,throughsystemsthatwillsend,store,process,andreceiveinformationandinwhichmessagesareheldinstorageuntiltheaddresseeaccessesthem.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Asimpletextmessage--apieceoftextsenttoarecipient.Inthebeginningandeventoday,e-mailmessagestendtobeshortpiecesoftext,althoughtheabilitytoaddattachmentsnowmakesmanye-mailmessagesquitelong.Evenwithattachments,however,e-mailmessagescontinuetobetextmessages.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingHowStuffWorks,http://computer.howstuffworks.com/email.htm/printable.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 100
Thewholeofanelectronicdocumentcontainingthemessageenvelopeandmessagecontent(attachments,etc.).
Source: RenewData,Glossary(10/5/2005).
Electronicmail,orcomputer-basedmail.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Anydatasetorgroupoffilesoriginatingfrommailcontainersore-mailsystems.Thisincludessingle-mailitemsoutsideoftheirmailapplications,likeMSG(Outlook)andEMLfiles(RFC822singlemailcontainers),RFC822mailfoldersaswellasmulti-mailarchives(PST,NSF,etc.).
Source: IbisConsulting,Glossary.
EarlyCaseAssessment(ECA)
Anindustry-specifictermgenerallyusedtodescribeavarietyoftoolsormethodsforinvestigatingandquicklylearningaboutaDocumentCollectionforthepurposesofestimatingtherisk(s)andcost(s)ofpursuingaparticularlegalcourseofaction.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Awidelyabusedterminwhichcorporatedataissiftedandcategorisedwithaviewtodetermininganorganisation'sexposureinthecontextofadispute.ThebestECAsystemsallowthesiftingtotakeplacewithinacorporation'sowndatastoreandcanbeusedtodrilldownrapidlytoidentifythemostpertinentevidentiarymaterialandtofacilitatedecisionswhethertolitigateorsettle.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
EB(Exabyte)
1millionterabytes.TheUSbusinesscommunityisestimatedtohavecreated35-50exabytesofelectronicdatain2004.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Bit
Byte
KB-kilobyte
MB-megabyte
GB-gigabyte
TB-terabyte
PB-petabyte
©2016EDRMLLC
EDI-OracleStudy
Anongoinginitiative(asofJanuary2013)oftheElectronicDiscoveryInstitutetoevaluateparticipatingvendors’searchanddocumentrevieweffortsusingaDocumentCollectioncontributedbyOracleAmerica,Inc.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDMS(ElectronicDocumentManagementSystem)
EDMS-electronicdocumentmanagementsystemisasoftwareprogramthatmanagesthecreation,storageandcontrolofdocumentselectronically.TheprimaryfunctionofanEDMSistomanageelectronicinformationwithinanorganizationworkflow.AbasicEDMSshouldincludedocumentmanagement,workflow,textretrieval,andimaging.AnEDMSmustbecapableofprovidingsecureaccess,maintainingthecontext,andexecutingdispositioninstructionsforallrecordsinthesystem.
Source: EDMS-ElectronicDocumentManagementSystem,http://www.edms.net
EDRM
EDRMisanorganizationthatcreatespracticalresourcestoimprovee-discoveryandinformationgovernance.Since2005thee-discoverycommunityhasreliedonEDRMforleadership,standards,bestpractices,tools,guidesandtestdatasetstoimproveelectronicdiscoveryandinformationgovernance.Memberindividuals,lawfirms,corporationsandgovernmentorganizationsactivelycontributetothedirectionofEDRM.
EDRMDiagram
TheEDRMdiagramisconceptual,non-linear,iterativemodelofthee-discoveryprocess.
Itrepresentsaconceptualviewofthee-discoveryprocess,notaliteral,linearorwaterfallmodel.Onemayengageinsomebutnotallofthestepsoutlinedinthediagram,oronemayelecttocarryoutthestepsinadifferentorderthanshownhere.
Thediagramalsoportraysaniterativeprocess.Onemightrepeatthesamestepnumeroustimes,honinginonamoreprecisesetofresults.Onemightalsocyclebacktoearliersteps,refiningone’sapproachasabetterunderstandingofthedataemergesorasthenatureofthematterchanges.
Source: EDRMDiagramElements
EDRMFramework
See: EDRMDiagram
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 102
EDRMFrameworkGuides
TheEDRMframeworkguidesareaseriesofpracticalguidesdevelopedforeachstageofthee-discoveryprocessasdepictedintheEDRMframework.
EDRMPhases
TheElectronicDiscoveryReferenceModel,alsoreferredtoasEDRMortheEDRMdiagram,outlinesthekeyprocessesandstagesofthee-discoveryprocessintheformofnineinterrelatedphases:InformationGovernance,Identification,Preservation,Collection,Processing,Review,Analysis,Production,andPresentation.Eachphaserepresentsacorestageofthee-discoveryprocess.Bybreakingthee-discoveryprocessintophases,practitionerscanleveragecoreresources(i.e.people,technology,andprocesses)inamoreorganizedfashiontoachievedesiredresults.
Source: EDRMMetricsGlossary
EGA(EnhancedGraphicsAdapter)
ShortforEnhancedGraphicsAdapter,EGAisavideostandardmanufacturedbyIBMin1984withahigherresolution(640x350)andmorecolors(16fromapaletteof64)whencomparedtoearlierstandardssuchasCGA.
Source: ComputerHope,EGAdefinition,http://www.computerhope.com/jargon/e/ega.htm
TheEnhancedGraphicsAdapter(EGA)isahistoricalIBMPCcomputerdisplaystandardfrom1984thatsupersededandexceededthecapabilitiesoftheCGAstandardintroducedwiththeoriginalIBMPC,andwasitselfsupersededbytheVGAstandardin1987.
Source: Wikipedia,EnhancedGraphicsAdapter,https://en.wikipedia.org/wiki/Enhanced_Graphics_Adapter
EIA(ElectronicIndustriesAssociation)
Atradeassociation.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EISA(ExtendedIndustryStandardArchitecture)
OneofthestandardbusesusedforPCs.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ElectronicDiscovery/E-Discovery
Discoverydocumentsproducedinelectronicformatsratherthanhardcopy.Theproductionmaybecontainedonharddrives,tapes,CDs,DVDs,externalharddrives,etc.Oncereceived,
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 103
thesedocumentsareconvertedto.tifformat.Itisduringtheconversionprocessthatmetadatacanbeextracted.
Source: RSI,Glossary.
Aprocessthatincludeselectronicdocumentsandemailintoacollectionof"discoverable"documentsforlitigation.Usuallyinvolvesbothsoftwareandaprocessthatsearchesandindexesfilesonharddrivesorotherelectronicmedia.Extractsmetadataautomaticallyforuseasanindex.Mayincludeconversionofelectronicdocumentstoanimageformatasifthedocumenthadbeenprintedoutandthenscanned.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Thediscoveryofelectronicdocumentsanddataincludinge-mail,Webpages,wordprocessingfiles,computerdatabases,andvirtuallyanythingthatisstoredonacomputer.Technically,documentsanddataare“electronic”iftheyexistinamediumthatcanonlybereadthroughtheuseofcomputers.Suchmediaincludecachememory,magneticdisks(suchascomputerharddrivesorfloppydisks),opticaldisks(suchasDVDsorCDs),andmagnetictapes.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Theprocessoffinding,identifying,locating,retrieving,andreviewingpotentiallyrelevantdataindesignatedcomputersystems.
Theprocessofidentifying,preserving,collecting,processing,searching,reviewingandproducingElectronicallyStoredInformationthatmaybeRelevanttoacivil,criminal,orregulatorymatter.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Seealso:
Computerevidence
Computerforensics
Computerinvestigations
Discovery
Electronicevidence
Forensicanalysis
Forensics
Mirroring
ElectronicDocument
Adocumentthathasbeenscanned,orwasoriginallycreatedonacomputer.Documentsbecomemoreusefulwhenstoredelectronicallybecausetheycanbewidelydistributedinstantly,andallowsearching.HTMLandPDFarewellknownelectronicdocumentformats.
Source: RSI,Glossary.
ElectronicDocumentDiscovery
See: ElectronicDiscovery/E-Discovery
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 104
ElectronicDocumentManagementSystem
See: EDMS(ElectronicDocumentManagementSystem)
ElectronicEvidence
AccordingtoBlack’slawdictionary,evidenceis“anyspeciesofproof,orprobativematter,legallypresentedatthetrialofanissue,bytheactofthepartiesandthroughthemediumofwitnesses,records,documents,exhibits,concreteobjects,etc.forthepurposeofinducingbeliefinthemindsofthecourtorjuryastotheircontention.”Electronicinformation(likepaper)generallyisadmissibleintoevidenceinalegalproceeding.
Source: RenewData,Glossary(10/5/2005).
Anycomputer-generateddatathatisrelevanttoacase.Includedareemail,textdocuments,spreadsheets,images,databasefiles,deletedemailandfilesandback-ups.Thedatamaybeonfloppydisk,zipdisk,harddrive,tape,CDorDVD.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingNorcrossGroupFAQ's,http://norcrossgroup.com/faq.html#5.
Seealso:
Computerevidence
Computerforensics
Computerinvestigations
Discovery
Electronicdiscovery/e-discovery
Forensicanalysis
Forensics
Mirroring
ElectronicIndustriesAssociation
See: EIA(ElectronicIndustriesAssociation)
ElectronicMail
See: E-Mail(ElectronicMail)
ElectronicMailMessage
Commonlyreferredtoas“e-mail”,anelectronicmailmessageisadocumentcreatedorreceivedviaanelectronicmailsystem,includingbriefnotes,formalorsubstantivenarrativedocuments,andanyattachments,suchaswordprocessingandotherelectronicdocuments,whichmaybetransmittedwiththemessage.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
ElectronicRecord
Informationrecordedinaformthatrequiresacomputerorothermachinetoprocessitandthatotherwisesatisfiesthedefinitionofarecord.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 105
Elusion
ThefractionofDocumentsidentifiedasNon-RelevantbyasearchorrevieweffortthatareinfactRelevant.ElusionisestimatedbytakingaRandomSamplefromtheNullSetanddetermininghowmany(orwhatProportionof)DocumentsareactuallyRelevant.AlowElusionvaluehascommonlybeenadvancedasevidenceofaneffectivesearchorrevieweffort(see,e.g.,Kleen),butthatcanbemisleadingbecauseitquantifiesonlythoseRelevantDocumentsthathavebeenmissedbythesearchorrevieweffort;itdoesnotquantifytheRelevantDocumentsfoundbythesearchorrevieweffort(i.e.,Recall).Consider,forexample,aDocumentPopulationcontainingonemillionDocuments,ofwhichtenthousand(or1%)areRelevant.Asearchorrevieweffortthatreturned1,000Documents,noneofwhichwereRelevant,wouldhave1.001%Elusion,belyingthefailureofthesearch.Elusion=100%–NegativePredictiveValue.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Aninformationretrievalmeasureoftheproportionofresponsivedocumentsthathavebeenmissed.Mostoftenusedasaqualityassurancemeasureinwhichasampleofnon-retrieveddocumentsisevaluatedtodeterminewhetherareviewhasmetreasonablecriteriaforcompleteness.
Source: HerbRoitblat,PredictiveCodingGlossary.
Em
Inanyprintfontorsizeisequaltothewidthoftheletter"M"inthatfontandsize.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
See: E-Mail(ElectronicMail)
EmailAddress
Anelectronicmailaddress.Emailaddressesfollowtheformula:[email protected],auser'semailaddressis"aliased"orrepresentedbytheirnaturalnameratherthantheirfullyqualifiedemailaddress.Forexample,[email protected].
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
EmailAttachment
Electronicfilesthataresentalongwithanemail.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 106
EmailMessageStore
Atopmoste-mailmessagestoreisthelocationinwhichane-mailsystemstoresitsdata.Forinstance,anOutlookPST(personalstoragefolder)isatypeoftopmostfilethatiscreatedwhenauser’sMicrosoftOutlookmailaccountissetup.AdditionalOutlookPSTfilesforthatusercanbecreatedforbackingupandarchivingOutlookfolders,messages,formsandfiles.Similartoafilingcabinet,whichisnotconsideredpartofthepaperdocumentscontainedinit,atopmoststoregenerallyisnotconsideredpartofafamily.
Source” KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EmailMetadata
Datastoredinanemailabouttheemail.Oftenthisdataisnotevenviewableinemailclientapplicationusedtocreatetheemail.Theamountofemailmetadataavailableforaparticularemailvariesgreatlydependingontheemailsystem.Contrastwithfilesystemmetadataanddocumentmetadata.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary
Datastoredinanemailabouttheemail.Oftenthisdataisnotevenviewableinemailclientapplicationusedtocreatetheemail.Theamountofemailmetadataavailableforaparticularemailvariesgreatlydependingontheemailsystem.Example:Subject,SentDate,To,Attachment,etc.
Seealso:
Customer-addedmetadata
Documentmetadata
Extrinsicdata
Fileparameters
Filesystemmetadata
File-specificmetadata
Generalmetadata
Metadata
Vendor-addedmetadata
EmailThreading
Groupingtogetheremailmessagesthatarepartofthesamediscourse,sothattheymaybeunderstood,reviewed,andcodedconsistentlyasaunit.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Embed
Toinsertanobjectinitsnativeformatintoacompounddocument.
Source: IbisConsulting,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 107
EmbeddedChart
Achartoragraphthatwouldnormallybedisplayedwithinaspreadsheet,butthatmayblockunderlyingtextordata.
EmbeddedObject
Adatacontainerwithoutfilepropertiesallocatedwithinanotherfilesuchasagraphic,MSWord,orMSExcelfile(havingthepropertiesofthatobject),typicallyrepresentedbyanapplication-specificiconinthebodyofanyTIFF'edcontent.Thisincludesinformationthatiscontainedinasourcefileandinsertedintoadestinationfile.Onceembedded,theobjectbecomesapartofthedestinationfile.
Source: IbisConsulting,Glossary.
Seealso:
Bibliographiccoding
Linkobject
Linksource
Linkedobject
Object
EML
EMLisafileextensionforane-mailmessagesavedtoafileintheMIMERFC822standardformatbyMicrosoftOutlookExpressaswellassomeotheremailprograms.
Source: EMLFileFormat,http://whatis.techtarget.com/fileformat/EML-Microsoft-Outlook-Express-mail-message-MIME-RFC-822.
AsingleRFC822mailfilemessage.
Source: IbisConsulting,Glossary
Anemailfileformat,usuallycontainingasingleemailmessage.
Seealso:
Container
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
OST
PST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
EmptyFile
Afilewithnocontent,butwithfileproperties,structure,andtypicallyalsometadata(includingcommercialapplicationdata).
Source: IbisConsulting,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 108
Emulate
Toimitateadevicewithaseconddeviceusingagraphicaluserinterface.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
En
HalfthewidthofanEm.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EncapsulatedPostScript(EPS)
Uncompressedfilesforimages,textandobjects.OnlyprintonPostScriptprinters.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Encryption
Atechnologythatrendersthecontentsofafileunintelligibletoanyonenotauthorizedtoreadit.Encryptionisusedtoprotectinformationasitmovesfromonecomputertoanother.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Atechnologythatrendersthecontentsofafileunintelligibletoanyonenotauthorizedtoreadit.Encryptionisusedtoprotectinformationasitmovesfromonecomputertoanother,andisanincreasinglycommonwayofsendingcreditcardnumbersovertheInternetwhenconductinge-commercetransactions.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Theconversionofdataintoaformcalledciphertextthatcannotbeeasilyunderstoodbyunauthorizedpeople/applications.
Source: IbisConsulting,Glossary.
Thecodingofmessagestoincreasesecurityandmaketransmissiononlyreadablebyrecipientswiththeabilitytodecodeonlybyusingthesamealgorithms.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Aprocedurethatrendersthecontentsofamessageorfileunintelligibletoanyonenotauthorizedtoreadit.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 109
Seealso:
Decryption
EndDocumentNumber
Thelastsinglepageimageofadocument.OftencalledEndDoc#.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
EndofFile(EOF)
Adistinctivecodewhichuniquelymarkstheendofadatafile.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EndUserProgram
Theprogramusedtoperformsearches,viewingandretrievalofascannedand/orcodedcollectionofimages.ExamplesincludeSummation,Concordance,JFSLitigatorsNotebook,Ringtail,Paradox,InMagicDB/Textworksandmanyothers.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Endorser
Alittleprinterinascannerthataddsadocument-controlnumbertoeachscannedsheet.Someformscontrolprocessingsoftwarecancontrolthisprinter.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 110
EnhancedGraphicsAdapter
See: EGA(EnhancedGraphicsAdapter)
EnhancedParallelPort(EPP)
AlsoknownasFastModeParallelPort.Anew,industrystandardparallelport,havinghightransfertimescompetitivewithSCSI.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EnhancedSmallDeviceInterface(ESDI)
Adefined,commonelectronicinterfacefortransferringdatabetweencomputersandperipherals,particularlydiskdrives.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EOF
See: EndofFile(EOF)
EORHB
EORHBv.HOAHoldingsLLC,Civ.ActionNo.7409-VCL,tr.andslipop.(Del.Ch.Oct.19,2012).ThefirstcaseinwhichacourtsuaspontedirectedthepartiestousePredictiveCodingasareplacementforManualReview(ortoshowcausewhythiswasnotanappropriatecaseforPredictiveCoding),absenteitherparty’srequestemployPredictiveCoding.ViceChancellorJ.TravisLasteralsoorderedthepartiestousethesameE-DiscoveryvendorandtoshareaDocumentrepository.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EPP
See: EnhancedParallelPort(EPP)
EPS
See: EncapsulatedPostScript(EPS)
eRecall
AnestimateofRecallcomputedfromprevalenceandproductionfrequency.
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 111
Error/ErrorRate
ThefractionofallDocumentsthatareincorrectlycodedbyasearchorrevieweffort.NotethatAccuracy+Error=100%,andthat100%–Accuracy=Error.WhilealowErrorRateiscommonlyadvancedasevidenceofaneffectivesearchorrevieweffort,itsusecanbemisleadingbecauseitisheavilyinfluencedbyPrevalence.Consider,forexample,aDocumentPopulationcontainingonemillionDocuments,ofwhichtenthousand(or1%)areRelevant.AsearchorrevieweffortthatfoundnoneoftherelevantDocumentswouldhave1%Error,belyingthefailureofthesearchorrevieweffort.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EscapingMechanism
Tosearchakeywordwhichcontainsawildcardcharactersuchasaquestionmark,anescapingmechanismisneededtosearch.Availabilityofmulti-characterwildcardsmaybelimitedinsomesystems.Somesearchenginesrequireacertainnumberofleadingcharactersanddonotsupportsearchtermsthatstartwithawildcard.
Source: EDRMSearchGlossary.
ESDI
See: EnhancedSmallDeviceInterface(ESDI)
ESI/ElectronicallyStoredInformation
ElectronicallyStoredInformationorESIisinformationthatisstoredelectronicallyonenumerabletypesofmediaregardlessoftheoriginalformatinwhichitwascreated.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
ElectronicallyStoredInformation:thisisanallinclusivetermreferringtoconventionalelectronicdocuments(e.g.spreadsheetsandwordprocessingdocuments)andinadditionthecontentsofdatabases,mobilephonemessages,digitalrecordings(e.g.ofvoicemail)andtranscriptsofinstantmessages.Allofthismaterialneedstobeconsideredfordisclosure.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
UsedinFederalRuleofCivilProcedure34(a)(1)(A)torefertodiscoverableinformation“storedinanymediumfromwhichtheinformationcanbeobtainedeitherdirectlyor,ifnecessary,aftertranslationbytherespondingpartyintoareasonablyusableform.”AlthoughRule34(a)(1)(A)references“DocumentsorElectronicallyStoredInformation,”individualunitsofreviewandproductionarecommonlyreferredtoasDocuments,regardlessofthemedium.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 112
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Ethernet
AcommonwayofnetworkingPCstocreateaLAN.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EvaluationOrder
Whiletheevaluationordershouldbeimmaterial,somesearchenginesproducedifferentresultsiftheorderisspecifieddifferently.Inotherimplementations,theperformanceofsearchisimpactedbytheorderofspecification.
Source: EDRMSearchGlossary.
Exabyte
See: EB(Exabyte)
ExceptionsReport
Areportlistingdocumentsthatcouldnotbeprocessedwithintheparametersofthenormalelectronicdiscoveryprocessing.Forvariousreasons,thedocumentslistedintheexceptionsreportcouldnotbeopened,theirtextextracted,ortheycouldnotbeproperlyimaged.Effectivesystemsminimizeexceptions,becausethesedocumentsmayrequirespecialprocessingmakingthemmoreexpensiveanddelayingtheproductionprocess.
ExpansionSlot
Aspaceinsideacomputerusedtoconnectaboardthatcontrolsotherfunctions,suchasascannerormodem,tothemotherboard.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ExperimentalDesign
Astandardprocedureacceptedinthescientificcommunityfortheevaluationofcompetinghypotheses.Therearemanyvalidexperimentaldesigns.SomethatcanbeappropriateforevaluatingTechnology-AssistedReviewprocessesincludeCrossoverTrialsandParallelTrials.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 113
ExploratorySearch
Adhocorsinglelogicalquery,likelytobeemployedinknowledgemanagementeffortontheleftside,orasadhocsearchaspartofcaseassessment,revieworpost-reviewwitnessprep.
Source: EDRMSearchGlossary.
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
ExtendedIndustryStandardArchitecture
See: EISA(ExtendedIndustryStandardArchitecture)
ExtensibleMarkupLanguage(XML)
Codewhichdescribesthecontentofdata.
AsubsetofSGMLthatisusedtodescribethestructureandcontentofdocuments.The“extensible”partofitsnameindicatesthatitcanbeusedtocreatenewdatastructures,whichmakesitmorepowerfulthanHTML.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
HTML
Java
JavaScript
SGML
SGML/HyTime
©2016EDRMLLC
Extensions/SizesFilter
Afilteroptionthatallowsforincluding/excludingfileswithembeddedobjects,forincludingorexcludingcertainfiletypesornon-maile-mailitems(suchasCalendar,Appointments,Tasks,Contacts,etc),aswellasestablishingthresholdsforfilesize.
Source: IbisConsulting,Glossary.
Seealso:
Datefilter
Filter
MD5-knownfilter
Sender/recipientfilter
ExternalDrive
See: Portabledrive
Extranet
Anintranetconnectionthatismadeaccessibletoauthorizedusersoutsideofthenetwork.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Anintranettowhichtheownersprovidelimitedaccesstooutsideuserssuchasclientsorco-counsel.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AnInternetbasedaccessmethodtoacorporateintranetsitebylimitedortotalaccessthroughasecurityfirewall.Thistypeofaccessistypicallyutilizedincasesofjointventureandvendorclientrelationships.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
ExtrinsicData
Informationaboutafile,suchasfilesignature,author,size,name,path,andcreatingandmodificationdates.Thisdataistheaccumulationofwhatisinthefile,onthemedialabel,discoveredbytheoperator,andcontributedbytheuser.Collectively,itrepresentsoneofthevaluesofexamininganelectronicfileasopposedtotheprintedversion.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Seealso:
Customer-addedmetadata
Documentmetadata
Emailmetadata
Fileparameters
Filesystemmetadata
File-specificmetadata
Generalmetadata
Metadata
Vendor-addedmetadata
©2016EDRMLLC
F
F
vanRijsbergen’sF.Aformulaforcombiningprecisionandrecallintoasinglenumbertomakeiteasiertocomparetheinformationretrievalaccuracyofdifferentsystems.
Source: HerbRoitblat,PredictiveCodingGlossary.
F1
TheHarmonicMeanofRecallandPrecision,oftenusedinInformationRetrievalstudiesasameasureoftheeffectivenessofasearchorrevieweffort,whichaccountsforthetradeoffbetweenRecallandPrecision.InordertoachieveahighF1score,asearchorrevieweffortmustachievebothhighRecallandhighPrecision.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
OneformofvanRijsbergen’sFformulaforcombiningprecisionandrecallintoasinglenumbertomakeiteasiertocomparetheinformationretrievalaccuracyofdifferentsystems.F1istheweightedharmonicmeanofprecisionandrecall=2*precision*recall/(precision+recall).
Source: HerbRoitblat,PredictiveCodingGlossary.
FacetedQuery
Asearchquerywhereinthesystemreturnsasetofalternativevaluesthataretypicallysubcategoriesoftheoriginalquery.Forexample,onawebsitesellingtelevisionsets,ausermightenteraninitialqueryconsistingoftheword“TV.”ThesystemwillthenpresentalistofvarioussubclassesofTVs,categorized,forexample,bythesizeofthescreen.IneDiscovery,asystemmightreturnalistofemailsendersinresponsetoaninitialquery.Afacetedqueryisatypeofqueryexpansion,wheretheexpandedqueriesarecategorizedandcanbeselected.
Source: HerbRoitblat,Search2020:TheGlossary.
Facsimile
Aprocessoftransmittingdocumentsbyscanningthemtodigital,convertingtoanalog,transmittingoverphonelinesandreversingtheprocessattheotherendandprinting."Group3"indicatesthe3rdgenerationoffaxeswhichtransmitsapageat9600baudinaboutaminute–withanormalresolutionof203x98dpiandafineresolutionof203x196.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 116
FalseNegative(FN)
ARelevantDocumentthatisMissed(i.e.,incorrectlyidentifiedasNon-Relevant)byasearchorrevieweffort.AlsoknownasaMiss.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Oneoffourresponsestatesinacategorizationtask.Truepositiveresponsesarethosethataretrulyinthepositivecategoryandareclassifiedasnegative.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
FalsePositive(FP) TrueNegative(TN) TruePositive(TP)
FalseNegativeRate(FNR)
Thefraction(orProportion)ofRelevantDocumentsthatareMissed(i.e.,incorrectlyidentifiedasNon-Relevant)byasearchorrevieweffort.NotethatFalseNegativeRate+Recall=100%,andthat100%–Recall=FalseNegativeRate.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
FalsePositive(FP)
ANon-RelevantDocumentthatisincorrectlyidentifiedasRelevantbyasearchorrevieweffort.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Oneoffourresponsestatesinacategorizationtask.Falsepositiveresponsesarethosethataretrulyinthenegativecategoryandareclassifiedaspositive.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
FalseNegative(FN) TrueNegative(TN) TruePositive(TP)
FalsePositiveRate(FPR)
Thefraction(orProportion)ofNon-RelevantDocumentsthatareincorrectlyidentifiedasRelevantbyasearchorrevieweffort.NotethatFalsePositiveRate+TrueNegativeRate=100%,andthat100%–TrueNegativeRate=FalsePositiveRate.InInformationRetrieval,alsoknownasFallout.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 117
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
FamilyRange
AfamilyrangedescribestherangeofdocumentsfromthefirstBatesproductionnumberassignedtothefirstpageofthetopmostparentdocumentthroughthelastBatesproductionnumberassignedtothelastpageofthelastchilddocument.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
FamilyRelationship
Afamilyrelationshipisformedamongtwoormoredocumentsthathaveaconnectionorrelatednessbecauseofsomefactor.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
FastModeParallelPort
See: EnhancedParallelPort(EPP)
FAT(FileAllocationTable)
AninternaldatatableonDOS-baseddisksthatliststhecontentsandaddressofeachfileonthedisk.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Filesystem NTfilingsystem NTFS
FaxBoard
Anadapterthatisinstalledinsideacomputertoallowdirectfaxing.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Feature
Acharacteristicofanitem.Intext,afeatureisusuallyaword,butitcouldbeaphraseorothergroupingofwords.Inasearchengine,featuresaretheitemsthatarespecificallyindexed.
Source: HerbRoitblat,Search2020:TheGlossary.
FeatureEngineering
TheprocessofidentifyingFeaturesofaDocumentthatareusedasinputtoaMachineLearningAlgorithm.TypicalFeaturesincludewordsandphrases,aswellasmetadatasuchassubjects,
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 118
dates,andfiletypes.OneofthesimplestandmostcommonFeatureEngineeringtechniquesisBagofWords.MorecomplexFeatureEngineeringtechniquesincludetheuseofOntologiesandLatentSemanticIndexing.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Features
TheunitsofinformationusedbyaMachineLearningAlgorithmtoClassifyorPrioritizeDocuments.TypicalFeaturesincludetextfragmentssuchaswordsorphrases,andmetadatasuchassender,recipient,andsentdate.SeealsoFeatureEngineering.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
FiberOptics
Transmittingwithlightpulsesovercablesmadefromthinstrandsofglass.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Field
Anameforanindividualpieceofstandardizeddatatobeextractedfromanimagecollection.Fieldscanbetheauthorofadocument,arecipient,thedateofadocumentoranyotherpieceofdatacommontomostdocumentsinanimagecollection.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Aunitofinformationinadatabase.Databaserecords,forexample,consistofanorderedlistoffieldswhereaspecifickindofinformationisstoredineachfield.Fieldsareoftenprintedascolumnsindatabasereports.
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 119
Subjectcategory Summary Text
FieldSeparator
Acode,usuallyacomma,thatseparatesthefieldsinarecord.(Also,adelimiter.)
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
FieldedSearch
Fieldedsearchesarebasedonvaluesstoredasmetadataratherthanactualcontentofanelectronicasset.Searchescanberefinedusingmetadatainformationextractedduringprocessing,suchassenderorreceiver,creationdate,modifieddate,author,filetypeandtitle,aswellassubjectiveuser-definedvaluesthatmaybeascribedtoadocumentaspartofdownstreamreview.SeealsoParametricSearch.
Source: EDRMSearchGlossary.
File
Adocumentorprogramaswellasaunitofstorageorfilemanagement.Eachfileisasetofbytes(eachbytetypicallyconsistsof8bits)thatisstoredonsomemedia,orinsideanarchive.FilescanbetransmittedovercommunicationlinesusingcommunicationprotocolssuchasSMTP/POP3(Mail),FTP,HTTP.Filesmay(ormaynot)havedifferentattributes(metadata).Therearemanydifferenttypesoffiles:datafiles,textfiles,programfiles,directoryfiles,andsoon.Differenttypesoffilesstoredifferenttypesofinformation.Forexample,programfilesstoreprograms,whereastextfilesstoretext.
Source: IbisConsulting,Glossary.
Anelementofdatastorageinafilesystem.Acollectionofdataorinformationthathasaname,calledthefilename.Almostallinformationstoredinacomputermustbeinafile.Therearemanydifferenttypesoffiles:datafiles,textfiles,programfiles,directoryfiles,andsoon.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Inwordprocessing,apieceoftextthatisusuallyonedocumentlong.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Inadatabase,acompletecollectionofrecordstreatedasoneunit.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Acollectionoflogicallyrelateddatarecords.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 120
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Acollectionofdataofinformationstoredunderaspecifiednameonadisk.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
FileAllocationTable
See: FAT(FileAllocationTable)
FileCompression
Atechnologyforstoringdatainfewerbits,itmakesdatasmallersolessdiskspaceisneededtorepresentthesameinformation.CompressionprogramslikeWinZipandUNIXcompressarevaluabletonetworkusersbecausetheysavebothtimeandbandwidth.Datacompressionisalsowidelyusedinbackuputilities,spreadsheetapplications,anddatabasemanagementsystems.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Atechnologyforstoringdatainfewerbits,itmakesdatasmallersolessdiskspaceisneededtorepresentthesameinformation.Datacompressioniswidelyusedtobackuputilities,spreadsheetapplications,anddatabasemanagementsystems.Compressedfilesmustbedecompressedinordertobeuseable.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Atechnologythatreducesthesizeofafile.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Anymethodwhichreducestheamountofdatanecessarytotransmitinformationfromonepointtoanother.Compressiongenerallyeliminatesredundantinformationand/orpredictswherechangeswilloccur."Lossless"compressiontechniquestotallypreservetheintegrityoftheinput."Lossy"methodsdisregardsomeoftheoriginals.Theratioofthefilesizesofacompressedfiletoanuncompressedfile,e.g.,witha20:1compressionratio,anuncompressedfileof1MBiscompressedto50KB.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Atechnologythatreducesthesizeofafile.Compressionprogramsarevaluabletonetworkusersbecausetheyhelpsavebothtimeandbandwidth.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 121
FileExtension
Distinguishesafile'sformatfortheapplicationusedtocreatethefileandcanbeusedtosimplifytheprocessoflocatingdata.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Atagofthreeorfourletters,precededbyaperiod,whichidentifiesadatafile'sformatortheapplicationusedtocreatethefile.Fileextensionscanstreamlinetheprocessoflocatingdata.Forexample,ifoneislookingforincriminatingpicturesstoredonacomputer,onemightbeginwiththe.gifand.jpgfiles.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Thelast(typically3characters)followingaperiodinafilenamethatindicateswhatkindoffileitis.MSworddocumentstypicallyendwith".doc,"etc.Fileextensionsareusedbytheoperatingsystemtodeterminethedefaultapplicationtousetoopenafile.
InDOSandsomeotheroperatingsystems,oneorseverallettersattheendofafilename.Filenameextensionsusuallyfollowaperiod(dot)andindicatethetypeofinformationstoredinthefile.Forexample,inthefilenameLETTER.DOC,theextensionisDOC,whichindicatesthatthefileisawordprocessingfile.
FileParameters
Filedatawhichcanbereadfromthefilesystem,includingfolderlocationname,filename,creationdate,lastmodifieddate,lastaccesseddate,andfilesize.
Source: IbisConsulting,Glossary.
Seealso:
Customer-addedmetadata
Documentmetadata
Emailmetadata
Extrinsicdata
Filesystemmetadata
File-specificmetadata
Generalmetadata
Metadata
Vendor-addedmetadata
FileServer
UtilizedwhenmanycomputersystemsareconnectedtogetheraspartofaLAN,afileservercanretainemailmessages,financialdata,wordprocessinginformation,orbeusedtobackupthenetwork.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Acomputerthatisthecentralstorageunitforalocalareanetwork(LAN).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
WhenseveralormanycomputersarenetworkedtogetherinaLANsituation,onecomputermaybeutilizedasastoragelocationforfilesforthegroup.Fileserversmaybeemployedtostoree-mail,financialdata,wordprocessinginformationortoback-upthenetwork.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 122
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Computer
Laptopcomputer
Microcomputer
Minicomputer
Notebookcomputer
Personalcomputer
Workstation
FileSet
Anygroupoffilestobeprocessed,eitherinoroutsideofacontainer.
Source: IbisConsulting,Glossary.
FileSharing
Sharingofcomputerdataorspaceonanetwork.Filesharingallowsmultipleuserstousethesamefilebybeingabletoread,modify,copyand/orprintit.Filesharingusersmayhavethesameordifferentlevelsofaccessprivilege.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Oneofthekeybenefitsofanetworkistheabilitytosharefilesstoredontheserveramongseveralusers.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
FileSignature
Withinafile,thefilesignatureistheinformationaboutthetrueprogram-relatedoriginofthefile,andtherefore,itstype.Toolsforreadingfilesignaturesidentifythetrueprogramsource,evenifthefileextensionhasbeenchanged.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
FileSystem
Thesystemthatanoperatingsystemorprogramusestoorganizeandkeeptrackoffiles.Forexample,ahierarchicalfilesystemisonethatuses[Directory|directories}]toorganizefilesintoatreestructure.Typesoffilesystemsincludefileallocationtable(FAT)andWindowsNTfilesystem(NTFS).
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Seealso:
FAT NTfilingsystem NTFS
©2016EDRMLLC
FileSystemMetadata
Datathatcanbeobtainedorextractedaboutafilefromthefilesystemstoringthefile.Contrastwithdocumentmetadataandemailmetadata.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Seealso:
Customer-addedmetadata
Documentmetadata
Emailmetadata
Extrinsicdata
Fileparameters
File-specificmetadata
Generalmetadata
Metadata
Vendor-addedmetadata
FileTransferProtocol(FTP)
TheprotocolforexchangingfilesovertheInternet.FTPworksinthesamewayasHTTPfortransferringWebpagesfromaservertoauser'sbrowserandSMTPfortransferringelectronicmailacrosstheInternet--inthat,likethesetechnologies,FTPusestheInternet'sTCP/IPprotocolstoenabledatatransfer.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingWebopediaComputerDictionary,http://www.pcwebopedia.com/TERM/F/FTP.html.
AnInternetprotocolthatenablesyoutotransferfilesbetweencomputersontheInternet.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingAppliedDiscovery'sGlossary,http://www.nysd.uscourts.gov/courtweb/pdf/D02NYSC/03-04265.PDF#page=24
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Aprotocolusedontheinternetforexchangingfiles.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
AnInternetprotocoltomovefilesfromonecomputertoanother.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
File-SpecificMetadata
File-specificmetadataisdefinedasdataaboutthefileitself,suchastitle,subject,author,keywords,comments,etc.forapplicationfilesandMessageID,header,text,timereceived,numberofattachments,etc.formailitems.
Source: IbisConsulting,Glossary.
Seealso:
©2016EDRMLLC
Customer-addedmetadata
Documentmetadata
Emailmetadata
Extrinsicdata
Fileparameters
Filesystemmetadata
Generalmetadata
Metadata
Vendor-addedmetadata
Filename
Thenamegiventoacomputerfileinordertodistinguishitfromotherfiles;maycontainanextensionthatindicatesthetypeoffile.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Thenameofafile.Allfileshavenames.Differentoperatingsystemsimposedifferentrestrictionsonfilenames.Mostoperatingsystems,forexample,prohibittheuseofcertaincharactersinafilenameandimposealimitonthelengthofafilename.Inaddition,manysystems,includingDOSandUNIX,allowafilenameextensionthatconsistsofoneormorecharactersfollowingtheproperfilename.Thefilenameextensionusuallyindicateswhattypeoffileitis.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Thenameofafile.Allfileshavenames.Differentoperatingsystemsimposedifferentrestrictionsonfilenames.Mostoperatingsystems,forexample,prohibittheuseofcertaincharactersinafilenameandimposealimitonthelengthofafilename.Inaddition,manysystems,includingDOSandUNIX,allowafilenameextensionthatconsistsofoneormorecharactersfollowingtheproperfilename.Thefilenameextensionusuallyindicateswhattypeoffileitis.ThelineofusuallyRomanalphabetcharactersplusafileattribute'sFILEEXTENSION(zip,doc,info,xwy)separatedbythesymbol'.'orafileattribute'sFILEEXTENSION(zip,doc,info,xwy).Thismay(orMAYNOT)beanindicatorofFILEFORMATFILEFORMAT.Fileformatspecificationallowsdifferentapplicationstounderstandthesamefiles.
Source: IbisConsulting,Glossary.
FilenameExtension
InDOSandsomeotheroperatingsystems,oneorseverallettersattheendofafilename.Filenameextensionsusuallyfollowaperiod(dot)andindicatethetypeofinformationstoredinthefile.Forexample,inthefilenameLETTER.DOC,theextensionis.DOC,whichindicatesthatthefileisawordprocessingfile.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 125
Filter
Variousmethodsofreducingadataset.
Source: IbisConsulting,Glossary.
Seealso:
Datefilter
Extensions/sizesfilter
MD5-knownfilter
Sender/recipientfilter
Filtering
Themethodofreducingadatasetbyapplyingoneormorefilters.
Source: IbisConsulting,Glossary.
Electronicfilteringofemailsandfilesforprivilegeorbykeyword,file,typeorname.Filteringremovesfilesthatdonotfitthesearchcriteriaandreducesthevolumeofdatathatrequiresfurtherinvestigation.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
FindSimilar
AsearchmethodthatidentifiesDocumentsthataresimilartoaparticularexemplar.FindSimilariscommonlymisconstruedtobethemechanismbehindTechnology-AssistedReview.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Fingerprinting
See: Hash
Firewall
Asystemintendedtothwartunauthorizedaccesstoorfromaprivatenetworkthatisoftenusedtopreventunauthorizedusersfromaccessingprivatenetworksconnectedtotheinternet.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Asetofrelatedprogramsthatprotecttheresourcesofaprivatenetworkfromusersfromothernetworks.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
FlashDrive
See: JumpDrive
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 126
FlatFileDatabase
Adatabasewithalldatainasinglelist,similartoatelephonebookoraRolodex.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Database
Fulltextdatabase
Relationaldatabase
SQL
WAIS-wideareainformationserver
FlatbedScanner
Ascannerdesigninwhichthedocumentisplacedinthescanner'sbed,eithermanuallyorbyanautomaticdocumentfeeder,andremainsstationaryduringscanning.Asaresult,flatbedscannersprovideamorestabletargetthanotherscannerdesigns,buttheyaregenerallyslower.
Source: RSI,Glossary.
Aflat-surfacescannerthatallowsuserstoinputbooksandotherdocuments.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Double-sidedscanner
Duplexscanner
Scanner
Simplexscanner
FloppyDisk
Athinmagneticfilmdiskthatisusedasanoldermethodforstoringdata.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Smallremovabledisks,alsoknownasdiskettes,thatcomeintwosizes,3.5”and5.25”.Theamountofdatathatcanbestoredonadiskettedependsonthesize,andcanbe360kilobytesto1.4megabytes.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Anincreasinglyrarestoragemediumconsistingofathinmagneticfilmdiskhousedinaprotectivesleeve.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 127
Diskette
DVD
DVD-ROM
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
FloppyDiskDrive
AFloppyDiskDrive,alsocalledFDDorFDforshort,isacomputerdiskdrivethatenablesausertosavedatatoremovablediskettes.Although8"diskdriveswerefirstmadeavailablein1971,thefirstrealdiskdrivesusedwerethe51/4"floppydiskdrives,whichwerelaterreplacedwiththe31/2"floppydiskdrives.
Source: ComputerHope,FDDdefinition,http://www.computerhope.com/jargon/f/fdd.htm
Seealso:
Diskdrive
Jazdrive
Magneto-opticaldrive
Portabledrive
Storagedevice
Tapedrive
Zipdrive
FN
See: FalseNegative(FN)
FogComputing
Usingacombinationoflocalandcloudcomputingresources.Datamaybekeptlocallyforsecurityreasons,toavoidtheexpenseandtimeofmovingthemtocentralizedcloudstorage,ortomeetcompliancerequirements.
Source: HerbRoitblat,Search2020:TheGlossary.
FolderBrowser
Asystemofon-screenfolders(usuallyhierarchicalor“stacked”)usedtoorganizedocuments.Forexample,theFileManagerprograminMicrosoftWindowsisatypeoffolderbrowserthatdisplaysthedirectoriesonyourdisk.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Font
Acompletesetofcharactersinadistinctivetypestyleandsize.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 128
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ForensicAnalysis
Thescientificanalysisofcomputermediaforlegalreasons.Typicallyforensicanalysisisintendedtodiscoverwhetherresponsivedocumentsorotherdatahavebeendeletedfromamachine.Specialsoftware,equipment,andtechniquesareusedtodetecthiddeninformation.Suchinvestigationshavebeenmorecommonincriminalproceedingsthaninlitigation.Strictprotocolsmustbeemployedtoavoidevidencespoliation.
Seealso:
Computerevidence
Computerforensics
Computerinvestigations
Discovery
Electronicdiscovery/e-discovery
Electronicevidence
Forensics
Mirroring
ForensicCopy
Anexactbit-by-bitcopyoftheentirephysicalharddriveofacomputersystem,includingslackandunallocatedspace.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Aprecisebit-by-bitcopyofacomputersystem'sharddrive,includingslackandunallocatedspace.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Bitstreamcopy
Image
Imagedcopy
Mirrorimage
ForensicallySoundProcedures
Proceduresusedforacquiringelectronicinformationinamannerthatensuresitis“asoriginallydiscovered”andisreliableenoughtobeadmittedintoevidence.SuchproceduresaredefinedinpartbytheUSDepartmentofJusticepublication“SearchingandSeizingComputersandObtainingElectronicEvidenceinCriminalInvestigations,”http://www.usdoj.gov/criminal/cybercrime/s&smanual2002.htm.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Seealso:
Chainofcustody Chainofevidence
©2016EDRMLLC
Forensics
Forelectronicdata,thediscoverydisciplinethatincludesthephysicalacquisitionofdigitaldatausingmethodologythatsatisfiesevidentiaryrequirementsofchain-of-custodyandauthentication.Preservingtheevidenceincludesperformingcodeandencryptioncracking,searchingandretrievingelusivedata,determiningiffileshaveorhavenotbeendeleted,recoveringdeletedfiles,anddetermininguse,includingInternet,networkaccess,printing,filing,andcopying.
Source: IbisConsulting,Glossary.
Indocumentmanagementterms,forensicworkiscomprisedof:
• Recreating“deleted”ormissingfilesfromharddrives• Validatingdatesandloggedinauthors/editorsofdocuments• Certifyingkeyelementsofdocumentsand/orhardwareforlegalpurposes
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Computerevidence
Computerforensics
Computerinvestigations
Discovery
Electronicdiscovery/e-discovery
Electronicevidence
Forensicanalysis
Mirroring
FormalSearch
Formalsearchincludesexecuting,tracking.reportingandmeasureimpact,anditeratethroughsetsofmultiplelogicalqueries.Seealso,IterativeSearch.
Source: EDRMSearchGlossary.
Format
Theinternalstructureofafile,whichdefinesthewayitisstoredandused.Specificapplicationsmaydefineuniqueformatsfortheirdata(i.e.,“MSWorddocumentfileformat”).Manyfilesmayonlybeviewedorprintedusingtheiroriginatingapplicationoranapplicationdesignedtoworkwithcompatibleformats.Computerstoragesystemscommonlyidentifyfilesbyanamingconventionthatdenotestheformat(andthereforetheprobableoriginatingapplication)(i.e.,“DOC”forMicrosoftWorddocumentfiles;“XLS”forMicrosoftExcelspreadsheetfiles;“TXT”fortextfiles;and“HTM”(forHypertextMarkupLanguage(HTML)filessuchasWebpages).Usersmaychoosealternatenamingconventions,butthismayaffecthowthefilesaretreatedbyapplications.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Theorganizationofdataonadisk.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 130
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Toprepareadiskforuse.Formattingadiskconsistsoferasingoldinformationonthediskandaddingnewcodestocontrolinformationrecording.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
FormsProcessing
Aspecializedimagingapplicationdesignedforhandlingpre-printedforms.Formsprocessingsystemsoftenusehigh-end(ormultiple)OCRenginesandelaboratedatavalidationroutinestoextracthand-writtenorpoorqualityprintfromformsthatgointoadatabase.Thistypeofimagingapplicationfacesmajorchallenges,sincemanyofthedocumentsscannedwereneverdesignedforimagingorOCR.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
FormsRouting
Theprocessofroutingaformthroughoutanorganizationelectronically–withoutanypapercopies.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
FormulaReport
Areportlistingthespreadsheetformulascellbycell.Theseformulasdescribehowacomputationwasperformedincalculatingthevaluesdisplayedinaspreadsheet.
FP
See: FalsePositive(FP)
FPR
See: FalsePositiveRate(FPR)
FragmentedData
Livedatathathasbeendisseminatedandstoredinmultipleareasonasingleharddriveordisk.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Fragmenteddataislivedatathathasbeenbrokenupandstoredinvariouslocationsonasingleharddriveordisk.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 131
“Live”datathathasbeenbrokenupandstoredinvariouslocationsonasingleharddrive.Mostfilesarestoredthisway.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Ambientdata
Freespace
Residualdata
Slackspace
Swapfile
Unallocatedspace
FreeSpace
Unusedclustersonaharddisk.
Source: PCMag,definitionoffreespace,http://www.pcmag.com/encyclopedia/term/56700/free-space
Seealso:
Ambientdata
Fragmenteddata
Residualdata
Slackspace
Swapfile
Unallocatedspace
FrequencyAnalysis
Utilizediterativelythroughoutthelifecycleofaprojectassearchcriteriaaremodified,frequencyanalysismaybeusedtoevaluatetheeffectivenessoftheinitialsearchcriteria.Thesearchtermsaretestedtodeterminewhethertheyeffectivelydiscriminatebetweenpotentiallyrelevantandclearlynon-relevantdata.Frequencyanalysisisarealitycheckonthesearchresultsversustheoverallcollectionsizeandthereasonablyexpectedproportionofrelevantresults.Itdoesnotaddresstherecallorcompletenessofrelevantitemsoutofthecollection.
Source: EDRMSearchGlossary.
FRN
See: FalseNegativeRate(FNR)
Front-End/Back-End
Expressionsthatdescribeprogramsrelativetotheuser.Afront-endprogramisonethatusersinteractwithdirectly,whileaback-endprogramsupportsthefront-endservices.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
FTP
See: FileTransferProtocol(FTP)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 132
FullDuplex
Datacommunicationsdeviceswhichallowfullspeedtransmissioninbothdirectionsatthesametime.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
FullTextDatabase
AdatabaseinwhichtheentiretextofdocumentsiselectronicallyavailableforsearchingbykeywordsorphrasesusingBooleanlogic.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Database
Flatfiledatabase
Relationaldatabase
SQL
WAIS-wideareainformationserver
FullTextIndexing
Enablestheretrievalofdocumentsbyeithertheirwordorphrasecontent.Everywordinthedocumentisindexedintoamasterwordlistwithpointerstothedocumentsandpageswhereeachoccurrenceofthewordappears.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
FullTextSearch
Afilteroptionthatallowsforincluding/excludingfilesthathavebeensearchedfordesignatedtermsorphrases(eitheruploadingalistinanytextfileformatorcreatedon-the-spot).Generalmetadata,filenames,andbodytextaresearched.
Source: IbisConsulting,Glossary.
Theabilitytosearchadatafileforspecifiedkey(s)definedbytheoccurrenceofwords,numbersand/orcombinationsorpatternsthereof.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Theabilitytosearchallofthewordsofadocument,notjustthosecontainedinspecialfields,metadata,codes,orsummaries.
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 133
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
FunctionKey
Akeyonthekeyboardthatcontrolsspecializedfunctionsotherthannormaltyping.FunctionkeysincludeF1throughF10,CTRL,ALT,SHIFT,PAGEUP,PAGEDOWN,DELETE,andINSERT.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
FuzzySearch
Fuzzysearchallowssearchingforwordvariationssuchasinthecaseofmisspellings.Typically,suchsearchingincludessomeformofdistanceandscorecomputationsbetweenthespecifiedwordandthewordsinthecorpus.
Source: EDRMSearchGlossary.
AsearchtechniquethatidentifiesESIbasedontermsclosetoanotherterm,withclosenessdefinedasatypographicaldifferenceand/orchange.Forexample,snitch,switch,andswankycanallmatchswatch,dependingonhowmanyincorrectlettersareallowedwithinthesearchthreshold.
Source: EDRMSearchGuideGlossary.
Searchthatlocateswordscloselymatchthespellingoftheprimaryword.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Afull-textsearchprocedurethatlooksforexactmatchesaswellassimilaritiestothesearchcriteria,inordertocompensateforspellingorOCRerrors.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 134
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
G
GainCurve
AgraphthatshowstheRecallthatwouldbeachievedforaparticularCutoff.TheGainCurvedirectlyrelatestheRecallthatcanbeachievedtotheeffortthatmustbeexpendedtoachieveit,asmeasuredbythenumberofDocumentsthatmustbereviewedandCoded.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
GarbageIn,GarbageOut(GIGO)
Well-knowncomputeradagewhichreferstothefactthatthecontentsofadatabaseareonlyasgoodasthedataoriginallyentered.Dataenteredincorrectlywillnotprovideaccuratesearchresultsandwillleaduserstorelyonincorrectinformation.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
GaussianDistribution
Anormaldistribution.Abellshapedcurverepresentingthelikelihoodofdifferentvaluesastheydepartfromtheaverage.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
NormalDistribution
GaussianEstimate
AStatisticalEstimateofaPopulationcharacteristicusingGaussianEstimation.ItisgenerallyexpressedasaPointEstimateaccompaniedbyaMarginofErrorandaConfidenceLevel,orasaConfidenceIntervalaccompaniedbyaConfidenceLevel.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 135
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
GB(Gigabyte)
Approximatelyonebillionbytes.Oftenshortenedto"gigs"orGB.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Onebillionbytes.Alsoexpressedasonethousandmegabytes.Intermsofimagestoragecapacity,onegigabyteequalsapproximately17,00081/2"x11"pagesscannedat300dpi,storedasTIFFGroupIVimages.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Theequivalentofonebillion(actually1,073,741,824)bytes;oronemillionkilobytes,oronethousandmegabytes.
Equalto1,000megabytes(MB)or1,073,741,824bytes.
Seealso:
Bit
Byte
KB-kilobyte
MB-megabyte
TB-terabyte
PB-petabyte
EB-exabyte
GeneralMetadata
Generalmetadataisnotrealmetadata,butisdataaboutafileotherthanthecontents.Thisincludesdatasuchasoriginalname,datecreated,datelastmodified,fileextension,filetype,filesize,MD5,etc.forapplicationfilesandfoldername,size,subject,timecreated,MD5,etc.formailitems.File-specificmetadataisdefinedasdataaboutthefileitself,suchastitle,subject,author,keywords,comments,etc.forapplicationfilesandMessageID,header,text,timereceived,numberofattachments,etc.formailitems.
Source: IbisConsulting,Glossary.
Seealso:
Customer-addedmetadata
Documentmetadata
Emailmetadata
Extrinsicdata
Fileparameters
Filesystemmetadata
File-specificmetadata
Metadata
Vendor-addedmetadata
GHz(Gigahertz)
WhenreferringtoacomputerprocessororCPU,GHzisaclockfrequency,alsoknownasaclockrateorclockspeed,representingacycleoftime.AnoscillatorcircuitsuppliesasmallamountofelectricitytoacrystaleachsecondthatismeasuredinKHz,MHz,orGHz."Hz"isan
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 136
abbreviationofHertz,and"K"representsKilo(thousand),"M"representsMega(million),and"G"representsGiga(thousandmillion).
Source: ComputerHope,GHzdefinition,http://www.computerhope.com/jargon/g/ghz.htm
Seealso:
Hz KHz MHz
GIF(GraphicInterchangeFile)
Abit-mappedgraphicsfileformatusedbytheontheInternet.GIFsupportscolorandvariousresolutions.Italsoincludesdatacompression,butbecauseitislimitedto256colors,itismoreeffectiveforscannedimagessuchasillustrationsratherthancolorphotos.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Animagestorageformatthatiswidelyusedontheweb.
Source: RSI,Glossary.
AcompressedfileformatusedbytheCompuServesystemforphotographs.Limitedto256colors.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Acomputercompressionformatforpictures.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Imagefileformat
Jointphotographicexpertgroup
JPEG
Multi-pageTIFF
PNG
PortableDocumentFormat
Portablenetworkgraphic
SearchableTIFF
Single-pageTIFF
TIFF
Gigabyte
See: GB(Gigabyte)
Gigahertz
See: GHz(Gigahertz)
GIGO
See: GarbageIn,GarbageOut(GIGO)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 137
GlobalAerospace
GlobalAerospaceInc.v.LandowAviation,Consol.CaseNo.CL61040,2012WL1431215(Va.Cir.Ct.Apr.23,2012).ThefirstStateCourtOrderapprovingtheuseofPredictiveCodingbytheproducingparty,overtheobjectionoftherequestingparty,withoutprejudicetotherequestingpartyraisinganissuewiththeCourtastothecompletenessorthecontentsoftheproduction,ortheongoinguseofPredictiveCoding.TheorderwasissuedbyLoudounCountyCircuitCourtJudgeJamesH.Chamblin.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
GlobalDeduplication
DeduplicationofDocumentsacrossmultiplecustodians.AlsoreferredtoasHorizontalDeduplication.(Cf.VerticalDeduplication.)
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Seealso:
Basicde-duplication
Casede-duplication
Custodiande-duplication
De-duplication
Duplicate
Dynamicde-duplication
HorizontalDeduplication
Productionde-duplication
VerticalDeduplication
GoldStandard
ThebestavailabledeterminationoftheRelevanceorNon-Relevanceofall(orasample)ofaDocumentPopulation,usedasbenchmarktoevaluatetheeffectivenessofasearchandrevieweffort.AlsoreferredtoasGroundTruth.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Goodhart’sLaw
Anobservationmadein1975byCharlesGoodhart,ChiefAdvisertotheBankofEngland,thatstatisticaleconomicindicators,whenusedforregulation,becomeunreliable.Restatedandgeneralizedin1997byUniversityofCambridgeProfessorMarilynStrathernas“Whenameasurebecomesatarget,itceasestobeagoodmeasure.”WithinthecontextofElectronicDiscovery,Goodhart’sLawsuggeststhatthevalueofInformationRetrievalmeasuressuchasRecallandPrecisionmaybecompromisediftheyareprescribedasthedefinitionofthereasonablenessofasearchorrevieweffort.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 138
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
GraphicInterchangeFile
See: GIF(GraphicInterchangeFile)
GraphicalUserInterface(GUI)
AbbreviatedGUI(pronouncedGOO-ee).Aprograminterfacethattakesadvantageofthecomputer'sgraphicscapabilitiestomaketheprogrameasiertouse.Well-designedgraphicaluserinterfacescanfreetheuserfromlearningcomplexcommandlanguages.
Source: http://www.webopedia.com/TERM/G/Graphical_User_Interface_GUI.html
Elementsincludesuchthingsaswindows,icons,buttons,cursors,andscrollbars.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Softwareprogramsthatusespecialiconsandothersymbolstoassistinperformingfunctions,decreaserelianceonkeyboardskills,andreducetrainingtime.ThetwomostprominentexamplesaretheAppleinterfaceandMicrosoftWindows.Pronounced"gooey."
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
GraphicalUserInterface,or"gooey".Presentinganinterfacetothecomputerusercomprisedofpicturesandicons,ratherthanwordsandnumbers.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Asetofscreenpresentationsandmetaphorsthatutilizegraphicelementssuchasiconsinanattempttomakeanoperatingsystemeasiertouse.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
GraphicsBoard
Aboardthatallowsthescreentodisplaygraphicimages.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Grayscale
Animagetypethatusesblack,white,andarangesofshadesofgray.Thenumberofshadesofgraydependsonthenumberofbitsperpixel.Thelargerthenumberofshadesofgray,thebettertheimagewilllook,andthelargerthefilewillbe.
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 139
Theuseofmanyshadesofgraytorepresentanimage.Continuous-toneimages,suchasblack-and-whitephotographs,useanalmostunlimitednumberofshadesofgray.Conventionalcomputerhardwareandsoftware,however,canonlyrepresentalimitednumberofshadesofgray(typically16or256).Gray-scalingistheprocessofconvertingacontinuous-toneimagetoanimagethatacomputercanmanipulate.Thebinaryrangeofagraphicrepresentationbetweenpureblackandpurewhite.Ascaleof256shadesofgraywillbeabetterrepresentationthan16shades.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Groupware
Softwaredesignedtopromoteactionamongmembersofspecificgroupswithinanorganization.Thebest-knowngroupwareisLotusNotes.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Softwaredesignedtooperateonanetworkandallowseveralpeopletoworktogetheronthesamedocumentsandfiles.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
GUI
See: GraphicalUserInterface(GUI)
H
Hadoop
Asoftwarelibrarythatprovidesaframeworkfordistributedprocessingoflargedatasets.Hadoopallowscomplexprocessestobebrokendownintobasiccomputingtasks,whichcanbedistributedamongapotentiallylargenumberofindividualcomputers.Theresultsofthesedistributedcomputationscanthenbemergedtoobtainthefinalproduct.
Source: HerbRoitblat,Search2020:TheGlossary.
HalfDuplex
Transmissionsystemswhichcansendandreceive,butnotatthesametime.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Halftone
Thegraphicrepresentationofanobjectbydots,whichsimulatecontinuoustones.Usuallyusedtorepresentorreplicateanoriginalphotographinput.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 140
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
HalftoneDot
Varyinsize;largerappeardarker,smallerappearlighter.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
HardDisk
Internalhardwarewhichstoresandprovidesaccesstolargeamountsofinformation.Mostnewcomputersincludeaninternalharddiskthatcontainsseveralgigabytesofstoragecapacity.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Ahigh-capacitymagneticmediastoragedevice,alsoknownasthe“fixeddisk.”Harddisksareeitherinternalorexternal.Aninternalharddiskcanbeusedonlywiththecomputerinwhichitisinstalled,whileanexternalharddiskcanbemovedfromonecomputertoanother.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
HardDrive
Theprimarycomputerstoragemediumindesktopandlaptopcomputers.
Source: RenewData,Glossary(10/5/2005).
Theprimaryhardwarethatacomputerusestostoreinformation,typicallymagnetizedmediaonrotatingdiscs.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 141
TheprimarystorageunitonPCs,consistingofoneormoremagneticmediaplattersonwhichdigitaldatacanbewrittenanderasedmagnetically.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Amagneticstoragedeviceusuallyinsideacomputerthatstoresfiles.Harddrivecapacitiesareusuallymeasuredingigabytes(GB).Inadditiontoharddrives,computersoftencontainCDdrivesandfloppydiskdrives.Whenyousaveafile,itisusuallystoredonthecomputer’sharddrive.
Ahigh-capacitymagneticmediastoragedevice,alsoknownasthe“fixeddisk.”Harddisksareeitherinternalorexternal.Aninternalharddiskcanbeusedonlywiththecomputerinwhichitisinstalled,whileanexternalharddiskcanbemovedfromonecomputertoanother.
Internalhardwarewhichstoresandprovidesaccesstolargeamountsofinformation.Mostnewcomputersincludeaninternalharddiskthatcontainsseveralgigabytesofstoragecapacity.
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
Hardcopy
Thepaperversionofadocument.
Hardware
Allthemechanicalandelectricalpartsofacomputer.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
HardwareKey
Externalsecurityusedwithsomesoftware.Withoutthiskey,thesoftwarewillnotfunction.
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 142
HarmonicMean
Thereciprocaloftheaverageofthereciprocalsoftwoormorequantities.Ifthequantitiesare
namedaandb,theirHarmonicMeanis .InInformationRetrieval,F1istheHarmonicMeanofRecallandPrecision.TheHarmonicMean,unlikethemorecommonarithmeticmean(i.e.,average),fallsclosertothelowerofthetwoquantities.Asasummarymeasure,aHarmonicMeanmaybepreferabletoanarithmeticmeanbecauseahighHarmonicMeandependsonbothhighRecallandhighPrecision,whereasahigharithmeticmeancanbeachievedwithhighRecallattheexpenseoflowPrecision,orhighPrecisionattheexpenseoflowRecall.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Hash
Analgorithmthatcreatesavaluetoverifyduplicateelectronicdocuments.Ahashmarkservesasadigitalthumbprint.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Seealso:
Hashvalue Hashing/Hash/HashValue
MD5
SHA-1
HashValue
Aunique,identifyingnumberofafilecalculatedbyahashalgorithm,e.g.MD5.
Source: IbisConsulting,Glossary.
Acomputednumericalvaluethatrepresentsa“digest”ofthecontentofafile.Ifandonlyiftwodocumentsareidenticaltotheletterwilltheyreturnthesamehashvalue.TheHashvalueisusedaspartofadigitalsignatureandtocomparedocumentcontentinthede-dupingprocess.
Seealso:
Hash Hashing/Hash/HashValue
MD5
SHA-1
Hashing/Hash/HashValue
AstatisticalmethodusedtoreducethecontentsofaDocumenttoasingle,fixed-size,alphanumericvalue,whichis,forallintentsandpurposes,uniquetoaparticularDocument;thesingle,fixed-sizealphanumericvalueresultingfromHashingaparticularDocument.Common
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 143
HashingAlgorithmsinclude,butarenotlimitedto,MD5,SHA-1,andSHA-2.HashingandHashValuesaretypicallyusedforDocumentidentification,Deduplication,orensuringthatDocumentshavenotbeenaltered.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Seealso:
Hash
Hashvalue
MD5
SHA-1
HD(HighDensity)
Highdensityfloppydisks;a5.25"holds1.2MBanda3.5"holds1.4MB.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Help
Instructionsthatassistauseronhowtosetupanduseaproductincludingbutnotlimitedtosoftware,manualsandinstructionfiles.
Source: RSI,Glossary.
Hertz(Hz)
Cyclespersecond.Oftenusedwithmetricprefixes,asinkiloHertz(kHz).
Source: RSI,Glossary.
Seealso:
KHz MHz GHz
Heuristic
Ageneralpracticalapproachtosolvingaproblemthatisusefultoaddresstheproblem,butwhoseresultisnotguaranteed.Examplesofheuristicsincludementalshortcuts,rulesofthumb,andgeneralstrategies.Unlikeanalgorithm,aheuristicisnotguaranteedtoproduceaspecificresult.
Source: HerbRoitblat,Search2020:TheGlossary.
Hexadecimal
Anumbersystemwithabaseof16(24),4bits.Thepositiondigitsare0-9,A-F,whereFequalsthedecimalvalue,15.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 144
HierarchicalStorageManagement(HSM)
Softwarethatautomaticallymigratesfilesfromon-linetonear-linestoragemedia,usuallyonthebasisoftheageorfrequencyofuseofthefiles.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
HighDensity
See: HD(HighDensity)
Hit
Atermtodescribetheresultsofasearchquery.Asearchforaspecificnamemayproducetwenty“hits,”whichmeansthenameappearstwentytimesinthedatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Hit/HitLevelResults
Toarrangeordesignateaccordingtocategorizationsuchaspotentiallyresponsiveorprivilegedversusnon-responsiveornot-privileged.
Source: EDRMSearchGlossary.
Hold
See: LegalHold
Holorith
Encodeddataonaperturecardsorold-stylepunchcardsthatcontainedencodeddata.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
HorizontalDeduplication
SeeGlobalDeduplication.(Cf.VerticalDeduplication.)
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Seealso:
Basicde-duplication
Casede-duplication
Custodiande-duplication
De-duplication
Duplicate
Dynamicde-duplication
GlobalDeduplication
Productionde-duplication
VerticalDeduplication
©2016EDRMLLC
Host
Computeronwhichanapplicationordatabaseresides.
Source: RSI,Glossary.
Inanetwork,thecentralcomputerwhichcontrolstheremotecomputersandholdsthecentraldatabases.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Hosting
Providinganapplicationon-line(seeApplicationServiceProvider)foroneormoreclients,usuallyhousedinadatacenter.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
HotKey
Anindividualkeythatisprogrammedtoperformaspecificfunction.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
HP-PCL
AHewlett-Packardgraphicsfileformat.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
HPGL
AHewlett-Packardgraphicsfileformat.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
HSM
See: HierarchicalStorageManagement(HSM)
HTML(HypertextMarkupLanguage)
AsetofcodesinsertedintoafileordocumentthatisintendedtodisplaythroughaWebbrowser.HTMLtellsthebrowserhowtodisplayadocument’swordsandimagesasaWebpage.Eachmarkupsymbolisreferredtoasanelementoratag.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 146
Alanguagethatusestagstostructuretextintoheadings,paragraphs,listsandlinks.IttellsaWebbrowserhowtodisplaytextandimages.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
TheunderlyingprogramstructureoftextontheWorldWideWeb.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
DevelopedbyCERNofGeneva,Switzerland.ThedocumentstandardofchoiceofInternet.(HTML+addssupportformulti-media.)Useininternetapplication.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Thetag-basedASCIIlanguageusedtocreatepagesontheWeb.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Thestandardcodeusedtotellabrowserhowawebpageshouldbedisplayed.
Seealso:
Java
JavaScript
SGML
SGML/HyTime
XML
Hub
Acentralunitthatrepeatsand/oramplifiesdatasignalsbeingsentacrossanetwork.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
HypertextMarkupLanguage
See: HTML(HypertextMarkupLanguage)
Hz
See: Hertz(Hz)
I
IBMWatsonProject
ApioneeringprojectfromIBMthatimplementswhattheycallcognitivecomputing.Watsonwasoriginallybuilttodemonstrateitsadvancednaturallanguageprocessing,knowledgerepresentation,machinelearning,andinformationretrievalprocessesinthecontextofthe
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 147
gameJeopardy.Itsspeedanextensiveknowledgebase,alongwithitsautomatedreasoningallowedittowinthecompetitionagainsthumanJeopardyplayers.Thesamecomputationalframeworkhassincebeenappliedonacommercialbasistootheropen-endedquestionansweringtasks,includingmedicaldiagnosis,andrecipeconstruction.
Source: HerbRoitblat,Search2020:TheGlossary.
I/O(Input/Output)
Thetransferofinformationinandoutofacomputer’smemory.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Icon
Agraphicimageorpictureofaprogramortaskdesignedtorepresentthatprogramortask.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
InaGUI,apictureordrawingwhichisactivatedby"clicking"amousetocommandthecomputerprogramtoperformapredefinedseriesofevents.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ICR(IntelligentCharacterRecognition)
Theconversionofscannedimages(barcodesorpatternsofbits)tocomputerrecognizablecodes(ASCIIcharactersandfiles)bymeansofsoftware/programswhichdefinetherulesofandalgorithmsforconversion.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
DirtyOCR
OCR
OpticalCharacterRecognition
Patternrecognition
IDE(IntegratedDriveElectronics)
AnengineeringstandardforinterfacingPC'sandharddisks.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 148
IEEE(InstituteofElectricalandElectronicEngineers)
Aninternationalassociationwhichsponsorsmeetings,publishesanumberofjournalsandestablishesstandards.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
IM(InstantMessaging)
InstantMessagingisaformofelectroniccommunicationwhichinvolvesimmediatecorrespondencebetweentwoormoreuserswhoareallonlinesimultaneously.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Electroniccommunicationallowingforinstantcorrespondencebetweentwoormoreuserswhoareonlineatthesametime.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Image
Abit-by-bitduplicateofabackuptapeorharddrivethatisforensicallysound.Alsoknownasanimagecopy,aforensiccopyoramirrorimage.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Asdistinctfromdocumentimaging,electronicevidenceismakinganidenticalcopyofaharddrive.Alsoknownasa“mirrorimage”or“mirroring”.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Anelectronic“picture”ofhowthedocumentwouldlookifprinted.Imagescanbestoredinvariousfileformats,themostcommonofwhichareTIFFandPDF.
Seealso:
Bitstreamcopy
Forensiccopy
Imagedcopy
Mirrorimage
ImageCompressionBoard
Animaging-dedicatedprocessor.RelievestheCPU(CentralProcessorUnit-thecomputer'smainchip)frommanyimaging-specifictasks-compression,decompression,display,zooming,shrinking,scale-to-gray.Infact,doesthembetterthantheCPU.
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 149
ImageEnable
Asoftwarefunctionthatcreateslinksbetweenexistingapplicationsandstoredimages.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ImageFileFormat
Whenapageisscanned,thepagecanbestoredinanumberoffiletypes.Thetypeshouldbechosenbasedonthedesireduseoftheimage,andthesoftwarethatwillbeused.Differentfileformatscommonlyusedifferentmethodsofcompressionaswell,andsometypesofimagescompressbetterusingsomeformatsratherthanothers.
Source: RSI,Glossary.
Seealso:
GIF
GraphicInterchangeFile
Jointphotographicexpertgroup
JPEG
Multi-pageTIFF
PNG
PortableDocumentFormat
Portablenetworkgraphic
SearchableTIFF
Single-pageTIFF
TIFF
ImageKey
Thenameofafilecreatedwhenapageisscannedinacollection.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ImageProcessing
Tocaptureanimageorrepresentation,enterinacomputerandprocessandmanipulateit.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Thinkof"dataprocessing":itreferstothemanipulationofrawdatatosolvesomeproblemorenlightentheuserinsomewaynotpossiblewithoutmanipulation.
Source: RSI,Glossary.
ImageProcessingCard(IPC)
Aboardmountedineitherthecomputer,scannerorprinterthatfacilitatestheacquisitionanddisplayofimages.TheprimaryfunctionofmostIPCsistherapidcompressionanddecompressionofimagefiles.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 150
ImageResolution
Thefinenessorcoarsenessofanimageasitwasdigitized,measuredasdotsorpixelsperinch.Allotherthingsbeingequal,thehighertheresolution,thebetteristheimage.Resolutionsof200dotsperinch(dpi).Thehighertheresolution,thegreatertheamountofdetailthatcanbeshown.
ImagedCopy
A"mirrorimage"bit-by-bitcopyofaharddrive,i.e.acompletereplicationofthephysicaldriveregardlessofhowthedriveisorganizedorwhethertheimagecreatedcontainsmeaningfuldatainwholeorinpart.Fromanimagedcopyofaharddriveitispossibletoreconstructtheentirecontentsandorganizationofthesourcedrivefromwhichitwastaken.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html↩
Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Bitstreamcopy
Forensiccopy
Image
Mirrorimage
Imaging
Theprocessofscanningpicturesordocumentsintoacomputerinordertobettermanagedocuments.
Source: RSI,Glossary.
Theprocessoftakinganelectronic“picture”ofadocumentandstoringitonadiskforlaterretrieval.Thestoredimagescannotbesearched,sotheyaretypicallylinkedtorecordsinadatabaseandretrievedwhentheassociatedrecordislocatedthroughadatabasesearch.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
TheprocessofcreatingaTIFForPDFimageofdocuments.Animageiscreatedthatrepresentshowthepagewouldlookifthedocumentwereprintedtopaper.
InRe:Actos
InRe:Actos(Pioglitazone)ProductsLiabilityLitigation,MDLNo.6:11-md-2299(W.D.La.July27,2012).AproductliabilityactionwithaCaseManagementOrder(“CMO”)thatmemorializestheparties’agreementona“searchmethodologyproofofconcepttoevaluatethepotentialutilityofadvancedanalyticsasaDocumentidentificationmechanismforthereviewandproduction”ofElectronicallyStoredInformation.ThesearchprotocolprovidesfortheuseofaTechnology-
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 151
AssistedReviewtoolontheemailoffourkeycustodians.TheCMOwassignedbyDistrictJudgeRebeccaF.Doherty.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
InaccessibleData
Or"relativelyinaccessibledata."Incontrastwithactivedata,datathathastoundergoarestorationprocessinordertobedisplayedoncomputerscreen.Thetwosubsets,inorderfrommoreaccessibletolessaccessible,are“backuptapes”and“erased,fragmentedordamageddata."Accordingtoanewlineofcaselaw,relativelyinaccessibleelectronicdataisthecategoryastowhichacourtshouldconsidercost-shifting.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingAppliedDiscovery'sGlossary,http://www.nysd.uscourts.gov/courtweb/pdf/D02NYSC/03-04265.PDF#page=24
InactiveRecord
InactiverecordsarethoseRecordsrelatedtoclosed,completed,orconcludedactivities.InactiveRecordsarenolongerroutinelyreferenced,butmustberetainedinordertofulfillreportingrequirementsorforpurposesofauditoranalysis.Inactiverecordsgenerallyresideinalong-termstorageformatremainingaccessibleforpurposesofbusinessprocessingonlywithrestrictionsonalteration.Insomebusinesscircumstances,inactiverecordsmaybereactivated.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Index
Alistofallwordsinadatabase(codedorfulltext)thatisusedbythesoftwaretoprovidefastaccesstoinformation.Ratherthansearchtheentiredatabaseforawordorphrasewhenaqueryisbuilt,thesoftwaresearchestheindexinstead.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Outputfromadatabasesuchasanindextoexhibitsordocumentsresponsivetoadiscoveryrequest.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
(n.)Indatabasedesign,alistofkeys(orkeywords),eachofwhichidentifiesauniquerecord.Indicesmakeitfastertofindspecificrecordsandtosortrecordsbytheindexfield--thatis,thefieldusedtoidentifyeachrecord.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 152
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingWebopediaComputerDictionary,http://www.pcwebopedia.com/TERM/I/FTP.html.
(v.)Tocreateanindexforadatabase,ortofindrecordsusinganindex.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingWebopediaComputerDictionary,http://www.pcwebopedia.com/TERM/I/FTP.html.
Creatingasetofrulesanddatafileswhichdefinescanneddocumentsetsandalloweasyandcompleteretrieval.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
AlistofKeywordsinwhicheachKeywordisaccompaniedbyalistoftheDocuments(andsometimesthepositionswithintheDocuments)whereitoccurs.Manualindiceshavebeenusedinbooksforcenturies;automaticindicesareusedinInformationRetrievalsystemstoidentifyDocumentsthatcontainparticularSearchTerms.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
Index/CodingField
Adatabasefieldusedtocategorizeandorganizedocuments.Oftenuser-defined,thesefieldscanbeusedforsearches.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
©2016EDRMLLC
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
Indexing
Indexing(whichissometimesinterchangedwiththeterm“coding”)referstotheinformationthatisaddedtoanimagetoallowittobefoundafteritisscanned.Objectiveindexingorcodingisoneoftwotypesofindexesusedinimaging.Atemplate,somethinglikeanindexcard,isattachedtotheimageinthecomputerandpertinentinformationistypedintothetemplate,taggingthedocumentforretrievalpurposes.Authorofthedocument,boxnumber,date,subjectandtypeofdocumentareallcommonindexfields.
Source: RSI,Glossary.
Universaltermforcodinganddataentry.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ThemanualorautomaticprocessofcreatinganIndex.InElectronicDiscovery,IndexingtypicallyreferstotheautomaticconstrictionofanelectronicIndexforuseinanInformationRetrievalSystem.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Seealso:
BibliographicCoding
Coding
IssueCode
Issuecoding
Levelcoding
Objectivecoding
Subjectivecoding
Tag
Taxonomiccoding
Verbatimcoding
©2016EDRMLLC
IndustrialInternet
Theintegrationofphysicalmachinery(suchasgenerators,refrigerators,industrialequipment)withnetworkedsensors,software,andbigdatarepositories.Thegoaloftheindustrialinternetistointegrateobjectswithpeople,processes,anddata.Theindustrialinternetjoinstheinternetofthings,machine-to-machinecommunicationandothersourcestoingestdatafromdevices,analyzeit,anduseittoguidethefurtheroperationofthesesystems.
Source: HerbRoitblat,Search2020:TheGlossary.
IndustryStandardArchitecture(ISA)
ISA(IndustryStandardArchitecture)isastandardbus(computerinterconnection)architecturethatisassociatedwiththeIBMATmotherboard.Itallows16bitsatatimetoflowbetweenthemotherboardcircuitryandanexpansionslotcardanditsassociateddevice(s).
Source: TechTarget,ISA(IndustryStandardArchitecture)definition,http://searchwindowsserver.techtarget.com/definition/ISA-Industry-Standard-Architecture
InformationGovernance
Thespecificationofdecisionrightsandanaccountabilityframeworktoencouragedesirablebehaviorinthevaluation,creation,storage,use,archivalanddeletionofinformation.Itincludestheprocesses,roles,standardsandmetricsthatensuretheeffectiveandefficientuseofinformationinenablinganorganizationtoachieveitsgoals(asdefinedbyGartner).
Source: IGRMWhitePaper
InformationGovernanceReferenceModel(IGRM)
Aframeworkandresponsibilitymodelforcross-functionalandexecutivedialoguethatservesasacatalystfordefiningaunifiedgovernanceapproachtoinformationbylinkingbusinessvalueandlegaldutiestotheinformationassets.
Source: IGRMWhitePaper
InformationNeed
InInformationRetrieval,theinformationbeingsoughtinasearchorrevieweffort.InE-Discovery,theInformationNeedistypicallytoidentifyDocumentsresponsivetoarequestforproduction,ortoidentifyDocumentsthataresubjecttoprivilegeorwork-productprotection.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 155
InformationRetrieval
ThescienceofhowtofindinformationtomeetanInformationNeed.WhilemodernInformationRetrievalreliesheavilyoncomputers,thedisciplinepredatestheinventionofcomputers.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Theprocessofidentifyingdocumentsorotherrecordsinacorpusthatarerelevanttotheuser’sinterestorinformationneed.Informationretrievalisatermofartincomputersciencethatistypicallybroaderthansearch(ahypernym),butisalsofrequentlyusedasasynonymforsearch.
Source: HerbRoitblat,Search2020:TheGlossary.
Input
Thetransferofdatafromkeyboardorexternalstoragedevicetocomputermemory.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
InputDevice
Anyobjectwhichallowsausertocommunicatewithacomputerbyenteringinformationorissuingcommands(e.g.keyboard,mouseorjoystick).
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Input/Output
See: I/O(Input/Output)
InstantMessaging
See: IM(InstantMessaging)
InstituteofElectricalandElectronicEngineers
See: IEEE(InstituteofElectricalandElectronicEngineers)
IntegratedDriveElectronics
See: IDE(IntegratedDriveElectronics)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 156
IntegratedServicesDigitalNetwork(ISDN)
Analldigitalnetworkwhichcancarrydata,videoandvoice.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Integration
Theabilityoftwosystems,whetherhardwareorsoftware,tointerfacewithoneanother.Integratedsystemsareoftendesignedtosharedatainawayspecificallyintendedtoreduceredundantdataentry.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
IntelligentCharacterRecognition
See: ICR(IntelligentCharacterRecognition)
Interface
Amechanicalorelectricallinkconnectingtwoormorepiecesofequipmenttogether.
Source: RSI,Glossary.
Apointofdemarcationbetweentwodeviceswheretheelectricalsignals,connectors,timingandhandshakingaredefined.
Source: RSI,Glossary.
Aconnectionbetweenanytwoelementsinacomputersystem.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Interlaced
TV&CRTpicturesmustconstantlybe"refreshed".Interlaceistorefresheveryotherlineonce/refreshcycle.Sinceonlyhalftheinformationdisplayedisupdatedeachcycle,interlaceddisplaysarelessexpensivethan"non-interlaced."However,interlaceddisplaysaresubjecttojitters.Thehumaneye/braincanusuallydetectdisplayedimageswhicharecompletelyrefreshedatlessthan30timespersecond.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
InternalInquiry
Acloseexaminationofamatterinasearchforinformationortruththatisinternaltoacompany.
Source: RenewData,Glossary(10/5/2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 157
InternalResponseCurve
FromSignalDetectionTheory,atoolforestimatingthenumberofRelevantandNon-RelevantDocumentsinaPopulation,orthenumberofDocumentsthatfallaboveandbelowaparticularCutoff.TheuseofInternalResponseCurvesforthispurposeassumesthatthescoresyieldedbyaMachineLearningAlgorithmforRelevantDocumentsobeyaGaussianDistribution,andthescoresforNon-RelevantdocumentsobeyadifferentGaussianDistribution.ThesedistributionsarethenusedtopredictthenumberofRelevantandNon-RelevantDocumentsinanygivenrangeofscores.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
InternationalStandardsOrganization(ISO)
ISOisanindependent,non-governmentalinternationalorganizationwithamembershipof161nationalstandardsbodies.Throughitsmembers,itbringstogetherexpertstoshareknowledgeanddevelopvoluntary,consensus-based,marketrelevantInternationalStandardsthatsupportinnovationandprovidesolutionstoglobalchallenges.
Source: AboutISO,http://www.iso.org/iso/home/about.htm
Internet
Theworld-widecollectionofinter-connectednetworksthatallusetheTCP/IPprotocolsandthatevolvedfromtheARPANETofthelate1960’sandearly1970’s.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AworldwidecomputernetworkcontainingabroadarrayofservicesandinformationavailabletoanyindividualwithaPCandthepaidconnection.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Theinterconnectingglobalpublicnetworkmadebyconnectingsmallersharedpublicnetworks.Themostwell-knownInternetistheInternet,theworldwidenetworkofnetworkswhichusetheTCP/IPprotocoltofacilitateinformationexchange.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Intranet Extranet
InternetofEverything
See: InternetofThings(IoT)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 158
InternetofThings(IoT)
Theemergingideathatevery-daythingslikerefrigerators,garagedooropeners,andthermostatscanandshouldbeconnectedtotheinternetwheretheycancommunicatewithoneanother,beremotelycontrolled,anddoothertaskssuchasorderingicecreamwhenthecurrentsupplyinthefreezerisnearlygone.Inordertobeconnectedusefullytotheinternet,thingsneedsensors,networkconnectivity,software,andprocessingcapabilitytocollectandexchangedata.
Source: HerbRoitblat,Search2020:TheGlossary.
InternetProtocolAddress(IPAddress)
Asetofnumberswhichuniquelyidentifiesanaddressonanetwork.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Theinternationallyrecognizedlocationofaspecificcomputerorserver;usedforinternet.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
AstringoffournumbersseparatedbyperiodsusedtorepresentacomputerontheInternet.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
InternetPublishing
SpecializedimagingsoftwarethatallowslargevolumesofpaperdocumentstobepublishedontheInternetorintranet.Thesefilescanbemadeavailabletootherdepartments,offsitecolleaguesorthepublicforsearching,viewingandprinting.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
InternetServiceProvider(ISP)
Acompanythatprovidesaccesstotheinternetthroughitsownequipmenttousersandchargesamonthlyorhourlyrateforprovidingthatservice.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AbusinessthatdeliversaccesstotheInternet.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
InternetworkPacketExchange(IPX)
AcommunicationsprotocolusedbyNovellnetworks.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 159
Externallinks:
WebopediaComputerDictionary,http://www.webopedia.com/TERM/I/IPX.html
Interpolated/Uninterpolated
Mostscannershaveamaximumpixel-per-inchresolutionbeforetheystartguessingorinterpolatingthedata.Interpolatedfilesrequirethecomputertosimulatedatainanimagefile,whileuninterpolatedfilesholdonlydatathatisaccuratetotheoriginal.Uninterpolatedresolutionis,therefore,preferredforaccuratescanning.
Source: RSI,Glossary.
Interrogatory
Inacivilaction,aninterrogatoryisalistofquestionsonepartysendstoanotheraspartofthediscoveryprocess.Therecipientmustanswerthequestionsunderoathandaccordingtothecase'sschedule.Becauseattorneysmayhelptheirclientsanswerinterrogatories,interrogatoryresponsestendtobemorefinelycraftedthananswerstodepositionquestions.Thenumberofquestionsincludedinaninterrogatoryisusuallylimitedbycourtrule.Forexample,undertheFederalRulesofCivilProcedure,eachpartymayonlyaskeachotherparty25questionsviainterrogatoryunlessthecourtgivespermissiontoaskmore.SeeRule33.
Source: LegalInformationInstitute,Interrogatory,https://www.law.cornell.edu/wex/interrogatory
Seealso:
Discoveryrequest Documentrequest Requestforadmission
Intranet
AprivateorinternalnetworkthatusesstandardinternetprotocolssoithastheappearanceofaWebsite.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AnetworkofinterconnectingsmallerprivatenetworksthatareisolatedfromthepublicInternet.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Acomputernetworkusuallythatusuallyrestrictsaccessonlytothosewithinthefirmorcorporation.Thetermisoftenusedtodiscussdocumentsstoredaswebpages,butitcanbeextendedtoanykindofcomputerresource.Aninternal(tothecompany)versionoftheinternet.
Seealso:
Internet Extranet
©2016EDRMLLC
InvertedIndex
Anindexthatmapsakeywordtothelistofdocumentsthatcontainthekeyword.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
Investigation
Aninquiryusuallyinitiatedbyagovernmentalagency.
Source: RenewData,Glossary(10/5/2005).
IoT
See: InternetofThings(IoT)
IPAddress
See: InternetProtocolAddress(IPAddress)
IPC
See: ImageProcessingCard(IPC)
IPv6
ThelatestversionoftheInternetProtocol,version6(IPv6),whichprovidesanidentificationandlocationsystemforcomputersandotherdevicesonnetworks.IPv4representedtheselocationsasastringoffournumbers:000.000.000.000,whereeachnumbercouldbebetween0and256.IPv6insteadcreatesaddressesoutoflongerdigitstringsrepresentedaseightgroupsoffourdigits(0to65535)writteninhexadecimal(base16),withacolonbetweengroups,forexample2001:0db8:0000:0042:0000:8a2e:0370:7334.ManymorethingscanberepresentedusingIPv6(2128or3.4×1038)thanwerepossibleunderIPv4(232,approximately4.3billionaddresses).IPv4allowedonly4.3billiondevicestobeconnecteddirectlytotheInternet,butmanytimesthatwouldberequiredwhentheInternetofThingsisfullyimplemented,forexample.
Source: HerbRoitblat,Search2020:TheGlossary.
IPX
See: InternetworkPacketExchange(IPX)
ISA
See: IndustryStandardArchitecture(ISA)
ISDN
See: IntegratedServicesDigitalNetwork(ISDN)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 161
ISISScannerDriver
Aspecializedapplicationusedforcommunicationbetweenscannersandcomputers.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ISO
See: InternationalStandardsOrganization(ISO)
ISO9660CDFormat
TheInternationalStandardsOrganizationformatforcreatingCD-ROMsthatcanbereadworldwide.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ISP
See: InternetServiceProvider(ISP)
IssueCode
Termforacodeusedtodesignateacase-specificissue.Issuecodesareusedtomaintainconsistency,eliminatespellingerrors,andspeedupsearchqueries.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Issuecodesareusedtoclassifydocumentsasbeingrelevanttooneormoreofthecaseissues.Caseissuesarethethingsaboutthecasethatneedtoprovedduringthelitigation.
Seealso:
BibliographicCoding
Coding
Indexing
Issuecoding
Levelcoding
Objectivecoding
Subjectivecoding
Tag
Taxonomiccoding
Verbatimcoding
IssueCode(s)/IssueCoding
OneormoresubcategoriesoftheoverallInformationNeedtobeidentifiedinasearchorrevieweffort;theactofgeneratingsuchsubcategoriesoftheoverallInformationNeed.Examplesincludespecificationofthereason(s)foradeterminationofRelevanceorNon-Relevance,Codingofparticularsubcategoriesofinterest,andCodingofprivileged,confidential,orsignificant(“hot”)Documents.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 162
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
IssueCoding
Aprocesswherecontentisevaluatedtodeterminewhetheritrelatestotopicsofinterestinalawsuitorsimilarproceedingandthentheresultsoftheevaluationarelogged.
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Levelcoding
Objectivecoding
Subjectivecoding
Tag
Taxonomiccoding
Verbatimcoding
IStorage
ThestandardforstoringadditionalfileinformationinMSOfficefiles.
Source: IbisConsulting,Glossary.
IterativeSearch
Formalsearchthatincludesexecuting,tracking.reportingandmeasureimpact,anditeratethroughsetsofmultiplelogicalqueries.Seealso,FormalSearch.
Source: EDRMSearchGlossary.
IterativeTraining
TheprocessofrepeatedlyaugmentingtheTrainingSetwithadditionalexamplesofCodedDocumentsuntiltheeffectivenessoftheMachineLearningAlgorithmreachesanacceptablelevel.TheadditionalexamplesmaybeidentifiedthroughJudgmentalSampling,RandomSampling,orbytheMachineLearningAlgorithm,asinActiveLearning.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
J
JaccardIndex
Ameasureoftheconsistencybetweentwosets(e.g.,DocumentsCodedasRelevantbytwodifferentreviewers).Definedmathematicallyasthesizeoftheintersectionofthetwosets,dividedbythesizeoftheunion(e.g.,thenumberofDocumentscodedasRelevantbybothreviewers,dividedbythenumberofDocumentsidentifiedasRelevantbyoneortheother,orbothreviewers).Itistypicallyusedasameasureofconsistencyamongreviewefforts,butalso
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 163
maybeusedasameasureofsimilaritybetweentwoDocumentsrepresentedastwoBagofWords.JaccardIndexisalsoreferredtoasOverlaporMutualF1.EmpiricalstudieshaveshownthatexpertreviewerscommonlyachieveJaccardIndexscoresofabout50%,andthatscoresexceeding60%arerare.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Ameasureofagreementorefficacy.TheJaccardindexcomparesthenumberofdocumentsselectedasresponsivebybothassessorsdividedbythenumberofdocumentsthatareselectedasresponsivebyeitherassessor.IfassessorAidentifies20documentsasresponsiveandassessorBidentifies25documentsasresponsive,andtheyagreeontheiridentificationof10documentsasresponsive,thenthenumeratorwouldbe10andthedenominatorwould20+25–10,or10/35or28.6%.
Source: HerbRoitblat,PredictiveCodingGlossary.
JASISTStudy
A2009study(HerbertL.Roitblat,AnneKershaw&PatrickOot,DocumentCategorizationinLegalElectronicDiscovery:ComputerClassificationvs.ManualReview,61J.AM.SOC’Y.FORINFO.SCI.&TECH.70(2010)),showingthatthePositiveAgreementbetweeneachoftwoTechnology-AssistedReviewmethods,andapriorproductiontotheDepartmentofJustice,exceededthePositiveAgreementbetweeneachoftwoManualReviewprocessesandthesameproduction.AlsoreferredtoastheEDIStudy.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Java
Javaisacomputerprogramminglanguagethatisdesignedforuseinthemultiple-computerenvironmentoftheinternet.Itcanbeusedtomakesingle-machineapplicationsorbedistributedamongmanycomputersonanetwork.Inaddition,JavacanbeusedtocreateappletsforusewithinaWebpage,whichallowsuserstointeractdirectlywiththepage.BecauseJavarequiresnooperatingsystemspecificextensions,Javaappletsrunonmostoperatingsystems.JavaisnotthesameasJavaScript,aNetscapecreationthatiseasiertolearnthanJava,butlackssomeofthespeedandportabilityofJava.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
HTML
JavaScript
SGML
SGML/HyTime
XML
©2016EDRMLLC
JavaScript
WhenreferringtoacomputerprocessororCPU,GHzisaclockfrequency,alsoknownasaclockrateorclockspeed,representingacycleoftime.AnoscillatorcircuitsuppliesasmallamountofelectricitytoacrystaleachsecondthatismeasuredinKHz,MHz,orGHz."Hz"isanabbreviationofHertz,and"K"representsKilo(thousand),"M"representsMega(million),and"G"representsGiga(thousandmillion).
Source: JavaScript,https://developer.mozilla.org/en-US/docs/Web/JavaScript
Seealso:
HTML
Java
SGML
SGML/HyTime
XML
JazDrive
AJazdriveisasmall,portableharddiskdriveusedprimarilyforbackingupandarchivingpersonalcomputerfiles.TheJazdriveissoldbyIomegaCorporation,thesamecompanythatdevelopedtheZipdrive.BoththeJazdriveandthediskscomeintwosizes,1GBand2GB.Thetwosizeslooksimilar,buta2GBdiskisnotcompatiblewitha1GBJazdrive.The2GBJazdrivecanusebothdisksizes.InternalandexternalJazdrivesareavailable.TheJazdriveusestheSmallComputerSystemInterface(SmallComputerSystemInterface)andrequiresaSCSIcontroller.
Source: TechTarget,JazDrivedefinition,http://searchstorage.techtarget.com/definition/Jaz-drive
Seealso:
Diskdrive
Floppydiskdrive
Magneto-opticaldrive
Portabledrive
Storagedevice
Tapedrive
Zipdrive
JMS(JukeboxManagementSoftware)
Themostfundamentalpurposeofjukeboxmanagementsoftwareistoprovidedriveletteraccesstoajukebox.Thisletsapplicationsandusersdirectlyaccessthejukeboxwithouthavingtousesomeexoticprogramming.Jukeboxesaretoocomplexforthestandardcomputerdesktop,somanagementsoftwaresimplifiesthemannerinwhichusersgainaccesstojukeboxesforstoringandretrievingfiles.
Source: KOMSoftware,https://www.komsoftware.com/news/news-reviews/jukebox-management-software.html
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 165
JointPhotographicExpertGroup(JPEG)
Oneformatofelectronicgraphicimagefilesupportedbytheweb.JPEGfilesendwiththesuffixjpg.OtherimageformatsincludeGraphicInterchangeFormat(GIF)andPortableNetworkGraphic(PNG).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Acompressionalgorithmforcondensingthesizeofimagefiles.JPEGsareveryhelpfulinallowingaccesstofull-screenimagefileson-linebecausetheyrequirelessstorageandthereforearequickertodownloadintoawebpage.
Source: RSI,Glossary.
Animagecompressionformatusedforstoringcolorphotographsandimages.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Animagecompressionstandardforphotographs.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Astandardalgorithmforthecompressionofdigitalimagesandthefilesthatresultfromencodinganimageusingthisalgorithm.
Seealso:
GIF
GraphicInterchangeFile
Imagefileformat
Multi-pageTIFF
PNG
PortableDocumentFormat
Portablenetworkgraphic
SearchableTIFF
Single-pageTIFF
TIFF
JOLTStudy
A2011study(MauraR.Grossman&GordonV.Cormack,Technology-AssistedReviewinE-DiscoveryCanBeMoreEffectiveandMoreEfficientThanExhaustiveManualReview,XVIIRICH.J.L.&TECH.11(2011)),availableathttp://jolt.richmond.edu/v17i3/article11.pdf,thatuseddatafromTREC2009toshowthattwoTechnology-AssistedReviewprocesses(oneusingMachineLearningandoneusingaRuleBase)generallyachievedbetterRecall,betterPrecision,andgreaterefficiencythantheTRECManualReviewprocess.AlsoknownastheRichmondJournalStudy,ortheRichmondStudy.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 166
JPEG
See: JointPhotographicExpertGroup(JPEG)
JudgmentalSample/JudgmentalSampling
AmethodinwhichaSampleoftheDocumentPopulationisdrawn,basedatleastinpartonsubjectivefactors,soastoincludethe“mostinteresting”Documentsbysomecriterion;theSampleresultingfromsuchmethod.UnlikeaRandomSample,thestatisticalpropertiesofaJudgmentalSamplemaynotbeextrapolatedtotheentirePopulation.However,anindividual(suchasaQualityAssuranceauditororanadversary)mayuseJudgmentalSamplingtoattempttouncoverdefects.Thefailuretoidentifydefectsmaybetakenasevidence(albeitnotstatisticalevidence,andcertainlynotproof)oftheabsenceofdefects.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Asamplingprocesswheretheobjectsareselectedonthebasisofsomeperson’sjudgmentsabouttheirrelativeimportanceratherthanonarandombasis.Judgmentalsamplingsometimesreferstotheuseofaseedsetorpreselecteddocumentsusedtotrainpredictivecodingsystems.Unlikerandomsamples,judgmentalsamplesarenottypicallyrepresentativeofthecollectionorpopulationfromwhichtheyaredrawn.Itisnotpossibletoextrapolatefromthecharacteristicsofajudgmentalsampletothecharacteristicsofthepopulationorcollection.
Source: HerbRoitblat,PredictiveCodingGlossary.
Jukebox
Automateddiskchangerforhigh-performance,centralizedstorageformultifunctionCD-ROM's&opticaldisks.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
JukeboxManagementSoftware
See: JMS(JukeboxManagementSoftware)
JumpDrive
Alsoknownaskeychaindrive,thumbdriveandUSBflashdrive.
K
KB(Kilobyte)
Onekilobyteofdataisequaltoonethousandbytes.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 167
Theequivalentof1,000(actually1,024)bytes.Indicates(1)sizeofthestorageareaonadisk,suchas32KB=32,768bytes,or(2)amountofmainmemory(RAM)inthecomputer,suchas640K=roomtostore640,000bytesofinstructions.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Onethousandbytesofdatais1Kofdata.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Bit
Byte
MB-megabyte
GB-gigabyte
TB-terabyte
PB-petabyte
EB-exabyte
Kerning
Adjustingthespacingbetweentwolettersfromthe"normal"spacing.Oftendonetoenhancethequalityofthetypography–forinstanceinaheadline.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Key
Akeyisavalueappliedusinganalgorithmtoastringofunencryptedtexttoproduceencryptedtext,orviceversa.Keylengthisafactorindeterminingthestrengthoftheencryption.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
KeyField
Adatabasefieldusedfordocumentsearchesandretrieval.Synonymouswith“indexfield.”
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 168
Productionsource
Recipient
Subjectcategory
Summary
Text
Keyboard
Thedevicethatallowscommandstobetypeddirectlyintothecomputer.Similartoatypewriterkeyboardbutwithspecialfunctionkeysaddedalongthetop.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
KeychainDrive
See: JumpDrive
KeystrokeMonitoring
Aformofusersurveillanceinwhichtheactualcharacter-by-charactertraffic(thatuser'skeystrokes)aremonitored,analyzed,and/orloggedforfuturereference.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Keyword
Aspecificwordusedtosearchadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Wordsrelatedtothecaseorspecificissues,designatedbythelawfirmandgenerallyhavingtheirownfieldinthedatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Usedinbibliographicalcodingtoindicatethateachpageinacollectionmustbereviewedforcertainimportantwordsandwherevertheyoccurthedatabasemustreferencethepagewheretheyoccur.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Aword(orSearchTerm)thatisusedaspartofaQueryinaKeywordSearch.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Seealso:
AdHocSearch Adaptivepatternrecognition
Associativeretrieval
Booleansearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 169
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
KeywordIndex/Indexing
Indexingisaprocessthatinventoriesthetotalcontentofafileandbuildsasearchableelectronicindex.Thisindextypicallymapsfromakeywordtoallthedocumentsthatcontainthekeyword.Searchindexesservetofunctionastoolsdesignedtofacilitateandexpeditetheretrievalofinformation.Searchengineswillusebothcommonandproprietarytechnologytobuildindexesandservicesearchqueries.
Source: EDRMSearchGlossary.
AtechniquethatexaminestheESIandbuildsasearchableelectronicindex.Thisindextypicallymapsfromakeywordtoallthedocumentsthatcontainthekeyword.
Source: EDRMSearchGuideGlossary.
KeywordOccurrence
Keywordoccurrencesarethecountsofkeywordsthatappearwithintheentiresearchresults.Whenasearchqueryinvolvesmultiplekeywordsorwhenoneormoreofthequeriesproducesstemming,wildcardorfuzzy-basedvariations,acompletecountoftotaloccurrencesforeachkeywordisusefulforevaluatingthevalueofsearchingusingcertainkeywords.Insomeinstances,thekeywordcountsbothatanaggregatelevel(totaledoverallthevariations)aswellascountsbasedonanindividualvariationlevelwouldeachbehelpful.
Source: EDRMSearchGuideGlossary.
KeywordSearch
Acommonsearchtechniquethatusesquerywords(“keywords”)andlooksfortheminESI,usinganindex.Akeywordsearchisabasicsearchtechniquethatinvolvessearchingforoneormorewordswithinacollectionofdocumentsandreturnsonlythosedocumentsthatcontainthesearchtermsentered.Thedocumentsreturnedbythesearchenginearecalledthesearchresults.Keywordsoftenformabasicbuildingblockforconstructingothermorecomplexcompoundsearches.SuchcompoundsearchesuseothersearchelementssuchasBooleanlogic.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 170
Source: EDRMSearchGlossary.
Averycommonsearchtechniquethatusesquerywords(“keywords”)andlooksfortheminESI,usinganindex.
Source: EDRMSearchGuideGlossary.
AsearchinwhichallDocumentsthatcontainoneormorespecificKeywordsarereturned.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Amethodofsearchingfordocumentsthatpossesskeywordsspecifiedbyauser.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Asearchusingafulltextsearchfilter.Aclientsearchtermlistisappliedtoafulltextindextofindresponsivefiles.
Source: IbisConsulting,Glossary.
Asearchfordocumentscontainingoneormorewordsthatarespecifiedbyauser.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
KHz(Kilohertz)
Thekilohertz,abbreviatedkHzorKHz,isaunitofalternatingcurrent(AC)orelectromagnetic(EM)wavefrequencyequaltoonethousandhertz(1,000Hz).Theunitisalsousedinmeasurementsorstatementsofsignalbandwidth.
Source: TechTarget,kHz(kilohertz)definition,http://searchnetworking.techtarget.com/definition/kHz
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 171
Seealso:
Hz MHz GHz
Kilobyte
See: KB(Kilobyte)
Kilohertz
See: KHz(Kilohertz)
Kleen
KleenProds.LLCv.PackagingCorp.ofAm.,CaseNo.1:10-cv-05711,variousPleadingsandTr.(N.D.Ill.2012).AfederalcaseinwhichplaintiffssoughttocompeldefendantstouseContentBasedAdvancedAnalytics(CBAA)fortheirproduction,afterdefendantshadalreadyemployedacomplexBooleanSearchestoidentifyResponsiveDocuments.DefendantsadvancedElusionscoresof5%,basedonaJudgmentalSampleofcustodians,todefendthereasonablenesstheBooleanSearch.Aftertwodaysofevidentiaryhearingsbefore(andmanyconferenceswith)MagistrateJudgeNanR.Nolan,plaintiffswithdrewtheirrequestforCBAA,withoutprejudice.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
KM(KnowledgeManagement)
Controlofandaccesstocontentinallanorganization'svariousdatabases(CMS,DMS,WP,etc.).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
KnowledgeManagement
See: KM(KnowledgeManagement)
KofaxBoard
ThegenerictermforaseriesofimageprocessingboardsmanufacturedbyKofaxImagingProcessing.Theseareusedbetweenthescannerandthecomputer,andperformrealtimeimagecompressionanddecompressionforfasterimageviewing,imageenhancement,andcorrectionstotheinputtoaccountforconditionssuchasdocumentmisalignment,"speckles,"etc.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 172
L
L600CodeSeries
AUTBMSCodeSetexclusivelyfore-discoverypurposescreatedbytheLEDESOversightCommittee(“LOC”)Board.
Source: EDRMMetricsGlossary
LAN(LocalAreaNetwork)
ALANisagroupofassociatedcomputerswhichshareacommoncommunicationslineandserverwithinthesamegeographicarea.Typically,LANusersshareapplicationsanddatastorageonthesameserver.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Asystemofinterconnectedcomputerswithacentralstorageunit(thefileserver),cablingsystem(thetopology),andspecificnetworksoftware(theNOS).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
UsuallyacollectionofPC's,connectedbycable.LandscapeModeTheimageisrepresentedonthepageormonitorsuchthatthewidthisgreaterthantheheight.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Usuallyreferstoanetworkofcomputersinasinglebuildingorotherdiscretelocation.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Acomputernetworkthatconnectsseveralcomputerslocatednearby,allowingthemtosharefilesanddevicessuchasprinters.
Seealso:
Client/servernetwork
MAN-metropolitanareanetwork
Network
Peer-to-peernetwork
SAN-storageareanetwork
Standalonecomputer
WAN-wideareanetwork
LandscapeOrientation
Inwordprocessinganddesktoppublishing,thetermsportraitandlandscaperefertowhetherthedocumentisorientedverticallyorhorizontally.Apagewithlandscapeorientationiswiderthanitistall.
Source: Webopedia,LandscapeOrientation,http://www.webopedia.com/TERM/L/landscape.html
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 173
Seealso:
Portraitorientation
LanguageModeling
Computingamodeloftherelationshipsamongwordsinacollection.Languagemodelingisusedinspeechrecognitiontopredictwhatthenextwordwillbebasedonthepatternofprecedingwords.Languagemodelingisusedininformationretrievalandpredictivecodingtorepresentthemeaningofwordsinthecontextofotherwordsinadocumentorparagraph.
Source: HerbRoitblat,PredictiveCodingGlossary.
LaptopComputer
Aportablecomputer,usuallyweighinglessthan15pounds.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Computer
Fileserver
Microcomputer
Minicomputer
Notebookcomputer
Personalcomputer
Workstation
LaserDisc
SameasanopticalCD,except12"indiameter.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
Latency
Thetimeittakestoreadadisk(orjukebox),includingthetimetophysicallypositionthemediaundertheread/writehead,seekthecorrectaddressandtransferit.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 174
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
LatentSemanticAnalysis(LSA)
(LSA)astatisticalmethodforfindingtheunderlyingdimensionsofcorrelatedterms.Forexample,wordslikelaw,lawyer,attorney,lawsuit,etc.Allsharesomemeaning.Thepresenceofanyoneoftheminadocumentcouldberecognizedasindicatingsomethingconsistentaboutthetopicofthedocument.LatentSemanticAnalysisusesstatisticstoallowthesystemtoexploitthesecorrelationsforconceptsearchingandclustering.
Source: HerbRoitblat,PredictiveCodingGlossary.
LatentSemanticIndexing(LSI)
Theuseoflatentsemanticanalysistoindexacollectionofdocuments.
Source: HerbRoitblat,PredictiveCodingGlossary.
LatentSemanticIndexing/LatentSemanticAnalysis
Latentsemanticindexing(sometimesalsoreferredtoasLatentSemanticAnalysis)isatechnologythatanalyzesco-occurrenceofkeywordtermsinthedocumentcollection.Intextualdocuments,keywordsexhibitpolysemyaswellassynonymy.LatentSemanticIndexingreferstotheadditionalfactorthatcertainkeywordsarerelatedtotheconceptinthattheyappeartogether.Theserelationshipscanbe“is-a”relationshipsuchas“motorcycleisavehicle”oracontainmentrelationshipsuchas“wheelsofamotorcycle”.SupportVectorMachines,ProbabilisticLatentSemanticAnalysis,LatentDirichletAllocation,andothers.
Source: EDRMSearchGlossary.
AFeatureEngineeringAlgorithmthatuseslinearalgebratogrouptogethercorrelatedFeatures.Forexample,"Windows,Gates,Ballmer"mightbeonegroup,while"Windows,Gates,Doors"mightbeanother.LatentSemanticIndexingunderliesmanyConceptSearchtools.WhileLatentSemanticIndexingisusedforFeatureEngineeringinsomeTechnology-AssistedReviewtools,itisnot,perse,aTechnology-AssistedReviewmethod.AlsoreferredtoasLatentSemanticAnalysis.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Leading
Pronounced"ledding,"theamountofspacebetweenlinesofprintedtext.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 175
LegacyData
Informationthedevelopmentofwhichanorganizationmayhaveinvestedsignificantresourcesandhasretaineditsimportance,buthasbeencreatedorstoredbytheuseofsoftwareand/orhardwarethathasbeenrenderedoutmodedorobsolete.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Informationinthedevelopmentprocessthatmayhavesignificantresourcesinvestedintoitthathasbeenproducedand/orstoredonsoftwareorhardwarethathasbecomeobsolete.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Informationcreatedorstoredonsoftwareand/orhardwarethatisoutmodedorobsolete.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
LegalHold
Alegalholdisacommunicationissuedasaresultofcurrentoranticipatedlitigation,audit,governmentinvestigationorothersuchmatterthatsuspendsthenormaldispositionorprocessingofrecords.ThespecificcommunicationtobusinessorITorganizationsmayalsobecalleda“hold,”“preservationorder,”“suspensionorder,”“freezenotice,”“holdorder,”or“holdnotice.”
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Lempel-Zif&Welch(LZW)
Acommon,losslesscompressionstandardforcomputergraphics–usedforthemajorityofTIFFfiles.Typicalcompressionratiosare4/1.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
LevelCoding
Usedinbibliographiccodingtoindicatethatcertaindocumenttypeswillgetamorethoroughextractionofdatathanothers.Thustheygetadeeper“level”ofcoding.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Issuecoding
Objectivecoding
Subjectivecoding
Tag
Taxonomiccoding
Verbatimcoding
©2016EDRMLLC
LineScreen
Thenumberofhalftonedotsthatcanbeprintedperinch.Asageneralrule,newspapersprintat65to85lpi,largecitynewspapersat100or120lpi;magazinesat133or150lpi;and,glossy,"coffeetable"booksat175to200.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
LinearReview
ADocument-by-DocumentManualReviewinwhichtheDocumentsareexaminedinaprescribedorder,typicallychronologicalorder.
SourceMauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
LinkObject
Anobjectthatspecifiesandmaintainstherelationshipbetweenalinkedobjectandalinksource.Seeembeddedobject.
Source: IbisConsulting,Glossary.
Seealso:
Bibliographiccoding
Embeddedobject
Linksource
Linkedobject
Object
LinkSource
Adataobjectstoredinaseparatelocationfromthecontainerandwhosedataisrepresentedinthecontainerbyalinkedobject.
Source: IbisConsulting,Glossary.
Seealso:
Bibliographiccoding
Embeddedobject
Linkobject
Linkedobject
Object
LinkedObject
Anobjectthatiscreatedinasourcefileandinsertedintoadestinationfile.
Source: IbisConsulting,Glossary.
Seealso:
Bibliographiccoding Embeddedobject Linkobject
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 177
Linksource Object
Listserv
Anautomaticmailinglisttowhichpeoplemaysubscribeandthensendandreceivee-mailmessagestoandfromeachother.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
LitigationSupport
See: ALS(AutomatedLitigationSupport)
LitigationSupportManager
Theindividualwhoadministerstheautomatedlitigationsupporteffortswithinalawfirmorcorporatelegaldepartment.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
LitigationSupportSystem
Oneofseveraltypesofdatabasewhichholdsbothacopydocumentandinformationaboutthatdocument.Mostsystemswillholdthefulltextofthedocumentandallowsearchestobeconductedagainstthetextand/oranyadditionalinformationthatmightbepresentinthedatabase.Themostsophisticatedsystemscangroupdocumentsintocategoriesbasedontheircontentandevenpredicttheircodingbyreferencetospecimencodingprovidedbyalawyer.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
LiveMachine
Acomputerthatispoweredupandactivelyloggedin.
Source: EDRMCollectionStandards
LoadFile
Aloadfileisusedtoimportimagesorcoding(thebibliographicinformationaboutadocument(e.g.,To,From,CC,BCC,andSubjectfieldswithinanemail)intoadatabase.Itsetsoutlinksbetweentherecordsinadatabaseandthedocumentimagefilestowhicheachrecordpertains.Thisisacriticaldeliverableofanyprocessing,scanning,orcodingjob.Withoutacorrectlystructuredloadfile,documentswillnotproperlylinktotheirrespectivedatabaserecords.
Source: EDRMMetricsGlossary
Adatafilethatsetsoutlinksbetweentherecordsinadatabaseandthedocumentimagefilestowhicheachrecordpertains.Thisisacriticaldeliverableofanyscanningandcodingjob.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 178
Withoutacorrectlystructuredloadfile,documentswillnotproperlylinktotheirrespectivedatabaserecords.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingCommonwealthLegal'sLitigationSupportGlossary,http://commonwealthlegal.com/resources/glossary.html#l.
Afileaccompanyingoutputdatadeliveredtoaclient,containingalogoffilesandimagesinaformatrequiredbytheclient’sdocumentmanagementsystem.
Source:IbisConsulting,Glossary.
Atextfilewithentriesforapplicationinformationandcomments.Typicallyusedinautomatedlitigationsupporttocarryinstructionsaboutadocumentimagecollectionforlinkingtoadatabaseprogram.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Afilethatrelatestoasetofscannedimagesandindicateswhereindividualpagesbelongtogetherasdocuments.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
AstandardisedfileusedforloadingmetadataandotherinformationintoaLitigationSupportSystem.Suchfilesgenerallycontainmetadatainadelimitedformattogetherwithinformationusedtoloadacopydocumentintothesystem.AstandardisedfileusedforloadingmetadataandotherinformationintoaLitigationSupportSystem.Suchfilesgenerallycontainmetadatainadelimitedformattogetherwithinformationusedtoloadacopydocumentintothesystem.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
LoadFileFormat
Thespecificformatforloadfiledata,includingtheloadfile,CDdirectorystructure,CDcontentandCDcontentrequirementsspecifictoaparticularclientorproject.
Source: IbisConsulting,Glossary.
LocalAreaNetwork
See: LAN(LocalAreaNetwork)
Log
Ahardcopyrecordbook,usuallyofentriesintoadatabasebutalsoofdocumentsreceived,documentsundergoingqualitycontrol,ordocumentsshippedout.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 179
LogicalEvidenceFile
Withalogicalevidencefile,youcanselectivelychoosewhichfilesorfoldersyouwanttopreserve,insteadofacquiringtheentiredrive.Unlikecopyingfilesfromadeviceandalteringcriticalmetadata,logicalevidencefilespreservetheoriginalfilesastheyexistedonthemediaandincludeadditionalinformationsuchasfilename,fileextension,lastaccessed,filecreated,lastwritten,entrymodified,logicalsize,physicalsize,MD5hashvalue,permissions,startingextent,andoriginalpathofthefile.
Source: EDRMCollectionStandards
LogicalTarget
WhenforensicimagingprocesstargetsalogicalportionofthemediasuchastheC:\driveorotherlogicalvolumeorpartition.
Source: EDRMCollectionStandards
LogicalUnitization
Theassemblyofindividuallyscannedpagesintodocuments:
• Physicalunitizationutilizesactualobjectssuchasstaples,paperclipsandfolderstodeterminepagesthatbelongtogetherasdocumentsforarchivalandretrievalpurposes.
• Logicalunitizationistheprocessofhumanreviewofeachindividualpageinanimagecollectionusinglogicalcuestodeterminepagesthatbelongtogetherasdocuments.Suchcuescanbeconsecutivepagenumbering,reporttitles,similarheadersandfootersandotherlogicalcues.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
LogisticRegression
Astate-of-the-artSupervisedLearningAlgorithmthatestimatestheProbabilitythataDocumentisRelevant,basedontheFeaturesitcontains.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
LookupTable
Apredefinedsetofentriesfromwhichausermaypickanameratherthanenterthenamedirectlyintoadatabasefield.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 180
LosslessCompression
Exactconstructionofimage,bit-by-bit,withnolossofresolutionorcolorfidelity.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
LossyCompression
Reducesstoragesizeofimagebyreducingtheresolutionandcolorfidelitywhilemaintainingminimumacceptablestandardforgeneraluse.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
LSA
See: LatentSemanticAnalysis(LSA)
LZW
See: Lempel-Zif&Welch(LZW)
M
MachineLearning
TheuseofacomputerAlgorithmtoorganizeorClassifyDocumentsbyanalyzingtheirFeatures.InthecontextofTechnology-AssistedReview,SupervisedLearningAlgorithms(e.g.,SupportVectorMachines,LogisticRegression,NearestNeighbor,andBayesianClassifiers)areusedtoinferRelevanceorNon-RelevanceofDocumentsbasedontheCodingofDocumentsinaTrainingSet.InElectronicDiscoverygenerally,UnsupervisedLearningAlgorithmsareusedforClustering,Near-DuplicateDetection,andConceptSearch.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Aprocessforusingcomputeralgorithmsandmethodstoimplementadecision,prediction,orcategorizationprocess.Machinelearningprocessestypicallyapplyinformationderivedfromexamplestopredict,categorize,ordecideaboutpreviouslyunseenobjects.Machinelearningmethodshavelargelybeenderivedfromthescienceofpatternrecognition,brainsimulation,learningtheory,anddecisiontheory.Machinelearningiscloselyrelatedtostatisticalmodeling.
Source: HerbRoitblat,Search2020:TheGlossary.
Abranchofcomputersciencethatdealswithdesigningcomputerprogramstoextractinformationfromexamples.Forexample,propertiesthatdistinguishbetweenresponsiveandnonresponsivedocumentsmaybeextractedfromexampledocumentsineachcategory.Thegoalistopredictthecorrectcategoryforfutureuntaggedexamplesbasedontheknowledge
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 181
extractedfromthepreviouslyclassifiedexamples.Exampleapproachesincludeneuralnetworks,supportvectormachines,Bayesianclassifiersandothers.
Source: HerbRoitblat,PredictiveCodingGlossary.
Macro
Apre-programmedkeystrokeorcombinationofkeystrokestoactivateasequenceofinstructions.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Magenta
Usedinfourcolorprinting.Reflectsblue&redandabsorbsgreen.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MagneticDiskEmulation(MDE)
Softwarethatmakesajukeboxlookandoperatelikeahard-drivesuchthatitwillrespondtoalltheI/Ocommandsordinarilysenttoaharddrive.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MagneticInkCharacterRecognition(MICR)
Theprocessusedbybankstoencodechecks.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MagneticStorageMedia
Includes,butisnotlimitedto,harddrives(alsoknownas"harddisks"),backuptapes,CD-ROMs,DVD-ROMs,JazandZipdrives,andfloppydiscs,allusedsinglyorincombinationin,orinconjunctionwith,yourcomputersandanyandallbackupandarchivesystemsforthesame.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Seealso:
Backup
Backuptape
DAT-digitalaudiotape
Dataextraction
Digitalaudiotape
Disasterrecoverytape
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 182
DLT-digitallineartape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
Tape
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Media
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
Magneto-Optical
Adiskstoragetechnologywhichcompeteswithtraditionalmagneticharddisks.Formfactorsare3.5",5.25"and12".Advantagesarethatone5.25"magneto-opticaldrivecanstoreabout1.3GB(31/2"holdupto230MB);mediaisremovableandportable;and,canlastfor20years–idealforarchivalstorage.Thedisadvantagesarecost,traditionallyslowerdiskaccessandlongerdiskwritetimes.Theinformationiswrittenonthediskbychangingthepolaritywithstrongmagnetsandreadbyalaserbysensingthemagneticfluxchanges(1'sor0's).Thistechnologyisre-usable.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Magneto-OpticalDrive
Adrivethatcombineslaserandmagnetictechnologytocreatehigh-capacityerasablestorage.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Diskdrive
Floppydiskdrive
Jazdrive
Portabledrive
Storagedevice
Tapedrive
Zipdrive
See: E-Mail(ElectronicMail)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 183
MailApplicationProgramInterface(MAPI)
AWindowssoftwarestandardthathasbecomeapopularemailinterfaceusedbyMSExchange,GroupWise,andotheremailpackages.
Source: IbisConsulting,Glossary.
ThisWindowssoftwarestandardhasbecomeapopulare-mailinterfaceandisusedbyMSExchange,GroupWise,andothere-mailpackages.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MailContainer
Anareainmemoryoronastoragedevicewhereemailisplaced.Inemailsystems,eachuserhasaprivatemailbox.Whentheuserreceivesemail,themailsystemautomaticallyputsitinthemailbox.Themailsystemallowsyoutoscanmailthatisinyourmailbox,copyittoafile,deleteit,printit,orforwardittoanotheruser.ThemailboxformatusedbyMicrosoftExchangeemailsystemsisPST,whileLotusNotesusesNSFfiles.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Acontainerforelectronicmaildata(messagesandattachments)thatcontainsoneormoremailmessages.Therearemulti-mailcontainerslikePST,NSF,Netscapemailcontainers,etc.andsinglemailcontainerslikeEMLandMSG3.
Source: IbisConsulting,Glossary.
Seealso:
Container
EML
MSG
Multi-mailcontainer
NSF
OST
PST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
Mailbox
See: MailContainer
MAN(MetropolitanAreaNetwork)
Ametropolitanareanetwork(MAN)isanetworkthatinterconnectsuserswithcomputerresourcesinageographicareaorregionlargerthanthatcoveredbyevenalargelocalareanetwork(LAN)butsmallerthantheareacoveredbyawideareanetwork(WAN).Thetermisappliedtotheinterconnectionofnetworksinacityintoasinglelargernetwork(whichmay
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 184
thenalsoofferefficientconnectiontoawideareanetwork).Itisalsousedtomeantheinterconnectionofseverallocalareanetworksbybridgingthemwithbackbonelines.Thelatterusageisalsosometimesreferredtoasacampusnetwork.
Source: TechTarget,metropolitanareanetwork(MAN)definition,http://searchnetworking.techtarget.com/definition/metropolitan-area-network-MAN
Seealso:
Client/servernetwork
LAN-localareanetwork
Network
Peer-to-peernetwork
SAN-storageareanetwork
Standalonecomputer
WAN-wideareanetwork
ManagementInformationSystems(MIS)
Amanagementinformationsystem(MIS)focusesonthemanagementofinformationsystemstoprovideefficiencyandeffectivenessofstrategicdecisionmaking.Theconceptmayincludesystemstermedtransactionprocessingsystem,decisionsupportsystem,expertsystem,orexecutiveinformationsystem.Thetermisoftenusedintheacademicstudyofbusinessesandhasconnectionswithotherareas,suchasinformationsystems,informationtechnology,informatics,e-commerceandcomputerscience;asaresult,thetermisusedinterchangeablywithsomeoftheseareas.
Managementinformationsystems(plural)asanacademicdisciplinestudiespeople,technology,organizations,andtherelationshipsamongthem.Thisdefinitionrelatesspecificallyto"MIS"asacourseofstudyinbusinessschools.Manybusinessschools(orcollegesofbusinessadministrationwithinuniversities)haveanMISdepartment,alongsidedepartmentsofaccounting,finance,management,marketing,andmayawarddegrees(atundergraduate,master,anddoctorallevels)inManagementInformationSystems.
Source: Wikipedia,Managementinformationsystem,https://en.wikipedia.org/wiki/Management_information_system
ManualReview
ThepracticeofhavinghumanreviewersindividuallyreadandCodetheDocumentsinaCollectionforResponsiveness,particularissues,privilege,and/orconfidentiality.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Map-Reduce
Acomputationalpatterninwhichcomplexcomputationsarebrokendownintotwokindsofsteps.IntheMapstep,thedataareprocessedinparallel,typicallyonalargenumberofprocessors.TheresultsoftheMapsteparethencombinedintheReducesteptoyieldafinalresult.TheMap-ReducepatternistypicallyusedontopofHadooptoprocessbigdata.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 185
Source: HerbRoitblat,Search2020:TheGlossary.
MAPI
See: MailApplicationProgramInterface(MAPI)
MAPIMailNear-Line
DocumentsstoredonopticaldisksorcompactdisksthatarehousedinthejukeboxorCDchangerandcanberetrievedwithouthumanintervention.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MarginofError
ThemaximumamountbywhichaPointEstimatemightlikelydeviatefromthetruevalue,typicallyexpressedas“plusorminus”apercentage,withaparticularConfidenceLevel.Forexample,onemightexpressaStatisticalEstimateas“30%oftheDocumentsinthePopulationareRelevant,plusorminus3%,with95%confidence.”ThismeansthatthePointEstimateis30%,theMarginofErroris3%,theConfidenceIntervalis27%to33%,andtheConfidenceLevelis95%.UsingGaussianEstimation,theMarginofErrorisone-halfofthesizeoftheConfidenceInterval.ItisimportanttonotethatwhentheMarginofErrorisexpressedasapercentage,itreferstoapercentageofthePopulation,nottoapercentageofthePointEstimate.Inthecurrentexample,ifthereareonemillionDocumentsintheDocumentPopulation,theStatisticalEstimatemayberestatedas“300,000DocumentsinthePopulationareRelevant,plusorminus30,000Documents,with95%confidence”;or,alternatively,“between270,000and330,000DocumentsinthePopulationareRelevant,with95%confidence.”TheMarginofErroriscommonlymisconstruedtobeapercentageofthePointEstimate.However,itwouldbeincorrecttointerprettheConfidenceIntervalinthisexampletomeanthat“300,000DocumentsinthePopulationareRelevant,plusorminus9,000Documents.”ThefactthataMarginofErrorof“plusorminus3%”hasbeenachievedisnot,byitself,evidenceofapreciseStatisticalEstimatewhenthePrevalenceofRelevantDocumentsislow.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Thelikelyrangeinwhichthetruepopulationvaluewillbefound.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
ConfidenceInterval
Marginalia
Adatafieldrecordingtheexistenceofhandwritinginthemarginsofadocument.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 186
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Handwrittennotesinthemarginofthepageindocuments.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
Mastering
MakingmanycopiesofaCD-ROMfromasinglemaster.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MB(Megabyte)
Onemegabyteofdataisequaltoonemillionbytes.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Theequivalentof1,000,000bytesor700double-spacedpagesoftypedmaterial,eachpageholdingapproximately1,500characters(bytes).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Amillionbytesofdataisamegabyte,orsimplyameg.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
1,000kilobytes(KB)or1,048,576bytes.
Seealso:
Bit
Byte
KB-kilobyte
GB-gigabyte
TB-terabyte
PB-petabyte
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 187
EB-exabyte
Mbox
mboxisacommonformatforstoringemailmessages.Anmboxisasinglefilecontainingzeroormoreemailmessages.
Source: http://www.qmail.org/qmail-manual-html/man5/mbox.html.
MCA(MicroChannelArchitecture)
AnIBMbusstandard.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MD5
MD5isanalgorithmthatisusedtoverifydataintegritythroughthecreationofa128-bitmessagedigestfromdatainput(whichmaybeamessageofanylength)thatisclaimedtobeasuniquetothatspecificdataasafingerprintistothespecificindividual.Thehashvalueofafileistheunique,identifyingnumbercalculatedbyMD5.
Source: IbisConsulting,Glossary.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingSearchSecurity.com,http://searchsecurity.techtarget.com/sDefinition/0,,sid14_gci527453,00.html.
Seealso:
Hash
Hashvalue
Hashing/Hash/HashValue
SHA-1
MD5-KnownFilter
Afilteroptionthatallowsforexcludingknown,commerciallyavailablefiles(suchasexecutablefilesorcommercialsoftware).
Source: IbisConsulting,Glossary.
Seealso:
Datefilter
Extensions/sizesfilter
Filter
Sender/recipientfilter
MDE
See: MagneticDiskEmulation(MDE)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 188
MeanTimeBetweenFailure(MTBF)
Averagetimebetweenfailures.Usedtocomputethereliabilityofdevices/equipment.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MeanTimeToRepair(MTTR)
Averagetimetorepair.Thehigherthenumber,themorecostlyanddifficulttofix.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MeasureofSuccess
Reachcomfortlevelthatreasonablestepsweretakentofinddocument(s),allowingforreasonabledeterminationthatdocumentdoesnotexist.
Source: EDRMSearchGlossary.
MeasurementBias
MeasurementBiasoccurswhentheactofsamplingcausesthemeasurementtobeimpacted.Ine-discovery,measurementbiascouldoccurifthecontentofthesampleisknownbeforethesamplingisdone.Forexample,ifoneweretosampleforresponsivedocumentsandduringthesamplingstage,contentisreviewed,thereispotentialforhigher-levellitigationstrategytoimpacttheresponsivedocuments.Ifaprojectmanagerhascommunicatedthecostofreviewingresponsivedocuments,anditisunderstoodthatresponsivedocumentsshouldsomehowbeassmallaspossible,thatcouldimpactyoursampleselection.Toovercomethis,thepersonimplementingthesampleselectionshouldnotbeprovidedaccesstothecontent.
Source: EDRMSearchGlossary.
Media
Thephysicalmaterialusedtostoreelectronicdata.Mediaincludesharddrives,backuptapes,computerdisks,CDs,DVDs,PDAs,memory,etc.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Anyexternaldatastoreformat,suchasCDs,Jazdrives,DLTtapes,DVDs,ordiskettesreceivedfromclientscontainingsourcedata.
Source: IbisConsulting,Glossary.
Thematerial(diskdrive,tape,floppydisk,paper,etc.)onwhichelectronicdocumentshavebeenrecorded.
Seealso:
©2016EDRMLLC
Backup
Backuptape
DAT-digitalaudiotape
Dataextraction
Digitalaudiotape
Disasterrecoverytape
DLT-digitallineartape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
Tape
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Opticaldisk
Storagemedia
WORMdisk
Zipdisk
MediaConversion
MovingdatafromonetypeofmediatoanothersuchastapetoCD.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Megabyte
See: MB(Megabyte)
Megahertz(MHz)
Aunitofelectricalfrequencyequaltoamillioncyclespersecond.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Hz KHz GHz
Memory
Internalstorageareasinthecomputer.Thetermmemoryidentifiesdatastoragethatcomesintheformofchips,andthewordstorageisusedformemorythatexistsontapesordisks.Moreover,thetermmemoryisusuallyusedasashorthandforphysicalmemory,whichreferstotheactualchipscapableofholdingdata.Somecomputersalsousevirtualmemory,whichexpandsphysicalmemoryontoaharddisk.Seethedefinitionsfortwotypesofphysicalmemory:RAMandROM.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 190
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Internalstorageareasinthecomputer.Thetermmemoryidentifiesdatastoragethatcomesintheformofchips,andthewordstorageisusedformemorythatexistsontapesordisks.Moreover,thetermmemoryisusuallyusedasashorthandforphysicalmemory,whichreferstotheactualchipscapableofholdingdata.Somecomputersalsousevirtualmemory,whichexpandsphysicalmemoryontoaharddisk.Seethedefinitionsfortwotypesofphysicalmemory:RAMandROM.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
DRAM RAM ROM
Merge
Theprocessofmergingvariouse-mailfiles(i.e.MicrosoftOutlook’s.pst)intoonefileforde-duplicationpurposes.
Source: RenewData,Glossary(10/5/2005).
Tocombinedatafromtwoseparatedatabasesintoone.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
MetaTag
AnelementofHTMLthatoftendescribesthecontentsofaWebpage,andisplacednearthebeginningofthepage'ssourcecode.Searchenginesuseinformationprovidedinametatagtoindexpagesbysubject.
Metadata
Thetermmetadatarefersto"dataaboutdata".Thetermisambiguous,asitisusedfortwofundamentallydifferentconcepts(types).Structuralmetadataisaboutthedesignandspecificationofdatastructuresandismoreproperlycalled"dataaboutthecontainersofdata";descriptivemetadata,ontheotherhand,isaboutindividualinstancesofapplicationdata,thedatacontent.Inthiscase,ausefuldescriptionwouldbe"dataaboutdatacontent"or"contentaboutcontent"thusmetacontent.
Source: http://en.wikipedia.org/wiki/Metadata
Dataaboutdata.Metadatacapturesdataelementsorattributes(name,size,date,type,etc.),dataaboutrecordsordatastructures(length,fields,columns,etc.)anddataaboutdata(whereitislocated,howitisassociated,ownership,etc.).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 191
Source: RenewData,Glossary(10/5/2005).
Metadataisinformationaboutaparticulardatasetwhichdescribeshow,whenandbywhomitwascollected,created,accessed,modifiedandhowitisformatted.Somemetadata,suchasfiledatesandsizes,caneasilybeseenbyusers;othermetadatacanbehiddenorembeddedandunavailabletocomputeruserswhoarenottechnicallyadept.Metadataisgenerallynotreproducedinfullformwhenadocumentisprinted.(Typicallyreferredtobythenothighlyinformative“shorthand”phrase“dataaboutdata,”describingthecontent,quality,condition,history,andothercharacteristicsofthedata.)
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Metadataisinformationaboutaparticulardatasetwhichmaydescribe,forexample,how,when,andbywhomitwasreceived,created,accessed,and/ormodifiedandhowitisformatted.Somemetadata,suchasfiledatesandsizes,caneasilybeseenbyusers;othermetadatacanbehiddenorembeddedandunavailabletocomputeruserswhoarenottechnicallyadept.Metadataisgenerallynotreproducedinfullformwhenadocumentisprinted.(Typicallyreferredtobythelessinformativeshorthandphrase“dataaboutdata,”itdescribesthecontent,quality,condition,history,andothercharacteristicsofthedata.)
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Datathatdescribeshow,whenandbywhomaparticularsetofdatawascreated,edited,formatted,andprocessed.Accesstometa-dataprovidesimportantevidence,suchasblindcopy(bcc)recipients,thedateafileoremailmessagewascreatedand/ormodified,andothersimilarinformation.Suchinformationislostwhenanelectronicdocumentisconvertedtopaperformforproduction.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingAppliedDiscovery'sGlossary,http://www.lexisnexis.com/applieddiscovery/clientResources/glossary_M.asp
Adescriptionordefinitionofelectronicdata,ordataaboutdata.Often,metadatacanonlybeassessedincertainviewingmodes.MetadatacanincludedescriptiveHTMLtagsandinformationaboutwhenadocumentwascreated,andwhatchangeshavebeenmadeonthatdocument.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Dataaboutdata.Indataprocessing,metadataprovidesinformationaboutadocumentorotherdatamanagedwithinanapplicationorenvironment.Therearefivetypesofmetadata:filesystem,document,email,vendor-added,andcustomer-added.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Dataaboutdata.Indataprocessing,metadataprovidesinformationaboutadocumentorotherdatamanagedwithinanapplicationorenvironment.Therearefivetypesofmetadata:filesystem,document,email,vendor-added,andcustomer-added.Traditionally,theOCRbasewastheonlydataextractedfromthedocuments.Withe-discovery,themetadatacanalsobe
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 192
obtained.OCRbaseintheinformationthatisculledfromtheimagescontainedwithineachpage.(Read:Whatevertextisdisplayedintheimage).Contrastingthisisthemetadata.Themetadataisthe“footprint”ofthedocument:ittheusertoreviewinformationobtainedabouttheactualdocumentratherthanthecontent.
Source: RSI,Glossary.
Indataprocessing,metadataisdatathatprovidesinformationaboutordocumentationofotherdatamanagedwithinanapplicationorenvironment.Therearetwotypesofmetadata:generalandfile-specificmetadata.MetadataisavailableforanyparticularMicrosoftfileinWindowsbyright-clickingonafileandviewingfileProperties.IntheSummarytabtheAdvancedoptionbringsupthelistofallpossiblemetadataforthatfile.Seealsogeneralmetadataandfile-specificmetadata.
Source: IbisConsulting,Glossary.
Informationaboutdatawhichdescribeshow,when,andbywhomitwasreceived,created,accessed,and/ormodifiedandhowitisformatted.Somemetadataisvisiblesuchasfilesizeanddateofcreation;mostisnotvisibleevenwhenthedocumentisprinted.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thedatathatisattachedtofilesinacomputerizedfilingsystem.Forinstance,inawordprocessingdocument,themetadataincludes:theauthor,datecreated,personanddateeditingthedocument,thenameofthedocument,thelocationstoredonaharddrive,howmanytimesandwhenithasbeenaccessed,changedoraltered,etc.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Dataaboutafileitself,suchaswhenitwascreated,modified,andwhichcomputeruserauthoredit.Foremailsthiscouldalsoinclude:bcc,datereceived,openedstatus,undeliverable,etc.Differentmetadataareavailablefordifferenttypesofelectronicfiles.Metadatacanbeusefultounderstandingmoreaboutthedocumentanditsrelevancetothecase.
Propertiesofanelectronicfile,someofwhichwillbeinternalandsomeexternal,notallofwhicharenecessarilyvisiblewhenviewingthatfile.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
Seealso:
Customer-addedmetadata
Documentmetadata
Emailmetadata
Extrinsicdata
Fileparameters
Filesystemmetadata
File-specificmetadata
Generalmetadata
Vendor-addedmetadata
©2016EDRMLLC
MetadataSearch
Metadatasearchallowssearchingtobeconstrainedbasedoncertainmetadataelementsofadocument.Ageneralsearchspecificationallowsfornamingthemetadatafields,specifyingtheinherenttypeofthatmetadata,andthevaluetosearchfor.
Source: EDRMSearchGlossary.
MetricsDB:ContainerFiles
Containerfilesstoreoneormorefilesinacompressedform(e.g.RARorZIPformat).
Source: EDRMMetricsGlossary
MetricsDB:CullingMethods
Proceduresusedtoselectaparticularsetofdocumentsfromalargercorpusbasedonspecifically-definecriteria.Cullingmethodsaremostcommonlyusedasameanstoeliminatenon-responsivematerialfromadocumentcollectioninordertonarrowthescopeofpotentiallyresponsivematerialsrequiringattorneyreview.Commoncullingmethodsincludecustodialculling,datasourceculling,dateculling,filetypeculling,domainculling,keywordculling,anddeduplication.
Source: EDRMMetricsGlossary
MetricsDB:CustodialCulling
Acullingmethodbywhichspecificdataiseitherselectedorremovedfromalargersetsolelybasedonwhethersaiddataisstoredand/ormaintainedbyaparticularindividualonarepositorywithintheiradministrativecontrol.
Source: EDRMMetricsGlossary
MetricsDB:DataSourceCulling
Acullingmethodbywhichspecificdataiseitherselectedorremovedfromalargersetsolelybasedonwhethersaiddataoriginatesfromorisstoredwithinaparticularrepositoryoroncertainmedia.
Source: EDRMMetricsGlossary
MetricsDB:DataVolumePost-Culling
Thevolumeofdataremainingafterspecificcullingmethodshavebeenappliedtoalargerdataset.
Source: EDRMMetricsGlossary
MetricsDB:DataVolumePost-Deduplication
Theamountofdataremaininginadatasetafterduplicatefileshavebeenremoved.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 194
Source: EDRMMetricsGlossary
MetricsDB:DataVolumePost-Processing
Theamountofdatainadatasetaftertheextractionofdatafromcontainedfilesandresultantexpansion,theapplicationofcullingfiltersandotherdatareductionmethodologies,textextractionandopticalcharacterrecognition,andothermanipulationofnativedata.
Source: EDRMMetricsGlossary
MetricsDB:DataVolumePre-Processing
Theamountofdatacollectedfromvarioussourcespriortotheapplicationofanycullingmethodologiesormanipulationorconversionoffilesintheirnativeformat.
Source: EDRMMetricsGlossary
MetricsDB:DataVolumeProduced
Thetotalamountofdataeitherdeliveredtoorreceivedfromathirdpartyinthecontextofalegalproceeding.Typically,anyamountofdatadeliveredtoanopposingorthirdpartyhasbeenreviewedforresponsiveness,confidentialityandprivilegeasapreconditionofproduction.
Source: EDRMMetricsGlossary
MetricsDB:DataVolumeReviewed
Thetotalamountofdataexaminedbycounselandclassifiedasresponsivetocertainclaimsorissues,asattorney-clientcommunicationorprivilegedworkproduct,confidential,orsomeotherdesignationpriortoitbeingproducedtoanythirdparty.Arevieweddatavolumemayalsoincludedatathathasbeenclassifiedthroughtheuseofadvancedanalyticstechnologiesaccordingtoadefinedassistedreviewprocess.
Source: EDRMMetricsGlossary
MetricsDB:DateCulling
Acullingmethodbywhichspecificdataiseitherselectedorremovedfromalargersetsolelybasedoncriteriarelatedtodateortime.Suchcriteriaincludethespecificdateonwhichadocumentwascreated,modified,oraccessed,or,thecaseofemail,thedateonwhichamessagewassentorreceived.Typically,adatecullingmethodologyleveragesarangeofdateswithinwhichdocumentsmatchspecificcriteria.
Source: EDRMMetricsGlossary
MetricsDB:DedupeMethod
(1)Global/Case.(2)Custodian.
Source: EDRMMetricsGlossary
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 195
MetricsDB:Deduplication
Inthecontextofe-discovery,deduplicationreferstothereductionofduplicatefilesbasedonidenticalfilefingerprintsoracombinationoffilefingerprintsandmetadataattributes.Deduplicationisusedinlegalreviewtoreducetheamountofdatarequiredforreview.Exactduplicatesareidentifiedbycomparingthehashvaluesoftwoormoredocuments.Ahashvalueisauniqueidentifierassociatedwithaparticulardocumentgeneratedbyaspecificmathematicalalgorithmbasedonadocument’scontentandattributes.MD5,SHA-1,andSHA-180areexamplesofdifferenthashingalgorithms.Deduplicationisappliedtodatasetsindifferentwaysincludingglobally(i.e.toanentiredatasetacrosscustodians–oftenreferredtoas“horizontaldeduplication”),bycustodian(i.e.withineachcustodian’sdocuments–oftenreferredtoas“verticaldeduplication”).Anearduplicateisadocumentthatismateriallysimilartoanotherbutisdifferentonabit-level.Itisimportanttonotethatneardeduplicationisbasedonthecontentofadocument,notthehashvalue,andcanbeimpactedbutsuchfactorsasstandardlanguage,documentheadersandfooters(e.g.emailsignaturesordisclaimers),andOCRquality.Finally,thedefinitionofdeduplicationwithinthecontextofe-discoveryisslightlydifferentthanthatusedwithindatastoragemanagement.Storagemanagementoftenleveragesdeduplicationtostoreasingleinstanceofafile.
Source: EDRMMetricsGlossary
MetricsDB:FamilyCountPost-Culling
Representsthenumberofparentorfamilyfilesthatremainaftervariousfilteringandcullingmethodshavebeenapplied.
Source: EDRMMetricsGlossary
MetricsDB:FamilyCountPost-Deduplication
Representsthenumberofparentorfamilyfilesremainingafterdeduplicationprocesshasbeenapplied.
Source: EDRMMetricsGlossary
MetricsDB:FamilyCountPost-Processing
Typically,thisisthefirststepinpreparingdataforfurtherfilteringandculling.Referstothetotalpre-deduplicationcountofparentorfamilyfiles/documentsaftercontainersandembeddingsareextracted.Doesnotincludetheoriginalcontainerfiles(e.g..zip,.rar,.jar,.tar,etc.),inthefilecount.
Source: EDRMMetricsGlossary
MetricsDB:FamilyCountProduced
Totalcountofparentorfamilyfilesproduced.
Source: EDRMMetricsGlossary
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 196
MetricsDB:FamilyCountReviewed
Thetotalcountofparentorfamilyfilesthathavebeenreviewed.
Source: EDRMMetricsGlossary
MetricsDB:FileCountPre-Processing
Representsthenumberoffilesgoingintoprocessing.Thisprocesswilltypicallycountthenumberofcontainerfilesbutnotnumberoffilesextractedfromcontainers.
Source: EDRMMetricsGlossary
MetricsDB:FileTypeCulling
Referstoaprocessingmethodbywhichspecificdataiseitherselectedorremovedfromalargersetsolelybasedontheformat.Thoughitcanbemanipulated,ausercanoftenidentifythekindofdatastoredinafilethroughthefilenameextension(e.g..pdf,.ppt,.docx).
Source: EDRMMetricsGlossary
MetricsDB:IndividualCountPost-Culling
Representsthenumberoffilesincludingparentsandchildrenremainingaftercullingmethodshavebeenapplied.
Source: EDRMMetricsGlossary
MetricsDB:IndividualCountProduced
Totalcountofallfilesproducedincludingparentsandchildren,countedseparately.
Source: EDRMMetricsGlossary
MetricsDB:IndividualCountReviewed
Totalcountofallfilesreviewedincludingparentsandchildren,countedseparately.
Source: EDRMMetricsGlossary
MetricsDB:OtherCulling
Anyothercullingmethodbywhichcertaincriteriaisusedtoeitherselectorremovecertaindatafromalargercorpus.Examplesinclude:daterange,filetypeorcustodian/source.
Source: EDRMMetricsGlossary
MetricsDB:ThreadingCulling
Anemailthreadisafilethatcontainsanoriginalemailalongwiththesubsequentrepliestoand/orforwardsofthatoriginalemail.Threadingcullingallowsuserstoreviewalloftheindividualrepliesandforwardedmessagesrelatingtoanoriginalemailasoneinclusiverecord
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 197
orgroupedsetofrecords.Userscanreviewemailsaccordingtoconversationsasopposedtoviewingfragmentedandduplicativeemailsmessagescontainedwithinathreadinisolation.
Source: EDRMMetricsGlossary
MetricsDB:TotalHoursReviewed
Thetotalnumberofhoursspentonreviewbyallreviewers(combined).Thisshouldincludebothattorneyandlitigationsupportteam(paralegal,reviewspecialist,etc)hours.Thisshouldnotincludetimespentonprocessingorcullingthedatainpreparationforreview,ortheestablishmentofbatchesorgroupsofrecordsforreview.
Source: EDRMMetricsGlossary
MetropolitanAreaNetwork
See: MAN(MetropolitanAreaNetwork)
MHz
See: Megahertz(MHz)
MICR
See: MagneticInkCharacterRecognition(MICR)
MicroChannelArchitecture
See: MCA(MicroChannelArchitecture)
Microcomputer
ThenextlevelofcomputerafterthePC,theminicomputerisdesignedtooperateinamulti-userenvironment.“Mini’s”oftenuseseveralcomputerprocessorsincombination.
Seealso:
Computer
Fileserver
Laptopcomputer
Minicomputer
Notebookcomputer
Personalcomputer
Workstation
Microfiche
Reducedsizeddocument(s)filedonsheetmicrofilm(4"by6"),containingreducedimagesof270pagesormoreinagridpattern.Usuallywithahuman-readabletitle.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 198
Microfilm
Filmonwhichdocumentsetc.arephotographicallygreatlyreducedinsize.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Microprocessor
Acomputerprocessorononechip.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
TheCPUofaPC,themostprevalentofwhichistheIntelchip(286,386,486,andPentium).
MicrosoftDiskOperatingSystem(MS-DOS)
Acronymfordiskoperatingsystem.ThetermDOScanrefertoanyoperatingsystem,butitismostoftenusedasashorthandforMS-DOS(Microsoftdiskoperatingsystem).OriginallydevelopedbyMicrosoftforIBM,MS-DOSwasthestandardoperatingsystemforIBM-compatiblepersonalcomputers.
Source: http://www.webopedia.com/TERM/D/DOS.html
Microsoft'sdiskoperatingsystem;usedinPC'sasthecontrolsystem.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
DOS
Linux
MicrosoftWindows
Networkoperatingsystem
NOS
Operatingsystem
OS
UNIX
Windows
Xenix
MicrosoftDOS
See: MicrosoftDiskOperatingSystem(MS-DOS)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 199
MicrosoftWindows
AsoftwareproductthatprovidesanoperatingenvironmentthatrunsunderMS-DOS,usingaGUIthatcanrundifferentprogramsatthesametimeindifferentwindows.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
DOS
Linux
MicrosoftDOS
Networkoperatingsystem
NOS
Operatingsystem
OS
UNIX
Windows
Xenix
MigratedData
Migrateddataisinformationthathasbeenmovedfromonedatabaseorformattoanother,usuallyasaresultofachangefromonehardwareorsoftwaretechnologytoanother.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Datathathasbeentransferredfromonedatabaseorformattoanotherthatisgenerallydonewhenmigratedfromoneformofhardwareorsoftwaretechnologytoanother.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Informationthathasbeenmovedfromonedatabaseorformattoanother.
Migration
Theprocessofmovingacomputersystemand/oritscomponentsfromoneoperatingenvironmenttoanotheroperatingenvironment.Migrationalsoreferstomovingdatafromonestoragemediumordevicetoanother,asinhardwareandsoftwareupgrades.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
MIME(MultipurposeInternetMailExtensions)
Astandardforencodingattachmentsinmailmessages.FileswithMIMEcanhavenumerous,unessentialpages.
Source: IbisConsulting,Glossary.
TheuniqueidentifierusedtodescribewhichfiletypeisconveyedacrossaMIME-basedprotocolsuchasMIMEe-mailorHTTP.TheMIMEtype,containedincertainfieldsofanemail,indicateswhatkindofcomputerfileisattachedtotheemailsothatthesystemknowshowtoopenthefileorotherwiseprocessit.Mimetypesnamesconformtoaninternationalstandard.RegistrationofMIMEtypesisexplainedinRFC2048.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 200
Minicomputer
ThenextlevelofcomputerafterthePC,theminicomputerisdesignedtooperateinamulti-userenvironment.“Mini’s”oftenuseseveralcomputerprocessorsincombination.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Computer
Fileserver
Laptopcomputer
Microcomputer
Notebookcomputer
Personalcomputer
Workstation
MirrorImage
Usedincomputerforensicinvestigationsandsomeelectronicdiscoveryinvestigations,amirrorimageisabit-by-bitcopyofacomputerharddrivethatensurestheoperatingsystemisnotalteredduringtheforensicexamination.Mayalsobereferredtoas“discmirroring,”orasa“forensiccopy.”
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Bitstreamcopy
Forensiccopy
Image
Imagedcopy
Mirroring
Duplicationofdataforpurposesofbackupordatadistribution.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Computerevidence
Computerforensics
Computerinvestigations
Discovery
Electronicdiscovery/e-discovery
Electronicevidence
Forensicanalysis
Forensics
MIS
See: ManagementInformationSystems(MIS)
Miss/Missed
ARelevantDocumentthatisnotidentifiedasRelevantbyasearchorrevieweffort.AlsoreferredtoasaFalseNegative.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 201
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
MissRate
Thefraction(orProportion)oftrulyRelevantDocumentsthatarenotidentifiedasRelevantbyasearchorrevieweffort.MissRate=100%–Recall.AlsoreferredtoastheFalseNegativeRate.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Modem(Modulator-Demodulator)
Adevicethatmodulatesdigitalsignalstoallowtheirtransmissionoveranalogcommunicationfacilities.Typicallyusedtoallowtwocomputerstocommunicateoverphonelines.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Adevicewhichcantakedigitaldatafromacomputer,translateitintoanalogsignals(tones)andtransmittheinformationovertelephoneslines.Anothermodematthereceivingcomputerwillreceivetheinformation,translateitbackfromanalogtodigitalandstoreit.Typicalspeedsarefrom1,200to14,400bitspersecond.Somemodemsalsocorrectanyerrorswhichoccurinthetransmissionprocess.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Apieceofhardwarethatletsacomputertalktoanothercomputeroveraphoneline.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Externallinks:
WebopediaComputerDictionary,http://www.webopedia.com/TERM/M/modem.html
Modulator-Demodulator
See: Modem(Modulator-Demodulator)
Monitor
Adedicateddevicethatplugsintoagraphicsboardandthendisplayscomputer-generatedinformation.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thescreenthatdisplaysdatafromthecomputer.Monitorsmaybemonochromeorcolor.Onnotebookcomputers,theymayalsobe“backlit”or“gasPlasma."
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 202
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Monochrome
Adisplaycapableofonlytwocolors,usuallyblack&white.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Mosaic
AprogramusedforfindingandreadingdocumentsontheWorld-Wide-Web.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Motherboard
Themainboardintowhichprintedcircuitboardsorcardsareattachedtothemicroprocessor.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Mount
Theprocessofmakingoff-linedataavailableforon-lineprocessing.Forexample,placingamagnetictapeinadriveandsettingupthesoftwaretorecognizeorreadthattape.Theterms“load”and“loading”areoftenusedinconjunctionwith,orsynonymouslywith,“mount”and“mounting”(asin“mountandloadatape”).“Load”mayalsorefertotheprocessoftransferringdatafrommountedmediatoanothermediaortoanon-linesystem.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Mouse
Ahand-helddevicethatisrolledonthedesktopandcontrolsthecursorpositiononthemonitor.Commonlyusedwithsoftwarethathasagraphicaluserinterface.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
MPEG
MPEG-1andMPEG-2aretwodifferentstandardsforfullmotionvideotodigitalcompression/decompressiontechniquesadvancedbytheMovingPicturesExpertsGroup.MPEG-1compressesthebandwidthneededfor30frames/secondoffull-motionvideo(severalhundredmegabytes)downtoabout1.5Mbits/sec.MPEG-2onlycompressestoabout3Mbitsandprovidesforbetterimagequalitywhencomparingcompressedfilesofthesamesize.Thisindustryapplicationcompeteswithothercompressiontechniques,knowasJPEG,CaptainCrunch,CinepakandIndeo.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 203
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
MS-DOS
See: MicrosoftDiskOperatingSystem(MS-DOS)
MSG
TheMicrosoftOutlookItem(.msg)FileFormatisusedtoformataMessageobject,suchasane-mailmessage,anappointment,acontact,atask,andsoon,forstorageinthefilesystem.
Source: [MS-OXMSG]:OutlookItem(.msg)FileFormat-Introduction,http://msdn.microsoft.com/en-us/library/ee160779(v=exchg.80).aspx.
Thefileformatofstand-alone,single-mailmessagecontainernotcontainedinmulti-mailcontainers.
Source: IbisConsulting,Glossary.
Messagefile.Typicallycontainsanemailmessage.
Seealso:
Container
EML
Mailcontainer
Mailbox
Multi-mailcontainer
NSF
OST
PST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
MTBF
See: MeanTimeBetweenFailure(MTBF)
MTTR
See: MeanTimeToRepair(MTTR)
Multi-MailContainer
Anaggregationofe-mailmessagesandattachmentssavedwithincontainers(forexample,.PSTor.NSF).
Source: IbisConsulting,Glossary.
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
NSF
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 204
OST
PST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
Multi-PageText
Extracted,multi-pagetextfiles,withorwithoutpagebreakcharacters.
Multi-PageTIFF
Multi-pageTIFFimages(asingleTIFFfilewithmultiplepages).TheBatesnumbernameassignedtoeachoftheseTIFFfilesistheBatesnumberofthefirstpageofthefile.
Source: IbisConsulting,Glossary.
A.tiffilecomprisedofallofthepagescontainedintheunderlyingelectronicfileorhardcopydocumentpriortoitsconversiontoorscanninginto.tifformat.Asdistinguishedfromthesituationwhereeachpageofanunderlyingmulti-pagedocumentbecomesaseparate.tiffile.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).
Seealso:
GIF
GraphicInterchangeFile
Imagefileformat
Jointphotographicexpertgroup
JPEG
PNG
PortableDocumentFormat
Portablenetworkgraphic
SearchableTIFF
Single-pageTIFF
TIFF
Multi-Task
Theabilitytoaccessmorethanonesoftwareapplicationatatime.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thecapabilitytocarryoutmultipletasksatthesametime.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Multi-Thread
Multi-taskingwithinthesameapplicationatthesametime.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 205
Multi-User
Thecapabilitytohavemorethanonepersonusingacomputersystematthesametime.Amulti-usersystemallowsthesharingofdataandperipheralequipmentamongallusers.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
MultipurposeInternetMailExtensions
See: MIME(MultipurposeInternetMailExtensions)
Multisynch
Analogvideomonitorswhichcanreceiveawiderangeofdisplayresolutions,usuallyincludingTV(NTSC).Coloranalogmonitorsacceptseparatered,green&blue(RGB)signals.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
N
N-Gram
NconsecutivewordsorcharacterstreatedasaFeature.Inthephrase,“Tobeornottobe,”awordBigram(i.e.,2-gram)wouldbe“tobe”;awordAnN-GramwhereN=3(i.e.,a3-gram).(i.e.,3-gram)wouldbe“tobeor”;aQuad-Gram(i.e.,4-gram)wouldbe“tobeornot”;andsoon.SeealsoShingling.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
NamesMentionedinText
Adatafieldusedtoclassifynamesthatappearinadocumentotherthanastheauthor,recipient,orrecipientofacarboncopy.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Notefield
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 206
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
NativeApplication
Anyapplicationusedtocreateandviewaparticularapplicationfiletype.
Source: IbisConsulting,Glossary.
NativeEnvironment
Theoriginalconfiguration(software,passwords,serverconfiguration,etc.)ofabackuptapeore-mailsystem(i.e.MicrosoftExchange).
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
NativeFile
Applicationfilesintheiroriginalfileformat.Alsousedinthecontextofdeliveringnativefileprocessing.
Source: IbisConsulting,Glossary.
Afilesavedintheformatoftheoriginalapplicationusedtocreatethefile.Dealingwithnativefilescanminimizeexpensiveper-pagecostsforthetraditionalTIFFand/orPDFprocessingandwillmaximizetherelevantinformationavailablefromthefile.
Source: RenewData,Glossary(10/5/2005).
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingRenewData'sElectronicEvidenceReferenceChart,http://www.renewdata.com/wall-chart-signup.html
Thesourcedocument,ascollectedfromthesourcecomputerorserver,beforeanyconversionorprocessingofthedocument.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Adocumentproducedintheformatinwhichitwasoriginallycreated.
NativeFormat
Electronicdocumentshaveanassociatedfilestructuredefinedbytheoriginalcreatingapplication.Thisfilestructureisreferredtoasthe“nativeformat”ofthedocument.Becauseviewingorsearchingdocumentsinthenativeformatmayrequiretheoriginalapplication(i.e.,viewingaMicrosoftWorddocumentmayrequiretheMicrosoftWordapplication),documents
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 207
areoftenconvertedtoastandardfileformat(i.e.,tiff)aspartofelectronicdocumentprocessing.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
NativeProduction
Producingfilesintheformattheywerecreatedandmaintainedisknownasanativeproduction.
Source: EDRMProductionGuide.
Adocumentproducedintheformatinwhichitwasoriginallycreated.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
NaturalLanguageSearch
Anon-Booleanretrievalmethod,which,insteadofusing“and/or”connectors,preparesthesearchrequestinordinarylanguageandisautomaticallyconvenedbythecomputerintoalgorithms.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
NaïveBayes
ASupervisedLearningAlgorithminwhichtherelativefrequencyofwords(orotherFeatures)inRelevantandNon-RelevantTrainingExamplesisusedtoestimatethelikelihoodthatanewDocumentcontainingthosewords(orotherFeatures)isRelevant.NaïveBayesreliesonthesimplisticassumptionthatthewordsinaDocumentoccurwithindependentProbabilities,withtheconsequencethatittendstoyieldextremelyloworextremelyhighestimates.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 208
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
NaïveBayesianClassifier
Asystemthatexaminestheprobabilitythateachwordinanewdocumentcamefromtheworddistributionderivedfromtrainedresponsivedocumentsorfromtrainednon-responsivedocuments.Thesystemisnaïveinthesensethatitassumesthatallwordsareindependentofoneanother.
Source: HerbRoitblat,PredictiveCodingGlossary.
NDLON
NationalDayLaborerOrganizingNetworkv.U.S.ImmigrationandCustomsEnforcementAgency,CaseNo.10-Civ-3488(SAS),2012WL2878130(S.D.N.Y.July13,2012),aFreedomofInformationAct(FOIA)caseinwhichDistrictJudgeShiraA.Scheindlinheldthat“mostcustodianscannotbe‘trusted’toruneffectivesearchesbecausedesigninglegallysufficientelectronicsearchesinthediscoveryorFOIAcontextsisnotpartoftheirdailyresponsibilities,”andstated(indicta)that“beyondtheuseofkeywordsearch,partiescan(andfrequentlyshould)relyonlatentsemanticindexing,statisticalprobabilitymodels,andmachinelearningtofindresponsivedocuments.Throughiterativelearning,thesemethods(knownas‘computer-assisted’or‘predictive’coding)allowhumanstoteachcomputerswhatdocumentsareandarenotresponsivetoaparticularFOIAordiscoveryrequestandtheycansignificantlyincreasetheeffectivenessandefficiencyofsearches.”
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Near-DuplicateDetection
Anindustry-specifictermgenerallyusedtodescribeamethodofgroupingtogether“nearlyidentical”Documents.Near-DuplicateDetectionisavariantofClusteringinwhichthesimilarityamongDocumentsinthesamegroupisverystrong.Itistypicallyusedtoreducereviewcosts,andtoensureconsistentCoding.AlsoreferredtoasNear-Deduplication.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
NearestNeighbor
ASupervisedLearningAlgorithminwhichanewDocumentisClassifiedbyfindingthemostsimilarDocumentintheTrainingSet,andassumingthatthecorrectCodingforthenewDocumentisthesameasthemostsimilaroneintheTrainingSet.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 209
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Astatisticalprocedurethatclassifiesobjects,suchasdocuments,accordingtothemostsimilaritemthathasalreadybeenassignedacategorylabel.Thisapproachusesasetoflabeledexamplestoclassifysubsequentunlabeleditems,bychoosingthecategoryassignedtothemostsimilarlabeledexample(itsnearestneighbor)orexamples.K-nearestneighborclassificationusesthekmostsimilarclassifiedobjectstodeterminetheclassificationofanunknownobject.
Source: HerbRoitblat,PredictiveCodingGlossary.
NegativePredictiveValue(NPV)
Thefraction(Proportion)ofDocumentsthatareidentifiedasNon-Relevantbyasearchorrevieweffort,thatareinfactNon-Relevant.ThecomplementofPrecision;thatis,NegativePredictiveValueiscomputedthesamewayasPrecisionwhenthedefinitionsofRelevantandNon-Relevantaretransposed.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Nesting
Documentnestingoccurswhenonedocumentisinsertedwithinanotherdocument(i.e.,anattachmentisnestedwithinanemail;graphicsfilesarenestedwithinaMicrosoftWorddocument).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
NetWareLoadableModule(NLM)
Anapplicationthatrunsaspartofthenetworkoperatingsystem(NOS)ofaNovellNetWareserver.
Source; FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Network
Agroupofconnectedcomputersthatallowpeopletoshareinformationandequipment(e.g.localareanetwork(LAN),wideareanetwork(WAN),metropolitanareanetwork(MAN),storageareanetwork(SAN),peer-to-peernetwork,client-servernetwork).
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 210
Multiplecomputersconnectedtogethersothattheyfunctionasamulti-usersystem.Anetworkmaybealocalareanetwork(LAN)orawideareanetwork(WAN).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Agroupofcomputersordevicesthatisconnectedtogetherfortheexchangeofdataandsharingofresources.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Client/servernetwork
LAN-localareanetwork
MAN-metropolitanareanetwork
Peer-to-peernetwork
SAN-storageareanetwork
Standalonecomputer
WAN-wideareanetwork
NetworkInterfaceCard(NIC)
Thecardinsideacomputerthatenablestheestablishmentofanetworkconnection.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
NetworkOperatingSystem(NOS)
Softwarewhichdirectstheoverallactivityofnetworkedcomputers.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Theoperatingsystemthatsupportsnetworkoperations.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
DOS
Linux
MicrosoftDOS
MicrosoftWindows
Operatingsystem
OS
UNIX
Windows
Xenix
NetworkTopology
Thewiring,connections,andadapterboardsthatinterconnectcomputersonanetwork.ThethreestandardtopologiesforPCsareEthernet,IBMTokenRing,andARCnet.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 211
NeuralNetwork
Anapproachtomachinelearningwheretheelementsoftheprocessresemblesimulatedneurons.Neurons,inturn,arethoughttobetheprimarycomputationalelementsinthebrain.Eachelementinaneuralnetworkreceivessomesetofinputs,eitherfromtheenvironmentorfromotherneurons.Itthencomputesanoutputbasedonitsinputs.Networksoftheseelementsarecapableofquitesophisticatedcomputations.Computingwithneuralnetworksisalsocalledbrain-stylecomputation.
Source: HerbRoitblat,Search2020:TheGlossary.
NIC
See: NetworkInterfaceCard(NIC)
NLM
See: NetWareLoadableModule(NLM)
Node
Anydeviceconnectedtonetwork.PCs,servers,andprintersareallnodesonthenetwork.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
NoiseWordFilter
Toavoidcreatinganoverlyinclusiveindex,mostindicesutilizeanoisewordfilter.Noisewordfiltersincludesacustomizedlistoftermsthatareoverlookedorignoredduringindexing.Somecommonnoisewordsinclude‘a’,‘and’,‘the’,‘from’,and‘because’.
Source: EDRMSearchGlossary.
Non-Interlaced
Wheneachlineofthevideoimageisscannedseparately.Computermonitorsusenon-interlacedvideo.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Non-MailData
Extractable,standardand(somenon-standard)mailarchiveitemsbesidese-mailmessagesandattachments,suchasCalendar,Tasks,Notes,Persons,Meetings,etc.
Source: IbisConsulting,Glossary.
Non-NativeEnvironment
Aproprietaryprocessinwhichelectronicdataisobtaineddirectlyfrombackuptapeswithouttheneedtorecreateanativeenvironment.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 212
Source: RenewData,Glossary(10/5/2005).
Non-NegativeMatrixFactorization
Amathematicaltechniquethatsummarizesthecorrelationbetweenitems.OneofthetechniquesusedineDiscoveryasthebasisforconceptsearch,wheretheitemsarewords.
Source: HerbRoitblat,Search2020:TheGlossary.
Non-PrintableFiles
Filesthatcan’tbeprinted,suchasDLL,EXE,AVIfiles.
Source: IbisConsulting,Glossary.
Non-Relevant/NotRelevant
InInformationRetrieval,aDocumentisconsideredNon-Relevant(orNotRelevant)ifitdoesnotmeettheInformationNeedofthesearchorrevieweffort.Thesynonym“irrelevant”israrelyusedinInformationRetrieval.
Source; MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Non-ResponseBias
Non-ResponseBiasoccurswhenaportionofpotentialsamplesisnotavailableforsampling.Asanexample,ifane-discoveryeffortisidentifyingpotentialresponsiveengineeringdocuments,andifthedocumentsareinadocumentformatand/orprogramminglanguagethatcouldnotbesampledorunderstood,therecouldbeasignificantnon-responseBias.Seealso,ResponseBias.
Source: EDRMSearchGlossary.
Seealso:
Non-ResponseBias ResponseBias
NormalDistribution
The“bellcurve”ofclassicalstatistics.ThenumberofRelevantDocumentsinaSampletendstoobeyaNormal(Gaussian)Distribution,providedtheSamplesizeislargeenoughtocaptureasubstantialnumberofRelevantandNon-RelevantDocuments.Inthissituation,GaussianEstimationisreasonablyaccurate.IftheSamplesizeisinsufficientlylargetocaptureasubstantialnumberofbothRelevantandNon-RelevantDocuments(asaruleofthumb,atleast12ofeach),theBinomialDistributionbettercharacterizesthenumberofRelevantDocumentsintheSample,andBinomialEstimationismoreappropriate.AlsoreferredtoasaGaussianDistribution.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 213
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Inprobabilitytheory,thenormal(orGaussian)distributionisacontinuousprobabilitydistribution.Ithasabell-shapedprobabilitydensityfunction,knownastheGaussianfunctionorinformallyasthebellcurve,theheightofthecurveshowstherelativelikelihoodofvariousvalues.Theareaunderthecurvesumsto1.0,sosectionsofthecurverepresentprobabilities.Thenormaldistributionderivesfromthecentrallimittheorem,whichsaysthattheaverageofalargenumberofrandomvariablesisdistributedasthenormaldistribution,howeverthevariableswereoriginallydistributed.Thenormaldistributionhaswideapplicationinstatistics,forexample,insampling.
Agraphofthenormaldistribution.Theconfidenceintervalisinthemiddleinwhite.The"tails"areshowninyellow.The95%confidenceintervalrepresents95%oftheareaunderthecurve.Inatwo-taileddistribution,this95%areaissymmetricallyalignedaroundtheaverageofthedistribution.Imagefromhttps://en.wikipedia.org/wiki/One-_and_two-tailed_tests
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
GaussianDistribution
NOS
See: NetworkOperatingSystem(NOS)
NoSQL
Generallyinterpretedtomean“NotOnlySQL,”referstodatabasesthatarebuiltusingstructuresotherthantablesandrelations.NoSQLdatabasesaretypicallydistributedovermanyphysicalmachines,horizontallyscalable,andareoftendistributedasopen-sourcesoftware.
Source: HerbRoitblat,Search2020:TheGlossary.
NoteField
Adatafieldthatallowstheentryoftextinamannersimilartowordprocessingsoftware,whichisnotlimitedtoaspecificnumberofcharacters.Typicallyusedforattorneys’notesorcomments.Anotefieldcannotbesorted.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
©2016EDRMLLC
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
Text
NotebookComputer
Asmalllaptopcomputer,usuallyweighinglessthan8pounds.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Computer
Fileserver
Laptopcomputer
Microcomputer
Minicomputer
Personalcomputer
Workstation
NotesStorageFacility(NSF)
DatabasesinIBMNotes,formerlyLotusNotes,areNotesStorageFacility(.nsf)files,containingbasicunitsofstorageknownasa"note".
Source: http://en.wikipedia.org/wiki/IBM_Notes.
ALotusNotesmailcontainer.
Source: IbisConsulting,Glossary.
ALotusNotes/Dominodatabase,includingemailcollections.
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
OST
PST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
NSF
See: NotesStorageFacility(NSF)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 215
NT
ReferstoMicrosoftWindowsNTserverandworkstationsoftware.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
NTFilingSystem(NTFS)
NTFS(NTfilesystem;sometimesNewTechnologyFileSystem)isthefilesystemthattheWindowsNToperatingsystemusesforstoringandretrievingfilesonaharddisk.NTFSistheWindowsNTequivalentoftheWindows95fileallocationtable(FAT)andtheOS/2HighPerformanceFileSystem(HPFS).However,NTFSoffersanumberofimprovementsoverFATandHPFSintermsofperformance,extendibility,andsecurity.
Source: TechTarget,NTFS(NTfilesystem;sometimesNewTechnologyFileSystem)definition,http://searchwindowsserver.techtarget.com/definition/NTFS
Seealso:
FAT Filesystem NTfilingsystem
NTFS
See: TFilingSystem(NTFS)
NullSet
ThesetofDocumentsthatarenotreturnedbyasearchprocessorthatareidentifiedasNotRelevantbyareviewprocess.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
NumericRangeSearch
Anumericrangesearchisasearchforanynumbersthatfallwithinarange.
Source: dtSearchSupport,NumericRangeSearching,https://support.dtsearch.com/webhelp/dtsearch/default.htm#numeric_.htm
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Phonicsearch
Phrasesearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 216
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
O
Object
Acombinationofcodeanddatacreatedatruntimethatcanbetreatedasaunit.Atable,chart,graphic,equation,orotherformofinformation.Seeembeddedobject.
Source: IbisConsulting,Glossary.
Seealso:
Bibliographiccoding
Embeddedobject
Linkobject
Linksource
Linkedobject
ObjectLinkingandEmbedding(OLE)
AfeatureinMicrosoft'sWindowswhichallowseachsectionofacompounddocumenttocallupitsowneditingtoolsorspecialdisplayfeatures.Thisallowsforcombiningdiverseelementsincompounddocuments.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ObjectiveCoding
Therecordingofbasicdatasuchasdate,author,ordocumenttype,fromdocumentsintoadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Extractinginformationfromelectronicdocumentssuchasdatecreated,authorrecipient,CCandlinkingeachimagetotheinformationinpre-definedobjectivefields.IndirectoppositiontoSubjectivecodingwherelegalinterpretationsofdatainadocumentarelinkedtoindividualdocuments.Alsocalledbibliographiccoding.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Extractingvarioussegmentsofinformationfromadocumentsuchasitsauthor,recipient,mailingdate,orotherfields,etc.ObjectiveCodingisusuallydonefromthedocumenttextorimagebecausemetadataorsearchabletextmaybeunavailable(e.g.ahandwrittendocumentthathasbeenscanned),orthedocumentmaycontaininaccuratemetadata(e.g.metadata
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 217
associatedwithadocumentwrittenandsignedbyapartnermightreflecttheadministrativeassistantastheauthorwherethedocumentwasoriginallytypedontheassistant’scomputer).
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Issuecoding
Levelcoding
Subjectivecoding
Tag
Taxonomiccoding
Verbatimcoding
ObstructionofJustice
AccordingtoBlack’slawdictionary,obstructionofjusticemeans“impedingorobstructingthosewhoseekjusticeinacourt,orthosewhohavedutiesorpowersofadministeringjusticetherein.”
Source: RenewData,Glossary(10/5/2005).
OccurrenceCount
OccurrencecountsearchallowsalegalprofessionaltospecifyOccurrencecountsearchallowsalegalprofessionaltospecifythatawordappearacertainnumberoftimesforthedocumenttobeselected.
Source: EDRMSearchGlossary.
OCR(OpticalCharacterRecognition)
Opticalcharacterrecognitionistheconversionofascanneddocumentintosearchabletextandtherenderingofitstextsusceptibletocopyingforpastingintoanewfile.Followingthescanningofagivendocument,OCRsoftwareevaluatesthescanneddataforshapesitrecognizesaslettersornumerals.OCRtechnologyreliesuponthequalityoftheprintedcopyandtheconversionaccuracyofthesoftware.Generallyacknowledgedtobeonly80-85percentaccurate.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingRenewData'sElectronicEvidenceReferenceChart,http://www.renewdata.com/wall-chart-signup.html
Amethodoftranslatingprintedtextandimagesintoaformthatacomputercanmanipulate(intoASCIIcodes,forexample).AnOCRsystemenablesyoutoscanaprinteddocumentdirectlyintoacomputerfile.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 218
Amethodofscanningprintedmaterialandconvertingitintoanelectronicfile,suchasaword-processingfile,whichcanthenbesearchedforspecificwordsorphrases.OCRisdistinguishablefrom“imaging”inthatitrecognizesonlyalphanumericcharactersandnothandwrittenorothergraphicmaterial.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Softwarethat,inconjunctionwithascanner,isableto“recognize”writtentextandconvertittoanASCIIfileorimportitintoawordprocessorsomayperformoneofthefulltextsearches.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thecomputerconversionofscannedinputimages(barcodesorpatternsofbits)tocomputerrecognizablecodes(ASCIIletters,numbersandcharacters).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Opticalcharacterrecognitionisatechnologywhichtakesdatafromapaperdocumentandturnsiteditabletextdata.Thedocumentisfirstscanned.ThenOCRsoftwaresearchesthedocumentforletters,numbers,andothercharacters.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Whenapaperdocumentisscannedintoacomputer,animageiscreated.ThecomputerdoesnotrecognizethecharactersofthedocumentastextuntilOCRsoftwareconvertstheimageintotext.OCRsystemsvarywidelyintheaccuracyoftheirconversion.Evenseeminglyhighaccuracyratescan,however,stillresultinsignificantnumbersofwordsbeingmisrepresented.A99%accuracy,forexample,wouldstillresultinonewordoutof20beingmisspelled.
Seealso:
DirtyOCR ICR Patternrecognition
ODBC(OpenDatabaseConnectivity)
AnapplicationinterfacefromMicrosoftthatprovidesacommonlanguagebetweenapplicationsanddatabasesonanetwork.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
AdatabaseprogramminginterfacethatprovidesacommonlanguageforWindowsapplicationstoaccessdatabasesonanetwork.
OEM(OriginalEquipmentManufacturer)
Classically,acompanywhobuysproductsfromanothercompany,re-labelstheproductsunderitsownnameandre-sells(usuallyinlargequantities).Hascometodefinenearlyanylargecustomerwhore-sellsproducts,brandedornot.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 219
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Offline
Whencomputersandotherdevicesarenotconnectedtothenetwork.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Archivaldocumentsstoredonopticaldisksorcompactdisksthatarenotconnectedorinstalledinthecomputer,butinsteadrequirehumaninterventiontobeaccessed.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Notconnected(toanetwork).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
OfflineStorage
Thestorageofelectronicdataoutsidethenetworkindailyuse(i.e.,onbackuptapes)thatisonlyaccessiblethroughtheoff-linestoragesystem,notthenetwork.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
OLE
See: ObjectLinkingandEmbedding(OLE)
On-SiteExtraction
Theextractionofhighvolumesofdatafrombackuptapesataclientsite.
Source: RenewData,Glossary(10/5/2005).
One-TailedTest
Inhypothesistesting,wecanbeinterestedinadeviationineitherdirectionorinonlyonedirection.Ifweareinterestedineitherdirection(onescoreisdifferentfromanother),weuseatwo-tailedtest.Ifweareinterestedinonlyonedirection(onescoreislessthananother),andwedon'tcareifitisgreater,thenweuseaone-tailedtest.Forexample,ifwewanttoknowwhetherapredictivecodingsystemhasperformedbetterthanchance,thenwecanuseaone-tailedtest.Wedon'tcareifthepredictivecodingsystemisworsethanchance(thatwouldnotbeparticularlyuseful),onlyifitisbetter.Confidenceintervalscanbeone-sidedortwo-sidedaswell.Thetailreferstotheyellowregionsinthefigure.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 220
Agraphofthenormaldistribution.Theconfidenceintervalisinthemiddleinwhite.The"tails"areshowninyellow.The95%confidenceintervalrepresents95%oftheareaunderthecurve.Inatwo-taileddistribution,this95%areaissymmetricallyalignedaroundtheaverageofthedistribution.Imagefromhttps://en.wikipedia.org/wiki/One-_and_two-tailed_tests
Source: HerbRoitblat,PredictiveCodingGlossary.
Online
Whencomputersandotherdevicesareconnectedtothenetwork.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
TheconditionofacomputerbeingconnectedtoacomputerizedinformationsystemsuchasLexis.OftenreferstobeingconnectedtotheInternet.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Documentsstoredontheharddriveormagneticdiskofacomputerthatareavailableimmediately.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Connected(toanetwork).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
OnlineStorage
Thestorageofelectronicdataasfullyaccessibleinformationindailyuseonthenetworkorelsewhere.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
OnlineSummary
Adigestorsummaryofadocumentcreateddirectlyfromthecomputerscreenbyreadingthedocumentandusingthecutandpastefunctiontomoveexcerptstoaseparatefile.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Ontology
ArepresentationoftherelationshipsamongwordsandtheirmeaningsthatisricherthanaTaxonomy.Forexample,anOntologycanrepresentthefactthatawheelisapartofabicycle,thatgoldisyellow,andsoon.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 221
Acategoricalorconceptualstructurethatmaynotbestrictlyhierarchical(cf.taxonomy).Conceptscanberelatedtooneanotherincomplexways.Forexample,anontologymayrepresentthatlawyers,paralegals,andjudgesareassociatedwithoneanother(oneisnotstrictlyasubsetoftheother).
Source: HerbRoitblat,Search2020:TheGlossary.
OpenDatabaseConnectivity
See: ODBC(OpenDatabaseConnectivity)
OperatingSystem(OS)
Softwarewhichdirectstheoverallactivityofacomputer(e.g.MS-DOS,Windows,Linux,etcetera).
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Themostimportantprogramthatrunsonacomputer.Everygeneral-purposecomputermusthaveanoperatingsystemtorunotherprograms.Operatingsystemsperformbasictasks,suchasrecognizinginputfromthekeyboard,sendingoutputtothedisplayscreen,keepingtrackoffilesanddirectoriesonthedisk,andcontrollingperipheraldevicessuchasdiskdrivesandprinters.Forlargesystems,theoperatingsystemhasevengreaterresponsibilitiesandpowers.Itislikeatrafficcop--itmakessurethatdifferentprogramsandusersrunningatthesametimedonotinterferewitheachother.Theoperatingsystemisalsoresponsibleforsecurity,ensuringthatunauthorizedusersdonotaccessthesystem.
Source: IbisConsulting,Glossary.
Softwarethatcontrolstheoperationofacomputer.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thesoftwarethattherestofthesoftwaredependsontomakethecomputerfunctional.OnmostPCsthisisWindowsortheMacintoshOS.UnixandLinuxareotheroperatingsystemsoftenfoundinscientificandtechnicalenvironments.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
DOS
Linux
MicrosoftDOS
MicrosoftWindows
Networkoperatingsystem
NOS
UNIX
Windows
Xenix
©2016EDRMLLC
OpticalCharacterRecognition
See: OCR(OpticalCharacterRecognition)
OpticalDisk
Computermediasimilartoacompactdiscthatcannotberewritten.Anopticaldriveusesalasertoreadthestoreddata.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Storagemedia
WORMdisk
Zipdisk
OriginalEquipmentManufacturer
See: OEM(OriginalEquipmentManufacturer)
OS
See: OperatingSystem(OS)
OST
AnofflinestoragemailcontainerthatrequiresconversiontoPSTbeforeextraction,inordertobeprocessedasmailmessagesandtheirattachments.
Source: IbisConsulting,Glossary.
MicrosoftOutlookOfflinefileusedtosaveemails.
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
PST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
©2016EDRMLLC
OtherNumberField
AdatafieldinadatabaseusedtocapturenumbersotherthantheprimaryBatesstampnumberthatappearsonthedocument.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Productionsource
Recipient
Subjectcategory
Summary
Text
Output
Thefolderorfilescreatedtocontaindatathatresultsfromagivenprocess.
Source: IbisConsulting,Glossary.
P
PackBits
AcompressionschemewhichoriginatedwiththeMacintosh.Suitableonlyforblack&white.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Packet
Afixedblockofdatatransmissionwhichalsocontainsidentityandroutinginformation.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Page
Asingleimageofa“onepieceofpaper.”Oneorseveralpagesmakeupa“document.”
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 224
PagesPerMinute(PPM)
Ameasurementofthethroughputspeedofascanner-howmanyletter-sizepagesthescannercanscaninoneminute.Beware:ppmcanbemisleading.
Source: RSI,Glossary.
PantoneMatchingSystem(PMS)
Acolorstandardinprinting.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
PaperDiscovery
Paperdiscoveryreferstothediscoveryofwritingsonpaperthatcanbereadwithouttheaidofsomedevices.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
PaperStylesandDefinitions
1. AcidFreePaper–Won'tchangecolor(yellow)formanyyears.2. Brightness–Thepercentageoflightthepaperreflects.Mostwhitepapersreflect60%
to90%.3. CoatedPapers–"glossy"paper,coatedwithclay.4. Cotton"Rag"Paper–Premiumpaperwith25%to100%cottonfibers.5. Laidfinish–Papersurfaceembossedwithlinestoresemblehandmadepaper.6. Ream–500sheets.7. Vellumfinish–Alesssmoothversionofrealvellum(fineparchment).8. Wovefinish–Verysmoothsurface.Characteristicofthemajorityofpapersmade.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Parallel
Referstomultipledatabitsstoredortransmittedsimultaneously.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Transmissionofallthebits(e.g.inacharacter)atthesametime.Ifthecharacterhaseightbits,thereareeightwires.Fasterandmoreexpensivethanserialwheretheeightbitswouldbesent,"sideways",oneatatime.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 225
ParallelPort
Aparallelportisusedforprintingbecauseitisfasterthanaserialport.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ParallelTrial
AnExperimentalDesignforcomparingtwosearchorreviewprocessesusingthesameDocumentCollectionandInformationNeed,inwhichbothprocessesareappliedconcurrentlybutindependently,andthentheresultsofthetwoeffortsarecompared.(Cf.CrossoverTrial.)
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
ParametricSearch
Parameterizedsearchallowssearchingtobebasednotonkeywordsbutoncertainparameters,suchasadocument’smetadata.Parameterizedsearchisalsoknownasfieldedsearch,becauseitisfrequentlyperformedondatastoredwithinthefieldsofadatabasetable.ExamplesincludeDateRange,Metadata,Custodian,restrictionsorpromotionsbasedondocumenttags/reviewcalls.
Source: EDRMSearchGlossary.
ParentDocument
Theprimarydocumentinasetofrelateddocuments,suchasafaxcoversheetoratransmittalletter.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Parent-ChildRelationship
Inanytaxonomy,thesuperiorcategorycanbecalledaparent,anditssubcategoriescanbecalledchildren.Anemailcanbeconsidered,forexample,tobeaparentofanyofitsattachments.Conversely,anattachmentcanbeconsideredtobeachildoftheemailtowhichitisattached.
PasswordProtection
Theuseofpersonalandconfidentialidentificationtoallowindividualusersaccesstoacomputersystemorspecificprograms.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 226
Path
Therouteofdirectoriesthroughwhichacomputersearchestofindaparticularfile.Thepathnameisthefullfilename,includingthenameofthedirectoryonwhichthefileisstored.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
PatternMatching
ThescienceofdesigningcomputerAlgorithmstorecognizenaturalphenomenalikepartsofspeech,faces,orspokenwords.
Source: TheGrossman-CormackGlossaryofTechnologyAssistedReview(Version1.02,Nov.2102).
PatternRecognition
Anelectronicapplicationutilizinganalgorithmthatsearchesdataforlikepatternsandflagsorextractsthepertinentdata.Forinstance,inlookingforaddresses,alphacharactersfollowedbyacommaandaspacefollowedbytwocapitalalphacharactersfollowedbyaspacefollowedbyfiveormoredigitsareusuallythecity,stateandzipcode.Byprogrammingtheapplicationtolookforthatpattern,theinformationcanbeelectronicallyextractedratherthanre-keyedbyhumanintervention.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
DirtyOCR
ICR
OCR
OpticalCharacterRecognition
PB(Petabyte)
Apetabyteisameasureofcomputerdatastoragecapacityandisonethousandmillionmillion(1,000,000,000,000,000)bytes.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Bit
Byte
KB-kilobyte
MB-megabyte
GB-gigabyte
TB-terabyte
EB-exabyte
©2016EDRMLLC
PC(PersonalComputer)
TechnicallyacomputerthatconformstothePCstandardsetbyIBM,thePCnowreferstoanydesktopcomputerotherthanaterminalonaUnixsystem.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
PCI(PeripheralComponentInterconnect)
Ahigh-speedinterconnectlocalbususedtosupportmultimediadevices.PromotedbyDigitalamongothers.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
PCMCIA(PersonalComputerMemoryCardInternationalAssociation)
Plug-incardsforcomputers(usuallyportables),whichextendthestorageand/orfunctionality.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
PCX(PersonalComputereXchange)
ThefileformatusedfordrawingsbyCorelPaintandWindowsPaintBrush.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
PDA(PersonalDigitalAssistant)
Anysmallhandheldwirelessdevicethatprovidescomputinganddatastorageabilities.ExamplesofPDAsincludethePalmPilotandtheBlackBerry.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Ahand-heldmicrocomputerthatfunctionslikeanelectronicrolodexandoftenconnectstoalargercomputerforsharingortransferringinformation.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Asmall,usuallyhand-held,computerwhich"assists"businesstasks.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
PDF(PortableDocumentFormat)
AproprietaryformatofAdobeCorporation,ithasbecomeadefactostandardfortransmittingdocumentsthatthesenderdoesnotwanttobealteredandfortransmittingdocumentstocommercialprintersandtotheWebforonlinepublishing.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 228
Source: RenewData,Glossary(10/5/2005).
AfileformatdevelopedbyAdobeSystems.PDFcapturesformattinginformationfromavarietyofdesktoppublishingapplications,makingitpossibletosendformatteddocumentsandhavethemappearontherecipient'smonitororprinterastheywereintended.ToviewafileinPDFformat,youneedAdobeAcrobatReader,afreeapplicationdistributedbyAdobeSystems.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Afilestandardfordocumentsthatcanbeprocessed(generallyviewedandprinted)byanycomputer,regardlessofthespecificapplicationprogramwhichcreatedtheoriginal.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
AnAdobetechnologyforformattingdocumentssothattheycanbeviewedandprintedusingtheAdobeAcrobatreader.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
PDF’scanbereadusingAdobeAcrobatReader(afreeprogram),regardlessoftheprogramusedtocreatetheoriginaldocument.APDFdocumentcancontaintext,images,orboth.OnlyPDFscontainingtextcanbesearcheddirectly.ThosecontainingimagesonlymustbeOCRed.
Seealso:
GIF
GraphicInterchangeFile
Imagefileformat
Jointphotographicexpertgroup
JPEG
Multi-pageTIFF
PNG
Portablenetworkgraphic
SearchableTIFF
Single-pageTIFF
TIFF
Peer-to-PeerNetwork
Anetworkofcomputersconfiguredtoallowcertainfilesandfolderstobesharedwitheveryoneorwithselectedusers.Peer-to-peernetworksarequitecommoninsmallofficesthatdonotuseadedicatedfileserver.AllclientversionsofWindows,MacandLinuxcanfunctionasnodesinapeer-to-peernetworkandallowtheirfilestobeshared.
Source: PCMag,Definitionof:peer-to-peernetwork,http://www.pcmag.com/encyclopedia/term/49056/peer-to-peer-network
Seealso:
Client/servernetwork
LAN-localareanetwork
MAN-metropolitanareanetwork
Network
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 229
SAN-storageareanetwork
Standalonecomputer
WAN-wideareanetwork
Peripheral
Anyhardwaredevicethatinterfaceswithacomputer,suchasaprinter,anexternalmodem,orascanner.Interfacingmaytakeplacethroughthecomputer’sparallelandserialportsorthroughaspecificinterfacecard.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
PeripheralComponentInterconnect
See: PCI(PeripheralComponentInterconnect)
PersonalComputer
See: PC(PersonalComputer)
PersonalComputereXchange
See: PCX(PersonalComputereXchange)
PersonalComputerMemoryCardInternationalAssociation
See: PCMCIA(PersonalComputerMemoryCardInternationalAssociation)
PersonalDigitalAssistant
See: PDA(PersonalDigitalAssistant)
PersonalInformationManager(PIM)
SoftwarethatperformsthefunctionsofRolodex.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
PersonalStorageFile(PST)
TherearetwotypesofOutlookDataFilesusedbyOutlook.AnOutlookDataFile(.pst)isusedformostaccounts....OutlookDataFiles(.pst)areusedforPOP3,IMAP,andweb-basedmailaccounts.WhenyouwanttocreatearchivesorbackupyourOutlookfoldersanditemsonyourcomputer,suchasExchangeaccounts,youmustcreateanduseadditional.pstfiles....APersonalFoldersfile(.pst)isanOutlookdatafilethatstoresyourmessagesandotheritemsonyourcomputer.ThisisthemostcommonfileinwhichinformationinOutlookissavedbyhomeusersorinsmallorganizations....
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 230
Source: IntroductiontoOutlookDataFiles(.pstand.ost),http://office.microsoft.com/en-us/outlook-help/introduction-to-outlook-data-files-pst-and-ost-HA010354876.aspx.
InMicrosoftOutlook,thePersonalFoldersfile(.pst)isadatafilethatstoresallofauser'smessagesandotheritemsonhis/hercomputer.AnOutlookusercancreateoneormore.pst'stoorganizeandbackupitemsforsafekeeping.Evenwhenane-mailsystemisbeingrunonaMicrosoftExchangeServer,Outlookdatacanbebackeduptoa.pstfilestoredeitherlocallyonaharddriveoronanetworkdrive--ratherthanonthee-mailserver.Each.pstfilecontainsallofone'sOutlookfolders,includingtheInbox,Calendar,andContacts.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingMicrosoftOfficeOnline,http://office.microsoft.com/en-us/assistance/HA010875321033.aspx
TheplacewhereOutlookstoresitsdata(whenOutlookisusedwithoutMicrosoft®ExchangeServer).APSTfileiscreatedwhenamailaccountissetup.AdditionalPSTfilescanbecreatedforbackingupandarchivingOutlookfolders,messages,formsandfiles.ThefileextensiongiventoPSTfilesis.pst.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
AMSOutlookmailcontainerthatrequiresextractioninordertobeprocessedasmailmessagesandtheirattachments.
Source: IbisConsulting,Glossary.
AfileusedinOutlooktosaveacollectionofemails.
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
OST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
Petabyte
See: PB(Petabyte)
PhaseChange
Amethodofstoringinformationonrewritableopticaldisks.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 231
Phases(Stages)
Distinctsegmentsofthee-discoveryprocesswhichcontainmeasurableactivitiesthatcanbetrackedaccordingtovolume,costandtime.ThesesegmentscorrespondtotheL600codesfor:Identification(L600),Preservation(L610),Collection(L620),Processing(L630),Review(L650),Analysis(L660),Production(L670),Presentation(L680)andProjectManagement(L690).
Source: EDRMMetricsGlossary
PhonicSearch
Phonicsearchinglooksforawordthatsoundslikethewordyouaresearchingforandbeginswiththesameletter.
Source: dtSearchSupport,PhonicSearching,https://support.dtsearch.com/webhelp/dtsearch/phonic_s.htm
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
PhraseSearch
Asearchconsistingofmultiplekeywordsseparatedbyspacestoformasinglephrase.Foradocumenttomatchthissearch,theentirephraseasenteredmustbecontainedwithinthedocument.
Source: EDRMSearchGuideGlossary.
Thesearchphrase“MassachusettsMutual”wouldlocatetextwherethewordsaresidebyside.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch Adaptivepatternrecognition
Associativeretrieval
Booleansearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 232
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
PhysicalTarget
Whentheforensicimagingprocesstargetstheentirephysicaldriveordatastoragemedia.
Source: EDRMCollectionStandards
PhysicalUnitization
Theassemblyofindividuallyscannedpagesintodocuments:
• Physicalunitizationutilizesactualobjectssuchasstaples,paperclipsandfolderstodeterminepagesthatbelongtogetherasdocumentsforarchivalandretrievalpurposes.
• Logicalunitizationistheprocessofhumanreviewofeachindividualpageinanimagecollectionusinglogicalcuestodeterminepagesthatbelongtogetherasdocuments.Suchcuescanbeconsecutivepagenumbering,reporttitles,similarheadersandfootersandotherlogicalcues.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Pica
Onesixth(1/6)ofaninch.Usedtomeasuregraphics/fonts.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
PICT(PictureFormat)
AcolorfileformatexclusivelyforMacintosh.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 233
PictureElement(Pixel)
Primaryunitofcoloronacomputermonitororinanelectronicimage.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Thebasicbuildingblockofallimages--asimpledot.Inbitonalimages,itismerelyablackorwhitedot(see"Bitonal"definitionabove).Ingreyscaleimages,dotswillhavebetween1-to-256possiblevaluesofgrey(foran8-bitgreyscaleimage).
Source: RSI,Glossary.
Adot.Onestep/addressablepositioninthetotalTVorCRTpresentation.TheminimumVGAdisplayhas307,200pixels(640by480).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Apictureelement.Apixelisthesmallestdotonthescreenofacomputerdisplay.Screenresolutionisusuallymeasuredinthenumberofpixelshorizontallyandverticallythatthescreencandisplay(rangingfrom640×480to1280×1024,orhigher).Generally,thehighertheresolution,thatis,themorepixelsintheimage,theclearertheimagewillappear.
PictureFormat
See: PICT(PictureFormat)
PieceofMedia(POM)
Oneunitofphysicalmedia(tapes,ZIP/Jazdisks,DLT,HDD,floppydisks,FTP’edmaterialore-mailedbundlesoffiles,etc.).
Source: IbisConsulting,Glossary.
PIM
See: PersonalInformationManager(PIM)
Pitch
Characters(ordots)perinch,measuredhorizontally.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
PivotTable
APivotTablereportisaninteractivetable,suchasthatfoundinExcelthatcanbeusedtosummarizedata.Forexample,inanexpensereport,youcanuseittosumallofthemoneyspentformeals,oryoucansummarizehowmuchwasspenteachday.Thesetablesareinteractive,onecanrotateitsrowsandcolumnstoseedifferentsummariesofthesourcedata,filterthedatabydisplayingdifferentpages,ordisplaythedetailsforareasofinterest.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 234
Pixel
See: PictureElement(Pixel)
Plaintext
Theleastformattedandthereforemostportableformoftextforcomputerizeddocuments.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Platform
Theunderlyinghardwareorsoftwareforasystem.Forexample,theplatformmightbeanIntel80486processorrunningDOSVersion6.0.TheplatformcouldalsobeUNIXmachinesonanEthernetnetwork.Theplatformdefinesastandardaroundwhichasystemcanbedeveloped.Oncetheplatformhasbeendefined,softwaredeveloperscanproduceappropriatesoftware-andmanagerscanpurchaseappropriatehardwareandapplications.
Source: IbisConsulting,Glossary.
Platform=operatingsystem(orfamilyofoperatingsystems)onthesoftwareside.Thetermcross-platformreferstoapplications,formats,ordevicesthatworkondifferentplatforms.Forexample,across-platformprogrammingenvironmentenablesaprogrammertodevelopprogramsformanyplatformsatonce.
Plug-in
AprogramthatenablesaWebbrowsertopresentnon-HTMLdocuments,suchasAdobeAcrobatdocumentsorsoundandvideoprograms.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
PMS
See: PantoneMatchingSystem(PMS)
PNG(PortableNetworkGraphic)
PortableNetworkGraphics(PNG/ˈpɪŋ/)isarastergraphicsfileformatthatsupportslosslessdatacompression.PNGwascreatedasanimproved,non-patentedreplacementforGraphicsInterchangeFormat(GIF),andisthemostusedlosslessimagecompressionformatontheInternet.
Source: Wikipedia,PortableNetworkGraphics,https://en.wikipedia.org/wiki/Portable_Network_Graphics
Seealso:
GIF
GraphicInterchangeFile
Imagefileformat
Jointphotographicexpertgroup
JPEG
Multi-pageTIFF
PortableDocumentFormat
SearchableTIFF
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 235
Single-pageTIFF TIFF
POD(PrintOnDemand)
Documentimagesarestoredinelectronicformatandareavailabletobequicklyprintedandintheexactquantityrequired,longorshortruns.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
PointEstimate
ThemostlikelyvalueforaPopulationcharacteristic.WhencombinedwithaMarginofError(orConfidenceInterval)andaConfidenceLevel,itreflectsaStatisticalEstimate.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
PointtoPointProtocol(PPP)
Astandardforconnectingtwocomputersfortransferringdata.
Pointer
Apointerisanindexentryinthedirectoryofadisk(orotherstoragemedium)thatidentifiesthespaceonthediscinwhichanelectronicdocumentorpieceofelectronicdataresides,therebypreventingthatspacefrombeingoverwrittenbyotherdata.Inmostcases,whenanelectronicdocumentis“deleted,”thepointerisdeleted,whichallowsthedocumenttobeoverwritten,butthedocumentisnotactuallyerased.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Anindexentryinthedirectoryofaharddiskthatidentifiesthespaceonthediskwhereaspecificfileislocated.Whenafileis“deleted,”itisactuallythepointerwhichiserasedandnotthefileitself.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
PolicyIntegration
Aformalizedcommonsetofgoalsandrulesthatpromotecross-functionalcommunication,collaboration,andoptimization.Informationgovernanceeffortscanbecrippledbyfailuretointegratepolicy.
Source: IGRMWhitePaper
Polysemy
Asinglewordorexpressionhavingmultiplemeanings.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 236
Source: EDRMSearchGlossary.
POM
See: PieceofMedia(POM)
Population
Theuniverseofthingsaboutwhichwearetryingtoinferwithoursamples.Forexample,thepopulationmaybethesetofdocumentsthatwewanttoclassifyasputativelyresponsiveorputativelynon-responsive.Thegroupfromwhichwepulloursamples.Alsocalledthesamplingframe.
Source: HerbRoitblat,PredictiveCodingGlossary.
Port
Aninterfaceforconnectingperipheralswiththecomputer.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thepartofthecomputerthroughwhichaperipheraldevicemaycommunicate,oftenaspecifictypeofplug.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Portability
Theabilitytotransportacomputeranddatafromonelocationtoanother.Typicallyafeatureoflaptopornotebookcomputers,butalsoafeatureofportabledrivesortapesystems.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
PortableDocumentFormat
See: PDF(PortableDocumentFormat)
PortableDrive
Anexternaldiskdrivethatispluggedintoaportonacomputer,typicallyaUSBorFireWireport.Typicallyusedforbackup,butalsoassecondarystorage.Suchunitsrivalinternaldrivesincapacity.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingTechWebTechEncyclopedia,http://www.techweb.com/encyclopedia/defineterm.jhtml?term=portableharddrive.
Seealso:
©2016EDRMLLC
Diskdrive
Floppydiskdrive
Jazdrive
Magneto-opticaldrive
Storagedevice
Tapedrive
Zipdrive
PortableNetworkGraphic
See: PNG(PortableNetworkGraphic)
PortableVolume
Afeaturethatfacilitatesthemovingoflargevolumesofdocumentswithoutrequiringcopyingmultiplefiles.PortablevolumesenableindividualCDstobeeasilyregrouped,detachedandreattachedtodifferentdatabasesforabroaderinformationexchange.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Portal
Awebsitewhichgivesentrytomultipleothersitesandservices.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
PortraitMode
Adisplaywheretheheightexceedsthewidth.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
PortraitOrientation
Animageregisteredsothatitistallerthanitiswide,withthenarrowedgerunningalongtopandbottom.Whenscanning,orientationisdeterminedbytheleadingedgeofthedocument.
Source: RSI,Glossary.
Seealso:
Landscapeorientation
PositiveAgreement
TheProbabilitythat,ifonereviewerCodesaDocumentasRelevant,asecondindependentreviewerwillalsoCodetheDocumentasRelevant.EmpiricalstudiesshowthatPositiveAgreementratesof70%aretypical,andPositiveAgreementratesof80%arerare.PositiveAgreementshouldnotbeconfusedwithAgreement(whichisalessinformativemeasure)orOverlap(whichisanumericallysmallermeasurethatconveyssimilarinformation).Undertheassumptionthatthetworeviewersareequallylikelytoerr,Overlapisroughlyequaltothe
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 238
squareofPositiveAgreement.Thatis,ifPositiveAgreementis70%,Overlapisroughly70%x70%=49%.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
PositivePredictiveValue(PPV)
SeePrecision.PositivePredictiveValueisatermusedinSignalDetectionTheory;PrecisionistheequivalentterminInformationRetrieval.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
PPM
See: PagesPerMinute(PPM)
PPP
See: PointtoPointProtocol(PPP)
PPV
See: PositivePredictiveValue(PPV)
PracticeDirection
PracticeDirections:theseareofficialadjunctstotheCPRandprovidemandatedguidanceforpractitionersinconductinglitigation.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
PracticeManagementSystem
AlsoknownasCaseManagementSystem(CMS).Suchsystemsmayincludefeaturessuchascalendar/docket,conflict-checking,documentassembly,andmaintenanceofdatabasesofclientandcaseinformation.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Precision
Precisionmeasuresthenumberoftrulyresponsivedocumentsintheretrievedsetofresponsivedocuments.Seealso,Recall.
Source: EDRMSearchGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 239
ThefractionofDocumentsidentifiedasRelevantbyasearchorrevieweffort,thatareinfactRelevant.AlsoreferredtoasPositivePredictiveValue.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Precisionistheproportionofretrieveddocumentsthatareresponsive.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
Recall
Precision-RecallCurve
ThecurverepresentingthetradeoffbetweenPrecisionandRecallforagivensearchorrevieweffort,dependingonthechosenCutoffvalue.SeeRecall-PrecisionCurve.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Precision-RecallTradeoff
ThenotionthatmostsearchstrategiescanbeadjustedtoincreasePrecisionattheexpenseofRecall,orviceversa.Atoneextreme,100%RecallcouldbeachievedbyasearchthatreturnedtheentireDocumentPopulation,butPrecisionwouldbelow(equaltoPrevalence).Attheotherextreme,100%PrecisioncouldbeachievedbyasearchthatreturnedasingleRelevantDocument,butRecallwouldbelow(equalto1/N,whereNisthenumberofRelevantDocumentsintheDocumentPopulation).Moregenerally,abroadersearchreturningmanyDocumentswillhavehigherRecallandlowerPrecision,whileanarrowersearchreturningfewerDocumentswillhavelowerRecallandhigherPrecision.APrecision-RecallCurveillustratesthePrecision-RecallTradeoffforaparticularsearchmethod.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
PredictiveCoding
Anindustry-specifictermgenerallyusedtodescribeaTechnology-AssistedReviewprocessinvolvingtheuseofaMachineLearningAlgorithmtodistinguishRelevantfromNon-RelevantDocuments,basedonaSubjectMatterExpert'sCodingofaTrainingSetofDocuments.SeeSupervisedLearningandActiveLearning.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 240
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Agroupofmachinelearningtechnologiesthatpredictwhichdocumentsareandarenotresponsivebasedonthedecisionsappliedbyasubjectmatterexperttoasmallsampleofdocuments.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
CAR TAR
PresentationPhase
DisplayingESIbeforeaudiences(atdepositions,hearings,trials,etc.),especiallyinnativeandnear-nativeforms,toelicitfurtherinformation,validateexistingfactsorpositions,orpersuadeandaudience.
Source; EDRMStages
CorrespondstoUTBMSCodeL680.ActivitiesandactionstoprepareanddisplayESIbeforeaudiences(atdepositions,hearings,trials,etc.),especiallyinnativeandnear-nativeforms,toelicitfurtherinformation,validateexistingfactsorpositions,orpersuadeanaudience.
Source: EDRMMetricsGlossary
PreservationPhase
EnsuringthatESIisprotectedagainstinappropriatealterationordestruction.
Source: EDRMStages
CorrespondstoUTBMSCodesL610-L619.PreservationOrder,LegalHold,QualityAssuranceandControl.
Source: EDRMMetricsGlossary
Prevalence
ThefractionofDocumentsinaPopulationthatareRelevanttoanInformationNeed.AlsoreferredtoasRichnessorYield.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Therichnessorproportionofresponsivedocumentsinacollection.Morebroadly,theprevalencereferstotheproportionofonekindofiteminapopulationofitems.
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 241
PrincipalComponentAnalysis
Amathematicaltechniquethatsummarizesthecorrelationbetweenitems.OneofthetechniquesusedineDiscoveryasthebasisforconceptsearch,wheretheitemsarewords.
Source: HerbRoitblat,Search2020:TheGlossary.
PrintOnDemand
See: POD(PrintOnDemand)
PrivateNetwork
AnetworkthatisconnectedtotheInternetbutisisolatedfromtheInternet.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Privilege
Aspecialandexclusivelegaladvantageorright.Examplesincludeattorneyworkproductandcertaincommunicationsbetweenanindividualandhisorherattorney,whichareprotectedfromdisclosure.
PrivilegeLog
Arecordoftheresponsiveand/orrelevantdocumentsthatarebeingwithheldfromproductiononaclaimthattheyeithercontainattorney-clientcommunicationorareattorneywork-product.Thoughthereisnotstandardruledescribingthenecessarycontentforaprivilegelog,theFederalRulesofCivilProcedurecontainageneralrequirementthataprivilegelog“describethenature”oftheprivilegeddocumentinamannerthat“willenableotherpartiestoassesstheclaim.”Fed.R.Civ.P.26(b)(5)(A).
Source: EDRMMetricsGlossary
AlistofasetofdocumentsthataProducingPartydidnotproduceonaccountofPrivilegesuchasAttorney-ClientPrivilege.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
PrivilegedDocuments
AsetofdocumentsthataProducingPartyisnotrequiredtoprovide,sincetheyfallintoPrivilegesuchasAttorney-ClientPrivilege.TheexistenceofsuchdocumentsshouldberecordedinthePrivilegeLog.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 242
ProbabilisticLatentSemanticAnalysis
AvariantofLatentSemanticAnalysisbasedonconditionalProbabilityratherthanoncorrelation.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Astatisticalprocedureforfindingtheunderlyingdimensionsofcorrelatedterms.LikeLatentSemanticAnalysis,thisprocedureattemptstocapturethemeaningsharedbymultipletermstoprovideaconceptsearchcapability.ItdifferssomefromLSAinthatitinvolvesadifferentstatisticalmodel.Alsocalledprobabilisticlatentsemanticindexing.
Source: HerbRoitblat,PredictiveCodingGlossary.
ProbabilisticModel
Aclassofmathematicalmodelsthataredescribedinthelanguageofprobabilitywithoutnecessarilyinvolvingrandomness.Forexample,ifwefindthatthreetimesoutoffour,whenacertainwordisusedinadocument,thedocumentisresponsive,thenaprobabilisticmodelwillincludeanestimatethattheprobabilitythatadocumentcontainingthatword(allotherthingsbeingequal)ismoreprobablyresponsive(75%)thannon-responsive(25%).
Source: HerbRoitblat,Search2020:TheGlossary.
Probability
Thefraction(ThefractionofasetofDocumentshavingsomeparticularproperty(typicallyRelevance).)oftimesthataparticularoutcomewouldoccur,shouldthesameactionberepeatedunderthesameconditionsaninfinitenumberoftimes.Forexample,ifoneweretoflipafaircoin,theProbabilityofitlanding“heads”isone-half,or50%;asonerepeatsthisactionindefinitely,thefractionoftimesthatthecoinlands“heads”willbecomeindistinguishablefrom50%.Ifoneweretofliptwofaircoins,theProbabilityofbothlanding“heads”isone-quarter,or25%.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
ProcessTransparency
hesharedownershipandexecutionofInformationGovernanceprocessesensuringthataccountabilitiesanddependenciesacrossthestakeholdersareclearlydefinedbyeachgrouptopromoteefficientandeffectivemanagementofinformation.
Source: IGRMWhitePaper
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 243
ProcessingPhase
ReducingthevolumeofESIandconvertingit,ifnecessary,toformsmoresuitableforreviewandanalysis.
Source: EDRMStages
CorrespondstoUTBMSCodeL630-L639.ESIStage,PreparationandProcess,Scanning-HardCopy,ForeignLanguageTranslation,ExceptionHandling,QualityAssuranceandControl.
Source: EDRMMetricsGlossary
ProducingParty
ApartythatownsthecompletecollectionofESI,andisresponsibleforproducingaportionoftheESIthatisdeemedtoberelevantforalegalcaseorlegalenquiry.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
Production
DeliveringESItoothersinappropriateforms&usingappropriatedeliverymechanisms.
Source: EDRMStages.
Deliveryofdataorinformationinresponsetoaninterrogatory,subpoenaordiscoveryorderorasimilarlegalprocess.
Source: RenewData,Glossary(10/5/2005).
ProductionDe-Duplication
Cullingofadocumentifmultiplecopiesofthatdocumentresidewithinthesameproductionset.Forexample,iftwoidenticaldocumentsarebothmarkedresponsive,non-privileged,productionde-duplicationensuresthatonlyoneofthosedocumentsareproduced.Contrastwithcasede-duplicationandcustodiande-duplication.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Basicde-duplication
Casede-duplication
Custodiande-duplication
De-duplication
Duplicate
Dynamicde-duplication
GlobalDeduplication
HorizontalDeduplication
VerticalDeduplication
©2016EDRMLLC
ProductionPhase
DeliveringESItoothersinappropriateformsandusingappropriatedeliverymechanisms.
Source: EDRMStage
CorrespondstoUTBMSCodesL670-L679.ConversionofESItoProductionFormat,QualityAssuranceandControl.
Source: EDRMMetricsGlossary
ProductionSource
Adatafieldinadatabasethatrecordstheindividualorcompanythatproducedtheparticulardocument.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Recipient
Subjectcategory
Summary
Text
Program
Aseriesofinstructionstothecomputer.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thetermforasoftwareapplication.
Source; LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ProjectManagementPhase
CorrespondstoUTBMSCodeL690.ActivitiesoractionstoassociatedwithsupervisingormanagingspecificactivitiesoractionsthroughouttheEDRMcontinuumsuchasconductingmeetingsandteamcalls,developingworkplans,budgets,forecasts,reportsandother
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 245
meaningfulactivitiesorforgeneralprojectmanagementnotassociatedwithaparticular"L"code.
Source: EDRMMetricsGlossary
ProjectManager
Anindividualresponsibleforadministrationandsupervisionoveraparticulardatabaseorautomationproject.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Prompt
Adisplaythataskstheoperatortoperformaspecificaction.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
TheDOSprompt.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Proportion
ThefractionofasetofDocumentshavingsomeparticularproperty(typicallyRelevance).
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Proportionality
PursuanttoFederalRulesofCivilProcedure26(b)(2)(B),26(b)(2)(C),26(g)(1)(B)(iii),andotherfederalandstateproceduralrules,thelegaldoctrinethatElectronicallyStoredInformationmaybewithheldfromproductionifthecostandburdenofproducingitexceedsitspotentialvaluetotheresolutionofthematter.Proportionalityhasbeeninterpretedinthecaselawtoapplytopreservationaswellasproduction.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
TheoverridingobjectiveoftheCPRistoenablethecourttodealwithcasesjustly(CPR1).Specificallythisisstatedtoinclude"dealingwiththecaseinwayswhichareproportionate..."anditthengoesontolistfactorswhichneedtobeconsideredtodeterminewhatisproportionate.TakentogetherthesefactorsaregenerallyreferredtotogetherasProportionality.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 246
ProximitySearch
AProximitySearchsearchesformultiplekeywords.Thematchingdocumentsmustcontainallthekeywords,withthekeywordsoccurringwithinaspecifiednumberofwordsfromeachother.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
Retrievesawordonlywhenitoccurswithinaspecificnumberoflinesorwordsofanotherword.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
For"full-text"searches,theabilitytolookforwordswhicharewithinaprescribeddistanceofanotherword(e.g.find"glove"within15wordsof"baseball".)
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
PST
See: PersonalStorageFile(PST)
Q
QA&Control
Acommonelementwithineache-discoveryPhasewhichreferstodefinedsteps,proceduresandmethodstakentoensurethatworkisdonecompletely,accuratelyandinamannerwhichisconsistentwithexpectations,instructionsandbestpractices.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 247
Source: EDRMMetricsGlossary
QBIC
See: QueryByImageContent(QBIC)
QIC
See: QuarterInchCartridge(QIC)
Quad-Gram
AnN-GramwhereN=4(i.e.,a4-gram).
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
QualityAssurance
Amethodtoensure,afterthefact,thatasearchorreviewefforthasachievedreasonableresults.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
QualityControl
Ongoingmethodstoensure,duringasearchorrevieweffort,thatreasonableresultsarebeingachieved.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Anyprocessusedtochecktheaccuracyandconsistencyofinformationcodedintoadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Theprocessofensuringthehighestlevelofresultsinagiventask.Indocumentmanagementprocesses,thisincludesimagequality(resolution,skew,speckle,legibility);dataquality(correctinformationinappropriatefields,validateddatafordates,addresses,names/issueslists).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
QualityControl/QualityAssurance
Processofvalidationduringpostselectionofdata;throughoutreview,pre-productiontoidentifyinconsistenciesindocumentproductions,totestforconflictingreviewcalls.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 248
Source: EDRMSearchGlossary.
QuarterInchCartridge(QIC)
Digitalrecordingtape,2000feetlong,withanuncompressedcapacityof5GB.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Backup
Backuptape
DAT-digitalaudiotape
Dataextraction
Digitalaudiotape
Disasterrecoverytape
DLT-digitallineartape
Magneticstoragemedia
Media
Tape
Query
Aformalsearchcommandprovidedasinputtoasearchtool.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Askforinformationordata.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Arequestfordatasenttoadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Asearchrequestinadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Aqueryisarequesttoasearchengineorsimilarinformationretrievalsystem.Queriesmayconsistofkeywords,phrases,complexexpressions,orevenwholedocuments.
Source: HerbRoitblat,Search2020:TheGlossary.
QueryByImageContent(QBIC)
AnIBMsearchsystemforstoredimageswhichallowstheusertosketchanimageandthensearchtheimagesfilestofindthosewhichmostcloselymatch.Theusercanspecifycolorandtexture–suchassandybeachesorclouds.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 249
QueryExpansion
TheprocessofaddingSearchTermstoaQuerytoimproveRecall,oftenattheexpenseofdecreasedPrecision.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Aprocesswhereinaquerysubmittedbyauserismodifiedtoincludeadditionalterms.Theexpandedquerymayincludesynonymsoftheinitialquery,spellingalternatives,orotherrelatedwords.Queryexpansionisoneofthemethodsusedtosupportconceptsearch.
Source: HerbRoitblat,Search2020:TheGlossary.
R
R-Squared(R2)
Astatisticalmeasureindicatinghowgoodonetermisatpredictinganother.PerfectpredictionswouldresultinR2valuesof1.0.Ifonetermisuselessforpredictingtheother,thenR2wouldbe0.0.ThehighertheR2,thebettertheprediction.Moretechnically,R2measureshowwellvariabilityinonetermpredictsthevariabilityintheother.
Source: HerbRoitblat,PredictiveCodingGlossary.
R2
See: R-Squared(R2)
RAID(RedundantArrayofIndependentDisks)
ArraysorJukeboxesofCD-ROM'sorCD-R's.Therearefivecommonlyused,differentlevelsofdataprotection,RAID1throughRAID5,whicharetradeoffsofprotectionversusstoragecapacity.Theseinclude:
• Level0:Datawritteninblocksacrossmultipledriveswithoutanprotectiononfailures.• Level1:DiskMirroring.• Level3:Thedrivespindlesaresynchronizedsuchthattheheadsallseekatthesame
timeandarepositionedoverthesameread/writeareassimultaneously.Dataiswrittenonebitatatimewithparitytoaseparatedrive.Thusiftherewerefourdisksinthearrayandtherewasamegabyteofdatatotransferredat1MB/sec,theeffectiverateis4MB/sec.
• Level5:Writesdatainchunks(usuallysmallerblocks512bytesto2K)withtheparitystripedalongwiththedata.AchievesahigherI/Orate.
FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 250
RAM(RandomAccessMemory)
Thehardwareinsideacomputerthatretainsmemoryonashort-termbasisandstoresinformationwhiletheuserutilizesthecomputer.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Themainmemoryofthecomputer,whereactivesoftwareandtemporaryfilesarestoredandmostofthecomputer’sworkisperformed.DatastoredinRAMaretemporarilystoredandarelostwhenthecomputeristurnedoff.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Memorywhichcanbereadorwritteninanysectionwithoneinstructionsequence.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Theworkingmemoryofthecomputerintowhichapplicationprogramscanbeloadedandexecuted.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
DRAM Memory ROM
RANDStudy
A2012study(NicholasM.Pace&LauraZakaras,WheretheMoneyGoes:UnderstandingLitigantExpendituresforProducingElectronicDiscovery,RANDInstituteforCivilJustice(2012)),indicatingthatDocumentreviewaccountsfor73%ofElectronicDiscoverycosts,andconcludingthat“[t]heexponentialgrowthindigitalinformation,whichshowsnosignsofslowing,makesacomputer-categorizedreviewstrategy,suchaspredictivecoding,notonlyacost-effectivechoicebutperhapstheonlyreasonablewaytohandlemanylarge-scaleproductions.”
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Random
Unpredictable.Randomselectionmeansthateachitemhasanequalchanceofbeingselectedandthereisnosystematicbiastoselectoneitemratherthananother.Coinflipsarerandom.Knowingthatonecoinflipcameupheadsdoesnotchangethelikelihoodthatthenextcoinflipwillcomeupheads(thesecoinflipsaresaidtobeindependent).
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 251
RandomAccessMemory
See: RAM(RandomAccessMemory)
RandomSample/RandomSampling
AsubsetoftheDocumentPopulationselectedbyamethodthatisequallylikelytoselectanyDocumentfromtheDocumentPopulationforinclusionintheSample;theSampleresultingfromsuchaction.RandomSamplingisthebasisofStatisticalEstimation.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Thestatisticalprocessofchoosingobjectsrandomly,meaningthateachobjecthasanequalchanceofbeingselected.Randomsamplingcanbeusedtotrainpredictivecodingsystemsandtoevaluatetheirefficacy.
Source: HerbRoitblat,PredictiveCodingGlossary.
RangeSearch
Adatabasequerywithinacertainrangeofdatesordocumentnumbers.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
Raster
Representsimagesbyahorizontalandverticalarrayofdotsorpixels.Amethodofrepresentinganimagewithagrid(or“map”)ofdotsorpixels.TypicalrasterfileformatsareGIF,JPEG,TIFF,PCX,BMP,etc.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 252
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
RAWImageFile
ARAWimagefileisabit-by-bitcopyofdataonadiskorvolume,withoutadditions,deletions,ormetadata.Originallyusedbydd,theRAWimageformatissupportedbymostcomputerforensicapplications.
Source: http://www.forensicswiki.org/wiki/Raw_Image_Format
RDBMS(RelationalDatabaseManagementSystem)
RelationalDatabaseManagementSystem.Thisisatechnicaltermfortheclassofsoftwareprogramsthatmanagedatausingarelationalschema,suchasMicrosoftSQLServerorOracle.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
ReadOnlyMemory(ROM)
Read-onlymemory(ROM)isatypeofnon-volatilememoryusedincomputersandotherelectronicdevices.DatastoredinROMcanonlybemodifiedslowly,withdifficulty,ornotatall,soitismainlyusedtostorefirmware(softwarethatiscloselytiedtospecifichardwareandunlikelytoneedfrequentupdates)orapplicationsoftwareinplug-incartridges.
Source: Wikipedia,Read-onlymemory,https://en.wikipedia.org/wiki/Read-only_memory
Seealso:
DRAM Memory RAM
Reboot
Tostopandstarttheoperatingsystemagain.Usuallydonewhenaproblemoccursorthecomputer“locksup”andisaccomplishedbypressingtheControl,Alternate,andDeletekeysatthesametime.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Recall
Recallmeasuresthenumberofresponsivedocumentsretrievedcomparedtothetotalnumberofresponsivedocumentsinthecorpus.Recallcannotbeabsoluteunlessalldocumentshavebeensearchedandallhavebeenreviewed.SinceRecallmeasurestheratioofresponsivedocumentsagainstthefullcorpus,thenumberofresponsivedocumentsinthecorpusisdifficulttodetermine.SeetheEDRMSearchGuideregardingprecision,recall,andsamplingformoreinformation.Seealso,Precision.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 253
Source: EDRMSearchGlossary.
ThefractionofRelevantDocumentsthatareidentifiedasRelevantbyasearchorrevieweffort.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Theproportionofresponsivedocumentsintheentirecollectionthathavebeenretrieved.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
Precision
Recall-PrecisionCurve
ThecurverepresentingthetradeoffbetweenRecallandPrecisionforagivensearchorrevieweffort,dependingonthechosenCutoffvalue.
Source: TheGrossman-CormackGlossaryofTechnologyAssistedReview(Version1.02,Nov.2102).
Recall-PrecisionGraph
Agraphthatshowsthetradeoffbetweenprecisionandrecall.Typically,thehighertherecalllevel,thelowertheprecisionlevel.Inordertogetmoreoftheresponsivedocuments,oneusuallyhastoacceptmoreirrelevantdocuments.
Source: HerbRoitblat,PredictiveCodingGlossary.
ReceiverOperatingCharacteristicCurve(ROC)
InSignalDetectionTheory,agraphofthetradeoffbetweenTruePositiveRateandFalsePositiveRate,astheCutoffisvaried.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Recipient
Adatafieldcontainingthenameoftheindividualorcompanywhoreceivedaspecificdocument.Alsocalled"addressee"field.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield Beginningdocumentnumber
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 254
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Subjectcategory
Summary
Text
Record
Information,regardlessofmediumorformat,thathasvaluetoanorganization.Collectivelythetermisusedtodescribebothdocumentsandelectronicallystoredinformation.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Acollectionofrelatedfieldsoritemsofdata,treatedasaunit.Forexample,eachlistinginaPersonalInformationManagerisarecord.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Anindividualiteminadocumentdatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
RecordLevelDeletion
Deletionistheprocesswherebydataisremovedfromactivefilesandotherdatastoragestructuresoncomputersandrenderedinaccessibleexceptusingspecialdatarecoverytoolsdesignedtorecoverdeleteddata.Deletionoccursinseverallevelsonmoderncomputersystems:
1. Fileleveldeletion:Deletiononthefilelevelrendersthefileinaccessibletotheoperatingsystemandnormalapplicationprogramsandmarksthespaceoccupiedbythefile'sdirectoryentryandcontentsasfreespace,availabletoreusefordatastorage.
2. Recordleveldeletion:Deletionontherecordleveloccurswhenadatastructure,likeadatabasetable,containsmultiplerecords;deletionatthislevelrenderstherecordinaccessibletothedatabasemanagementsystem(DBMS)andusuallymarksthespaceoccupiedbytherecordasavailableforreusebytheDBMS,althoughinsomecasesthespaceisneverreuseduntilthedatabaseiscompacted.Recordleveldeletionisalsocharacteristicofmanye-mailsystems.
3. Byteleveldeletion:Deletionatthebyteleveloccurswhentextorotherinformationisdeletedfromthefilecontent(suchasthedeletionoftextfromawordprocessingfile);suchdeletionmayrenderthedeleteddatainaccessibletotheapplicationintendedtobeusedinprocessingthefile,butmaynotactuallyremovethedatafromthefile's
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 255
contentuntilaprocesssuchascompactionorrewritingofthefilecausesthedeleteddatatobeoverwritten.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Deletionistheprocesswherebydataisremovedfromactivefilesandotherdatastoragestructuresoncomputersandrenderedinaccessibleexceptusingspecialdatarecoverytoolsdesignedtorecoverdeleteddata.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Removingactivefilesmakingthemunavailable.Specialdatarecoverytoolscanstillretrievethesefiles.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
RecordLifecycle
Thetimeperiodfromwhenarecordiscreateduntilitisdisposed.
Source; KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
RecordsCustodian
Arecordscustodianisanindividualresponsibleforthephysicalstorageandprotectionofrecordsthroughouttheirretentionperiod.Inthecontextofelectronicrecords,custodianshipmaynotbeadirectpartoftherecordsmanagementfunctioninallorganizations.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
RecordsManagement
RecordsManagementistheplanning,controlling,directing,organizing,training,promotingandothermanagerialactivitiesinvolvingthelifecycleofinformation,includingcreation.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Theprocessofmaintainingbusinessdocumentsorrecords.Arecordsmanagementplanincludespoliciesfordocumentretentionanddestruction.Recordsmanagementplansareoftendesignedbyacollaborationamonginformationtechnology,businessunits,andlegaldepartments.
RecordsRetentionPeriod
Thelengthoftimeagivenrecordsseriesmustbekept,expressedaseitheratimeperiod(i.e.,fouryears),aneventoraction(i.e.,audit),oracombination(i.e.,sixmonthsafteraudit).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 256
RecordsRetentionSchedule
Aplanforthemanagementofrecords,listingtypesofrecordsandhowlongtheyshouldbekept;thepurposeistoprovidecontinuingauthoritytodisposeofortransferrecordstohistoricalarchives.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Red,GreenandBlue(RGB)
Thethreeprimarycolorsintheadditivecolorfamilywhichcreateallthecomputercolorvideosignalsforacomputer'scolorterminal.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Redact
Aportionoftheimageisblackedoutintentionallytoconcealinformationfromthedocument.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Theprocessofremovingprivilegedinformationfromadocument.Thisisusuallyaccomplishedbyplacingablackareaovertheprivilegedtext.
ReducedInstructionSetChip(RISC)
Atypeofcomputerchipthatcombinesmanyinstructionsinordertospeedupprocessing.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
RedundantArrayofIndependentDisks(RAID)
See: RAID(RedundantArrayofIndependentDisks)
RefreshRate
HowmanytimesasecondandimageonaCRTorTVisupdated.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Region
Anareaofanimagefilethatisselectedforspecializedprocessing.Alsocalleda“zone.”
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 257
Registration
Liningupaformsimagetodeterminewhichfieldsarewhere.Also,enteringpagesintoascannersuchthattheyarecorrectlyread.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Registry
ThesystemconfigurationfilesusedbyMicrosoftWindowstostoresettingsaboutuserpreferences,installedsoftware,hardwareanddriversandothersettingsrequiredforWindowstoruncorrectly.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
RegularExpressions
Apatternthatdescribeswhatthesearchshouldreturnbasedonspecialcharactersaddedtothekeyword.Forexample,car*usesthecharacter*asawildcard,andtheresultingdocumentsshouldcontainwordsthatbeginwiththecharacters“car”,suchascar,cartoon,orcartography.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
RelatedWordSearch
Relatedwordssearchallowsalegalprofessionaltospecifyawordandotherwordsthataredeemedtoberelatedtoit.Typically,suchrelatedwordsaredeterminedaseitherpartofconceptsearchorbystatisticalco-occurrencewithotherwords.
Source: EDRMSearchGlossary.
RelationalDatabase
Arelationaldatabaseisacollectionofdataitemsorganizedasasetofformally-describedtablesfromwhichdatacanbeaccessedorreassembledinmanydifferentwayswithouthavingtoreorganizethedatabasetables.InventedbyE.F.CoddatIBMin1970.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingSearchDatabase.com,http://searchoracle.techtarget.com/sDefinition/0,,sid41_gci212885,00.html.
Adatabaseinwhichsomeitemsinonetypeofrecordrefertoitemsinanothertypeofrecord.Relationaldatabasesgenerallylinktogethertwoormoretablesorfilesfromdifferentdatabasesthroughacommonfieldorwithinranges,thusallowingsearchesofmultiplefields,suchasdates.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 258
Adatabasecontainingrecordsinfieldsthataresomehowconnectedor“related.”Thisallowssimultaneoussearchesofmultiplefields.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Astyleofdatastorageandaccesswherethedataarestoredintables.Eachrowcontainsonerecord,andeachcolumncontainsonevariableforthatrecord.Relationaldatabasesalsoallowreferences(relations)betweentables.SQL,structuredquerylanguageisthetypicalmethodusedtoaccesstheinformationinrelationaldatabases.
Source: HerbRoitblat,Search2020:TheGlossary.
Seealso:
Database
Flatfiledatabase
Fulltextdatabase
SQL
WAIS-wideareainformationserver
RelationalDatabaseManagementSystem
See: RDBMS(RelationalDatabaseManagementSystem)
Relevance/Relevant
InInformationRetrieval,aDocumentisconsideredRelevantifitmeetstheInformationNeedofthesearchorrevieweffort.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
RelevanceFeedback
AnActiveLearningprocessinwhichtheDocumentswiththehighestlikelihoodofRelevancearecodedbyahuman,andaddedtotheTrainingSet.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Aclassofmachinelearningtechniqueswhereusersindicatetherelevanceofitemsthathavebeenretrievedforthemandthemachinelearnstherebytoimprovethequalityofitsrecommendations.
Source: HerbRoitblat,Search2020:TheGlossary.
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 259
RelevanceRanking
AsearchmethodinwhichtheresultsarerankedfromthemostlikelytotheleastlikelytobeRelevanttoanInformationNeed;theresultofsuchranking.GoogleWebSearchisanexampleofRelevanceRanking.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
RelevancyRank
Ameasurementofrelevancyofadocument,sothattheSearchHitswithinaSearchResultscanbeordered.Relevancymeasurementsofteninvolvecountingthenumberofoccurrencesofakeywordwithinadocument,aswellasnumberofdocumentsakeywordisfoundin.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
RemoteConnectivity
Theuseofacomputeroutsidetheuser’soffice.Commonlyassociatedwiththeuseofportablelaptopornotebookcomputers,butmayalsorefertotheabilitytoaccesscomputersfromotheroffices,fromthecourtroom,orfromtheclient’soffice.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Render
TheprocessofconvertingresponsivedocumentsintoastandardformattypicallyTIFFsorPDFs.Thesedocumentsarehistoricallydeliveredonpaper,thoughtheymaybeproducedasimagesorasoriginalelectronicfilesdependingonthecase,requestsoftheattorneys,etc.
Report
Theuseofacomputeroutsidetheuser’soffice.Commonlyassociatedwiththeuseofportablelaptopornotebookcomputers,butmayalsorefertotheabilitytoaccesscomputersfromotheroffices,fromthecourtroom,orfromtheclient’soffice.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Repository
Acentralizeddatabasestoredonacomputerthathousesspecificinformation.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 260
RepositoryforElectronicRecordsisadirectaccessdeviceonwhichtheelectronicrecordsandassociatedmetadataarestored.Sometimescalleda“recordsstore,”“onlinerepository”or“recordsarchive.”
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
RequestforAdmission
Inacivilaction,arequestforadmissionisadiscoverydevicethatallowsonepartytorequestthatanotherpartyadmitordenythetruthofastatementunderoath.Ifadmitted,thestatementisconsideredtobetrueforallpurposesofthecurrenttrial.Partiesmayalsousethisdiscoverydevicetorequestthatotherpartiesverifythatdocumentsaregenuine.
Source: LegalInformationInstituteWex,Requestsforadmission,https://www.law.cornell.edu/wex/requests_for_admission
Externallinks:
Rule36.RequestsforAdmission,https://www.law.cornell.edu/rules/frcp/rule_36
Seealso:
Discoveryrequest Documentrequest Interrogatory
RequestforComments(RFC)
Themeansbywhichinternetstandardsarecreatedandmodified.RFCsaredistributedbytechnicalexpertsactingontheirowninitiativeandreviewedbytheInternetatlarge,ratherthanformallypromulgatedthroughaninstitutionsuchasANSI.Forthisreason,theyremainknownasRFCsevenonceadoptedasstandards.
RequestforProductionofDocuments
See: DocumentRequest
RequestingParty
ApartythatdoesnotowntheESIandisrequestingthattheProducingPartywhichownstheESItoprovidesomesubsetoftheESIbasedonaSearchRequest.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
ResidualData
Sometimesreferredtoas"ambientdata,"referstodatathatisnotactiveonacomputersystem.Residualdataincludes:
Datafoundonmediafreespace;
Datafoundinfileslackspace;and
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 261
Datawithinfilesthathasfunctionallybeendeletedinthatitisnotvisibleusingtheapplicationwithwhichthefilewascreated,withoutuseofundeleteorspecialdatarecoverytechniques.
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Datathatisnotcurrentlyliveonthecomputersystem,includingdatafoundinfileslackspace,datafoundonmediafreespace,anddatafromdeletedfiles.Alsoknownas"ambientdata."
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Datathatisnotactiveonacomputersystemsuchasdatainmediafreespace,slackspaceorfilesthathavebeen“deleted”.Sometimescalled"ambientdata."
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Ambientdata
Fragmenteddata
Freespace
Slackspace
Swapfile
Unallocatedspace
Resolution
Indicatesthenumberofdots,oftenmeasuredindpi,thatmakeupanimageonascreenorprinter.Thelargerthenumberofdots,andthusthehighertheresolution,thefinerandsmootherimagescanappearwhendisplayedatagivensize.Lowresolutioncausesjaggedcharacters.Theidealresolutionisatrade-offbetweenqualityandtheoverheadinstoragepowerandprocessingstrengthrequiredtouseit.
Source: RSI,Glossary.
Thevisualclarityofadisplayscreenorprinter.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ResponseBias
OnetypeofResponseBiascanoccurifthesamplingprocessconsidersthecontentofthedocuments.Seealso,Non-ResponseBias.
Source: EDRMSearchGlossary.
Seealso:
Non-ResponseBias
ResponsiveFile
Afilethatisresponsivetooneofthefilters(fulltextsearch,extension/size,date,MD5-known,cookies,sender/recipient,andcustomprocessing)inanelectronicdiscoveryprocess.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 262
Source: IbisConsulting,Glossary.
Seealso:
Responsive/RelevantDocuments
Responsive/RelevantDocuments
AsubsetofESIthatmatchespotentiallythedesiredsetofdocumentsforthecase.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
Seealso:
Responsive/RelevantDocuments
Responsiveness
ADocumentthatisRelevanttoanInformationNeedexpressedbyaparticularrequestforproductionorsubpoenainacivil,criminal,orregulatorymatter.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Astandardthatmeasureswhetheradocumentfitstheestablishedparametersofthedocumentrequest.
Restore
Amethodofpreparingadatasetforprocessingbyconvertingmailbackupstomailarchives(forexample,PSTforMSOutlookandNSFforLotusNotes).
Source: IbisConsulting,Glossary.
Indatamanagement,restoreisaprocessthatinvolvescopyingbackupfilesfromsecondarystorage(tape,Zipdiskorotherbackupmedia)toharddisk.Arestoreisperformedinordertoreturndatatoitsoriginalconditioniffileshavebecomedamaged,ortocopyormovedatatoanewlocation.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Totransferdatafromabackupmedium(suchastapes)toanon-linesystem,oftenforthepurposeofrecoveryfromaproblem,failure,ordisaster.Restorationofarchivalmediaisthetransferofdatafromanarchivalstoretoanon-linesystemforthepurposesofprocessing(suchasquery,analysis,extractionordispositionofthatdata).Archivalrestorationofsystemsmayrequirenotonlydatarestorationbutalsoreplicationoftheoriginalhardwareandsoftwareoperatingenvironment.Restorationofsystemsisoftencalled“recovery”.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 263
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingWhatis.com,http://searchstorage.techtarget.com/sDefinition/0,,sid5_gci965124,00.html.
RetentionPeriod
See: RecordsRetentionPeriod
RetentionSchedule
See: RecordsRetentionSchedule
Retrieval
Theon-screenresultofaquery.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ReviewFeedbackValidation
Reviewfeedbackvalidationinvolvescrossreferencingtheresultsofsearchwiththecallsmadebyattorneysduringdocumentreview.Thedocumentlevelclassificationasrelevantorprivilegedprovideskeeninsightintorefiningthesearchandselectioncriteriaorinidentifyinggapsthatrequireadditionalanalysis.ThisfeedbackwillbeusedforadditionalanalysisandtorefinetheSearchCriteriasets.Thefeedbackmayidentifycategoriesofdocumentsthatarenotyieldingresponsivedocumentsandorcouldidentifydocumentstobeexcludedfromthereviewset.Also,thefeedbackmayidentifynewcategoriesofdocumentsthatshouldbeincludedandthecriteriawillbebroadenedtoincludethosedocumentsinthereviewset.
Source: EDRMSearchGlossary.
ReviewPhase
EvaluatingESIforrelevanceandprivilege.
Source: EDRMStages
CorrespondstoUTBMSCodeL650-L659.HostingCosts,ReviewPlanningandTraining,ObjectiveandSubjectiveCoding,FirstPassDocumentReview,SecondPassDocumentReview,PrivilegeReview,Redaction,QualityAssuranceandControl.
Source: EDRMMetricsGlossary
Rewritable
Storagedeviceswherethedatamaybewrittenmorethanonce–typicallyharddrives,floppiesandopticaldisks.Theassetsarere-use,highspeedandcapacity.TheopticaldiskshavethesamebasiccharacteristicsasaCD-ROM,exceptthatyoucanwriteovertheexistingdata.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 264
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
RFC
See: RequestforComments(RFC)
RFCCompliantEmail
Emailsthatareconsistentwiththeinternetstandardsforsuchdocuments.Thestandardsareestablishedthrougharequestforcommentsresultinginageneralandopendiscussion.EmailRFCsincludeRFC1939--PostOfficeProtocol,andRFC2821;SimpleMailTransferProtocol.Compliancetotheseprotocolsensuresthattheemailcanbeprocessedaccurately.
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
OST
PST
RFC822
Single-mailarchive
Single-mailcontainer
SMTP
RFC822
ThestandardfortheformatofARPAInternettextmessages.Thisisauniversal(andoutdated)standardfore-mailthatisentirelytext-based,portableandreadablebyvirtuallyanysystem.
Source: IbisConsulting,Glossary.
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
OST
PST
RFCcompliantemail
Single-mailarchive
Single-mailcontainer
SMTP
RGB
See: Red,GreenandBlue(RGB)
Richness
Theproportionorprevalenceofresponsivedocumentsinacollection.
Source: HerbRoitblat,PredictiveCodingGlossary.
SeePrevalenceorYield.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 265
Source: TheGrossman-CormackGlossaryofTechnologyAssistedReview(Version1.02,Nov.2102).
RISC
See: ReducedInstructionSetChip(RISC)
ROC
See: ReceiverOperatingCharacteristicCurve(ROC)
Rollback
Thefunctionalitytoundoapplicationprocesses.
Source: IbisConsulting,Glossary.
RollingCollection/RollingIngestion
AprocessinwhichtheDocumentCollectionisperiodicallyaugmentedasnew,potentiallyRelevantDocumentsareidentifiedandgathered.WhenevertheDocumentCollectionisaugmented,theresultsofpriorsearchorrevieweffortsmustbesupplementedtoaccountforthenewDocuments.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
RollingProduction
AprocessinwhichResponsiveDocumentsaredeliveredincrementallytoarequestingpartytoprovidetimely,partialsatisfactionofaDocumentrequest.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
ROM
See: ReadOnlyMemory(ROM)
RotaryCamera
Inmicrofilming,thepapersareread"onthefly"withacamerathat'ssynchronizedtothemotion.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Router
Apieceofhardwarethatroutesdatafromalocalareanetwork(LAN)toaphoneline.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 266
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Rule
Aformalstatementofoneormorecriteriausedtodetermineaparticularoutcome,e.g.,whethertoCodeaDocumentasRelevantorNon-Relevant.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
RuleBase
AsetofRulescreatedbyanexperttoemulatethehumandecision-makingprocessforthepurposesofClassifyingDocumentsinthecontextofE-Discovery.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Rule-BasedWorkflow
Aprogrammedseriesofautomatedstepsthatroutedocumentstovarioususersonamulti-userimagingsystem.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
AdHocWorkflow Rule-BasedWorkflow Workflow
S
S-HTTP(SecureHTTP)
SecureHTTPorS-HTTPenablesthesecureexchangeofinformationandfilesontheWeb.S-HTTPfilesareencryptedand/orcontainadigitalcertificate.Thistypeoftransactionsecurityislikelytobeusedbyfinancialinstitutions,becauseS-HTTPismoresecurethanauserIDandpassword.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Sample/Sampling
AsubsetoftheDocumentPopulationusedtoassesssomecharacteristicofthePopulation;theactofgeneratingsuchasubsetoftheDocumentPopulation.SeeIntervalSample,JudgmentalSample,StatisticalEstimate,StatisticalSample,orSystematicSample.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 267
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Theprocessofselectingasubsetofitemsfromapopulationandinferringfromthecharacteristicsofthesamplewhatthecharacteristicsofthepopulationarelikelytobe.Oftenreferstoasimplerandomsample,whicheachiteminthepopulationhasanequalchanceofbeingselectedinthesample.
Source: HerbRoitblat,PredictiveCodingGlossary.
SampleSize
ThenumberofdocumentsdrawnatrandomthatareusedtocalculateaStatisticalEstimate.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Sampling
Samplingisamethodofreviewingstatisticalratiosofcompleteorportionsofaclassifiedcorpusforthepurposesofvalidation.
Source: EDRMSearchGlossary.
Theprocessofstatisticallytestingdataforthepresenceofrelevantinformation.Oftenusedtoprovidecourtswithacostestimateinordertoallocatecostsharing.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Samplingusually(butnotalways)referstotheprocessofstatisticallytestingadatabaseforthelikelihoodofrelevantinformation.Itcanbeausefultechniqueinaddressinganumberofissuesrelatingtolitigation,includingdecisionswhatrepositoriesofdataareappropriatetosearchinaparticularlitigation,anddeterminationsofthevalidityandeffectivenessofsearchesorotherdataextractionprocedures.Samplingcanbeusefulinprovidinginformationtothecourtabouttherelativecostburdenversusbenefitofrequiringapartytoreviewcertainelectronicrecords.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Source: MerrillCorporation,ElectronicDiscoveryGlossary.
SamplingDistribution
Theprobabilitydistributionofagivenmeasurebasedonarandomsample.Somevaluesaremorelikelythanothers.Thesamplingdistributiontellsaboutthelikelihoodorprobabilityofeachvalue.Wecanusesamplingdistributionstotesthypotheseswithouthavingtocomputeforourselvesallpossiblecombinationsoftheeventsthatleadtotheoutcome.Themostcommonlyusedsamplingdistributionistheso-callednormalorGaussiandistribution,thefamiliarbell-shapedcurve.Valuesnearthecenterofthedistribution,themeanoraverage,are
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 268
morelikelythanvaluesthatarefarawayfromthecenter.Whendrawn,thesamplingheightofthesamplingdistributionshowstheprobabilityofthatvalue.Theareaunderthecurvetellsusabouttheprobabilityofscorescoveredbythatarea.
Source: HerbRoitblat,PredictiveCodingGlossary.
SamplingFrame
See: Population
SamplingRate
Thefrequencyatwhichanalogsignalsareconvertedtodigitalvaluesduringdigitization.Thehighertherate,themoreaccuratetheprocess.InprintingThenumberofpixelsscannedperhalftonedot.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
SAN(StorageAreaNetwork)
Astorage-areanetwork(SAN)isadedicatedhigh-speednetwork(orsubnetwork)thatinterconnectsandpresentssharedpoolsofstoragedevicestomultipleservers.
Source: TechTarget,storageareanetwork(SAN)definition,http://searchstorage.techtarget.com/definition/storage-area-network-SAN
Astorageareanetwork(SAN)isanetworkwhichprovidesaccesstoconsolidated,blockleveldatastorage.SANsareprimarilyusedtoenhancestoragedevices,suchasdiskarrays,tapelibraries,andopticaljukeboxes,accessibletoserverssothatthedevicesappeartotheoperatingsystemaslocallyattacheddevices.ASANtypicallyhasitsownnetworkofstoragedevicesthataregenerallynotaccessiblethroughthelocalareanetwork(LAN)byotherdevices.ThecostandcomplexityofSANsdroppedintheearly2000stolevelsallowingwideradoptionacrossbothenterpriseandsmalltomedium-sizedbusinessenvironments.
Source: Wikipedia,Storageareanetwork,https://en.wikipedia.org/wiki/Storage_area_network
Seealso:
Client/servernetwork
LAN-localareanetwork
MAN-metropolitanareanetwork
Network
Peer-to-peernetwork
Standalonecomputer
WAN-wideareanetwork
Sanctions
Consequences,punishments,andpenaltiesimposedbythecourtforviolationoftherulesorordersofthecourt,orbyregulatorsforviolationsoftherulesorordersofregulatorybodies.
Source: IbisConsulting,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 269
Sandbox
Anetworkorseriesofnetworksthatarenotconnectedtoothernetworks.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
SansSerif
Withoutserif.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Scalability
Theabilityofasystemtoaddhardwaretoincreasepowerorperformancewithoutrequiringanyadjustmentstotheunderlyingsystem.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thecapacityofasystemtoexpandwithoutrequiringmajorreconfigurationorre-entryofdata.Multipleserversoradditionalstoragecanbeeasilyadded.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ScaletoGray
Anoptiontodisplayablackandwhiteimagefileinanenhancedmode,makingiteasiertoview.Ascale-to-graydisplayusesgrayshadingtofillingapsorjumps(knownasaliasing)thatoccurwhendisplayinganimagefileonacomputerscreen.Alsoknownasgrayscale.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Scan
Theprocessofcreatinganelectronicimageofapaperdocument,usuallyforthepurposeofloadingintoalitigationsupportsystem.
Source: LitSavantLtd.,Glossary,http://www.litsavant.com/full-glossary.aspx
Scanningistheprocessofconvertingahardcopypaperdocumentintoadigitalimageforuseinacomputersystem.Afteradocumenthasbeenscanned,itcanbereviewedusingfieldandfull-textsearching,instantdocumentretrieval,andacompleterangeofelectronicdocumentreviewoptions.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 270
Scanner
Aninputdevicecommonlyusedtoconvertpaperdocumentsintocomputerimages.Scannerdevicesarealsoavailabletoscanmicrofilmandmicrofiche.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Double-sidedscanner
Duplexscanner
Flatbedscanner
Simplexscanner
Scanning
See: Scan
ScanningSoftware
Softwarethatenablesascannertodeliverindustrystandardformatsforimagesinacollection.Enablestheuseofcodingoftheimages.IPRO,DocuLexandZyImageareseveralexamples.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
SCSI(SmallComputerSystemsInterface)
Pronounced"scuzzy".Anindustrystandard(ofsorts)forconnectingperipheraldevicesandtheircontrollerstoamicroprocessor.SCSIdefinesbothhardwareandsoftwarestandardsforcommunicationbetweenahostcomputerandaperipheral.ComputersandperipheraldevicesdesignedtomeetSCSIspecificationsshouldworktogether.
Source: RSI,Glossary.
Pronounced“skuzzy.”Astandardforattachingperipherals(notablymassstoragedevicesandscanners)tocomputers.SCSIallowsforupto7devicestobeattachedinachainviacables.ThecurrentSCSIstandardis“SCSIII,”alsoknownas“FastSCSI.”
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
SCSIScannerInterface
Adeviceusedtoconnectascannerwithacomputer.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Search
Amethodoffindingtermswithindatasets.SearchtypesincludeBooleanconnectorsandspecialcharacterstodefineasearch.Typesinclude:naturallanguage(simplewordsor
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 271
phrases),Booleanoperators,andsearchesintroducedbyspecialcharactersforwildcard,stemming,fuzzy,phonic,synonym,numericrangeandvariableweightingsearches.
Source: IbisConsulting,Glossary.
Theabilitytolookwithinthedataandsearchbyaname,dateorkeywordtofinddesiredinformation.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Adatabasequery.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Theprocessoflocatingandidentifyingdocumentsthatarerelevant.Ininformationretrieval,thewordusuallyreferstoanactiveprocesswhereauserentersaqueryconsistingofoneormoreterms.Inresponsetheinformationretrievalsystemorsearchenginereturnsa(typicallyranked)setofdocumentsthatcorrespondtothatquery.
Source: HerbRoitblat,Search2020:TheGlossary.
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
SearchEngine
Asearchcomponentthatimplementstheactualprocessofinterpretingasearchrequestandidentifyingsubsetsofdocuments.Forexample,adatabasemanagementsystemsuchasMicrosoftSQLServercontainsacomponentthatmanagessearchesofthedatastoredinitsdatabases.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 272
SearchEngineOptimization(SEO)
ChangesmadetoaWebpagethatimprovesthepositioningofthatpagewithoneormoresearchengines.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
SearchEnginePositioning(SEP)
TheprocessoforderingWebsitesorWebpagesbyasearchengineordirectorysothatthemostrelevantsitesappearfirstinthesearchresults.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
SearchHit
AdocumentinESIthatisconsideredtomatchtherequestedSearchQuery.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
SearchQuery
Awell-formulatedSearchrequestthatanautomatedsearchenginecaninterpretinordertoproducematchingresults.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
SearchResults
AcollectionofSearchHitsthatmatchtheintendeddocumentsofaSearchRequest.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
SearchSyntax
Aparticularsearchlanguagerequiredbyasoftwareprogram.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
SearchTermList
Thelistofkeywordsprovidedbyaclientforthepurposeofsearchingadatasetforresponsivefilesusingfulltextsearch.
Source: IbisConsulting,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 273
SearchableTIFF
Animagedfileaccompanied,inadatabase,byOCR'dtextthatissearchable.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).
Seealso:
GIF
GraphicInterchangeFile
Imagefileformat
Jointphotographicexpertgroup
JPEG
Multi-pageTIFF
PNG
PortableDocumentFormat
Portablenetworkgraphic
Single-pageTIFF
TIFF
SECRegulation10b(5)
SecuritiesandExchangeCommissionregulationgoverningtherightsofshareholders.ManylawsuitsbyshareholdersarefiledunderRule10b(5).
Source: IbisConsulting,Glossary.
SECRegulation17a4
SecuritiesandExchangeCommissionregulationrelatingtodataretentionforfinancialservicesfirms.
Source: IbisConsulting,Glossary.
SecondRequest
AdocumentrequestbyeithertheDepartmentofJustice(DOJ)ortheFederalTradeCommission(FTC)for“additionalinformationanddocumentarymaterialrelevantto[a]proposedacquisition”undertheHart-Scott-RodinoAntitrustImprovementsActof1976(the“HSRAct.”)
Source: IbisConsulting,Glossary.
SecureHTTP
See: S-HTTP(SecureHTTP)
Sedona/SedonaConference
TheSedonaConference®(https://thesedonaconference.org)isanonprofit,501(c)(3)researchandeducationalinstitute,foundedin1997byRichardG.Braman,dedicatedtotheadvancedstudyoflawandpolicyintheareasofantitrust,complexlitigation,andintellectualpropertyrights.Sedonasponsorsapreeminentthink-tankintheareaofElectronicDiscoveryknownasWorkingGroup1onElectronicDocumentRetentionandProduction.Sedonaiswellknownforitsthoughtful,balanced,andfreepublications,suchasTheSedonaConference®Glossary:E-Discovery&DigitalInformationManagement(ThirdEdition,Sept.2010),TheSedonaPrinciples
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 274
AddressingElectronicDocumentProduction,SecondEdition(June2007),andTheSedonaConference®CooperationProclamation(July2008).
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
SeedSet
TheinitialTrainingSetprovidedtothelearningAlgorithminanActiveLearningprocess.TheDocumentsintheSeedSetmaybeselectedbasedonRandomSamplingorJudgmentalSampling.SomecommentatorsusethetermmorerestrictivelytoreferonlytoDocumentschosenusingJudgmentalSampling.OthercommentatorsusethetermgenerallytomeananyTrainingSet,includingthefinalTrainingSetinIterativeTraining,ortheonlyTrainingSetinnon-IterativeTraining.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Acollectionofpre-categorizeddocumentsthatisusedastheinitialtrainingforapredictivecodingsystem.
Source: HerbRoitblat,PredictiveCodingGlossary.
SelfCollection
Aprocesswhereindividualcustodiansidentifyandcopypotentiallyrelevantfilesfordiscovery.
Source: EDRMCollectionStandards
Sender/RecipientFilter
Afilteroptionthatallowsforincluding/excludingselectedsendersandrecipientsofe-mailmessages.
Source: IbisConsulting,Glossary.
Seealso:
Datefilter
Extensions/sizesfilter
Filter
MD5-knownfilter
Sensor
Amechanismformeasuringsomefeatureoftheenvironmentordevicethatcanbeusedcomputationally.
Source: HerbRoitblat,Search2020:TheGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 275
SentimentAnalysis
Aprocessforidentifythesentimentinatext(forexample,aTweet,blogpost,ordocument).Typicallysentimentanalysisidentifieswhetherthetextexpressesapositive(e.g.,happiness)ornegative(e.g.,anger)emotion,thoughmoresubtledistinctionsarealsopossible.
Source: HerbRoitblat,Search2020:TheGlossary.
SEO
See: SearchEngineOptimization(SEO)
SEP
See: SearchEnginePositioning(SEP)
SequencedPacketExchange(SPX)
AcommunicationsprotocolusedbyNovellnetworks.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Externallinks:
WebopediaComputerDictionary,http://www.webopedia.com/TERM/S/SPX.html.
Serial
Datastoredortransmittedsequentially,asopposedtoparallel.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
SerialLineInternetProtocol(SLIP)
AcommunicationsstandardusedinInternetcommunications.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Serif
Thelittlecrossbarsorcurlsattheendofstrokesontypefonts.Forexample,inthissentence,thehorizontallineatthebottomoftheletter‘r’.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Server
AnycomputeronanetworkthatcontainsdataorapplicationssharedbyusersofthenetworkontheirclientPCs.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 276
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
ServiceBureau
AvendorwhichperformsALSservicessuchasphotocopying,scanning,imaging,codingand,morerecently,e-discoveryservices.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ServiceLevelAgreement(SLA)
Aservice-levelagreementisacontractthatdefinesthetechnicalsupportorbusinessparametersthatanapplicationserviceproviderorotherIToutsourcingfirmwillprovideitsclients.Theagreementtypicallyspellsoutmeasuresforperformanceandconsequencesforfailure.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
ServicePack
Softwareprogrambugfixes.Theyareavailableafterreleaseofthesoftwareprogram.Theyalsousuallyaddresscompatibilityissues.Often,acertainservicepackisrequiredtoenablecertainsoftwareeithertorunortorunwell.
SGML(StandardGeneralizedMarkupLanguage)
Aninformalindustrystandard(linguafranca)foropensystemsdocumentmanagementwhichspecifiesthedataencodingofadocument'sformatandcontent.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Atextbased“language”fordescribingthecontentandstructureofdocuments.SGMLisused,forexample,bysomegovernmentagenciestopublishreportsthatareuseablebybothmachinesandhumanreaders.HTMLisasimplifiedapplicationofSGML.
Seealso:
HTML
Java
JavaScript
SGML/HyTime
XML
SGML/HyTime
AmultimediaextensiontoSGML,sponsoredbyDOD.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
©2016EDRMLLC
HTML
Java
JavaScript
SGML
XML
SHA-1
Incryptography,SHA-1(SecureHashAlgorithm1)isacryptographichashfunctiondesignedbytheUnitedStatesNationalSecurityAgencyandisaU.S.FederalInformationProcessingStandardpublishedbytheUnitedStatesNIST.SHA-1producesa160-bit(20-byte)hashvalueknownasamessagedigest.ASHA-1hashvalueistypicallyrenderedasahexadecimalnumber,40digitslong.
Source: Wikipedia,SHA-1,https://en.wikipedia.org/wiki/SHA-1
Seealso:
Hash
Hashvalue
Hashing/Hash/HashValue
MD5
ShadowIT
Projects,devices,orsoftwarethatareusedbyemployeeswithoutthecontrol,permission,oreventheawarenessoftheorganization’sinformationtechnologyprogram.ShadowITisrelatedtoBYOD(bringyourowndevice),butshadowITimpliesanungoverneduseoftechnology,andBYODmayhavelimitedorevendeepcontrols.
Source: HerbRoitblat,Search2020:TheGlossary.
Shingling
AFeatureEngineeringmethodinwhichtheFeaturesconsistofallN-Gramsinatext,forsomenumberN.Forexample,theTrigramShinglingofthetext“Tobeornottobe”consistsoftheFeatures“tobeor”;“beornot”;“ornotto”;and“nottobe.”NotethattheFeaturesoverlaponeanotherinthetext,suggestingthemetaphorofroofshingles.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Sibling
Asiblingisadocumentthatsharesacommonparentwiththedocumentinquestion(e.g.twoattachmentsthatsharethesameparentemailoraresiblingdocumentsinthesameZipfile).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
SignalDetectionTheory
Inventedatthesametimeandinconjunctionwithradar,thescienceofdistinguishingtrueobservationsfromspuriousones.SignalDetectionTheoryiswidelyusedinradioengineeringandmedicaldiagnostictesting.ThetermsTruePositive,TrueNegative,FalsePositive,False
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 278
Negative,Sensitivity,Specificity,ReceiverOperatingCharacteristicCurve,AreaUndertheROCCurve,andInternalResponseCurve,allarisefromSignalDetectionTheory.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Significance/Significant
Theconfirmation,withagivenConfidenceLevel,ofapriorhypothesis,usingaStatisticalEstimate.TheresultissaidtobeStatisticallySignificantifallvalueswithintheConfidenceIntervalforthedesiredConfidenceLevel(typically95%)areconsistentwiththehypothesisbeingtrue,andinconsistentwithitbeingfalse.Forexample,ifthehypothesisisthatfewerthan300,000DocumentsareRelevant,andaStatisticalEstimateshowsthat,290,000DocumentsareRelevant,plusorminus5,000Documents,wesaythattheresultisSignificant.Ontheotherhand,iftheStatisticalEstimateshowsthat290,000DocumentsareRelevant,plusorminus15,000Documents,wesaythattheresultisnotSignificant,becausetheConfidenceIntervalincludesvalues(i.e.,thevaluesbetween300,000and305,000)thatcontradictthehypothesis.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Statisticallysignificantmeansthattheobservedresultsareunlikelytohaveoccurredbychance.Usedinstatisticaldecisionstodecidewhetheradifference,forexample,islargeenoughthatitisunlikelytohappenedbychancefromthesamplingdistribution.Instatistics,ingeneral,significance,referstowhethertheoutcomeissounlikelyunderthenullhypothesis(norealdifference)thatwerejectthenullhypothesisandacceptthealternative.Forexample,weselectarandomsampleofstudentsfromeachoftwoschoolsandwemeasuretheirreadingcomprehension.Thenullhypothesisisthatthereisnodifferencebetweenschoolsonreadingcomprehension.Themotivatedhypothesisisthatthereisadifference.Ifthedifferencebetweenmean(average)readingcomprehensionofthesetwosamplesissufficientlylargethatitisunlikely,thenwesaythatthedifferenceissignificant,andthatthetwoschoolsdifferintheirreadingcomprehension.Itisamisnomertospeakaboutasignificantrandomsample.Significancereferstothiskindofhypothesistest,notthesizeofthesample.
Source: HerbRoitblat,Search2020:TheGlossary.
Source: HerbRoitblat,PredictiveCodingGlossary.
SimilarDocumentSearch
Asearchthatfindsalldocumentssimilartotheprimarydocument.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
AdHocSearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 279
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
SIMM(Single,In-LineMemoryModule)
Asearchthatfindsalldocumentssimilartotheprimarydocument.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
SimpleMailTransferProtocol
See: SMTP(SimpleMailTransferProtocol)
Simplex
One-sidedpage(s).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
SimplexScanner
Adocumentscannerthatcopiessingle-sideddocuments.
Source: RSI,Glossary.
Seealso:
Double-sidedscanner
Duplexscanner
Flatbedscanner
Scanner
Single,In-LineMemoryModule
See: SIMM(Single,In-LineMemoryModule)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 280
Single-MailArchive
Anaggregateofe-mailmessagesandattachmentssavedoutsideofmulti-mailcontainerslikePSTorNSF.Single-mailarchivescantaketheformofMSG,EML,TXT,HTML,orotherfileformats.
Source: IbisConsulting,Glossary.
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
OST
PST
RFCcompliantemail
RFC822
Single-mailcontainer
SMTP
Single-MailContainer
Acontainer,orfile,thatholdsasinglee-mailmessage,suchasEMLandMSG.
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
OST
PST
RFCcompliantemail
RFC822
Single-mailarchive
SMTP
Single-PageText
Extracted,single-pagetextfiles.
Source: IbisConsulting,Glossary.
Single-PageTIFF
ThestandardoutputformatforTIFFimages,whereonepage=oneTIFFimage.SomeloadfileformatsrequiremultipleTIFFfilesmergedintooneTIFFfilewithmultiplepages.
Seealso:
GIF
GraphicInterchangeFile
Imagefileformat
Jointphotographicexpertgroup
JPEG
Multi-pageTIFF
PNG
PortableDocumentFormat
Portablenetworkgraphic
SearchableTIFF
TIFF
©2016EDRMLLC
SingularValueDecomposition
Amathematicaltechniquethatsummarizesthecorrelationbetweenitemsandtheirfeatures.OneofthetechniquesusedineDiscoveryasthebasisforconceptsearch,wheretheitemsaredocumentsandthefeaturesarewords.
Source: HerbRoitblat,Search2020:TheGlossary.
Skew
Duringprintingorscanning,thecontentsofapagearealmostneverexactlyvertical,whichreferredtoasbeingskewed.
Source: RSI,Glossary.
Seealso:
De-Skew
SLA
See: ServiceLevelAgreement(SLA)
SlackSpace
Aformofresidualdata,slackspaceistheamountofon-diskfilespacefromtheendofthelogicalrecordinformationtotheendofthephysicaldiskrecord.Slackspacecancontaininformationsoft-deletedfromtherecord,informationfrompriorrecordsstoredatthesamephysicallocationascurrentrecords,metadatafragmentsandotherinformationusefulforforensicanalysisofcomputersystems.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Remnantdatafromdeletedfilesstilllocatedinclustersonaharddrive.
Source: RenewData,Glossary(10/5/2005).
Thedifferenceinemptybytesofthespacethatisallocatedinclustersminustheactualsizeofthefiles.Alsodescribedasthedatafragmentsstoredrandomlyonaharddriveduringthenormaloperationofacomputer,ortheresidualdataleftontheharddriveafternewdatahasoverwrittensomeofthepreviouslystoreddata.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Thedifferencebetweenthesizeofafileandthesizeofthevariousclusterswhereitisstored,sincethefilesegmentsmaybesmallerthantheclusterswheretheyreside.Mayalsorefertodatafragmentsstoredrandomlyonaharddriveduringthenormaloperationofacomputerorresidualdataleftonaharddriveafternewdatahasoverwrittendeletedfiles.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 282
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Ambientdata
Fragmenteddata
Freespace
Residualdata
Swapfile
Unallocatedspace
SLIP
See: SerialLineInternetProtocol(SLIP)
SmallComputerSystemsInterface
See: SCSI(SmallComputerSystemsInterface)
SmartCard
Acreditcardsizedevicewhichcontainsamicroprocessor,memoryandabattery.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
SMP(SymmetricMulti-Processing)
AsystemdesignofmultipleCPUsinwhichanyCPUcanbeassignedanyapplicationtask.Typically,oneCPUisthecontrollerandhandlessystemboot,I/Orequests,anddistributionoftaskstotheotherCPUs.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
SMTP(SimpleMailTransferProtocol)
SimpleMailTransferProtocol(SMTP)isanInternetstandardforelectronicmail(email)transmission.FirstdefinedbyRFC821in1982,itwaslastupdatedin2008withtheExtendedSMTPadditionsbyRFC5321—whichistheprotocolinwidespreadusetoday.
Source: Wikipedia,SimpleMailTransferProtocol,https://en.wikipedia.org/wiki/Simple_Mail_Transfer_Protocol
(pronouncedasseparateletters)ShortforSimpleMailTransferProtocol,aprotocolforsendinge-mailmessagesbetweenservers.Moste-mailsystemsthatsendmailovertheInternetuseSMTPtosendmessagesfromoneservertoanother;themessagescanthenberetrievedwithane-mailclientusingeitherPOPorIMAP.Inaddition,SMTPisgenerallyusedtosendmessagesfromamailclienttoamailserver.ThisiswhyyouneedtospecifyboththePOPorIMAPserverandtheSMTPserverwhenyouconfigureyoure-mailapplication.
Source: Webopedia,SMTP-SimpleMailTransferProtocoldefinition,http://www.webopedia.com/TERM/S/SMTP.html
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 283
Seealso:
Container
EML
Mailcontainer
Mailbox
MSG
Multi-mailcontainer
NSF
OST
PST
RFCcompliantemail
RFC822
Single-mailarchive
Single-mailcontainer
SocialNetworkAnalysis
Investigationsofwhoinanorganizationiscommunicatingwithwhom.Theseconnectionsareoftendisplayedasanetworkdiagram,withindividualsasnodesandtheemailsorothercommunicationsbetweenthemaslinks.Socialnetworksareoftenusefultodeterminehowinformationhasbeenflowingthroughanorganization.Theycanalsohelptoidentifyindividualswithspecifickindsofknowledge.
Source: HerbRoitblat,Search2020:TheGlossary.
Software
Anysetofinstructionsstoredoncomputer-readablemediathattellsacomputerwhattodo.Includesoperatingsystemsandsoftwareapplications.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: RSI,Glossary.
Aseriesoffilescontaininginstructionstothecomputerforperformingfunctions.Asoftware“program”containstheinstructionstoacceptdataincertainformats.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Codedinstructions(programs)thatmakeacomputerdousefulwork.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
SoftwareApplication
See: Application
Sort
Puttingareportinaparticularorder,suchaschronologicalornumerical.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 284
Sound-Alike
Asearchmethodwherebythecomputerproducesalistofwordsthat“sound”similartothedesiredwordandcanthemselvesbesearched.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
SourceFile
Therawdatareceivedfromaclient,eitherondigitalmediaoruploadedtothenetwork.
Source: IbisConsulting,Glossary.
Splatter
Datathatshouldbekeptononediscofajukeboxgoesinsteadtomultipleplatters.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Spoliation
Spoliationisthedestructionoralterationofevidenceduringon-goinglitigationorduringaninvestigationorwheneithermightoccursometimeinthefuture.Failuretopreservedatathatmaybecomeevidenceisalsospoliation.
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingNorcrossGroupFAQ's,http://norcrossgroup.com/faq.html#14.
Generally,theintentionalornegligentdestructionoralterationofevidencewhenthereiscurrentlitigationoraninvestigationorthereisreasonableanticipationthateithermayoccurinthenearfuture.Somejurisdictionsalsodefineitasafailuretopreserveinformationthatmaybecomeevidence.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 285
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Theintentionalalterationordestructionofarelevantdocumentordocuments.
Source: IbisConsulting,Glossary.
Theoriginallegaldefinitionwasthedestructionofathingbytheactofastranger;asintheerasureoralterationofawritingbytheactofastranger.Ine¬discoverycasesthefocushasbeenontheintentionalnatureoftheact,whichcanincludedeletion,partialdestructionoralteration,generallybyapartytotheactionorsomeoneundertheircontrol.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Spoliationisthedestructionofrecordswhichmayberelevanttoongoingoranticipatedlitigation,governmentinvestigationoraudit.Courtsdifferintheirinterpretationofthelevelofintentrequiredbeforesanctionsmaybewarranted.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Theintentional,negligent,orreckless,loss,destruction,alterationorobstructionofrelevantevidence.
SPP(StandardParallelPort)
Astandardparallelport(SPP)isaportforconnectingvariousrelativelyhighbandwidthperipherals,mostcommonlyprinters,toaPC.LaterversionsoftheSPPallowduplexcommunication.TheyusetheDB-25connector.TheoriginalSPP,byCentronics,wasintroducedin1970andsoonbecamethedefactoindustrystandard.However,anumberofdifferentmanufacturersusedtheSPPwithavarietyofconnectors,suchastheDC-35,theDD50andtheM50.
Source: Techopedia,StandardParallelPort(SPP),https://www.techopedia.com/definition/3667/standard-parallel-port-spp
Spreadsheet
Asoftwareprogramthatarrangesdatainamatrixofcellsandperformscalculationsbasedonthearrangementofthecells.ThemostpopularspreadsheetsareLotus1-2-3andMicrosoftExcel.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Acompilationofdataintableformarrangedincolumnsandrows.ProgramssuchasExcelbuildfilesthatcontainoneormorespreadsheets.Spreadsheetsmayalsocontaingraphicandotherelementsandmayhiddendata.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 286
SpreadsheetFormula
Amathematicalformulaappliedtoacelltocalculatethecontents,e.g.cellA3=cellA1+cellA2.
SPX
See: SequencedPacketExchange(SPX)
SQL(StructuredQueryLanguage)
Atypeofrelationaldatabasemanagementsystem(RDBMS).Relationshipsinarelationaldatabasearerepresentedbylinkagesthatexistbetweentwoormorepiecesofdata.ThefinaldefiningfeatureofSQLisitsabilitytoreturndatafromonedatafieldbasedonitsrelationshipwithanotherdatafield.SeealsoRelationalDatabaseManagementSystems.
Source: EDRMSearchGlossary.
SQLisastandardprogramminglanguageforgettinginformationfromandupdatingadatabase.AlthoughSQLisastandard,manydatabaseproductssupportSQLwithproprietaryextensionstothestandardlanguage.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thepopularstandardforrunningdatabasesearches(queries)andreports.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
StructuredQueryLanguage,thelanguageusedtocontroltraditionalrelationaldatabases.Arelationaldatabasestoresdatainoneormoretables.Itcanalsorepresentrelationsbetweenthecolumnsofonetable(variablesinthattable)andcolumns(variables)inanothertable,hencethename“relationaldatabase.”
Source: HerbRoitblat,Search2020:TheGlossary.
Seealso:
Database
Flatfiledatabase
Fulltextdatabase
Relationaldatabase
WAIS-wideareainformationserver
StandAloneComputer
Asinglecomputernotconnectedtoanetwork.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Asinglecomputer,asdistinctfromacomputerattachedtoanetwork.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Apersonalcomputerthatisnotconnectedtoanyothercomputerornetwork,exceptpossiblythroughamodem.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 287
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
Client/servernetwork
LAN-localareanetwork
MAN-metropolitanareanetwork
Network
Peer-to-peernetwork
SAN-storageareanetwork
WAN-wideareanetwork
StandardGeneralizedMarkupLanguage
See: SGML(StandardGeneralizedMarkupLanguage)
StandardParallelPort
See: SPP(StandardParallelPort)
StatisticalEstimate
AquantitativeestimateofaPopulationcharacteristicusingStatisticalEstimation.ItisgenerallyexpressedasaPointEstimateaccompaniedbyaMarginofErrorandaConfidenceLevel,orasaConfidenceIntervalaccompaniedbyaConfidenceLevel.
Source; MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
StatisticalEstimation
TheactofestimatingtheProportionofaDocumentPopulationthathasaparticularcharacteristic,basedontheProportionofaRandomSamplethathasthesamecharacteristic.MethodsofStatisticalEstimationincludeBinomialEstimationandGaussianEstimation.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
StatisticalModel
AmathematicalabstractionoftheDocumentPopulationthatremovesirrelevantcharacteristicswhilelargelypreservingthoseofinterestforaparticularpurpose.ForthepurposeofcomputingRecall,aStatisticalModelneedonlyconsiderwhetherornottheDocumentsareRelevant,andwhetherornottheDocumentsareCodedRelevant,notanyothercharacteristicsoftheDocuments.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 288
StatisticalSample/StatisticalSampling
AmethodinwhichaSampleoftheDocumentPopulationisdrawnatrandom,sothatstatisticalpropertiesoftheSamplemaybeextrapolatedtotheentireDocumentPopulation;theSampleresultingfromsuchaction.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Status
Acommonelementwithineache-discoveryPhasewhichreferstotheactivities,tasksandmethodsundertakeninrelationtoadefinedobjectivewithinaPhase.InProjectManagement,benchmarkingofcurrentworkagainstexpressed,intendedorexpectedoutcome,andreportingonsame.
Source: EDRMMetricsGlossary
Stemming
Asearchoptionthatreturnsmatchesforallvariationsoftherootwordoftheinitialqueryword.Forexample,ifthequerywordwassing,thenifasearchusedstemmingthesearchresultswouldmatchsinging,sang,sung,song,andsongsaswellassing.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
InKeywordorBooleanSearch,orFeatureEngineering,theprocessofequatingallformsofthesamerootword.Forexample,thewords“stem,”“stemming,”“stemmed,”and“stemmable”wouldallbetreatedasequivalent,andwouldeachyieldthesameresultwhenusedasaSearchTermsinaQuery.Insomesearchsystems,stemmingisimplicitandinothers,itmustbemadeexplicitthroughparticularQuerysyntax.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Theprocessofremovingprefixesandsuffixesfromwordsbeforeindexingthemandaspartofqueryprocessing.Forexample,theword“swimming”couldbestemmedto“swim.”Ifwordsarestemmedastheyareindexed,thequerymustalsostemthewordssothatthequerycanmatchtheindex.Inasystemthatusesstemming,severalwordformscanbeindexedidentically,forexample,“swimmer”and“swimming”wouldbothbeindexedas“swim.”
Source: HerbRoitblat,Search2020:TheGlossary.
Seealso:
AdHocSearch Adaptivepatternrecognition
Associativeretrieval
Booleansearch
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 289
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
Stipulation
Stipulationisanagreementmadebetweenopposingpartiespriortoapendinghearingortrial.Forexample,bothpartiesmightstipulatetocertainfacts,andthereforenothavetoarguethosefactsincourt.Afterthestipulationisenteredinto,itispresentedtothejudge.
Source: EDRMPresentationGuide.
StopWord
AcommonwordthatiseliminatedfromIndexing.EliminatingStopWordsfromIndexingdramaticallyreducesthesizeoftheIndex,whileonlymarginallyaffectingthesearchprocessinmostcircumstances.ExamplesofStopWordsinclude“a,”“the,”“of,”“but,”and“not.”Becausephrasesandnamessuchas“Tobeornottobe,”and“TheWho,”containexclusivelyStopWordsthatwouldnotbeIndexed,theywouldnotbeidentified(oridentifiable)throughaKeywordSearch.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Wordsthataresocommonlyusedthatthereislittlelossbyignoringtheminaqueryorwhenthedocumentsareindexed.Manysystemsinvolvestopwordlists,whichmayincludewordslike“I”,“he”,“are”,and“is.”
Source: HerbRoitblat,Search2020:TheGlossary.
StorageAreaNetwork
See: SAN(StorageAreaNetwork)
StorageDevice
Anydevicethatacomputerusestostoreinformation.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 290
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Diskdrive
Floppydiskdrive
Jazdrive
Magneto-opticaldrive
Portabledrive
Tapedrive
Zipdrive
StorageMedia
Anyremovabledevicethatstoresdata.Seemagneticoropticalstoragemedia.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
WORMdisk
Zipdisk
Store
Toplaceinformationontoadiskwhereitisavailableforlateruse.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
StratifiedSampling
Aformofrandomsamplinginwhichthepopulationisformedintosubgroupsor“strata.”Objectsineachgrouparesampledinthesameproportionasthesizeofthegroupistothewholepopulation.Eachobjecthasanequalchanceofbeingsampled,butitalsoensuresthateachgroupissampledproportionately.
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 291
StructuredData
StructuredDataisdatathatisorganized.Themostcommontypeisdatabasecontent.ItreferstoanytypeofdataorganizedsuchasInternetdataorothertypesofdatathathasbeenindexed.
Source: EDRMMetricsGlossary
Datathatresidesinafixedfieldwithinarecordorfileiscalledstructureddata.Thisincludesdatacontainedinrelationaldatabasesandspreadsheets.
Source: http://www.webopedia.com/TERM/S/structured_data.html.
StructuredQueryLanguage
See: SQL(StructuredQueryLanguage)
SubjectCategory
Adatafieldinadatabaseusedtocapturespecificsubjectcodes.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Summary
Text
SubjectCode
Acodeforacase-specificlegalorfactualsubject.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
SubjectMatterExpert(s)
Oneormoreindividuals(typically,butnotnecessarily,attorneys)whoarefamiliarwiththeInformationNeedandcanrenderanauthoritativedeterminationastowhetheraDocumentisRelevantornot.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 292
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
SubjectiveCoding
TheSubjectiveCodingofadocumentinvolveslinkingalegalinterpretationtoanindividualdocument.Indirectoppositiontoobjectivecoding,inwhichbibliographicdataaboutthedocumentisrecorded.SubjectiveCodingtypesincludetheclassificationofdocumentsasprivilegedandresponsive,andthecategorizationofdocumentsbylegalissue(“issuecoding”).
Source: EDRMMetricsGlossary
Enteringinformationfromadocumentthatrequiresthecodertoexercisejudgment,suchassubjectorissuecodes.Thisfieldisoftenleftblankforthelawfirm’sparalegalsorassociatestofillin.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Thecodingofadocumentusinglegalinterpretationasthedatathatfillsafield.Performedbyparalegalsorothertrainedlegalpersonnel.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Categorizingdocumentsbytheirresponsivenesstospecificcaseissuesortopics.
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Issuecoding
Levelcoding
Objectivecoding
Tag
Taxonomiccoding
Verbatimcoding
SubtractiveColors
Sincethecolorsofobjectsarewhitelightminusthecolorabsorbedbytheobject,theyarecalledsubtractive.Thisishowinkonpaperworks.ThesubtractivecolorsofprocessinkareCMYK(Cyan,Magenta,YellowandBlack)andarespecificallybalancedtomatchadditivecolors(RGB).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Summary
Adatafieldinadatabasethatrecordsthesummaryofadocument.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 293
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Text
SuperVideoGraphicAdapter(SVGA)
AvideographicadapterwhichexceedstheminimumVGAstandardof640by480by16colors.Canreach1600by1280and256colors.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
SupervisedLearning
AMachineLearningmethodinwhichthelearningAlgorithminfershowtodistinguishbetweenRelevantandNon-RelevantDocumentsusingaTrainingSet.SupervisedLearningcanbeastand-aloneprocess,orusedrepeatedlyinanActiveLearningprocess.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Akindofmachinelearningwheretheobjectsarelabeledbyanexteriorsource,typically,asubjectmatterexpert.Thegoalofsupervisedlearningistypicallytoreplicatethedecisionpatternoftheoutsideexpertandapplythesamepatternstopreviouslyunseenobjects.
Source: HerbRoitblat,Search2020:TheGlossary.
SupportVectorMachine
Astate-of-the-artSupervisedLearningAlgorithmthatseparatesRelevantfromNon-RelevantDocumentsusinggeometricmethods(i.e.,geometry).EachDocumentisconsideredtobeapointin[hyper]space,whosecoordinatesaredeterminedfromtheFeaturescontainedintheDocument.TheSupportVectorMachinefindsa[hyper]planethatbestseparatesRelevantfromNon-RelevantTrainingExamples.DocumentsoutsidetheTrainingSet(i.e.,uncodedDocumentsfromtheDocumentCollection)arethenClassifiedasRelevantornot,dependingonwhichside
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 294
ofthe[hyper]planetheyfallon.AlthoughaSupportVectorMachinedoesnotcalculateaProbabilityofRelevance,onemayinferthattheClassificationofDocumentsclosertothe[hyper]planeislesscertainthanforthosethatarefarfromthe[hyper]plane.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Amachine-learningapproach,usedforcategorizingdata.ThegoaloftheSVMistolearntheboundariesthatseparatetwoormoreclassesofobjects.Givenasetofalreadycategorizedtrainingexamples,anSVMtrainingalgorithmidentifiesthedifferencesbetweentheexamplesofeachtrainingcategoryandcanthenapplysimilarcriteriatodistinguishingfutureexamples.
Source: HerbRoitblat,Search2020:TheGlossary.
Source: HerbRoitblat,PredictiveCodingGlossary.
SVGA
See: SuperVideoGraphicAdapter(SVGA)
SwapFile
Aswapfile(orswapspaceor,inWindowsNT,apagefile)isaspaceonaharddiskusedasthevirtualmemoryextensionofacomputer'srealmemory(RAM).Havingaswapfileallowsyourcomputer'soperatingsystemtopretendthatyouhavemoreRAMthanyouactuallydo.TheleastrecentlyusedfilesinRAMcanbe"swappedout"toyourharddiskuntiltheyareneededlatersothatnewfilescanbe"swappedin"toRAM.Inlargeroperatingsystems(suchasIBM'sOS/390),theunitsthataremovedarecalledpagesandtheswappingiscalledpaging.
Source: TechTarget,swapfile(swapspaceorpageful)definition,http://searchwindowsserver.techtarget.com/definition/swap-file-swap-space-or-pagefile
Seealso:
Ambientdata
Fragmenteddata
Freespace
Residualdata
Slackspace
Unallocatedspace
SymmetricMulti-Processing
See: SMP(SymmetricMulti-Processing)
SynonymSearch
Asynonymsearchreturnsdocumentsthatcontaintermssimilarinmeaningtothequerywords,usuallyusingathesaurustodeterminewhichtermswouldmatchthequerywords.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 295
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
Synonymy
Havingtheequivalenceofmeaning;havingthesamedefinitionwithouthavingthesameexpression.
Source: EDRMSearchGlossary.
SyntheticDocument
Anindustry-specifictermgenerallyusedtodescribeanartificialDocumentcreatedbyeithertherequestingpartyortheproducingparty,aspartofaTechnology-AssistedReviewprocess,foruseasaTrainingExampleforaMachineLearningAlgorithm.SyntheticDocumentsarecontrivedDocumentsinwhichonepartyimagineswhattheevidencemightlooklikeandreliesontheMachineLearningAlgorithmtofindactualDocumentsthataresimilartotheartificialDocument.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Sysadmin
Asystemadministrator,orsysadmin,isapersonwhoisresponsiblefortheupkeep,configuration,andreliableoperationofcomputersystems;especiallymulti-usercomputers,suchasservers.
Source: http://en.wikipedia.org/wiki/System_administrator.
Thepersoninchargeofkeepinganetworkworking.Alsoreferredtoassysadminorsysop.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 296
Sysop
See: Sysadmin
SystemAdministrator
See: Sysadmin
SystemProgram
Programsthatcontroltheinternaloperationsofacomputersystem.Examplesareoperatingsystems,compilers,interpreters,assemblers,andmathematicalroutines.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
SystemRegistry
ThesystemconfigurationfilesusedbyMicrosoftWindowstostoresettingsaboutuserpreferences,installedsoftware,hardwareanddriversandothersettingsrequiredforWindowstoruncorrectly.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
SystematicSample/SystematicSampling
ASamplingmethodinwhicheveryNthDocument(forsomefixednumberN)isselected,whentheDocumentsareconsideredinsomeprescribedorder;theSampleresultingfromsuchaction.ASystematicSampleisrandom(andhenceatrueStatisticalSample)onlywhentheprescribedorderisitselfrandom.SometimesreferredtoasanIntervalSample/IntervalSampling.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Systems
Acommonelementwithineache-discoveryPhasewhichreferstocomputerstoragedevices,activeapplicationsforthestorageoruseofdataorESI;ortoworkprocessesdesignedtoachieveaspecifiedresult.
Source: EDRMMetricsGlossary
T
Tag
Anemendationaddedtoadocumentduringthereviewprocess.Tagscanbeusedtoassigndocumentstoissues,toindicatewhichonesshouldbeprinted,orforanyotherreasonthecaserequires.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 297
TaggedImageFileFormat(TIFF)
Agraphicfileformatusedforstoringstill-imagebitmaps.TIFFsarestoredintaggedfields,andprogramsusethetagstoacceptorignorefields,dependingontheapplication.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Fenwick&WestLLP,FWPSeDiscoveryTerminology(11/6/2005).CitingFios'eDiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary_sz.html.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Awidelyusedbit-mappedgraphicsfileformat.Thisisessentiallyapictureofadocument.
Source: RenewData,Glossary(10/5/2005).
Abitmappedgraphicsfileformatthatcontainsapictureofadocument.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Graphicfilesthatportrayasinglepageofafileforviewingpurposeswitha.tifextension(inthecaseofMulti-pageTIFFs,outputimagescanconsistofmultiplepages).
Source: IbisConsulting,Glossary.
Oneofseveralstandardsformakingelectronicimages.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
The"defacto"electronic/computerstandardforscanned,bit-mappedimages–8bitcolorandgrayscale.Originatedin1986asajointprojectofMicrosoftandAldus.Includesseveraltypesandgroupswhicharecompressed&uncompressed.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Oneofthemostwidelysupportedfileformatsforstoringbit-mappedimages.FilesinTIFFformatoftenendwitha.tifextension.10
Thisimageformatiscommonlyusedasthestandardfiledeliveryformatforproduction.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Seealso:
GIF
GraphicInterchangeFile
Imagefileformat
Jointphotographicexpertgroup
JPEG
Multi-pageTIFF
PNG
PortableDocumentFormat
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 298
Portablenetworkgraphic SearchableTIFF Single-pageTIFF
Tape
Randommemorywhichcanbereadbutnotwritten(i.e.changed).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Backup
Backuptape
DAT-digitalaudiotape
Dataextraction
Digitalaudiotape
Disasterrecoverytape
DLT-digitallineartape
Magneticstoragemedia
Media
QIC-quarterinchcartridge
TapeBackup
Aprocessofcopyingelectronicdatafromastoragedevice,suchasacomputer'sharddrive,toatapecartridgedevice.Thissecuritymeasureensuresthatthedataisnotlostintheeventofanequipmentfailureordisaster.Tapebackupcanbeachievedmanuallyorprogrammedtooccurautomatically.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
TapeDrive
Ahardwaredeviceusedtostoredataonamagnetictape.Tapedrivesareusuallyusedtobackuplargequantitiesofdataduetotheirlargecapacityandcheapcostrelativetootherdatastorageoptions.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Diskdrive
Floppydiskdrive
Jazdrive
Magneto-opticaldrive
Portabledrive
Storagedevice
Zipdrive
TAPI(TelephonyApplicationProgrammingInterface)
AMicrosoft-basedstandardforbasictelephoneservicesthatallowsaPCtoaccessphonebooks,controlphoneequipment,andinterfacewithvoice-mailande-mailsystems.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 299
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
TAR
See: Technology-AssistedReview(TAR)
TargaImageFileFormat(TGA)
Thisisa"scannedformat"–widelyusedforcolor-scannedmaterials(24-bit)aswellasbyvarious"paint"anddesktoppublishingpackages
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
TargetedCollectionStrategies
Atargetedcollectionstrategyisonethatisspecificallydesignedtoavoidovercollectionofdatathatisknowntobeirrelevantorpotentiallyirrelevant.Anon-targetedstrategyisdesignedtocollectallthedatafromaparticularstoragedeviceorrepositoryinacomprehensivemanner.Cullingandfilteringprotocolsaresubsequentlyappliedtothedatacorpustoeithereliminatenon-responsivedataorisolateresponsivedata.Atargetedstrategytakesintoconsiderationcullingandfilteringprotocolsatthepointofcollection(e.g.onlycollectingacustodian’semailinboxasopposedtoimagingtheirentireharddrive).
Source: EDRMMetricsGlossary
TaxonomicCoding
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Issuecoding
Levelcoding
Objectivecoding
Subjectivecoding
Tag
Verbatimcoding
Taxonomy
Ahierarchicalorganizationalschemethatarrangesthemeaningsofwordsintoclassesandsubclasses.Forexample,vehicles,aircraft,andshipsaremodesoftransportation;cars,trucks,andbicyclesarevehicles,andFordsandChryslersarecars.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Aspecificcodinglanguageandterminologydevelopedforuseinaparticularcase.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 300
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Ahierarchicalcategoricalstructurewhereeachclassmaycontainoneormoresubclasses.Thescientificclassificationsystemoforganizingplantsandanimalsintoaspecificphylum,family,genus,species,etc.isanexampleofataxonomicsystem.Eachsubclassisanexampleofitsparentclass.
Source: HerbRoitblat,Search2020:TheGlossary.
TB(Terabyte)
Atrillionbytes,oramillionmegabytes.TheentirecollectionoftheLibraryofCongresswouldequalapproximately20terabytesifdigitized.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Atrillionbytes,ormorecorrectly1,024megabytes.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Aterabyteisameasureofcomputerdatastoragecapacityandisonethousandbillion(1,000,000,000,000)bytes.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
A1000Gigabytes(GB)or1,099,511,627,776bytes.
Seealso:
Bit
Byte
KB-kilobyte
MB-megabyte
GB-gigabyte
PB-petabyte
EB-exabyte
TCP(TransmissionControlProtocol)
TheprotocolusedinconjunctionwithInternetProtocol(IP)totransmitinformationovertheInternetintheformofunits.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
TCP/IP(TransmissionControlProtocol/InternetProtocol)
AcollectionofprotocolsthatdefinethebasicworkingsofthefeaturesoftheInternet.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Networkcommunicationsprotocol.ThisistheprotocolusedbytheInternet.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 301
Technology-AssistedReview(TAR)
AprocessforPrioritizingorCodingaCollectionofDocumentsusingacomputerizedsystemthatharnesseshumanjudgmentsofoneormoreSubjectMatterExpert(s)onasmallersetofdocumentsandthenextrapolatesthosejudgmentstotheremainingDocumentCollection.SomeTARmethodsuseMachineLearningAlgorithmstodistinguishRelevantfromNon-RelevantDocuments,basedonTrainingExamplesCodedasRelevantorNon-RelevantbytheSubjectMatterExperts(s),whileotherTARmethodsderivesystematicRulesthatemulatetheexpert(s)’decision-makingprocess.TARprocessesgenerallyincorporateStatisticalModelsand/orSamplingtechniquestoguidetheprocessandtomeasureoverallsystemeffectiveness.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Anyofanumberoftechnologiesthatusetechnology,usuallycomputertechnology,tofacilitatethereviewofdocumentsfordiscovery.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
CAR PredictiveCoding TAR
Telecommunications
Datatransmissionbetweenacomputersystemandremotedevices,usuallyovertelephonelines.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Telephony
Convertingsoundsintoelectronicsignalsfortransmission.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
TelephonyApplicationProgrammingInterface
See: TAPI(TelephonyApplicationProgrammingInterface)
Terabyte
See: TB(Terabyte)
TermFrequencyandInverseDocumentFrequency(TF-IDF)
AnenhancementtotheBagofWordsmethodinwhicheachwordhasaweightbasedonTermFrequency–thenumberoftimesthewordappearsintheDocument–andInverseDocumentFrequency–reciprocalofthenumberofDocumentsinwhichthewordoccurs.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 302
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
TermSearch
Avariantofkeywordsearch,withtheemphasisonsearchingforcombinationsofwordssuchasphrases.
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Topicalsearch
Weightedrelevancesearch
Wildcardsearch
Terminal
Adevicewithinputandoutputdevices(keyboardandmonitor)connectedtoacomputersystem.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Text
Adatafieldthatallowstheentryoftextinamannersimilartowordprocessingsoftware,butislimitedtoaspecificnumberofcharacters.Textfieldscanbesortedandaretypicallyusedfornames.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Attachmentfield
Attorneynotesfield
Authorfield
Beginningdocumentnumber
Beginningnumberfield
Copyeefield
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 303
Cross-referencefield
Customizeddatafield
Customizedfielddefinition
Datafielddefinition
Datefield
Enddocumentnumber
Field
Index/codingfield
Keyfield
Marginalia
Namesmentionedintext
Notefield
Othernumberfield
Productionsource
Recipient
Subjectcategory
Summary
TextClustering
Textclusteringisatechnologythatanalyzesadocumentcollectionandorganizesthedocumentsintogroupsbasedonfindingdocumentsthataresimilartoeachotherbasedonwordscontainedwithinit(suchasnounphrases).Textclusteringestablishesanotionof“distancebetweendocuments”andattemptstoselectenoughdocumentsintotheclustersoastominimizetheoverallpair-wisedistanceamongallpairsofdocuments.
Source: EDRMSearchGlossary.
TextExtraction
Theprocessofpullingthetextanddatafromelectronicdocumentsforthepurposesofloadingthedataintoadatabase.Theprocessremovesformatting,andgraphicsfromadocumentleavingonlythetext.
TextREtrievalConference
See: TREC
TF-IDF
See: TermFrequencyandInverseDocumentFrequency(TF-IDF)
TF-IDF
Ininformationretrieval,aweightingproceduresothatsomewordsinaqueryordocumentgetemphasizedmorethanothers.AdocumentisrankedhigherusingTF-IDFwhenithasmoreoccurrencesofthequeryterm(TFortermfrequency)andrankslowerwhenthewordoccursinmoredocuments(IDForinversedocumentfrequency).TherearedifferentrulesfordecidinghowtocombineTFwithIDF,oncommonruleistorankthedocumentsbasedontheratioofTFtolog(IDF).
Source: HerbRoitblat,Search2020:TheGlossary.
TGA
See: TargaImageFileFormat(TGA)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 304
TheGenerallyAcceptedRecordkeepingPrinciples®(“ThePrinciples”)
ThePrinciplesreflectstandardsandguidelinesrelatedtorecordsmanagement,developedbyARMAInternational,anot-for-profitprofessionalassociationandawidely-recognizedauthorityonmanagingrecordsandinformation.ThePrinciplesinclude:(1)PrincipleofAccountability,(2)PrincipleofIntegrity,(3)PrincipleofProtection,(4)PrincipleofCompliance,(5)PrincipleofAvailability,(6)PrincipleofRetention,(7)PrincipleofDisposition,and(8)PrincipleofTransparency.
Source: IGRMWhitePaper
ThesaurusExpansion
InKeywordorBooleanSearch,replacingasingleSearchTermbyalistofitssynonyms,aslistedinathesaurus.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Thing
AthinginthecontextofInternetofthings(IoT),isanyobjectthatcouldbeconnectedtotheInternet,eachofwhichwouldhaveauniqueURI.
Source: HerbRoitblat,Search2020:TheGlossary.
Threading
Organizingemailsintoconversationalgroups.Forexample,ifJohnsendsanemailtoMaryandshereplies,bothemailsarepartofthesameconversationalthread.
Source: HerbRoitblat,Search2020:TheGlossary.
ThumbDrive
Alsoknownaskeychaindrive,thumbdriveandUSBflashdrive.
Thumbnail
Asmallversionofanimageusedforquickoverviewsortogetageneralideaofwhattheimagelookslike.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
TIFF
See: TaggedImageFileFormat(TIFF)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 305
TIFFGroupIII
Aone-dimensionalcompressionformatforstoringblackandwhiteimagesthatisutilizedbymostfaxmachines.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
TIFFGroupIV
Atwo-dimensionalcompressionformatforstoringblackandwhiteimages.Typicallycompressesata20-to-1ratioforstandardbusinessdocuments.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
TIFFing
Theprocessofopeningfilesintheirnativeapplications,extractingtextandprintingthemasTIFFimages.
Source: IbisConsulting,Glossary.
Time
Referstothemeasurablehoursinvolvedineachidentifiabletask,activityoraction.TimeisalsoavariableelementofCostandVolume.
Source: EDRMMetricsGlossary
Tokenization
Anoperationthatexaminesadocumentorblockoftextandbreaksthetextintowords.Typically,aspaceisusedtoseparatewords,butspecialcharacterssuchasahyphen,period,orquotationmarkcanalsobeused.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
ToolKitWithoutAnInterestingName(TWAIN)
Auniversaltoolkitwithstandardhardware/softwaredriversformulti-mediaperipheraldevices.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
TopicalSearch
Averticalsearchengineasdistinctfromageneralwebsearchengine,focusesonaspecificsegmentofonlinecontent.Theyarealsocalledspecialtyortopicalsearchengines.Theverticalcontentareamaybebasedontopicality,mediatype,orgenreofcontent.Commonverticalsincludeshopping,theautomotiveindustry,legalinformation,medicalinformation,scholarly
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 306
literature,andtravel.Examplesofverticalsearchenginesinclude;Mocavo,Nuroa,TruliaandYelp.Incontrasttogeneralwebsearchengines,whichattempttoindexlargeportionsoftheWorldWideWebusingawebcrawler,verticalsearchenginestypicallyuseafocusedcrawlerwhichattemptstoindexonlyrelevantwebpagestoapre-definedtopicorsetoftopics.
Source: Wikipedia,Verticalsearch,https://en.wikipedia.org/wiki/Vertical_search
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Weightedrelevancesearch
Wildcardsearch
TrainingExample
OneDocumentfromaTrainingSet.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
TrainingSet
ASampleofDocumentsCodedbyoneormoreSubjectMatterExpert(s)asRelevantorNon-Relevant,fromwhichaMachineLearningAlgorithmtheninfershowtodistinguishbetweenRelevantandNon-RelevantDocumentsbeyondthoseintheTrainingSet.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
TransmissionControlProtocol
See: TCP(TransmissionControlProtocol)
TransmissionControlProtocol/InternetProtocol
See: TCP/IP(TransmissionControlProtocol/InternetProtocol)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 307
TransmissionSpeed
Therateatwhichdatapassesthroughcommunicationslines;usuallymeasuredinbitspersecond(bps).
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
TREC
TheTextREtrievalConference,sponsoredbytheNationalInstituteofStandardsandTechnology(NIST),whichhasrunsince1992,to“supportresearchwithintheinformationretrievalcommunitybyprovidingtheinfrastructurenecessaryforlarge-scaleevaluationoftextretrievalmethodologies.Inparticular,theTRECworkshopserieshasthefollowinggoals:toencourageresearchininformationretrievalbasedonlargetestCollections;toincreasecommunicationamongindustry,academia,andgovernmentbycreatinganopenforumfortheexchangeofresearchideas;tospeedthetransferoftechnologyfromresearchlabsintocommercialproductsbydemonstratingsubstantialimprovementsinretrievalmethodologiesonreal-worldproblems;andtoincreasetheavailabilityofappropriateevaluationtechniquesforusebyindustryandacademia,includingdevelopmentofnewevaluationtechniquesmoreapplicabletocurrentsystems.”
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
TheTextREtrievalConference,organizedtheUSNationalInstituteofStandardsandTechnology.TRECisanannualconferenceandfriendlycompetitionamonginformationretrievalsystemsintendedtopromotethescienceoftextretrieval.ForseveralyearsTRECincludedalegaltrack,whichinvestigatedtextretrievalinthecontextofdiscovery.
Source: HerbRoitblat,Search2020:TheGlossary.
Externallink:
TextREtrievalConference(TREC),http://trec.nist.gov
TRECLegalTrack
From2006through2011,TRECincludedaLegalTrack,whichsought“toassesstheabilityofinformationretrievaltechniquestomeettheneedsofthelegalprofessionfortoolsandmethodscapableofhelpingwiththeretrievalofelectronicbusinessrecords,principallyforuseasevidenceincivillitigation.”
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 308
Triage
Theprocessofreviewingadocumentset(discoveryset)forresponsivenessand/orprivilege.Triagereferstothepracticeofquicklyidentifyingwhichdocumentsrequireadditionalattentionandwhichcanbeeasilyclassifiedaseitherresponsiveornonresponsive.
Trigram
AnN-GramwhereN=3(i.e.,a3-gram).
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
TrueNegative(TN)
ANon-RelevantDocumentthatiscorrectlyidentifiedasNon-Relevantbyasearchorrevieweffort.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Oneoffourresponsestatesinacategorizationtask.Truenegativeresponsesarethosethataretrulyinthenegativecategoryandareclassifiedasnegative.
Source: HerbRoitblat,PredictiveCodingGlossary.
Seealso:
FalseNegative(FN) FalsePositive(FP) TruePositive(TP)
TrueNegativeRate(TNR)
Thefraction(orProportion)ofNon-RelevantDocumentsthatarecorrectlyidentifiedasNon-Relevantbyasearchorrevieweffort.
Source; MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
TruePositive(TP)
ARelevantDocumentthatiscorrectlyidentifiedasRelevantbyasearchorrevieweffort.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Oneoffourresponsestatesinacategorizationtask.Truepositiveresponsesarethosethataretrulyinthepositivecategoryandareclassifiedaspositive.
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 309
Seealso:
FalseNegative(FN) FalsePositive(FP) TrueNegative(TN)
TruePositiveRate(TPR)
Thefraction(orProportion)ofRelevantDocumentsthatarecorrectlyidentifiedasRelevantbyasearchorrevieweffort.TruePositiveRateisatermusedinSignalDetectionTheory;RecallistheequivalentterminInformationRetrieval.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
TrueResolution
The"true"opticalresolutionofascanneristhenumberofpixelsperinch(withoutanysoftwareenhancements).
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Truncation
ASearchSpecificationthatindicatesthatmatchingdocumentsmustcontainwordsthatbeginwiththelettersentered,butthatthematchingwordscanendwithanycombinationofletters.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
TWAIN
See: ToolKitWithoutAnInterestingName(TWAIN)
TWAINScannerDriver
Aspecializedapplicationusedforcommunicationbetweenscannersandcomputers.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Two-TailedDistribution
See: Two-TailedTest
Two-TailedTest
Aconfidenceintervalthatisarrangedsymmetricallyaroundtheaverageormeanofadistribution.Thetails,outsideoftheconfidenceintervalareofequalsize.Alsocalledtwo-taileddistribution.
Source: HerbRoitblat,PredictiveCodingGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 310
Typeface
Thereareover10,000typefacesavailableforcomputers.Thegeneralcategoriesare:
1. Oldstyle:Faceshaveslantedserifs,gradualthicktothinstrokesandaslantedstress(the"O"appearsslanted)
2. Modern:Faceshavethin,horizontalserifs,radicalthicktothinstrokesandaverticalstreet(the"O"doesnotappeartoslant.)
3. SlabSerif:Faceshavethick,horizontalserifs,littleornothick-to-thininthestrokesandaverticalstress(the"O"appearsvertical).
4. SansSerif:Faceshavenoserifs.5. Script:Fromelaboratehandwritingstylestocasual,freeform,unconnectedletterforms.6. Decorative:Unusualfonts,designedtobeverydifferentandattentiongetting.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
U
Ultrafiche
Microfichewhichcanhold1,000documents/sheetasopposedtothenormal270.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
UnallocatedSpace
Spaceonaharddrivethatpotentiallycontainsintactfiles,remnantsoffiles,subdirectories,ortemporaryfilesthatwerecreatedandthendeletedbyeitheracomputerapplication,theoperatingsystemortheoperator.
Source: RenewData,Glossary(10/5/2005).
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Seealso:
Ambientdata
Fragmenteddata
Freespace
Residualdata
Slackspace
Swapfile
UncertaintySampling
AnActiveLearningapproachinwhichtheMachineLearningAlgorithmselectstheDocumentsastowhichitisleastcertainaboutRelevance,forCodingbytheSubjectMatterExpert(s),andadditiontotheTrainingSet.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 311
Unicode/UnicodeTransformationFormat
Allelectronicdataisrepresentedassequencesofbits,ornumbers.Eachalphabetorscriptusedinalanguageismappedtoauniquenumericvalue,or‘encoded’foruseonacomputerusingastandardknownasUnicode.WithinUnicode,eachletterorcharacterhasbeenassigneditsownuniquevalueintheUnicodeencodingschemes,knownastheUnicodeTransformationFormat(UTF).TheUTFutilizesmultipleencodingschemes,ofwhichthemostcommonlyusedareknownasUTF-8andUTF-16.Forexample,theEnglishalphabetandthemorecommonpunctuationmarkshavebeenassignedvaluesbetween0and255,whileTibetancharactershavebeenassignedthevaluesbetween3,840(writtenasx0F00)and4,095(writtenasx0FFF).Allmodern(andmanyhistorical)scriptsaresupportedbytheUnicodeStandard.Unicodeprovidesauniquenumberforeverycharacter,regardlessoftheplatform,program,orlanguage.TheUnicodeStandardisdescribedindetailatthewebsitehttp://www.unicode.org.Seealso,CharacterEncoding.
Source: EDRMSearchGlossary.
UnifiedGovernance
UnifiedGovernanceisamarriagebetweenpolicyintegrationandprocesstransparency.Effectiveunifiedgovernancecreatesanorganizationalenvironmentwherebythekeystakeholdershaveadefinedpartnershipwithexecutivebuy-inandoversighttocreateauniformapproachandtoestablishastronglinkagebetweenlegalobligationsforinformation,recordsmanagement,andIT;andthedutyandvalueassociatedwiththedataasset.
Source: IGRMWhitePaper
UniformResourceIdentifier(URI)
Asymbolicstringrepresentationofthelocationofaninternetresource.AURL,UniformResourceLocatorisonetypeofURIforobjectsontheWorldWideWeb."Webaddress"isaURLthatusestheHTTPorHTTPSprotocol.
Source: HerbRoitblat,Search2020:TheGlossary.
Unitization
Theassemblyofindividuallyscannedpagesintodocuments:
• Physicalunitizationutilizesactualobjectssuchasstaples,paperclipsandfolderstodeterminepagesthatbelongtogetherasdocumentsforarchivalandretrievalpurposes.
• Logicalunitizationistheprocessofhumanreviewofeachindividualpageinanimagecollectionusinglogicalcuestodeterminepagesthatbelongtogetherasdocuments.Suchcuescanbeconsecutivepagenumbering,reporttitles,similarheadersandfootersandotherlogicalcues.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 312
UniversalSerialBus(USB)
APlugandPlayinterfacebetweenacomputerandaperipheralsuchasamouse,keyboard,digitalcamera,printerorscanner.UnlikedevicesconnectedviaSCSIports,USBdevicescanbeaddedtoandremovedfromthecomputerwithouthavingtorebootthecomputer.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
UNIX
Pronouncedyoo-niks,apopularmulti-user,multitaskingoperatingsystemdevelopedatBellLabsintheearly1970s.Createdbyjustahandfulofprogrammers,UNIXwasdesignedtobeasmall,flexiblesystemusedexclusivelybyprogrammers.
Source: http://www.webopedia.com/TERM/U/UNIX.html
AnoperatingsystemdevelopedbyBellLaboratoriesthatoffersmulti-userfunctionalityanduseshigh-levelprograms.OnPCs,itisoftenmarketedunderthenameXenix.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Asoftwareoperatingsystem.OriginallypioneeredbyBellLabs–nowwidelyusedbyworkstations.
Source:FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
DOS
Linux
MicrosoftDOS
MicrosoftWindows
Networkoperatingsystem
NOS
Operatingsystem
OS
Windows
Xenix
UnstructuredData
Datathatisnotintabularordelimitedformat.Filetypesincludewordprocessingfiles,htmlfiles(webpages),projectplans,presentationfiles,spreadsheets,graphics,audiofiles,videofilesandemails.
Source: RenewData,Glossary(10/5/2005).
UnsupervisedLearning
AMachineLearningmethodinwhichthelearningAlgorithminferscategoriesofsimilarDocumentswithoutanytrainingbySubjectMatterExpert(s).ExamplesofUnsupervisedLearningmethodsincludeClusteringandNear-DuplicateDetection.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 313
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Akindofmachinelearningwheretheobjectsarenotlabeledbyanexteriorsource.Instead,themachinelearningsystemorganizestheobjectsbasedonimplicitcriteriathatitderives.Theselectionofcriteriaisafunctionofthespecificlearningmethodsthatareemployed,thenatureoftheobjects,andthewayinwhichfeaturesoftheobjectarerepresented.Clusteringisanexampleofanunsupervisedmachinelearningmethod.Thegoalofunsupervisedlearningistypicallytoidentifyhiddenstructureinunlabeleddata,tosummarizekeyfeaturesofthedata.
Source: HerbRoitblat,Search2020:TheGlossary.
Upload
Totransferdatafromauser’scomputertoaremotecomputersystem.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
URI
See: UniformResourceIdentifier(URI)
USB
See: UniversalSerialBus(USB)
USBFlashDrive
See: JumpDrive
UserCreatedFile
Afilethatwascreatedbecauseoftheactionsoftheuser,withorwithoutintentorawareness.Excludessystemgeneratedlogfiles.
Source: DavidGreetham,[email protected](2008).
Datacreatedbyapersonoraperson'sinteractionwithacomputer.
Source: JohnMartin,[email protected](2008).
UserGroup
Anyorganizationmadeupofcomputerusers(asopposedtovendors)designedtogivetheusersaforumtoshareinformationaboutaparticularsystem.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 314
UserGuide
Asetofinstructionsoramanualforasoftwareprogramorhardwaresystem.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
User-Friendly
Termusedtodescribeasoftwareprogramthatisbotheasytolearnandeasytouse.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
UTBMS:ConversiontoProductionFormat
Theprocessofrestoringdatathathasbeen“deleted”fromastoragedeviceorretrievingdatafromadevicethathasfailed,beencorrupted,isdamaged,orconsideredinaccessible.
Source: EDRMMetricsGlossary
UTBMS:DataRecovery
Theprocessofrestoringdatathathasbeen“deleted”fromastoragedeviceorretrievingdatafromadevicethathasfailed,beencorrupted,isdamaged,orconsideredinaccessible.
Source: EDRMMetricsGlossary
UTBMS:DataSteward
Adatastewardissomeonethatisresponsibleformaintainingandmanagingthedataassetsofaparticularorganization.Theroleofdatastewardcanbecontrastedfromadatacustodianinthat,thoughtheybothmaysharecertainresponsibilitieswithregardstodata,adatacustodian,inthee-discoverycontext,isoftenusedtodescribetheindividualresponsiblefortheday-to-daycontrolofacertaindataset(i.e.anindividualisthedatacustodianfortheiremailinbox,anITmanageristhedatacustodianforfilesharesonanetworkserver).
Source: EDRMMetricsGlossary
UTBMS:Defensibility
AhighlyprizedInformationGovernance(IG)solutionattributetodayandintheforeseeablefutureisDefensibility.CE-discoveryandarchivingvendorstoutthisconcept,especiallyasitrelatestoanentity’slitigationorregulatoryactivitiesanditsabilitytoproduce,inatimelyfashion,writtendocumentsorESI.However,inpractice,defensibilityisamuchbroaderconcept.
IntheparlanceoftheIGspace,defensibilitycanapplyto:
• Theabilitytodemonstratethatappropriate,achievable,consistentpoliciesgoverningthemanagementofphysicalandelectronicrecordshavebeendevelopedand
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 315
implementedandthatemployeeshavebeeninformedandeducatedonthosepoliciesaswellasofferedongoingtrainingandupdates.
• Theabilitytodemonstraterepeatableprocessesthatsupportafirm’sneedtocomplywithlegalorregulatoryrequirements.
• TheabilitytorespondtolegalorregulatoryediscoveryrequestsinatimelyfashiontothwartquestionablelitigationsorpotentialfinesthatcouldbeleviedduetoinabilitytoproduceESI.
• Theimplementationofsolutionsthatofferpredictabilitywhetheritbetosupportcompliancewithretentionpolicies,theabilitytocaptureallappropriateESIortheabilitytoscaleasmorecontentismanagedelectronicallywithassurancethatallneededESIiscapturedandpreserved.
• Thecreationofaninformationsecuritystrategythatlimitsbothexternalandinternalrisksandbreacheswhentheyoccur.
• Ariskmanagementstrategythatidentifiespotentialliabilities,improvesdisasterpreparednessandprotectscorporateandpersonalassets.
Source: http://wikibon.org/wiki/v/Not_your_Fathers_Enterprise_Information_Archiving_Solution:_The_Next_Generation_Defined
Source: EDRMMetricsGlossary
UTBMS:ESIDataMap
Datamappingfindsorsuggestsassociationsbetweenfileswithinalargebodyofdata,whichmaynotbeapparentusingothertechniques.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
UTBMS:ESIInventory
AnESIinventoryisasystematicprocessforidentifyingalloftherecordsandnon-recordinformationinanorganization,whocreates,uses,orreceivestheinformation,andwhereusersstoreit.Acompletedinventoryprovidesacompletepictureoftheinformationenvironment.ThispictureisveryhelpfulforassessingtheneedsofyourRIMprogram.
Source: http://www.aiim.org/community/blogs/expert/carrying-out-a-records-inventory#sthash.TUKP9zzt.dpuf
UTBMS:ESIPreparation
PreparingESI(electronicallystoredinformation)forprocessingandpresentation.
Source: EDRMMetricsGlossary
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 316
UTBMS:ESIPresentation
PresentingESI(electronicallystoredinformation)tothedesiredviewer,whetheritbeindepositions,production,trial,etc.ESImaybepresentedinnativeformat,nearnativeformat,orinsomeotherformacceptabletotheparties.
Source: EDRMMetricsGlossary
UTBMS:ESIProcessing
Anyactiontakenondatausingtechnologytoreduceadatacorpusbasedonspecificcriteria,organizedataaccordingtocertainparameters,orconvertdatatoanotherformatmoresuitableforreviewandanalysis.
Source: EDRMMetricsGlossary
UTBMS:ESIStaging
DatastagingistheprocessbywhichoriginalESIfilesarecopied,isolated,andstoredinaforensicallysoundmannerforfutureuse.
Source: EDRMMetricsGlossary
UTBMS:ExceptionHandling
Exceptionhandlingistheprocessofrespondingtotheoccurrence,duringcomputation,ofexceptions–anomalousorexceptionaleventsrequiringspecialprocessing–oftenchangingthenormalflowofprogramexecution.Itisprovidedbyspecializedprogramminglanguageconstructsorcomputerhardwaremechanisms.Exceptionsalsooccurwhendoingareview,forexample.
Ingeneral,anexceptionishandled(resolved)bysavingthecurrentstateofexecutioninapredefinedplaceandswitchingtheexecutiontoaspecificsubroutineknownasanexceptionhandler.Iftheexceptionstatepermitscontinuation,thehandlermaylaterresumetheexecutionattheoriginalstateusingthesavedinformation.Forexample,afloatingpointdividebyzeroexceptionwilltypically,bydefault,allowtheprogramtoberesumed,whileanoutofmemoryconditionmightnotberesolvabletransparently.
Alternativeapproachestoexceptionhandlinginsoftwareareerrorchecking,whichmaintainsnormalprogramflowwithsubsequentexplicitchecksforcontingenciesreported,usingspecialreturnvaluesorsomeauxiliaryglobalvariablesuchasC’serrnoorfloatingpointstatusflags;orinputvalidationtopreemptivelyfilterexceptionalcases.
Source: http://en.wikipedia.org/wiki/Exception_handling.Formoreinformationonexceptionhandling,seehttp://www.meridiandiscovery.com/articles/exception-handling-and-reporting-in-e-discovery/.
Source: EDRMMetricsGlossary
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 317
UTBMS:FirstPassDocumentReview
Whereadocumentreviewisorganizedinstages,thefirstpassdocumentreviewisthefirstlookatthedocumentsthatwereidentifiedaspotentiallyresponsiveorrelevantfromtheinitialdocumentcollection.Typically,afirstpassrevieweranalyzesthedocumentsforrelevanceorresponsivenessandcodesormarksthemassuch.Often,thereviewerwillcodeforconfidentialityandmakeaninitialprivilegedeterminationduringthefirstpassreview.
Source: EDRMMetricsGlossary
UTBMS:ForensicAnalysisActivity
Forensicanalysisistheuseofcontrolledanddocumentedanalyticalandinvestigativetechniquestoidentify,collect,examine,andpreservedigitalinformation.Recognizingthefragilenatureofdigitaldata,andthelegalandregulatoryrequirementstoproperlypreserveelectronicallystoredinformation(ESI)duringforensicinvestigations.
Source: EDRMMetricsGlossary
UTBMS:HostingCosts
Thecosttohostdataonadatabaseorreviewplatform;traditionally,thehostingphaseoccursafterdataiscollected,processed,andloadedtothereviewtool.CostofhostingistypicallybyGBpermonth.
Source: EDRMMetricsGlossary
UTBMS:LegalHold
Alegalholdisacommunicationissuedasaresultofcurrentoranticipatedlitigation,audit,governmentinvestigationorothersuchmatterthatsuspendsthenormaldispositionorprocessingofrecords.Legalholdscanencompassbusinessproceduresaffectingactivedata,including,butnotlimitedto,backuptaperecycling.ThespecificcommunicationtobusinessorITorganizationsmayalsobecalleda“hold,”“preservationorder,”“suspensionorder,”“freezenotice,”or“holdnotice.”
Source: SharonD.Nelson,BruceOlson,JohnW.Simek,TheElectronicEvidenceandDiscoveryHandbook:Forms,ChecklistsandGuidelines,AmericanBarAssociationLawPracticeDivision(2006).
Source: EDRMMetricsGlossary
UTBMS:NativeFormat
Electronicdocumentshaveanassociatedfilestructuredefinedbytheoriginalcreatingapplication.Thisfilestructureisreferredtoasthe“nativeformat”ofthedocument.Becauseviewingorsearchingdocumentsinthenativeformatmayrequiretheoriginalapplication(i.e.,viewingaMicrosoftWorddocumentmayrequiretheMicrosoftWordapplication),documentsareoftenconvertedtoastandardfileformat(i.e.,tiff)aspartofelectronicdocumentprocessing.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 318
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Source: EDRMMetricsGlossary
UTBMS:Near-LineStorage
Near-linestorageisusedasaninexpensive,scalablewaytostorelargevolumesofdata.Near-linestoragedevicescanincludeDATandDLTtapes,opticalstorage,andstandardalsoslowerP-ATAandSATAharddiskdrives.Near-Lineimpliesthatthestorageisnotimmediatelyavailable,butcanbemadeonlinequicklywithouthumanintervention.Near-linecanbeslower,butgenerally,thetypeofdatastoredinnear-linesystemsdoesnotrequireinstantaccess.
Source: http://www.webopedia.com/TERM/N/near-line_storage.html.
Source: EDRMMetricsGlossary
UTBMS:Near-NativeForms
Anear-nativeformatdescribesanelectronicdocumentthathasbeenalteredorconvertedfromitsoriginalforminordertoprovideenhancedcontentcontroltoaproducingpartywhilemaintainingalevelofusabilityconsistentwithitsoriginalformat(e.g.conversionofaworddocumenttoaTIFFimagewithOCRtosupportredactions).
Source: EDRMMetricsGlossary
UTBMS:Non-CustodialData
Dataorrecordsthatarenotcreatedormaintainedbyanindividualuser,orwhosephysicalstorageandprotectionduringtheretentioncyclearemaintainedbyasystemcustodianandnotend-users.Examplesofnon-custodialdatamayincludedataincertainstructuredsystems,oraccesscontrolorsimilarlogs.Itmaynotbepossibletoattributeauthorshiptonon-custodialdata.IncontrasttoCustodialData.
Source: EDRMMetricsGlossary
UTBMS:ObjectiveCoding
Therecordingofbasicdatasuchasdate,author,ordocumenttype,fromdocumentsintoadatabase.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Extractinginformationfromelectronicdocumentssuchasdatecreated,authorrecipient,CCandlinkingeachimagetotheinformationinpre-definedobjectivefields.IndirectoppositiontoSubjectivecodingwherelegalinterpretationsofdatainadocumentarelinkedtoindividualdocuments.Alsocalledbibliographiccoding.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 319
Extractingvarioussegmentsofinformationfromadocumentsuchasitsauthor,recipient,mailingdate,orotherfields,etc.ObjectiveCodingisusuallydonefromthedocumenttextorimagebecausemetadataorsearchabletextmaybeunavailable(e.g.ahandwrittendocumentthathasbeenscanned),orthedocumentmaycontaininaccuratemetadata(e.g.metadataassociatedwithadocumentwrittenandsignedbyapartnermightreflecttheadministrativeassistantastheauthorwherethedocumentwasoriginallytypedontheassistant’scomputer).
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Issuecoding
Levelcoding
Subjectivecoding
Tag
Taxonomiccoding
Verbatimcoding
UTBMS:Off-LineStorage
Anystoragemediumthatisnotimmediatelyavailable,andmustbeinsertedintoastoragedrivebyapersonbeforeitcanbeaccessedbythecomputersystem.ExamplesincludeCD/DVDopticalmedia,USBmemorysticks,andtapecartridges.Offlinestorageisalsocalledremovablestorage.
Source: http://www.webopedia.com/TERM/O/offline_storage.html.
Source: EDRMMetricsGlossary
UTBMS:On-LineStorage
Onlinestorageisfullyaccessibleandimmediatelyavailable.ThisincludesDRAMmemory,solid-statedrives(SSD),andalways-onspinningdisk,regardlessofrotationalspeed.Incontrasttonear-linestorageandoff-linestorage.
Source: EDRMMetricsGlossary
UTBMS:PreservationOrder
Alsocalleda“legalhold,”“hold,”“holdorder,”“holdnotice,”“suspensionorder,”or“freezenotice”.APreservationOrderisacommunicationissuedasaresultofpendingorreasonablyanticipatedlitigationorgovernmentinvestigationoractiondirectingthesuspensionofthenormaldispositionorprocessingofrecords,includingelectronicallystoredinformation.
Source: EDRMMetricsGlossary
UTBMS:PrivilegeReview
Areviewofthedocumentsidentifiedasresponsiveorrelevantinaparticularlegalproceedingfortheadditionallegalclassificationofprivilegewhetherasattorney-clientcommunicationorunderthework-productdoctrine.Thelawpermitsadisclosingpartytowithholdproductionof
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 320
documentsonthegroundsoflegalprivilege.Usually,aPrivilegeLogisgeneratedinconjunctionwiththePrivilegeReview.SeealsoPrivilegeLog.
Source: EDRMMetricsGlossary
UTBMS:ProductionFormat
Theformatinwhichvariousdocumentsaredeliveredfromonepartytoanotherduringthecourseofalegalproceeding.Availableformatsfordocumentproductionarenative,nearorquasi-native,image(e.g.TIFForPDFimages),andpaper.Rule26(f)setsanexpectationthatthemethodandformatbywhichESIistobeproducedshouldbeconsideredandnegotiatedbythepartiesearlyinthediscoveryprocess.FRCP34(b)(1)(E)(ii)statesthat“ifarequestdoesnotspecifyaformforproducingESI,apartymustproduceitinaformorformsinwhichitisordinarilymaintainedorinareasonablyusableformorforms.”Akeyquestionregardingproductionformatsiswhethertoincludeassociatedmetadata.Seealso:native,near-native,image,andpaper.
Source: EDRMMetricsGlossary
UTBMS:ProjectManagement
Projectmanagementisthedisciplineofplanning,organizing,motivating,andcontrollingresourcestoachievespecificgoals.
Source: http://en.wikipedia.org/wiki/Project_management.
Aprojectisatemporaryendeavorwithadefinedbeginningandend,undertakentomeetspecificgoalsandobjectivesandcanbedistinguishedfromoperations(businessasusual).Theprimaryobjectiveofprojectmanagementistodelivertheprojectgoalswhilemanagingtheconstraintsonprojectdelivery.Theprimaryconstraintsarescope,time,qualityandbudget.
Source: PMI(2010).AGuidetotheProjectManagementBodyofKnowledgep.27-35.
Thesecondarychallengesaretooptimizetheallocationofnecessaryinputsandintegratethemtomeetpre-definedobjectives.
Source: http://en.wikipedia.org/wiki/Project_management.
Projectmanagementprinciplesapplytoe-discovery,andmoste-discoveryactivitiesareprojects.
UTBMS:Redaction
Theprocessingofeditingthecontentofadocument,usuallybyobscuringorremovingcertainsensitive,confidentialorprivilegedinformation,priortoitsproductionfromonepartytoanother.
Source: EDRMMetricsGlossary
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 321
UTBMS:SecondPassDocumentReview
Whereadocumentreviewisorganizedinstages,thesecond-passdocumentreviewisthesecond,moredetailedreviewofdocumentsthatwereidentifiedaspotentiallyresponsiveorrelevantinthefirst-passreview.Second-passreviewcanconsistofadetailedreviewofdocumentstodeterminewhatdocumentsshouldbewithheldproductiononthegroundsofprivilege,relevanceorotherfactorsandwhichdocumentsshouldberedacted.Second-passreviewcanalsobeusedtoquality-check(QC)thefirst-passreview.Second-passreviewisfrequentlyperformedbymoreseniorattorneys.Incontrasttofirst-passreview.Seealso:DocumentReview.
Source: EDRMMetricsGlossary
UTBMS:SecondaryLineStorage
Computerstorage,asondiskortape,supplementaltoandslowerthanmainstorage,andnotunderthedirectcontroloftheCPUandgenerallycontainedoutsideit.
Source: http://dictionary.reference.com/browse/secondary+storage.
Source: EDRMMetricsGlossary
UTBMS:StructuredData
StructuredDataisdatathatisorganized.Themostcommontypeisdatabasecontent.ItreferstoanytypeofdataorganizedsuchasInternetdataorothertypesofdatathathasbeenindexed.
Source: EDRMMetricsGlossary
Datathatresidesinafixedfieldwithinarecordorfileiscalledstructureddata.Thisincludesdatacontainedinrelationaldatabasesandspreadsheets.
Source: http://www.webopedia.com/TERM/S/structured_data.html.
UTBMS:SubjectiveCoding
TheSubjectiveCodingofadocumentinvolveslinkingalegalinterpretationtoanindividualdocument.Indirectoppositiontoobjectivecoding,inwhichbibliographicdataaboutthedocumentisrecorded.SubjectiveCodingtypesincludetheclassificationofdocumentsasprivilegedandresponsive,andthecategorizationofdocumentsbylegalissue(“issuecoding”).
Source:EDRMMetricsGlossary
Enteringinformationfromadocumentthatrequiresthecodertoexercisejudgment,suchassubjectorissuecodes.Thisfieldisoftenleftblankforthelawfirm’sparalegalsorassociatestofillin.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 322
Thecodingofadocumentusinglegalinterpretationasthedatathatfillsafield.Performedbyparalegalsorothertrainedlegalpersonnel.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Categorizingdocumentsbytheirresponsivenesstospecificcaseissuesortopics.
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Issuecoding
Levelcoding
Objectivecoding
Tag
Taxonomiccoding
Verbatimcoding
UTBMS:UnstructuredData
UnstructuredDataisthemajorityofdatacreatedtoday.Itistheoppositeof“structureddata”suchasindexeddatafoundinadatabasebecauseitisnotpre-organizedorpre-defined.ExamplesofunstructureddataincludeMicrosoftWordandotherwordprocessingdocuments;spreadsheets;email;Webpages;images;videos;andtext.
Source: EDRMMetricsGlossary
Utilities
Asetofroutinesdesignedtoserviceaprogramorsystem.Examplesareutilitiesfilemaintenance,informationrecoveryfromdamageddisks,diskinitializing,diskcopying,routinesystemmaintenancechecks,andsupervisoryfunctions.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
V
V.32bis
TheITUstandardfor14.4kbsmodemcommunications.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
V.34
TheproposedITUstandardfor28.8kbsmodemcommunications.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 323
VAD
See: VAR
Validation
Theactofconfirmingthataprocesshasachieveditsintendedpurpose.ValidationmayinvolveStatisticalorJudgmentalSampling.
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
ValidationMethodologies
ValidationmethodologiesinvolvethecaseteaminreviewingsamplesofdocumentstodeterminelitigationrelevancetoclassifydocumentsasResponsiveorNotResponsivetotheissuesofthecaseandthereforeincreasingtheprecisionofthesearchresults.Resultsofakeywordoriterativesearchmaybevalidatedbyobservingthefrequencyofhits,validatingdroppeditems,samplingnon-hits,andreviewcallfeedbackanalysis.
Source: EDRMSearchGlossary.
ValidationTable
Alsocalleda“lookuptable.”Apre-definedsetofentriesforaspecificfield,oftenabbreviations,whichappearwhenthecodermovestothatfield.Validationtablesareusedtocutdownonerrorsduringdataentry.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Value
Utilityorbusinesspurposeofspecificinformation.Thelineofbusinesshasaninterestininformationproportionaltoitsvalue—thedegreetowhichithelpsdrivethe“Profit”orpurposeoftheenterpriseitself,itsmissionandgoals.
Source: IGRMWhitePaper
Value-AddedDealer
See: VAR
Value-AddedReseller
See: VAR
Value-AddedSpecialtyDistributor
See: VAR
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 324
VAR
Value-AddedResellerValue-AddedDealerValue-AddedSpecialtyDistributor
Companiesorpeoplewhosellcomputerhardwareorsoftwareand"add-value"intheprocess.Mostusuallythevalueaddedisspecifictechnicalormarketingknowledgeand/orexperience.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
VASD
See: VAR
VDT(VideoDisplayTerminal)
Genericnameforalldisplayterminals.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
VectorGraphics
Asetofroutinesdesignedtoserviceaprogramorsystem.Examplesareutilitiesfilemaintenance,informationrecoveryfromdamageddisks,diskinitializing,diskcopying,routinesystemmaintenancechecks,andsupervisoryfunctions.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Vendor
Thesellerofcomputersorapplications.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Vendor-AddedMetadata
Datacreatedandmaintainedbytheelectronicdiscoveryvendorasaresultofprocessingthedocument.Whilesomevendor-addedmetadatahasdirectvaluetocustomers,muchofitisusedforprocessreporting,chainofcustody,anddataaccountability.Contrastwithcustomer-addedmetadata.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 325
Seealso:
Customer-addedmetadata
Documentmetadata
Emailmetadata
Extrinsicdata
Fileparameters
Filesystemmetadata
File-specificmetadata
Generalmetadata
Metadata
VerbatimCoding
Extractingdatafromdocumentsinacollectioninawaythatmatchesexactlyastheinformationappearsinthedocuments.Theoppositeofthestandardizationtypecodingtreatment.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
BibliographicCoding
Coding
Indexing
IssueCode
Issuecoding
Levelcoding
Objectivecoding
Subjectivecoding
Tag
Taxonomiccoding
Version
Areleaseofasoftwareprogram.Normally,newversionsincludeadditionalfeatures.
VerticalDeduplication
Deduplicationwithinacustodian;identicalcopiesofaDocumentheldbydifferentcustodiansarenotDeduplicated.(Cf.HorizontalDeduplication.)
Source: MauraR.GrossmanandGordonV.Cormack,EDRMpage&TheGrossman-CormackGlossaryofTechnology-AssistedReview,withForewordbyJohnM.Facciola,U.S.MagistrateJudge,2013Fed.Cts.L.Rev.7(January2013).
Seealso:
Basicde-duplication
Casede-duplication
Custodiande-duplication
De-duplication
Duplicate
Dynamicde-duplication
GlobalDeduplication
HorizontalDeduplication
Productionde-duplication
VESA(VideoElectronicsStandardsAssociation)
Concentratesoncomputervideostandards.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 326
VGA(VideoGraphicsAdapter)
APCindustrystandard,firstintroducedbyIBMin1987,forcolorvideodisplays.Theminimumdot(pixel)displayis640by480by16colors.Then"SuperVGA"wasintroducedat800x600x16,then256colors.VGAcanextendto1024by768by256colors.ReplacesEGA,anearlierstandardandtheevenolderCGA.Newerstandarddisplayscanrangeupto1600by1280.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
VideoDisplayTerminal
See: VDT(VideoDisplayTerminal)
VideoElectronicsStandardsAssociation
See: VESA(VideoElectronicsStandardsAssociation)
VideoGraphicsAdapter
See: VGA(VideoGraphicsAdapter)
VideoScannerInterface
Atypeofdeviceusedtoconnectscannerswithcomputers.ScannerswiththisinterfacerequireascannercontrolboarddesignedbyKofax,XionicsorDunord.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Videoblog(Vlog)
AvlogisaWeblogthatusesvideoasitsprimarymediumfordistributingcontent.Vlogpostsareusuallyaccompaniedbytext,image,andothermetadatatoprovideacontextoroverviewforthevideo.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Virtual
Thecreationofcase-specificwordsandcodestoensureuniformdataentry.Usedinconjunctionwithvalidationtables.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
VirtualPrivateNetwork(VPN)
Awaytoprovideremoteaccesstoanorganization'snetworkviatheInternet.VPNssenddataoverthepublicInternetthroughsecure"tunnels."
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 327
Avirtuallyprivatenetworkthatisconstructedbyusingpublicwirestoconnectnodes.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Vlog
See: Videoblog(Vlog)
VocabularyControl
Thecreationofcase-specificwordsandcodestoensureuniformdataentry.Usedinconjunctionwithvalidationtables.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Volume
VolumeistheprincipalvariableofMetrics.VolumeistheamountofdatathatispartoftheeDiscoverycollection.VolumewillsettheestimateforCostandTime.Forexample,alargedataVolumewillcauseanincreaseintheTimerequiredtocompletethePhasesforProcessing,ReviewandProduction,thusincreasingtheCostoftheproject.Ifthevolumeofdatadecreases,Time&Costwillalsolikelydecrease.
Source: EDRMMetricsGlossary
VPN
See: VirtualPrivateNetwork(VPN)
W
WAIS(WideAreaInformationServer)
Acentraldatabaseusedforinformationaccessbynetworkusersinmultiplephysicallocations.OftenreferstoanInternetdatabase,butWAISservershaveexistedforsometimeoutsidetheInternetarena.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
Database
Flatfiledatabase
Fulltextdatabase
Relationaldatabase
SQL
WAN(WideAreaNetwork)
Acentraldatabaseusedforinformationaccessbynetworkusersinmultiplephysicallocations.OftenreferstoanInternetdatabase,butWAISservershaveexistedforsometimeoutsidetheInternetarena.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 328
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Asystemoflocalareanetworksindifferentphysicallocationsconnectedthroughcommunicationssoftware.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
GenerallyanetworkofPC's,remotetoeachother,connectedbytelecommunicationslines.
Seealso:
Client/servernetwork
LAN-localareanetwork
MAN-metropolitanareanetwork
Network
Peer-to-peernetwork
SAN-storageareanetwork
Standalonecomputer
WAP(WirelessApplicationProtocol)
Awidelyusedsetofprotocolsthatstandardizethemannerinwhichwirelessdevices,suchascellphonesandsomePDAsareabletoaccesspartsoftheInternet,suchase-mailandtheWeb.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
WAV
FileextensionnameforWindowssoundfiles.Compressionisnotrequired..WAVfilescanreach5megabytesforoneminuteofaudio.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Web(WorldWideWeb)
TheWWWismadeupofallofthecomputersontheInternetwhichuseHTML-capablesoftware(Netscape,Explorer,etc.)toexchangedata.DataexchangeontheWWWischaracterizedbyeasy-to-usegraphicalinterfaces,hypertextlinks,images,andsound.TodaytheWWWhasbecomesynonymouswiththeInternet,althoughtechnicallyitisreallyjustonecomponent.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
TheportionoftheInternetwithaGUI.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 329
WebReview
Allowsclientsandlitigatorstoreviewmetadata,text,images,ornativefiles,inanycombination,viaaWebbrowserconnectedtoadocumentrepositoryviaasecureon-lineconnection.
Source: IbisConsulting,Glossary.
WebSite
AcollectionofUniformResourceIndicators(URIs,includingURLs(UniformResourceLocators))inthecontrolofoneadministrativeentity.MayincludedifferenttypesofURIs(i.e.,filetransferprotocolsites,telnetsites,aswellasWorldWideWebsites).
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
WeightedRelevanceSearch
Atypeofsearchthatwillallowtheusertosortandretrievedocumentsaccordingtoastatistical“weight”givenbytheuseofamathematicalrelevancyevaluationprogram.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Wildcardsearch
WhatYouSeeIsWhatYouGet(WYSIWYG)
Pronounced“wizeewig.”Asystemthatallowstheusertoseeonscreenexactlywhatwillbeprintedout.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Display&softwaretechnologywhichshowsonthecomputerscreenexactlywhatyou'llgetwhenyouprintthatscreen.Usuallyrequiresalarge,high-densitymonitor.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 330
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
WideAreaInformationServer
See: WAIS(WideAreaInformationServer)
WideAreaNetwork
See: WAN(WideAreaNetwork)
WildcardSearch
Thewildcardsymbol,typically"*",canbeusedwithanyothersearchtoretrievedifferentvariationsofthesameword,e.g.,“insur*”forinsurance,orinsured.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
AdHocSearch
Adaptivepatternrecognition
Associativeretrieval
Booleansearch
Combinedwordsearch
ComplianceSearch
Conceptsearch
ExploratorySearch
Fulltextsearch
Fuzzysearch
Index
Index/codingfield
Keyword
Keywordsearch
Naturallanguagesearch
Numericrangesearch
Phonicsearch
Phrasesearch
Proximitysearch
Rangesearch
Search
Similardocumentsearch
Sound-alike
Stemming
Synonymsearch
Termsearch
Topicalsearch
Weightedrelevancesearch
Wildcards
Symbolssuchas*or?includedwithinaKeywordtoindicatethatthelocationwherethesymbolsareusedmaymatchasingleletterormultipleletters.
Source: EDRMSearchGuideGlossary.
Source: EDRMSearchGlossary.
Windows
AsoftwareproductthatprovidesanoperatingenvironmentthatrunsunderMS-DOS,usingaGUIthatcanrundifferentprogramsatthesametimeindifferentwindows.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 331
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
DOS
Linux
MicrosoftDOS
MicrosoftWindows
Networkoperatingsystem
NOS
Operatingsystem
OS
UNIX
Xenix
WindowsNTFileSystem
See: NTFilingSystem(NTFS)
WinZip
WinZipisaprogramcommonlyutilizedtozipfiles.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Wipe
Termfordeliberatelyoverwritingapieceofmediaandremovinganytractoffilesorfilefragments.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
WirelessApplicationProtocol
See: WAP(WirelessApplicationProtocol)
Wordnet
AnelectronicthesaurusdevelopedbyGeorgeMillerandhisstudentsatPrincetonUniversity.Usedbysomesystemstoprovidesynonymsforqueryexpansion.
Source: HerbRoitblat,Search2020:TheGlossary.
Workflow
Thestreamofinformationprocessingthroughanorganization.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Workgroup
Agroupofcomputerusersconnectedtoshareindividualtalentsandresourcesaswellascomputerhardwareandsoftware–oftentoaccomplishateamgoal.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 332
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Workstation
Asinglecomputer,eitheradesktopwithaharddiskoradumbterminal.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
ApowerfulmicrocomputerorminicomputerwithaRISCchip,typicallyusedbyengineersorgraphicstechnicians.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Seealso:
Computer
Fileserver
Laptopcomputer
Microcomputer
Minicomputer
Notebookcomputer
Personalcomputer
WorldWideWeb
See: Web(WorldWideWeb)
WORM(WriteOnce,ReadMany)
AnopticaldiscstoragedevicethatuseslasertechnologysimilartotheCD-ROM.InformationwrittentotheWORMdisc,cannotbealtered.TheadvantagesofWORMareincreaseddiscdensityandlifeexpectancy.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
Datastoragedevices(e.g.CD-ROM's)wherethespaceonthediskscanonlybewrittenonce.Thedataispermanentlystored.Thisisoftentoday'sprimarymediaforarchivalinformation.TheexpectedviablelifetimeofaWORMisatleast50years.Sinceit'simpossibletochange,thegovernmenttreatsitjustlikepaperormicrofilmanditisacceptedinlitigationandotherrecord-keepingapplication.Onthenegativeside,thereisnocurrentstandardforhowWORM'sarewritten.TheonlyISOstandardisforthe14"version,manufacturedonlybyonevendor.A5.25"standardisemergingfromtheEuropeanComputerManufacturingAssociationbutisnotyetaccepted.Further,WORMdiscsarewrittenonbothsides,buttherearecurrentlynodrivesthatreadbothsidesatthesametime.Asforspeed,WORMisfasterthantapeorCD-ROM,butslowerthanmagnetic.Typicaldiskaccesstimesrunbetween40and150milliseconds(comparedwith11msforfastmagneticdisksand300msforCD-ROM.Datatransferratesrunbetween1and2MB/sec(comparedwith5to10formagneticdiscsand600KB/secforCD-ROM.Disksizesrunfrom5.25"(1.3gigabytes)to12"(8to10gigabytes)capacities.Thereis
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 333
alsoa14'"disc(13to15gigabytes),onlymanufacturedbyKodak'sopticalstoragegroup.WORM'scanalsobeconfiguredintojukeboxes.Therearevarioustechnologies:
TechnologyDescription Benefit DrawbackAblative:Laserburnsholesindisk
Unalterabledata
Dust,moisturemayaffectmedia
Bubble-forming:Laserformsbubblesinthemedia
Unalterabledata
Fewdrivesavailable
DyePolymer:Laserheatsdyedlayertoformbumps
Potentiallowmediacost
Lasermechanismmoreexpensive;diskswearoutfaster;fewdrivesavailable
Magneto:Laserfocusesmagneticfield
Manysuppliers,longdisklife
NotrueWORMinmulti-function;datatheoreticallyalterable
Phasechange:Laserheatchangesdisk'smolecularstructure
One-passdata(noerasestep)
SameasDyePolymer
FromImagingMagazine,September,1994
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
WORMDisk
Apopulararchivalstoragemediaduringthe1980s.Acknowledgedasthefirstopticaldisks,theyareprimarilyusedtostorearchivesofdatathatcannotbealtered.WORMdisksarecreatedbystandalonePCsandcannotbeusedonthenetwork,unlikeCD-Rs.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
Zipdisk
WriteOnce,ReadMany
See: WORM(WriteOnce,ReadMany)
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 334
WriteProtect
Restrictadiskettefromhavinginformationrecordedtoit.Usedtopreventtheerasureofvaluableinformation.
Source: LegalElectronicDocumentInstitute,BasicPrinciplesofAutomatedLitigationSupport(2005).
WYSIWYG
See: WhatYouSeeIsWhatYouGet(WYSIWYG)
X
X.25
Astandardprotocolfordatacommunications.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
Xerography
Abeamoflighthitsanelectricallychargeddrumandcausesadischargeatthatpoint.Toneristhenappliedwhichstickstothenon-chargedareas.Paperispressedagainstthedrumtoformtheimageandisthenheatedtodrythetoner.Usedinlaserprintersandcopyingmachines.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
XML
See: ExtensibleMarkupLanguage(XML)
Y
Yield
SeePrevalenceorRichness.
Source: TheGrossman-CormackGlossaryofTechnologyAssistedReview(Version1.02,Nov.2102).
Z
Z-TestofProportions
Astatisticalhypothesistestcomparingtwoproportions.Thistestassumesthatthetwoproportionsaredistributedapproximatelyaccordingtothenormaldistribution.Withsamplesizesmorethanafewtens,thisisanappropriateassumption.Itteststhehypothesisthatthereisnodifferencebetweenthetwopopulationproportions.
EDRMGlossary http://www.edrm.net/resources/glossaries/glossary
©2016EDRMLLC 335
Source: HerbRoitblat,PredictiveCodingGlossary.
Zero-LengthFile
Afilewithfileproperties,0bytesize,(0Bor0K)butnocontent,ormetadata(includingcommercialapplicationdata).
Source: IbisConsulting,Glossary.
Zip
Theactofcompressinglargefilesintoasinglefile,calledazipfile.Zipfilestakeuplessstoragespacesotheyeasytosendviaemail.NottobeconfusedwithZipdrive,aportablestorageperipheral.
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Acommonfilecompressionformatthatallowsquickandeasystoragefortransport.Compressesandcombinesoneormoredocumentsbyutilizinganalgorithmthat'removes'whitespaceandreplacesitwhendecompressiontakesplace.Commonlyusedtocombineandsendlargedocumentsviaemail.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
AnopenstandardforcompressionanddecompressionusedwidelyforPCdownloadarchives.ZIPisusedonWindows-basedprogramssuchasWinZipandDragandZip.ThefileextensiongiventoZIPfilesis.zip.
Source: KrollOntrack,GlossaryofTerms,http://www.krollontrack.com/glossaryterms
Analgorithmusedtocreateacompressedarchive.Thearchivecancontainmanydifferentfiles,eachofwhichcanberecoveredby“unzipping”thearchive.
ZipDisk
Alargecapacityfloppydiskthatcanonlybereadfromorwrittentousingaproprietaryzipdiskdrive.
Seealso:
CD
CD-R
CD-ROM
CD-RW
Disc
Disk
Diskette
DVD
DVD-ROM
Floppydisk
Harddisk
Harddrive
Jazdisk
Laserdisc
Magneticdisk
Magneticstoragemedia
Media
Opticaldisk
Storagemedia
WORMdisk
©2016EDRMLLC
ZipDrive
Abrand-namemagneticstoragedevicethatcanholdbetween100and250megabytesofdata.
Source: Fios,E-DiscoveryGlossary,http://discoveryresources.org/01_electronic_discovery_glossary.html
Source: Vinson&ElkinsLLPPracticeSupport,EDDGlossary.
Source: RSI,Glossary.
Seealso:
Diskdrive
Floppydiskdrive
Jazdrive
Magneto-opticaldrive
Portabledrive
Storagedevice
Tapedrive
ZoneOCR
Anadd-onfeatureoftheimagingsoftwarethatpopulatesdocumenttemplatesbyreadingcertainregionsorzonesofadocument,andthenplacingthetextintoadocumentindex.
Source: FormerlyAmericanDocumentManagement,GlossaryofTerms,now5iSolutionsGlossary.
.
.E01File
".E01"isalegacyEnCaseevidencefileformat.An".E01"fileisabyte-for-byterepresentationofaphysicaldeviceoralogicalvolume.
Source: EnCaseForensicImager,Version7.06,User'sGuide.GuidanceSoftware.
.Ex01File
".Ex01"isthecurrentEnCaseevidencefileformat.An".Ex01"fileisabyte-for-byterepresentationofaphysicaldeviceoralogicalvolume.IthasLZcompression,AES256encryptionwithkeypairsorpasswords,andoptionsforMD5hashing,SHA-1hashing,orboth.
Source: EnCaseForensicImager,Version7.06,User'sGuide.GuidanceSoftware.