21
5/14/2019 2019 Stockholm National Paper 2 - Google Docs https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 1/21 Entry into the Stockholm Junior Water Prize 2019 A Novel Method of Monitoring the Health of our Global Fresh Water Supply using DNA Barcoding of Chironomidae (Diptera) Sonja Michaluk New Jersey

Barcoding of Chironomidae (Diptera) Global Fresh Water ......The Chironomidae are the only free-living (non-parasitic) holometabolous insect extant on every continent, including Antarctica,

  • Upload
    others

  • View
    1

  • Download
    0

Embed Size (px)

Citation preview

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 1/21

    Entry into the Stockholm Junior Water Prize 2019

    A Novel Method of Monitoring the Health of our

    Global Fresh Water Supply using DNA

    Barcoding of Chironomidae (Diptera)

    Sonja Michaluk

    New Jersey

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 2/21

    I. Abstract: It is forecast that 66% of our population will experience water scarcity within a decade,

    leaving us more dependent on surface water for drinking. [14] This requires more filtration infrastructure,

    and monitoring of surface water sources. Current methods rely on expensive and technically challenging

    manual identification of biological samples. Macroinvertebrates spend their larval lives within a small

    area of water, showing cumulative effects of habitat alteration and pollutants that chemical testing and

    field sensors do not. [9] Molecular methods enhance biomonitoring programs. This project explores

    deoxyribonucleic acid (DNA) barcoding, to measure waterway health with larval Chironomidae (order

    Diptera), the most widespread macroinvertebrate family. [4] Their complex taxonomy makes manual

    morphological identification difficult. A statistical sampling plan was designed that represents variation

    in geological, ecological, and land use factors. Fou r m ethods of isolation and amplification were

    compared. Statistical analysis shows DNA Barcoding of Chironomidae results in more accurate and

    precise waterway health data, adding significant value for monitoring scarce water resources. The

    learnings from these data are being applied building microbiology capability at a non-profit water study

    institute.

    II. Table of Contents: 1. Introduction - p2

    2. Materials and Methods - p5

    3. Results - p7

    4. Discussion - p14

    5. Conclusions - p18

    6. References - p19

    7. Bibliography - p20

    III. Key Words: Public Water, Chironomidae, Water Scarcity, DNA Barcoding, Surface Water, Water

    Monitoring, Bioassessment, Nonpoint Source Pollution, Macroinvertebrate, COI (cytochrome c oxidase

    subunit 1)

    IV. Abbreviations and Acronyms: COI: Cytochrome c oxidase subunit 1

    NJDEP: New Jersey Department of Environmental Protection

    PCR: Polymerase chain reaction

    V. Acknowledgements: This effort was designed and conducted by the author. Appreciation to Karen

    Lucci of Hopewell Valley Central High School for guidance on phylogenetic tree analysis. Based on

    previous training and experience, Cold Spring Harbor Laboratory graciously allowed open access to

    1

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 3/21

    microbiology labs and equipment to conduct independent experimentation. Special appreciation to Dr.

    Cristina Fernandez for DNA Barcoding principles. Sample preparation, DNA extraction,

    macroinvertebrate identification and chemical analysis was performed in home laboratory underwritten

    by funding awarded for prior research. Additional thanks to Erin Stretz, Dr. Steve Tuorto, and Jim

    Waltman from The Watershed Institute for internships and opportunities to learn aquatic entomology and

    chemical environmental monitoring; Dr. Patricia Shanley for guidance on policy and advocacy;

    Lawrenceville Summer Scholars for robotics and programming.

    VI. Biography: My independent research focuses on how environmental data can be gathered and used

    to inform decision making in terms of how and when we develop our natural and water resources. I have

    been a member of the Society for Freshwater Science since 2014, and I have presented data at their 2015

    Mid-Atlantic Chapter meeting at the Academy of Natural Sciences in Philadelphia, and the 2017 Annual

    Conference in Raleigh, North Carolina. Additionally, I have presented at the Environmental Protection

    Agency (EPA), for receiving a Presidential Award, National Geographic Society in Washington, D.C.,

    as well as New York Academy of Sciences Bicentennial celebration, New York, NY (2017).

    Massachusetts Institute of Technology (MIT) named a minor planet in my honor. Through chemical and

    biological stream assessment (certified for 8 years), I have been monitoring the health of our local

    waterways as an active member of a StreamWatch volunteer program since 2011. Encyclopedia

    Britannica published my definition of “macroinvertebrate.” I was the featured speaker and Watershed

    Hero at The Alliance for Watershed Education River Days 2017 & the East Coast Greenway River Days

    Kick-Off at Fairmount Water Works, Philadelphia.

    My research, data collection, and advocacy have led to environmental improvements: data

    submission to the New Jersey Department of Environmental Protection (NJDEP) and modifications to a

    pipeline construction project that minimized impact to streams, 20 acres of ecologically critical forest

    and wetland being preserved as open space, providing a critical east-west wildlife habitat corridor. My

    efforts to locate and document the southernmost population of a threatened amphibian species support a

    current proposal for categorization of a special wetland habitat as a C1 Stream by the NJDEP.

    1. Introduction : Parts of the world are abundant with fresh water, but 2.7 billion people (about 40% of

    our population) experience water scarcity at least one month a year. [14] This is expected to grow to

    two-thirds of the world’s population within a decade (Falkenmark Water Stress Indicator) as population

    and water usage increase. [14] Less than 1% of the world’s water is accessible as a public water source. [3]

    2

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 4/21

    Water scarcity affects every continent and was listed in 2015 by the World Economic Forum as the

    largest global risk in terms of potential impact over the next decade. [13][6]

    As water scarcity increases, we become more and more dependant on surface water for drinking,

    therefore require more filtration infrastructure, and more monitoring of surface water sources. Currently

    63% of public water (serving a population of 169 million) in the USA is from surface water. [10] Wetlands

    provide surface water filtration, however more than half the world’s wetlands have disappeared. [14]

    New York City makes use of wetlands as a natural water filtration resource for their public water. Over

    1,000,000 acres of protected land in the Catskill/Delaware watersheds provide natural filtration for 90%

    of New York City’s population of 8.5 million. [11] New York is one of only five cities that can rely on

    simpler natural filtration for public water. [11] The New York City Land Acquisition Program purchased or

    protected over 130,000 acres since 1997 and restricts development. [8] A dedicated police force of more

    than 200 members guards the health of the wetlands and prevents illegal dumping. [12]

    Measures of taxa richness and relative abundance provide valuable information on trends in

    ecosystem health. Macroinvertebrates provide a logical choice because they can be seen with the naked

    eye and spend their larval lives in a small area of water and therefore show the cumulative effects of

    habitat alteration, contaminants, and pollutants. Additionally macroinvertebrates play a significant part

    of the food web, preyed upon by fish, birds, reptiles, and amphibians. Current waterway assessment

    methods are based on a procedure defined and popularized and standardized by Hilsenhoff [7] in 1977: a

    100 organism sub-sample is obtained from a Stratified Random Sample taken in the field. Organisms are

    identified to the lowest practical taxonomic level with a microscope and taxonomic keys. [7][9]

    These current methods of surface water monitoring can be expensive and technically

    challenging, relying on manual identification of biological macroinvertebrate samples. Additionally

    macroinvertebrates are relatively easy to identify to family level manually by morphology, however

    genus and species level identification is exponentially more complex. Highly detailed genus and species

    level data is more accurate and precise but difficult to obtain due to cost, specimen condition, incomplete

    taxonomic knowledge, poor taxonomic keys, and lack of trained taxonomists. Error rates of genus and

    species in samples identified by professional taxonomists have been found to be as high as 65%. [4]

    Molecular methods, such as deoxyribonucleic acid (DNA) barcoding from a region of the mitochondrial

    gene COI (cytochrome c oxidase subunit 1), have begun to enhance biomonitoring programs. DNA

    Barcoding offers the promise of a more rapid, accurate (less human error), and precise (species level)

    3

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 5/21

    identification of macroinvertebrate taxa. This is important to obtain accurate environmental assessments.

    DNA Barcoding overcomes limitations of manual taxonomic identification and significantly improves

    the statistical power of bioassessment tools. [15] Taxonomic identification to family level by volunteers is

    widely used for citizen science programs and broad data gathering. Many studies have explored the

    potential of DNA Barcoding for bioassessment, and the increased precision and statistical power

    provided by genus and species level identification. This effort creates a methodology that allows DNA

    Barcoding to be integrated into existing water monitoring programs. This project aims to enhance citizen

    science environmental monitoring programs with a DNA Barcoding methodology and capability in

    order to improve accuracy, precision, and statistical power of results.

    Figure 1. DNA Barcoding

    resolves even cryptic species

    that are morphologically

    indistinguishable

    This project explores the potential of using DNA Barcoding to measure waterway health with the

    larval non-biting midge Chironomidae (order Diptera). Chironomidae are versatile macroinvertebrates

    and a common denominator among most aquatic sites. [4] They occupy many important parts of food

    webs, and includes all functional feeding groups: collector/gatherers, shredders, scrapers, filter-feeders,

    and predators. [4] They have a holometabolous, or complete metamorphosis, life cycle with; egg, larva,

    pupa, and adult. The Chironomidae are the only free-living (non-parasitic) holometabolous insect extant

    on every continent, including Antarctica, and in a great range of altitudes. [4] They have been found 5600

    m above sea level on glaciers in Nepal, and 1360 meters below the surface of freshwater Lake Baikal in

    Russia. This project is concerned with the larval form, which in some species occurs in water films a

    millimeter thick, and in others dwells in arid regions and can tolerate drought (one even survived 18

    months in the vacuum of space). Other larvae are found in glacial meltwater just above freezing, and

    4

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 6/21

    there are even Chironomidae in hot springs over 40°C. There are fully marine species, and some have

    even been found in algae on sea turtle shells. Some Chironomid larvae have hemoglobin which allow

    them to absorb oxygen from and tolerate low-oxygen waters that other macroinvertebrates cannot

    survive. Due to their reddish color they are commonly called bloodworms. Unfortunately for citizen

    scientists, Chironomidae have complex taxonomy that makes manual morphological identification to

    genus and species level extremely difficult. The Hilsenhoff Family Tolerance Value for Chironomids is

    6. This is an average of the Genera Tolerance Values which have been shown to range greatly (e.g. from

    2 to 10 for the genera sampled here). Since they are difficult to identify morphologically, DNA

    Barcoding adds great value, additionally unlike some other macroinvertebrates they lack inhibitors that

    impede amplification using the silica resin isolation method and polymerase chain reaction (PCR)

    primer beads.

    This research hypothesizes that a novel DNA Barcoding process utilizing Chironomidae

    (Diptera) will compare favorably to standard methods of monitoring surface water sources. The purpose

    is to contribute an improved method of bioassessment to aid in preservation of our freshwater resources.

    In phase 1, method development was explored. The independent variables were DNA extraction

    methods and primers used. The dependent variable was the percent amplification of samples. The

    control was the DNA ladder.

    In phase 2, The Chironomidae were explored as an index of waterway health. The independent

    variables were the sample sites, varying freshwater bodies with a statistically planned variety of

    geological, ecological, and anthropogenic factors. The dependent variables were the genera and species

    present. The positive control is a known healthy location (per statistical data) and manual identification.

    The negative control is known unhealthy location.

    The research questions explored here support creation of a microbiology lab at a non-profit water

    study institute that supplements their existing citizen science water monitoring programs. 1) Can DNA

    Barcoding be used as a means of monitoring surface water sources? 2) How do Chironomidae genera

    and species vary in response to variation in geological, ecological, and land use factors? 3) How do

    Chironomidae genera and species vary in response to nutrient pollution? 4) Will this project add new

    species to the Chironomidae data sets in genetic sequence databases used by the scientific community?

    5) What is the effect of different methods of PCR on the amplification of Chironomidae DNA?

    2. Materials and Methods:

    5

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 7/21

    2.1 Risk and Safety: These procedures involve use of ethanol, a Lamotte Water Quality test kit, and

    microliter amounts of DNA isolation, PCR amplification, and gel electrophoresis reagents. Material

    Safety Data Sheets (MSDS) sheets were reviewed. Personal protective equipment was used to protect

    against risk of chemical exposure. Waste liquid was collected and given to Clean Harbors, a company

    specializing in hazardous waste disposal. Training was completed and up to date for equipment,

    chemicals, and taxonomic identification.

    2.2 Procedures: The following methods of DNA isolation were selected (Rapid DNA Isolation,

    PowerSoil Isolation Method (Metabarcoding), Silica Resin Isolation). Research showed these to be more

    likely to work for macroinvertebrates and they are fairly easy and economical for real world use.

    A statistical sampling plan was designed to represent variation in geological, ecological, and land use

    factors. Sample sites were chosen according to a statistical sampling plan to capture a variety of

    geological, ecological, and anthropogenic factors: high gradient vs coastal plain, stream vs. pond,

    healthy ecosystem vs. unhealthy ecosystem.

    2.3 Water Quality Chemical Analysis: Water quality chemical analysis certifications relevant to this

    project were up to date. Chemical sampling was performed with LaMotte water test kit and procedure.

    Nitrates, orthophosphates, dissolved oxygen, pH, and turbidity was monitored over 9 months at 13 sites.

    2.4 Benthic Macroinvertebrate Sampling : Sampling was performed per NJDEP procedure. Freshwater

    macroinvertebrate samples were collected with D-frame net. The percentage of net jabs taken in each

    habitat type corresponded to the percentage of each habitat type’s presence in the stream reach. The

    sample was stored in ethanol. Macroinvertebrates were identified, and those from the Chironomidae

    family (order Diptera) were identified under a microscope and removed for DNA Barcode analysis.

    Stream health was monitored over 9 months at 13 sites.

    2.5 DNA Isolation Procedure : The membrane-bound organelles such as the nucleus and mitochondria

    were dissolved with lysis solution. A sterile plastic pestle was used to liquify the macroinvertebrate

    sample in a 1.5ml tube. Silica resin was used to bind DNA. The nucleic acids were eluted from the silica

    resin with laboratory grade distilled water. Samples were stored at -20 C prior to PCR amplification.

    2.6 Polymerase Chain Reaction (PCR) Amplification : Primers were selected based on sample type.

    Different methods of PCR amplification were tested: Ready-To-Go PCR Beads were activated by adding

    a mix of loading dye and COI primers LCO1490 and HC02198. After bead dissolves, the DNA sample

    was added with micropipette. The PCR tubes was then be mixed by lightly flicking, and centrifuged for

    6

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 8/21

    30 seconds at 13,400 RPM to spin the liquid to the bottom of the tube. Samples were thermal cycled

    with the appropriate temperature profile programmed. NEB Taq 2X Master Mix: 10μL of loading dye

    per sample was mixed with 12.5μL of NEB Taq 2X Master Mix per sample, combined in a 1.5ml tube,

    and shaken gently for mixing. 2μL of sample DNA was then be added with micropipette to the

    correspondly labeled PCR tubes. 23μL of the LCO1490 and HC02198 primer mix was added to each

    PCR tube. The PCR tubes were then be mixed by lightly flicking, and centrifuged for 30 seconds at

    13,400 RPM to spin the liquid to the bottom of the tube. Samples were then be thermal cycled with the

    appropriate temperature profile programmed.

    2.7 Gel Electrophoresis: Agarose gel was poured, and when it was solid it was be placed into the

    electrophoresis chamber. Tris/Borate/EDTA (TBE) buffer was added. PCR samples was loaded, the gel

    was run at 130V and the image were captured. Images for samples prepared with PCR Beads and with

    Master Mix were used to verify DNA amplification.

    2.8 Sequencing: Samples were then sent for DNA Sequencing. Bioinformatic analysis was completed

    by trimming and analyzing the Chironomidae genetic sequences. The final sequences were submitted

    and compared to multiple genetic sequence databases to determine the genus and species of each sample.

    Software tools were programmed and developed to easily calculate biological health scores. The

    appropriate index was selected (High Gradient or Coastal Plain Macroinvertebrate Index).

    3. Results: DNA Barcoding overcomes limitations of manual taxonomic identification and significantly

    improves the statistical power of bioassessment tools. [15] Hilsenhoff tolerance scores of the

    Chironomidae genera sampled and identified using the DNA Barcoding method developed here were

    used with GIS software to provide an overview of water quality.

    Figure 2. Overview of waterway health

    using Hilsenhoff tolerance scores of the

    Chironomidae genera identified by

    DNA Barcoding. [5]

    7

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 9/21

    Highly detailed genus and species level data is more accurate and precise but difficult to obtain

    due to cost, specimen condition, incomplete taxonomic knowledge, poor taxonomic keys, lack of trained

    taxonomists.

    Figure 3. The left diagram shows a taxonomic macroinvertebrate sample identified to class and family

    level by a trained volunteer. The right diagram shows the sample identified to species level by DNA

    Barcoding, and reveals the additional resolution provided by DNA Barcoding.

    An important step to developing a methodology for use of Chironomidae in bioassessment was

    comparing and evaluating molecular analysis methods. Silica resin and PCR bead successfully amplified

    100% of the samples. Four approaches were evaluated, and had very different results in terms of the

    percent of samples that accurately identified.

    Figure 4. Summary of the

    molecular analysis methods

    evaluated. Silica resin and PCR

    bead successfully amplified

    100% of the samples.

    8

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 10/21

    The Chironomidae samples showed the least undetermined nucleotides, best peak quality, and

    best sequence quality. The Gammaridae also responded very well to barcoding, However the

    gammaridae do not have the range of geography and biotic indices that the Chironomidae do.

    Figure 5. Summary of various

    taxa samples identified by DNA

    Barcoding with silica resin and

    PCR beads, and compares their

    response to DNA Barcoding.

    Phred scores were compared using two-sample t-tests (0.01 significance level). This test was selected

    because the number of samples n was less than 30.

    Chironomidae vs. Physidae p = 1.01 x 10 -6 indicating a statistically significant difference.

    Chironomidae vs. Haliplidae p = 7.37 x 10 -8 indicating a statistically significant difference.

    Chironomidae vs. Gammaridae p = .053 indicating a difference that is not significant, however

    Gammaridae were not chosen due to their more limited number of species, geographic range, and

    biotic index range.

    The Chironomidae sampled here aligned by genera with either high gradient streams in piedmont

    geology, or sandy soils and coastal plain geology. Only 13% of the genera sampled were found evenly in

    both geologies.

    Figure 6. Percent of Chironomidae genera based on

    the surface geology they predominantly occur in.

    9

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 11/21

    Nutrient pollution was compared with the weighted average Hilsenhoff scores of the Chironomidae

    genera sampled at each site. The value for nutrient pollution was calculated from the average ppm of

    nitrate and orthophosphate sampled at each site, which was normalized to a value between

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 12/21

    Figure 8. Relationship between the

    weighted average Hilsenhoff scores of

    the Chironomidae genera sampled at

    each site, and the overall historical

    health based on sampling at each site.

    A Bland-Altman analysis showed limits of agreement of -0.853 and 0.868 between the weighted average

    tolerance values of Chironomidae genera barcoded and the standard method that uses manual taxonomic

    identification by morphology. This indicates that the new method proposed here of DNA barcoding

    Chironomidae is in agreement with the current standard to within 1.72 on a zero to ten health scale.

    Figure 9. A Bland-Altman

    analysis showed limits of

    agreement of -0.853 and

    0.868 between the weighted

    average tolerance values of

    the Chironomidae genera

    barcoded and the standard

    method that uses manual

    taxonomic identification by

    morphology.

    The following phylogenetic trees were used to analyze the genetic relationships between selections of

    the Chironomidae sampled with respect to site, taxa level identified, and biotic index.

    11

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 13/21

    Figure 10. Phylogenetic tree

    which diagrams the genetic

    relationship between the

    Chironomidae samples from six

    sites with the most variation.

    The following phylogenetic tree was used to analyze the genetic relationships between selections of the

    Chironomidae sampled with respect taxa level identified (e.g. subfamily, genus, or species).

    Identification down to species level indicates a match in the sequence databases. Identification to genus

    or subfamily indicates gaps in the sequence database that can be filled with a widespread barcoding

    initiative. The gaps could also allude to potential novel species.

    12

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 14/21

    Figure 11. Phylogenetic tree which diagrams

    the genetic relationship between Chironomidae

    samples with the taxa level identified (e.g,

    subfamily, genus, species).

    The family biotic index for Chironomidae is 6. This masks an underlying variability as the genera

    sampled for this study range in biotic index from 2 to 10

    Figure 12. Phylogenetic tree which diagrams

    the genetic relationship between Chironomidae

    samples with the Hilsenhoff tolerance value for

    each genera.

    13

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 15/21

    4. Discussion

    4.1 DNA Barcoding for Bioassessment: Highly detailed genus and species level data is more accurate

    and precise but difficult to obtain due to cost, specimen condition, incomplete taxonomic knowledge,

    poor taxonomic keys, lack of trained taxonomists. Error rates of genus and species in samples identified

    by experts have been found to be as high as 65%. [4] This demonstrated the value of DNA Barcoding,

    especially for identifying such versatile and phenotypically similar specimens as Chironomidae.

    Hilsenhoff tolerance scores of the Chironomidae genera sampled and identified using the DNA

    Barcoding method developed here were used with GIS software to provide a water quality overview

    map. Visualizations from this project’s data were used in community land use decision making. In

    addition to the value of making data readily available data to communities, it is important to note that

    DNA Barcoding enables an increase in the amount and accuracy of data available for community and

    land use decision making.

    4.2 Comparison of Molecular Analysis Methods: An important step to developing a methodology for

    use of Chironomidae in bioassessment was comparing and evaluating molecular analysis methods. Four

    approaches were evaluated: eDNA Metabarcoding Extraction and eDNA Metabarcoding Primer, Rapid

    Method (chromatography paper) Extraction and PCR Bead, Silica Resin Extraction and PCR Bead,

    Silica Resin Extraction and MM Primer. Silica resin and PCR bead successfully amplified 100% of the

    samples. This result also verified that the appropriate laboratory and field practices and techniques had

    been used, and that the techniques and methods were not excessively cumbersome.

    4.3 Selection of Chironomidae as a Global Common Denominator: Various macroinvertebrate

    families were identified by DNA Barcoding with silica resin and PCR beads. Selecting one family to

    focus on provided a natural limit that allowed effects of differences in extraction and amplification of

    DNA to be minimized, for example macroinvertebrates with tough exoskeletons or shells can be more

    difficult to extract DNA from, and many mollusks contain PCR inhibitors. The response of various

    macroinvertebrate families to DNA Barcoding and success at identification was compared using

    measures DNA sequence quality: visual analysis of electropherograms, Phred score, undetermined

    nucleotides, peak quality, sequence quality. The Chironomidae were identified as the best option with

    the best sequence quality as they had the best Phred score least undetermined nucleotides, best peak

    quality, and best sequence quality. The Gammaridae also responded very well to barcoding, with a Phred

    14

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 16/21

    score of 98% vs 99% for Chironomidae, however the gammaridae do not have the range of geography

    and biotic indices that the Chironomidae do.

    4.4 Chironomidae and Surface Geology Variation: The Chironomidae sampled here aligned by

    genera with either high gradient streams in piedmont geology, or sandy soils and coastal plain geology.

    Only 13% of the genera sampled were found evenly in both geologies.

    4.5 Comparison of Genera Tolerance Values and Nutrient Pollution: This analysis compared the

    relationship between the weighted average Hilsenhoff scores of the Chironomidae genera sampled at

    each site and the nutrient pollution. The value for nutrient pollution was calculated from the average

    ppm of nitrate and orthophosphate sampled at each site, which was normalized to a value between 0 and

    10. This shows a statistically significant relationship with p

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 17/21

    4.7 Phylogenetic Tree Analysis: Phylogenetic trees were used to analyze the genetic relationships

    between selections of the Chironomidae sampled with respect to site, taxa level identified, and biotic

    index. The phylogenetic tree in Figure 11 was used to analyze the genetic relationships between

    selections of the Chironomidae sampled with respect taxa level identified (e.g. subfamily, genus, or

    species). Identification down to species level indicates a match in the sequence databases. Identification

    to genus or subfamily indicates gaps in the sequence database that can be filled with a widespread

    barcoding initiative. The gaps could also allude to potential novel species. The phylogenetic tree in

    Figure 12 diagrams the genetic relationship between Chironomidae samples with the Hilsenhoff

    tolerance value for each genera. The Hilsenhoff family biotic index for Chironomids is 6. The genera

    sampled range in Hilsenhoff Biotic index from 2 to 10.

    4.8 Statistical Tools and Analysis: In Phase I, a two-sample t-test was used to compare Phred sequence

    quality scores between Chironomidae and other macroinvertebrates sampled to a 0.01 significance level.

    The two-sample t-test was selected because because the sample quantity n was less than 30. The

    significance of 0.01 was chosen to emphasize the very low p value obtained for the Physidae and

    Haliplidae.

    When the Chironomidae were compared with Physidae, Haliplidae, and Gammaridae, the null

    hypothesis was that the mean proportion of ideal Phred scores for chironomids would be equal to that of

    Physidae. The alternative hypothesis was that the mean proportion of ideal Phred scores would be

    greater for Chironomidae than, for example Physidae. Because p= 1.01 x 10 -6 and is lower than the

    significance level of 0.01, the null was rejected, indicating that the Chironomidae DNA sequence quality

    was significantly better than the Physidae sequence quality. For Chironomidae vs. Haliplidae p = 7.37 x

    10 -8 < 0.01 indicating a statistically significant sequence quality improvement. For Chironomidae vs.

    Gammaridae p = .053 indicating a difference that is not significant, however Gammaridae were not

    chosen due to their more limited number of species, geographic range, and biotic index range.

    In Phase II bioassessment measurement systems were compared. In order to compare waterway

    ecosystem bioassessment by weighted average tolerance values of the Chironomidae genera barcoded,

    and the standard method that uses manual taxonomic identification by morphology, a Bland-Altman

    analysis was used. [1] The Bland-Altman test was selected as this is a common statistical tool used to

    compare a new measurement method to an existing standard of measurement when a true value or

    calibration standard is not available, and measurements must be made indirectly.

    16

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 18/21

    Comparing two measurement systems by running a regression and calculating a correlation

    coefficient r value is not sufficient to compare measurement systems, as two methods of measuring the

    same value are nearly guaranteed to be correlated. Additionally, they can be correlated without being in

    agreement, such as a measurement of length in inches, and in centimeters. [1] Bland-Altman analysis

    determines the level of agreement between two measurement systems. This comparison showed limits of

    agreement of -0.853 and 0.868 between the weighted average tolerance values of the Chironomidae

    genera barcoded and the standard method that uses manual taxonomic identification by morphology.

    This indicates that the new method proposed here of DNA barcoding Chironomidae is in agreement with

    the current standard to within 1.72. This is a significant finding, especially considering that waterway

    health data is often reported as good / fair / poor, and leads to the conclusion that the measurement

    method is sensitive enough, and waterway ecosystem bioassessment by DNA Barcoding of

    Chironomidae is a viable option for bioassessment globally.

    Statistical power is the sensitivity of a test, or the ability of a test to find an effect if there is one

    to be found, or in other words the probability that the test will correctly reject a false null hypothesis.

    Statistical power = 1 – β, where β is the probability of making a Type II error. (alpha α is the probability

    of making a Type I error.) Statistical power is an function of the sample size, alpha, and effect size.

    Increasing the sample size increases statistical power, but there is typically a cost or challenge to

    obtaining more samples. Increasing alpha also increases statistical power, however this merely

    exchanges this risk of a Type II error (β-risk) for the risk of a Type I error (α-risk). Where statistical

    significance determines if there is a difference between two groups, effect size quantifies the difference

    between two groups. Bigger effects are easier to detect than smaller effects. If the data being sampled

    has a large amount of variance, both from the value being measured and the noise in the data, this will

    decrease the statistical power. [2] Measurement error is also a source of noise. The goal of using DNA

    Barcoding to resolve taxa in more detail to the genus and species level, is to reduce variability and

    therefore increase statistical power. Increasing the precision of the measurement increases the statistical

    power and/or decreases sample size. A statistical power of 0.80 is typical, and indicates a 4:1 trade off

    between β-risk and α-risk. Highly consistent systems in engineering and physical sciences, as well as

    medical tests where the risk of a false negative (not detecting a disease) can have higher statistical power

    such as 0.90.

    17

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 19/21

    DNA Barcoding increases resolution from family level, to genus and species, as well as reducing

    error from manual taxonomic identification by morphology. In the case of Chironomidae this means that

    genus level tolerance values ranging from 0 to 10 can be used instead of the family level tolerance value

    of 6. This increases the statistical power of the bioassessment method.

    5. Conclusions :

    5.1 Based on Bland-Altman analysis waterway ecosystem bioassessment by DNA Barcoding of

    Chironomidae is sensitive enough to be a viable option for bioassessment globally. Manual taxonomic

    identification under magnification is typically only performed to the family level. Stream health data

    from Chironomidae genera matched historical health data. (Statistically significant p

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 20/21

    to databases used by the scientific community. Phylogenetic tree groupings match geography and

    historical health data. Samples from the healthiest sites are nearly genetically identical. The most

    sensitive genus of Chironomid was only found in the healthiest sites.

    5.5 DNA Barcoding of Chironomidae can be faster and lower cost than the current method. This method

    is robust, reproducible, and suitable for augmenting citizen science initiatives.

    5.6 In analyzing the distribution of Chironomidae genera between streams with urban vs. open space

    catchment areas, there was not a statistical correlation. This may require further study with more detailed

    land use data. (Not statistically significant p>.05)

    5.7 Finally, the investigation into the Chironomidae family shows that DNA Barcode analysis can result

    in waterway health data that is both more accurate and more precise, and therefore increase statistical

    power and significant value for monitoring an increasingly scarce water resource.

    6. References: [1] Bland, J. Martin, and Douglas G. Altman (2010). Statistical Methods for Assessing Agreement between Two Methods of Clinical Measurement. International Journal of Nursing Studies, vol. 47, no. 8, 931–936. [2] Coe, Robert (2002). It's the Effect Size, Stupid: What Effect Size Is and Why It Is Important. University of Leeds, Education-Line, www.leeds.ac.uk/educol/documents/00002182.htm. [3] “Competing for Clean Water Has Led to a Crisis: Clean Water Crisis Facts and Information”, National Geographic, 27 Jan. 2017 [4] Epler, John H. (2001). Identification Manual for the Larval Chironomidae (Diptera) of North and South Carolina. EPA Region 4 and Human Health and Ecological Criteria Division [5] GIS waterway and World Light Gray Reference, (2019) Esri Data & Maps [6] “The Global Risks 2015 Report”, World Economic Forum, 2015 [7] Hilsenhoff, William L. (1977), Use of Arthropods to Evaluate Water Quality of Streams. Wisconsin Department of Natural Resources, Technical Bulletin No. 100, Madison, WI [8] Land Acquisition, www1.nyc.gov/html/dep/html/watershed_protection/land_acquisition.shtml, 2018 [9] NJ Division of Water Monitoring and Standards. http://www.nj.gov/dep/wms/, 2018 [10] Public Supply Water Use, www.usgs.gov/special-topic/water-science-school/science/ public-supply-water-use?qt-science_center_objects=0#qt-science_center_objects, 2018 [11] Rueb, Emily S. “How New York Gets Its Water”, New York Times (New York, NY), 24 Mar. 2016 [12] Salazar, Cristian. “How Does New York City Get Its Water?” Am New York, (New York, NY) www.amny.com/lifestyle/how-nyc-gets-its-water-1.9205765, 18 April 2018 [13] “Scarcity.” UN-Water, www.unwater.org/water-facts/scarcity/, 2018 [14] “Water Scarcity.” WWF, World Wildlife Fund, www.worldwildlife.org/threats/water-scarcity, 2018 [15] Stein, E. D., White, B. P., Mazor, R. D., Jackson, J. K., Battle, J. M., Miller, P. E.,. Sweeney, B. W. (2014). Does DNA barcoding improve performance of traditional stream bioassessment metrics? Freshwater Science.

    19

  • 5/14/2019 2019 Stockholm National Paper 2 - Google Docs

    https://docs.google.com/document/d/117or8f6OcEMlj5wtGSII_lWN--ucQiM9IK7nPB05ET0/edit 21/21

    7. Bibliography Cañedo-Argüelles, Miguel, et al. (2016). Are Chironomidae (Diptera) Good Indicators of Water Scarcity? Dryland Streams as a Case Study. Ecological Indicators, vol. 71, 2016, 155–162. Carew, M. E., Pettigrove, V. J., Metzeling, L., & Hoffmann, A. A. (2013). Environmental monitoring using next generation sequencing: rapid identification of macroinvertebrate bioindicator species. Frontiers in Zoology. Ekrem, Torbjørn, et al. (2010). Females Do Count: Documenting Chironomidae (Diptera) Species Diversity Using DNA Barcoding. Organisms Diversity & Evolution, vol. 10, no. 5, 2010, 397–408 Ekrem, Torbjørn, et al. (2018). DNA Barcode Data Reveal Biogeographic Trends in Arctic Non-Biting Midges. Genome, vol. 61, no. 11, 787–796. Folmer, Ole & Black, Michael & Wr, Hoeh & Lutz, R & Vrijenhoek, Robert. (1994). DNA primers for amplification of mitochondrial Cytochrome C oxidase subunit I from diverse metazoan invertebrates. Molecular marine biology and biotechnology. 294-9. Hilsenhoff, William L. (1982). Using a Biotic Index to Evaluate Water Quality in Streams. Wisconsin Department of Natural Resources, Technical Bulletin No. 132, Madison, WI. Lin, Xiao-Long, et al. (2018). DNA Barcodes and Morphology Reveal Unrecognized Species in Chironomidae (Diptera). Insect Systematics & Evolution, vol. 49, no. 4, 2018, 329–398. MacCafferty, W. P. (1981). Aquatic Entomology. Jones and Bartlett. Sudbury, MA. Merritt, R. W. (1984). An introduction to the aquatic insects of North America. Kendall Hunt, Dubuque, IA. Micklos, D. A. (2003). DNA SCIENCE: A First Course. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Micklos, D. A., Nash, B., Hilgert, U. (2013). GENOME SCIENCE: A Practical and Conceptual Introduction to Molecular Genetic Analysis in Eukaryotes. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY. Pilgrim, E. M., Jackson, S. A., Swenson, S., Turcsanyi, I., Friedman, E., Weigt, L., & Bagley, M. J. (2011). Incorporation of DNA barcoding into a large-scale biomonitoring program: opportunities and pitfalls. Journal of the North American Benthological Society, 30(1), 217-231. Silva, Fabio & Ekrem, Torbjorn & Fonseca-Gessner, Alaide. (2013). DNA barcodes for species delimitation in Chironomidae (Diptera): A case study on the genus Labrundinia. The Canadian Entomologist. 145. Singh, P., Rawal, D. (2016). Molecular phylogeny of Einfeldia (Diptera: Chironomidae) inferred from sequencing of mitochondrial cytochrome oxidase subunit 1 (COX1) gene, International Journal of Entomology Research, Volume 1; Issue 6; 11-15 “Volunteer Stream Monitoring: A Methods Manual”, United States Environmental Protection Agency, https://nepis.epa.gov/Exe/ZyPDF.cgi/P100MRC3.PDF?Dockey=P100MRC3.PDF, (2018). Walton, Brett, and Brett Walton. “Cape Town's Water Panic Was Years in the Making.” CityLab, 17 July 2018, www.citylab.com/environment/2018/07/how-cape-town-got-to-the-brink-of-water-catastrophe/564800/. Images credited to author unless otherwise noted. Original artwork by author

    20