A Medley of Potpourri

Sunday, January 5, 2020

Innocence Project

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/Innocence_Project


Formation	1992
Founder	Barry Scheck Peter Neufeld
Founded at	Cardozo School of Law Yeshiva University
Type	Nonprofit organization
Tax ID no.	32-0077563
Legal status	501(c)(3)
Purpose	Exoneration Justice reform "The Innocence Project's mission is to free the staggering number of innocent people who remain incarcerated, and to bring reform to the system responsible for their unjust imprisonment."
Headquarters	40 Worth Street, Suite 701 New York, NY 10013
Region	United States
Executive Director	Maddy deLone
Chair	Vered Rabia
Affiliations	The Innocence Network
Revenue (2018)	$13,426,018
Expenses (2018)	$13,608,849
Endowment	$21,620,304 ₍₂₀₁₈₎
Employees (2017)	88
Volunteers (2017)	17
Website	www.innocenceproject.org

The Innocence Project is a 501(c)(3) nonprofit legal organization that is committed to exonerating wrongly convicted people through the use of DNA testing and to reforming the criminal justice system to prevent future injustice. The group cites various studies estimating that in the United States, between 2.3% and 5% of all prisoners are innocent. The Innocence Project was founded in 1992 by Barry Scheck and Peter Neufeld.

As of November 17, 2019, the Innocence Project has worked on 189 successful DNA-based exonerations.

Founding

The Innocence Project was established in the wake of a study by the United States Department of Justice and United States Senate, in conjunction with the Benjamin N. Cardozo School of Law, which found that incorrect identification by eyewitnesses was a factor in over 70% of wrongful convictions. The original Innocence Project was founded in 1992 by Scheck and Neufeld as part of the Cardozo School of Law of Yeshiva University in New York City. It became an independent 501(c)(3) nonprofit organization on January 28, 2003, but it maintains institutional connections with Cardozo. As of September 5, 2018, the executive director of the Innocence Project is Madeline deLone.

The Innocence Project has become widespread as countries are using scientific data to overturn wrongful convictions and in turn freeing those wrongly convicted. One such example exists in the Republic of Ireland where in 2009 a project was set up at Griffith College Dublin.

Mission

The Innocence Project focuses on cases in which DNA evidence is available to be tested or retested. DNA testing is possible in 5–10% of criminal cases. Other members of the Innocence Network also help to exonerate those in whose cases DNA testing is not possible.

In addition to working on behalf of those who may have been wrongfully convicted of crimes throughout the United States, those working for the Innocence Project perform research and advocacy related to the causes of wrongful convictions.

Some of the Innocence Project's successes have resulted in releasing people from death row. The successes of the project have fueled American opposition to the death penalty and have likely been a factor in the decision by some American states to institute moratoria on criminal executions.

In District Attorney's Office v. Osborne (2009), US Supreme Court Chief Justice Roberts wrote that post-conviction challenge "poses questions to our criminal justice systems and our traditional notions of finality better left to elected officials than federal judges." In the opinion, another justice wrote that forensic science has "serious deficiencies". Roberts also said that post-conviction DNA testing risks "unnecessarily overthrowing the established system of criminal justice." Law professor Kevin Jon Heller wrote: "It might lead to a reasonably accurate one."

Overturned convictions

As of November 2019, 367 people previously convicted of serious crimes in the United States had been exonerated by DNA testing since 1989, 21 of whom had been sentenced to death. Almost all (99%) of the wrongful convictions were of males, with minority groups constituting approximately 70% (61% African American and 8% Latino). The National Registry of Exonerations lists 1,579 convicted defendants who were exonerated through DNA and non-DNA evidence from January 1, 1989 through April 12, 2015. According to a study published in 2014, more than 4% of persons overall sentenced to death from 1973 to 2004 are probably innocent. The following are examples of notable exonerations:

In 2003, Steven Avery was exonerated after serving 18 years in prison for a sexual assault charge. After his release, he was convicted of murder.
In 2004, Darryl Hunt was exonerated after serving 19 1/2 years in prison of a life sentence for the rape and murder of a newspaper copy editor, Deborah Sykes.
In 2007, after an investigation begun by The Innocence Project, James Calvin Tillman was exonerated after serving 16 1/2 years in prison for a rape he did not commit. His sentence was 45 years.
In 2014, Glenn Ford was exonerated in the murder of Isadore Newman. Ford, an African American, had been convicted by an all-white jury without any physical evidence linking him to the crime, and with testimony withheld. He served 30 years on death row in Angola Prison before his release.

Work

The Innocence Project originated in New York City but accepts cases from any part of the United States. The majority of clients helped are of low socio-economic status and have used all possible legal options for justice. Many clients hope that DNA evidence will prove their innocence, as the emergence of DNA testing allows those who have been wrongly convicted of crimes to challenge their cases. The Innocence Project also works with the local, state and federal levels of law enforcement, legislators, and other programs to prevent further wrongful convictions.

About 3,000 prisoners write to the Innocence Project annually, and at any given time the Innocence Project is evaluating 6,000 to 8,000 potential cases.

All potential clients go through an extensive screening process to determine whether or not they are likely to be innocent. If they pass the process, the Innocence Project takes up their case. In roughly half of the cases that the Innocence Project takes on, the clients' guilt is reconfirmed by DNA testing. Of all the cases taken on by the Innocence Project, about 43% of clients were proven innocent, 42% were confirmed guilty, and evidence was inconclusive and not probative in 15% of cases. In about 40% of all DNA exoneration cases, law enforcement officials identified the actual perpetrator based on the same DNA test results that led to an exoneration.

Funding

The Innocence Project receives 45% of its funding from individual contributions, 30% from foundations, 15% from an annual benefit dinner, 7% from the Cardozo School of Law, and the rest from corporations.

Innocence Network

The Innocence Project is a founder of the Innocence Network, an organization of law and journalism schools, and public defense offices that collaborate to help convicted felons prove their innocence. 46 American states along with several other countries are a part of the network. In 2010, 29 people were exonerated worldwide from the work of the members of this organization.

The Innocence Network brings together a growing number of innocence organizations from across the United States as well as includes members from other English-speaking common law countries: Australia, Canada, Ireland, New Zealand, and the United Kingdom.

In South Africa, the Wits Justice Project investigates South African incarcerations. In partnership with the Wits Law Clinic, the Julia Mashele Trust, the Legal Resource Centre (LRC), the Open Democracy Advice Centre (ODAC), and the US Innocence Project, the Justice Project investigates individual cases of prisoners wrongly convicted or awaiting trial.

Causes of wrongful conviction

There are many reasons why wrongful convictions occur. The most common reason is false eyewitness identification, which played a role in more than 75% of wrongful convictions overturned by the Innocence Project. Often assumed to be incontrovertible, a growing body of evidence suggests that eyewitness identifications are unreliable. Another cause for misidentification is when a "show-up" procedure occurs. This is where a suspect is shown at the scene of a crime in a poorly lit lot or in a police car. Someone might also misidentify when they learn more about the suspect; it may cause them to change their description.

Unreliable or improper forensic science played a role in some 50% of Innocence Project cases. Scientific techniques such as bite-mark comparison, once widely used, are now known to be subjective. Many forensic science techniques also lack uniform scientific standards.

In about 25% of DNA exoneration cases, innocent people were coerced into making false confessions. Many of these false confessors went on to plead guilty to crimes they did not commit (usually to avoid a harsher sentence, or even the death penalty).

Government misconduct, inadequate legal counsel, and the improper use of informants also contributed to many of the wrongful convictions since overturned by the Innocence Project.

In popular culture

Film

After Innocence (2005) is a documentary that features the Innocence Project.
Conviction (2010), is a film about the exoneration of Kenneth Waters, who was a client of the Innocence Project. Hilary Swank plays Waters' sister Betty Anne, who went to college and law school to fight for his freedom, and Sam Rockwell plays Waters. Barry Scheck is portrayed by Peter Gallagher.

Literature

In the non-fiction book, The Innocent Man: Murder and Injustice in a Small Town (2006), John Grisham recounted the cases of Ron Williamson and Dennis Fritz, who were assisted on appeal by the Innocence Project and freed by DNA evidence, after being wrongfully convicted of the murder of Debra Ann Carter.

Podcasts

Serial Season 1 referenced the Innocence Project in episode 7 where Deirdre Enright, director of investigation for the Innocence Project at the University of Virginia School of Law, and a team of law students analyzed the case against Adnan Syed.

Stage productions

The Exonerated (2002) is a play by Erik Jensen and Jessica Blank about six people who had been wrongly convicted and sentenced to death, but were exonerated.

Television

In Justice is an American television series with a similar premise.
Castle is an American television series; in the episode "Like Father, Like Daughter" (season 6, episode 7), the Innocence Project was mentioned, as well as Frank Henson who was wrongfully convicted in 1998 of the death of Kimberly Tolbert.
The Innocence Project, a BBC One drama series that aired from 2006 to 2007, is based on a UK version of the Innocence Project.
The Innocence Project was discussed in season 2, episode 9 of The Good Wife, "Nine Hours" (December 14, 2010). Innocence Project co-founder Barry Scheck played himself in the episode, which was largely based on the actual Innocence Project case of Cameron Todd Willingham. Cary Agos, a recurring character on The Good Wife, is said to have worked for the Innocence Project after law school (and is a family friend of Scheck's).
In season six of Suits, a US legal dramedy, law student and paralegal Rachel Zane takes on an Innocence Project for a man wrongfully accused of murder.
In season three of Riverdale, a dark reimagining of the Archie Comics universe, Veronica Lodge mentions starting a chapter of the Innocence Project to help free her boyfriend Archie Andrews from prison following being falsely convicted of murder.

DNA barcoding

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/DNA_barcoding

DNA barcoding scheme

DNA barcoding is a method of species identification using a short section of DNA from a specific gene or genes. The premise of DNA barcoding is that, by comparison with a reference library of such DNA sections (also called "sequences"), an individual sequence can be used to uniquely identify an organism to species, in the same way that a supermarket scanner uses the familiar black stripes of the UPC barcode to identify an item in its stock against its reference database. These "barcodes" are sometimes used in an effort to identify unknown species, parts of an organism, or simply to catalog as many taxa as possible, or to compare with traditional taxonomy in an effort to determine species boundaries.

Different gene regions are used to identify the different organismal groups using barcoding. The most commonly used barcode region for animals and some protists is a portion of the cytochrome c oxidase I (COI or COX1) gene, found in mitochondrial DNA. Other genes suitable for DNA barcoding are the Internal transcribed spacer (ITS) rRNA often used for fungi and RuBisCO used for plants. Microorganisms are detected using different gene regions. The 16S rRNA gene for example is widely used in identification of prokaryotes, whereas the 18S rRNA gene is mostly used for detecting microbial eukaryotes. These gene regions are chosen because they have less intraspecific (within species) variation than interspecific (between species) variation, which is known as the "Barcoding Gap".

Some applications of DNA barcoding include: identifying plant leaves even when flowers or fruits are not available; identifying pollen collected on the bodies of pollinating animals; identifying insect larvae which may have fewer diagnostic characters than adults; or investigating the diet of an animal based on its stomach content, saliva or feces. When barcoding is used to identify organisms from a sample containing DNA from more than one organism, the term DNA metabarcoding is used, e.g. DNA metabarcoding of diatom communities in rivers and streams, which is used to assess water quality.

Background

DNA barcoding techniques were developed from early DNA sequencing work on microbial communities using the 5S rRNA gene. In 2003, specific methods and terminology of modern DNA barcoding were proposed as a standardized method for identifying species, as well as potentially allocating unknown sequences to higher taxa such as orders and phyla, in a paper by Paul D.N. Hebert et al. from the University of Guelph, Ontario, Canada. Hebert and his colleagues demonstrated the utility of the cytochrome c oxidase I (COI) gene, first utilized by Folmer et al. in 1994, using their published DNA primers as a tool for phylogenetic analyses at the species levels as a suitable discriminatory tool between metazoan invertebrates. The "Folmer region" of the COI gene is commonly used for distinction between taxa based on its patterns of variation at the DNA level. The relative ease of retrieving the sequence, and variability mixed with conservation between species, are some of the benefits of COI. Calling the profiles "barcodes", Hebert et al. envisaged the development of a COI database that could serve as the basis for a "global bioidentification system".

Methodology

Sampling and preservation

Barcoding can be done from tissue from a target specimen, from a mixture of organisms (bulk sample), or from DNA present in environmental samples (e.g. water or soil). The methods for sampling, preservation or analysis differ between those different types of sample.

Tissue samples

To barcode a tissue sample from the target specimen, a small piece of skin, a scale, a leg or antenna is likely to be sufficient (depending on the size of the specimen). To avoid contamination, it is necessary to sterilize used tools between samples. It is recommended to collect two samples from one specimen, one to archive, and one for the barcoding process. Sample preservation is crucial to overcome the issue of DNA degradation.

Bulk samples

A bulk sample is a type of environmental sample containing several organisms from the taxonomic group under study. The difference between bulk samples (in the sense used here) and other environmental samples is that the bulk sample usually provides a large quantity of good-quality DNA. Examples of bulk samples include aquatic macroinvertebrate sample collected by kick-net, or insect samples collected with a Malaise trap. Filtered or size-fractionated water samples containing whole organisms like unicellular eukaryotes are also sometimes defined as bulk samples. Such samples can be collected by the same techniques as used to obtain traditional samples for morphology-based identification.

eDNA samples

The environmental DNA (eDNA) method is a non-invasive approach to detect and identify species from cellular debris or extracellular DNA present in environmental samples (e.g. water or soil) through barcoding or metabarcoding. The approach is based on the fact that every living organism leave DNA in the environment, and this environmental DNA can be detected even for organisms that are at very low abundance. Thus, for field sampling, the most crucial part is to use DNA-free material and tools on each sampling site or sample to avoid contamination, if the DNA of the target organism(s) is likely to be present in low quantities. On the other hand, an eDNA sample always includes the DNA of whole-cell, living microorganisms, which are often present in large quantities. Therefore, microorganism samples taken in the natural environment also are called eDNA samples, but contamination is less problematic in this context due to the large quantity of target organisms. The eDNA method is applied on most sample types, like water, sediment, soil, animal feces, stomach content or blood from e.g. leeches.

DNA extraction, amplification and sequencing

DNA barcoding requires that DNA in the sample is extracted. Several different DNA extraction methods exist, and factors like cost, time, sample type and yield affect the selection of the optimal method.

When DNA from organismal or eDNA samples is amplified using polymerase chain reaction (PCR), the reaction can be affected negatively by inhibitor molecules contained in the sample. Removal of these inhibitors is crucial to ensure that high quality DNA is available for subsequent analyzing.

Amplification of the extracted DNA is a required step in DNA barcoding. Typically, only a small fragment of the total DNA material is sequenced (typically 400–800 base pairs) to obtain the DNA barcode. Amplification of eDNA material is usually focused on smaller fragment sizes (<200 amplicon="" and="" argue="" as="" base="" be="" between="" detection="" dna="" edna.="" edna="" fragmented="" from="" however="" is="" likely="" material="" more="" no="" of="" other="" p="" pairs="" rate="" relationship="" size="" some="" sources.="" studies="" than="" that="" there="" to="">

HiSeq sequencers at SciLIfeLab in Uppsala, Sweden. The photo was taken during the excursion of SLU course PNS0169 in March 2019.

When the DNA barcode marker region has been amplified, the next step is to sequence the marker region using DNA sequencing methods. Many different sequencing platforms are available, and technical development is proceeding rapidly.

Marker selection

A schematic view of primers and target region, demonstrated on 16S rRNA gene in Pseudomonas. As primers, one typically selects short conserved sequences with low variability, which can thus amplify most or all species in the chosen target group. The primers are used to amplify a highly variable target region in between the two primers, which is then used for species discrimination. Modified from »Variable Copy Number, Intra-Genomic Heterogeneities and Lateral Transfers of the 16S rRNA Gene in Pseudomonas« by Bodilis, Josselin; Nsigue-Meilo, Sandrine; Besaury, Ludovic; Quillet, Laurent, used under CC BY, available from: https://www.researchgate.net/figure/Hypervariable-regions-within-the-16S-rRNA-gene-in-Pseudomonas-The-plotted-line-reflects_fig2_224832532.

Markers used for DNA barcoding are called barcodes. In order to successfully characterize species based on DNA barcodes, selection of informative DNA regions is crucial. A good DNA barcode should have low intra-specific and high inter-specific variability and possess conserved flanking sites for developing universal PCR primers for wide taxonomic application. The goal is to design primers that will detect and distinguish most or all the species in the studied group of organisms (high taxonomic resolution). The length of the barcode sequence should be short enough to be used with current sampling source, DNA extraction, amplification and sequencing methods.

Ideally, one gene sequence would be used for all taxonomic groups, from viruses to plants and animals. However, no such gene region has been found yet, so different barcodes are used for different groups of organisms, or depending on the study question.

For animals, the most widely used barcode is mitochondrial cytochrome C oxidase I (COI) locus. Other mitochondrial genes, such as Cytb, 12S or 16S are also used. Mitochondrial genes are preferred over nuclear genes because of their lack of introns, their haploid mode of inheritance and their limited recombination. Moreover, each cell has various mitochondria (up to several thousand) and each of them contains several circular DNA molecules. Mitochondria can therefore offer abundant source of DNA even when sample tissue is limited.

In plants, however, mitochondrial genes are not appropriate for DNA barcoding because they exhibit low mutation rates. A few candidate genes have been found in the chloroplast genome, the most promising being maturase K gene (matK) by itself or in association with other genes. Multi-locus markers such as ribosomal internal transcribed spacers (ITS DNA) along with matK, rbcL, trnH or other genes have also been used for species identification. The best discrimination between plant species has been achieved when using two or more chloroplast barcodes.

For bacteria, the small subunit of ribosomal RNA (16S) gene can be used for different taxa, as it is highly conserved. Some studies suggest COI, type II chaperonin (cpn60) or β subunit of RNA polymerase (rpoB) also could serve as bacterial DNA barcodes.

Barcoding fungi is more challenging, and more than one primer combination might be required. The COI marker performs well in certain fungi groups, but not equally well in others. Therefore, additional markers are being used, such as ITS rDNA and the large subunit of nuclear ribosomal RNA (LSU).

Within the group of protists, various barcodes have been proposed, such as the D1–D2 or D2–D3 regions of 28S rDNA, V4 subregion of 18S rRNA gene, ITS rDNA and COI. Additionally, some specific barcodes can be used for photosynthetic protists, for example the large subunit of ribulose-1,5-bisphosphate carboxylase-oxygenase gene (rbcL) and the chloroplastic 23S rRNA gene.

**Markers that have been used for DNA barcoding in different organism groups, modified from Purty and Chatterjee.**
Organism group	Marker gene/locus
Animals	COI, Cytb, 12S, 16S
Plants	matK, rbcL, psbA-trnH, ITS
Bacteria	COI, rpoB, 16S, cpn60, tuf, RIF, gnd
Fungi	ITS, RPB1 (LSU), RPB2 (LSU), 18S (SSU)
Protists	ITS, COI, rbcL, 18S, 28S

Reference libraries and bioinformatics

Reference libraries are used for the taxonomic identification, also called annotation, of sequences obtained from barcoding or metabarcoding. These databases contain the DNA barcodes assigned to previously identified taxa. Most reference libraries do not cover all species within an organism group, and new entries are continually created. In the case of macro- and many microorganisms (such as algae), these reference libraries require detailed documentation (sampling location and date, person who collected it, image, etc.) and authoritative taxonomic identification of the voucher specimen, as well as submission of sequences in a particular format. The process also requires the storage of voucher specimens in museum collections and other collaborating institutions. Both taxonomically comprehensive coverage and content quality are important for identification accuracy. Several reference databases exist depending on the organism group and the genetic marker used. There are smaller, national databases (e.g. FinBOL), and large consortia like the International Barcode of Life Project (iBOL).

BOLD

Launched in 2007, the Barcode of Life Data System (BOLD) is one of the biggest databases, containing more than 450 000 BINs (Barcode Index Numbers) in 2019. It is a freely accessible repository for the specimen and sequence records for barcode studies, and it is also a workbench aiding the management, quality assurance and analysis of barcode data. The database mainly contains BIN records for animals based on the COI genetic marker.

UNITE

The UNITE database was launched in 2003 and is a reference database for the molecular identification of fungal species with the internal transcribed spacer (ITS) genetic marker region. This database is based on the concept of species hypotheses: you choose the % at which you want to work, and the sequences are sorted in comparison to sequences obtained from voucher specimens identified by experts.

Diat.barcode

Diat.barcode database was first published under the name R-syst::diatom in 2016 starting with data from two sources: the Thonon culture collection (TCC) in the hydrobiological station of the French National Institute for Agricultural Research (INRA), and from the NCBI (National Center for Biotechnology Information) nucleotide database. Diat.barcode provides data for two genetic markers, rbcL (Ribulose-1,5-bisphosphate carboxylase/oxygenase) and 18S (18S ribosomal RNA). The database also involves additional, trait information of species, like morphological characteristics (biovolume, size dimensions, etc.), life-forms (mobility, colony-type, etc.) or ecological features (pollution sensitivity, etc.).

Bioinformatic analysis

In order to obtain well structured, clean and interpretable data, raw sequencing data must be processed using bioinformatic analysis. The FASTQ file with the sequencing data contains two types of information: the sequences detected in the sample (FASTA file) and a quality file with quality scores (PHRED scores) associated with each nucleotide of each DNA sequence. The PHRED scores indicate the probability with which the associated nucleotide has been correctly scored.

**PHRED quality score and the associated certainty level**
10	90%
20	99%
30	99.9%
40	99.99%
50	99.999%

In general, the PHRED score decreases towards the end of each DNA sequence. Thus some bioinformatics pipelines simply cut the end of the sequences at a defined threshold.

Some sequencing technologies, like MiSeq, use paired-end sequencing during which sequencing is performed from both directions producing better quality. The overlapping sequences are then aligned into contigs and merged. Usually, several samples are pooled in one run, and each sample is characterized by a short DNA fragment, the tag. In a demultiplexing step, sequences are sorted using these tags to reassemble the separate samples. Before further analysis, tags and other adapters are removed from the barcoding sequence DNA fragment. During trimming, the bad quality sequences (low PHRED scores), or sequences that are much shorter or longer than the targeted DNA barcode, are removed. The following dereplication step is the process where all of the quality-filtered sequences are collapsed into a set of unique reads (individual sequence units ISUs) with the information of their abundance in the samples. After that, chimeras (i.e. compound sequences formed from pieces of mixed origin) are detected and removed. Finally, the sequences are clustered into OTUs (Operational Taxonomic Units), using one of many clustering strategies. The most frequently used bioinformatic softwares include Mothur, Uparse, Qiime, Galaxy, Obitools, JAMP, and DADA2.

Comparing the abundance of reads, i.e. sequences, between different samples is still a challenge because both the total number of reads in a sample as well as the relative amount of reads for a species can vary between samples, methods, or other variables. For comparison, one may then reduce the number of reads of each sample to the minimal number of reads of the samples to be compared – a process called rarefaction. Another way is to use the relative abundance of reads.

Species identification and taxonomic assignment

The taxonomic assignment of the OTUs to species is achieved by matching of sequences to reference libraries. The Basic Local Alignment Search Tool (BLAST) is commonly used to identify regions of similarity between sequences by comparing sequence reads from the sample to sequences in reference databases. If the reference database contains sequences of the relevant species, then the sample sequences can be identified to species level. If a sequence cannot be matched to an existing reference library entry, DNA barcoding can be used to create a new entry.

In some cases, due to the incompleteness of reference databases, identification can only be achieved at higher taxonomic levels, such as assignment to a family or class. In some organism groups such as bacteria, taxonomic assignment to species level is often not possible. In such cases, a sample may be assigned to a particular operational taxonomic unit (OTU).

Applications

Applications of DNA barcoding include identification of new species, safety assessment of food, identification and assessment of cryptic species, detection of alien species, identification of endangered and threatened species, linking egg and larval stages to adult species, securing intellectual property rights for bioresources, framing global management plans for conservation strategies and elucidate feeding niches. DNA barcode markers can be applied to address basic questions in systematics, ecology, evolutionary biology and conservation, including community assembly, species interaction networks, taxonomic discovery, and assessing priority areas for environmental protection.

Identification of species

Specific short DNA sequences or markers from a standardized region of the genome can provide a DNA barcode for identifying species. Molecular methods are especially useful when traditional methods are not applicable. DNA barcoding has great applicability in identification of larvae for which there are generally few diagnostic characters available, and in association of different life stages (e.g. larval and adult) in many animals. Identification of species listed in the Convention of the International Trade of Endangered Species (CITES) appendixes using barcoding techniques is used in monitoring of illegal trade.

Detection of invasive species

Alien species can be detected via barcoding. Barcoding can be suitable for detection of species in e.g. border control, where rapid and accurate morphological identification is often not possible due to similarities between different species, lack of sufficient diagnostic characteristics and/or lack of taxonomic expertise. Barcoding and metabarcoding can also be used to screen ecosystems for invasive species, and to distinguish between an invasive species and native, morphologically similar, species.

Delimiting cryptic species

DNA barcoding enables the identification and recognition of cryptic species. The results of DNA barcoding analyses depend however upon the choice of analytical methods, so the process of delimiting cryptic species using DNA barcodes can be as subjective as any other form of taxonomy. Hebert et al.(2004) concluded that the butterfly Astraptes fulgerator in north-western Costa Rica actually consists of 10 different species. These results, however, were subsequently challenged by Brower (2006), who pointed out numerous serious flaws in the analysis, and concluded that the original data could support no more than the possibility of three to seven cryptic taxa rather than ten cryptic species. Smith et al. (2007) used cytochrome c oxidase I DNA barcodes for species identification of the 20 morphospecies of Belvosia parasitoid flies (Diptera: Tachinidae) reared from caterpillars (Lepidoptera) in Area de Conservación Guanacaste (ACG), northwestern Costa Rica. These authors discovered that barcoding raises the species count to 32, by revealing that each of the three parasitoid species, previously considered as generalists, actually are arrays of highly host-specific cryptic species. For 15 morphospecies of polychaetes within the deep Antarctic benthos studied through DNA barcoding, cryptic diversity was found in 50% of the cases. Furthermore, 10 previously overlooked morphospecies were detected, increasing the total species richness in the sample by 233%.

Barcoding is a tool to vouch for food quality. Here, DNA from traditional Norwegian Christmas food is extracted at the molecular systematic lab at NTNU University Museum.

Diet analysis and food web application

DNA barcoding and metabarcoding can be useful in diet analysis studies, and is typically used if prey specimens cannot be identified based on morphological characters. There is a range of sampling approaches in diet analysis: DNA metabarcoding can be conducted on stomach contents, feces, saliva or whole body analysis. In fecal samples or highly digested stomach contents, it is often not possible to distinguish tissue from single species, and therefore metabarcoding can be applied instead. Feces or saliva represent non-invasive sampling approaches, while whole body analysis often means that the individual needs to be killed first. For smaller organisms, sequencing for stomach content is then often done by sequencing the entire animal.

Barcoding for food safety

DNA barcoding represents an essential tool to evaluate the quality of food products. The purpose is to guarantee food traceability, to minimize food piracy, and to valuate local and typical agro-food production. Another purpose is to safeguard public health; for example, metabarcoding offers the possibility to identify groupers causing Ciguatera fish poisoning from meal remnants, or to separate poisonous mushrooms from edible ones (Ref).

Biomonitoring and ecological assessment

DNA barcoding can be used to assess the presence of endangered species for conservation efforts (Ref), or the presence of indicator species reflective to specific ecological conditions (Ref), for example excess nutrients or low oxygen levels.

Potentials and shortcomings

Potentials

Traditional bioassessment methods are well established internationally, and serve biomonitoring well, as for example for aquatic bioassessment within the EU Directives WFD and MSFD. However, DNA barcoding could improve traditional methods for the following reasons; DNA barcoding (i) can increase taxonomic resolution and harmonize the identification of taxa which are difficult to identify or lack experts, (ii) can more accurately/precisely relate environmental factors to specific taxa (iii) can increase comparability among regions, (iv) allows for the inclusion of early life stages and fragmented specimens, (v) allows delimitation of cryptic/rare species (vi) allows for development of new indices e.g. rare/cryptic species which may be sensitive/tolerant to stressors, (vii) increases the number of samples which can be processed and reduces processing time resulting in increased knowledge of species ecology, (viii) is a non-invasive way of monitoring when using eDNA methods.

Time and cost

DNA barcoding is faster than traditional morphological methods all the way from training through to taxonomic assignment. It takes less time to gain expertise in DNA methods than becoming an expert in taxonomy. In addition, the DNA barcoding workflow (i.e. from sample to result) is generally quicker than traditional morphological workflow and allows the processing of more samples.

Taxonomic resolution

DNA barcoding allows the resolution of taxa from higher (e.g. family) to lower (e.g. species) taxonomic levels, that are otherwise too difficult to identify using traditional morphological methods, like e.g. identification via microscopy. For example, Chironomidae (the non-biting midge) are widely distributed in both terrestrial and freshwater ecosystems. Their richness and abundance make them important for ecological processes and networks, and they are one of many invertebrate groups used in biomonitoring. Invertebrate samples can contain as many as 100 species of chironomids which often make up as much as 50% of a sample. Despite this, they are usually not identified below the family level because of the taxonomic expertise and time required. This may result in different chironomid species with different ecological preferences grouped together, resulting in inaccurate assessment of water quality.

DNA barcoding provides the opportunity to resolve taxa, and directly relate stressor effects to specific taxa such as individual chironomid species. For example, Beermann et al. (2018) DNA barcoded Chironomidae to investigate their response to multiple stressors; reduced flow, increased fine-sediment and increased salinity. After barcoding, it was found that the chironomid sample consisted of 183 Operational Taxonomic Units (OTUs), i.e. barcodes (sequences) that are often equivalent to morphological species. These 183 OTUs displayed 15 response types rather than the previously reported two response types recorded when all chironomids were grouped together in the same multiple stressor study. A similar trend was discovered in a study by Macher et al. (2016) which discovered cryptic diversity within the New Zealand mayfly species Deleatidium sp. This study found different response patterns of 12 molecular distinct OTUs to stressors which may change the consensus that this mayfly is sensitive to pollution.

Shortcomings

Despite the advantages offered by DNA barcoding, it has also been suggested that DNA barcoding is best used as a complement to traditional morphological methods. This recommendation is based on multiple perceived challenges.

Physical parameters

It is not completely straightforward to connect DNA barcodes with ecological preferences of the barcoded taxon in question, as is needed if barcoding is to be used for biomonitoring. For example, detecting target DNA in aquatic systems depends on the concentration of DNA molecules at a site, which in turn can be affected by many factors. The presence of DNA molecules also depends on dispersion at a site, e.g. direction or strength of currents. It is not really known how DNA moves around in streams and lakes, which makes sampling difficult. Another factor might be the behavior of the target species, e.g. fish can have seasonal changes of movements, crayfish or mussels will release DNA in larger amounts just at certain times of their life (moulting, spawning). For DNA in soil, even less is known about distribution, quantity or quality.

Incomplete barcode reference libraries

The major limitation of the barcoding method is that it relies on barcode reference libraries for the taxonomic identification of the sequences. The taxonomic identification is accurate only if a reliable reference is available. However, most databases are still incomplete, especially for smaller organisms e.g. fungi, phytoplankton, nematoda etc. In addition, current databases contain misidentifications, spelling mistakes and other errors. There is massive curation and completion effort around the databases for all organisms necessary, involving large barcoding projects (for example the iBOL project for the Barcode of Life Data Systems (BOLD) reference database). However, completion and curation are difficult and time-consuming. Without vouchered specimens, there can be no certainty about whether the sequence used as a reference is correct. DNA sequence databases like GenBank contain many sequences that are not tied to vouchered specimens (for example, herbarium specimens, cultured cell lines, or sometimes images). This is problematic in the face of taxonomic issues such as whether several species should be split or combined, or whether past identifications were sound. Therefore, best practice for DNA barcoding is to sequence vouchered specimens. For many taxa, it can be however difficult to obtain reference specimens, for example with specimens that are difficult to catch, available specimens are poorly conserved, or adequate taxonomic expertise is lacking. Importantly, DNA barcodes can also be used to create interim taxonomy, in which case OTUs can be used as substitutes for traditional Latin binomials – thus significantly reducing dependency on fully populated reference databases.

Technological bias

DNA barcoding also carries methodological bias, from sampling to bioinformatics data analysis. Beside the risk of contamination of the DNA sample by PCR inhibitors, primer bias is one of the major sources of errors in DNA barcoding. The isolation of an efficient DNA marker and the design of primers is a complex process and considerable effort has been made to develop primers for DNA barcoding in different taxonomic groups. However, primers will often bind preferentially to some sequences, leading to differential primer efficiency and specificity and unrepresentative communities’ assessment and richness inflation. Thus, the composition of the sample's communities sequences is mainly altered at the PCR step. Besides, PCR replication is often required, but leads to an exponential increase in the risk of contamination. Several studies have highlighted the possibility to use mitochondria-enriched samples or PCR-free approaches to avoid these biases, but as of today, the DNA metabarcoding technique is still based on the sequencing of amplicons. Other bias enter the picture during the sequencing and during the bioinformatic processing of the sequences, like the creation of chimeras.

Lack of standardization

Even as DNA barcoding is more widely used and applied, there is no agreement concerning the methods for DNA preservation or extraction, the choices of DNA markers and primers set, or PCR protocols. The parameters of bioinformatics pipelines (for example OTU clustering, taxonomic assignment algorithms or thresholds etc.) are at the origin of much debate among DNA barcoding users. Sequencing technologies are also rapidly evolving, together with the tools for the analysis of the massive amounts of DNA data generated, and standardization of the methods is urgently needed to enable collaboration and data sharing at greater spatial and time-scale. This standardisation of barcoding methods at the European scale is part of the objectives of the European COST Action DNAqua-net and is also addressed by CEN (the European Committee for Standardization).

Another criticism of DNA barcoding is its limited efficiency for accurate discrimination below species level (for example, to distinguish between varieties), for hybrid detection, and that it can be affected by evolutionary rates (Ref needed).

Mismatches between conventional (morphological) and barcode based identification

It is important to know that taxa lists derived by conventional (morphological) identification are not, and maybe never will be, directly comparable to taxa lists derived from barcode based iendtification because of several reasons. The most important cause is probably the incompleteness and lack of accuracy of the molecular reference databases preventing a correct taxonomic assignment of eDNA sequences. Taxa not present in reference databases will not be found by eDNA, and sequences linked to a wrong name will lead to incorrect identification. Other known causes are a different sampling scale and size between a traditional and a molecular sample, the possible analysis of dead organisms, which can happen in different ways for both methods depending on organism group, and the specific selection of identification in either method, i.e. varying taxonomical expertise or possibility to identify certain organism groups, respectively primer bias leading also to a potential biased analysis of taxa.

Estimates of richness/diversity

DNA Barcoding can result in an over or underestimate of species richness and diversity. Some studies suggest that artifacts (identification of species not present in a community) are a major cause of inflated biodiversity. The most problematic issue are taxa represented by low numbers of sequencing reads. These reads are usually removed during the data filtering process, since different studies suggest that most of these low-frequency reads may be artifacts. However, real rare taxa may exist among these low-abundance reads. Rare sequences can reflect unique lineages in communities which make them informative and valuable sequences. Thus, there is a strong need for more robust bioinformatics algorithms that allow the differentiation between informative reads and artifacts. Complete reference libraries would also allow a better testing of bioinformatics algorithms, by permitting a better filtering of artifacts (i.e. the removal of sequences lacking a counterpart among extant species) and therefore, it would be possible obtain a more accurate species assignment. Cryptic diversity can also result in inflated biodiversity as one morphological species may actually split into many distinct molecular sequences.

DNA metabarcoding

Differences in the standard methods for DNA barcoding & metabarcoding. While DNA barcoding points to find a specific species, metabarcoding looks for the whole community.

DNA metabarcoding is defined as the barcoding of DNA or eDNA (environmental DNA) that allows for simultaneous identification of many taxa within the same (environmental) sample, however often within the same organism group. The main difference between the approaches is that metabarcoding, in contrast to barcoding, does not focus on one specific organism, but instead aims to determine species composition within a sample.

Methodology

The metabarcoding procedure, like general barcoding, covers the steps of DNA extraction, PCR amplification, sequencing and data analysis. A barcode consists of a short variable gene region (for example, see different markers/barcodes) which is useful for taxonomic assignment flanked by highly conserved gene regions which can be used for primer design. Different genes are used depending if the aim is to barcode single species or metabarcoding several species. In the latter case, a more universal gene is used. Metabarcoding does not use single species DNA/RNA as a starting point, but DNA/RNA from several different organisms derived from one environmental or bulk sample.

Applications

Metabarcoding has the potential to complement biodiversity measures, and even replace them in some instances, especially as the technology advances and procedures gradually become cheaper, more optimized and widespread.

DNA metabarcoding applications include:

Biodiversity monitoring in terrestrial and aquatic environments

Paleontology and ancient ecosystems

Plant-pollinator interactions

Diet analysis

Food safety

Advantages and challenges

The general advantages and shortcomings for barcoding reviewed above are valid also for metabarcoding. One particular drawback for metabarcoding studies is that there is no consensus yet regarding the optimal experimental design and bioinformatics criteria to be applied in eDNA metabarcoding. However, there are current joined attempts, like e.g. the EU COST network DNAqua-Net, to move forward by exchanging experience and knowledge to establish best-practice standards for biomonitoring.

Conservation genetics

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/Conservation_genetics

Conservation genetics is an interdisciplinary subfield of Population Genetics that aims to understand the dynamics of genes in populations principally to avoid extinction. Therefore, it applies genetic methods to the conservation and restoration of biodiversity. Researchers involved in conservation genetics come from a variety of fields including population genetics, molecular ecology, biology, evolutionary biology, and systematics. Genetic diversity is one of the three fundamental levels of biodiversity, so it is directly important in conservation. Genetic variability influences both the health and long-term survival of populations because decreased genetic diversity has been associated with reduced fitness, such as high juvenile mortality, diminished population growth, reduced immunity, and ultimately, higher extinction risk.

Genetic diversity

Genetic diversity is the variability of genes in a species. A number of means can express the level of genetic diversity: observed heterozygosity, expected heterozygosity, the mean number of alleles per locus, or the percentage of polymorphic loci.

Importance of genetic diversity

Genetic diversity determines the potential fitness of a population and ultimately its long-term persistence, because genes encode phenotypic information. Extinction risk has been associated with low genetic diversity and several researchers have documented reduced fitness in populations with low genetic diversity. For example, low heterozigosity has been associated with low juvenile survival, reduced population growth, low body size, and diminished adult lifespan.

Heterozygosity, a fundamental measurement of genetic diversity in population genetics, plays an important role in determining the chance of a population surviving environmental change, novel pathogens not previously encountered, as well as the average fitness of a population over successive generations. Heterozygosity is also deeply connected, in population genetics theory, to population size (which itself clearly has a fundamental importance to conservation). All things being equal, small populations will be less heterozygous - across their whole genomes - than comparable, but larger, populations. This lower heterozygosity (i.e. low genetic diversity) renders small populations more susceptible to the challenges mentioned above.

In a small population, over successive generations and without gene flow, the probability of mating with close relatives becomes very high, leading to inbreeding depression - a reduction in fitness of the population. The reduced fitness of the offspring of closely-related individuals is fundamentally tied to the concept of heterozygosity, as the offspring of these kinds of pairings are, by necessity, less heterozygous (more homozygous) across their whole genomes than outbred individuals. A diploid individual with the same maternal and paternal grandfather, for example, will have a much higher chance of being homozygous at any loci inherited from the paternal copies of each of their parents' genomes than would an individual with unrelated maternal and paternal grandfathers (each diploid individual inherits one copy of their genome from their mother and one from their father).

High homozygosity (low heterozygosity) reduces fitness because it exposes the phenotypic effects of recessive alleles at homozygous sites. Selection can favour the maintenance of alleles which reduce the fitness of homozygotes, the textbook example being the sickle-cell beta-globin allele, which is maintained at high frequencies in populations where malaria is endemic due to the highly adaptive heterozygous phenotype (resistance to the malarial parasite, Plasmodium falciparum).

Low genetic diversity also reduces the opportunities for chromosomal crossover during meiosis to create new combinations of alleles on chromosomes, effectively increasing the average length of unrecombined tracts of chromosomes inherited from parents. This in turn reduces the efficacy of selection, across successive generations, to remove fitness-reducing alleles and promote fitness-enhancing allelels from a population. (A simple hypothetical example would be two adjacent genes - A and B - on the same chromosome in an individual. If the allele at A promotes fitness "one point", while the allele at B reduces fitness "one point", but the two genes are inherited together, then selection can't favour the allele at A while penalising the allele at B - the fitness balance is "zero points". Recombination can swap out alternative alleles at A and B, allowing selection to promote the optimal alleles to the optimal frequencies in the population - but only if there are alternative alleles to choose between!)

The fundamental connection between genetic diversity and population size in population genetics theory can be clearly seen in the classic population genetics measure of genetic diversity, the Watterson estimator, in which genetic diversity is measured as a function of effective population size and mutation rate. Given the relationship between population size, mutation rate, and genetic diversity, it is clearly important to recognise populations at risk of losing genetic diversity before problems arise as a result of the loss of that genetic diversity. Once lost, genetic diversity can only be restored by mutation and gene flow. If a species is already on the brink of extinction there will likely be no populations to use to restore diversity by gene flow, and any given population will (by definition) be small and therefore diversity will accumulate in that population by mutation much more slowly than it would in a comparable, but bigger, population (since there are fewer individuals whose genomes are mutating in a smaller population than a bigger population).

Contributors to extinction

Inbreeding and inbreeding depression.
The accumulation of deleterious mutations
A decrease in frequency of heterozygotes in a population, or heterozygosity, which decreases a species' ability to evolve to deal with change in the environment.
Outbreeding depression
Fragmented populations
Taxonomic uncertainties, which can lead to a reprioritization of conservation efforts
Genetic drift as the main evolutionary process, instead of natural selection
Management units within species

Techniques

Specific genetic techniques are used to assess the genomes of a species regarding specific conservation issues as well as general population structure. This analysis can be done in two ways, with current DNA of individuals or historic DNA.

Techniques for analysing the differences between individuals and populations include

These different techniques focus on different variable areas of the genomes within animals and plants. The specific information that is required determines which techniques are used and which parts of the genome are analysed. For example, mitochondrial DNA in animals has a high substitution rate, which makes it useful for identifying differences between individuals. However, it is only inherited in the female line, and the mitochondrial genome is relatively small. In plants, the mitochondrial DNA has very high rates of structural mutations, so is rarely used for genetic markers, as the chloroplast genome can be used instead. Other sites in the genome that are subject to high mutation rates such as the major histocompatibility complex, and the microsatellites and minisatellites are also frequently used.

These techniques can provide information on long-term conservation of genetic diversity and expound demographic and ecological matters such as taxonomy.

Another technique is using historic DNA for genetic analysis. Historic DNA is important because it allows geneticists to understand how species reacted to changes to conditions in the past. This is a key to understanding the reactions of similar species in the future.

Techniques using historic DNA include looking at preserved remains found in museums and caves. Museums are used because there is a wide range of species that are available to scientists all over the world. The problem with museums is that, historical perspectives are important because understanding how species reacted to changes in conditions in the past is a key to understanding reactions of similar species in the future. Evidence found in caves provides a longer perspective and does not disturb the animals.

Another technique that relies on specific genetics of an individual is noninvasive monitoring, which uses extracted DNA from organic material that an individual leaves behind, such as a feather. This too avoids disrupting the animals and can provide information about the sex, movement, kinship and diet of an individual.

Other more general techniques can be used to correct genetic factors that lead to extinction and risk of extinction. For example, when minimizing inbreeding and increasing genetic variation multiple steps can be taken. Increasing heterozygosity through immigration, increasing the generational interval through cryopreservation or breeding from older animals, and increasing the effective population size through equalization of family size all helps minimize inbreeding and its effects. Deleterious alleles arise through mutation, however certain recessive ones can become more prevalent due to inbreeding. Deleterious mutations that arise from inbreeding can be removed by purging, or natural selection. Populations raised in captivity with the intent of being reintroduced in the wild suffer from adaptations to captivity.

Inbreeding depression, loss of genetic diversity, and genetic adaptation to captivity are disadvantageous in the wild, and many of these issues can be dealt with through the aforementioned techniques aimed at increasing heterozygosity. In addition creating a captive environment that closely resembles the wild and fragmenting the populations so there is less response to selection also help reduce adaptation to captivity.

Solutions to minimize the factors that lead to extinction and risk of extinction often overlap because the factors themselves overlap. For example, deleterious mutations are added to populations through mutation, however the deleterious mutations conservation biologists are concerned with are ones that are brought about by inbreeding, because those are the ones that can be taken care of by reducing inbreeding. Here the techniques to reduce inbreeding also help decrease the accumulation of deleterious mutations.

Applications

These techniques have wide ranging applications. One application of these specific molecular techniques is in defining species and sub-species of salmonids. Hybridization is an especially important issue in salmonids and this has wide ranging conservation, political, social and economic implications. In Cutthroat Trout mtDNA and alloenzyme analysis, hybridization between native and non-native species was shown to be one of the major factors contributing to the decline in their populations. This led to efforts to remove some hybridized populations so native populations could breed more readily. Cases like these impact everything from the economy of local fishermen to larger companies, such as timber. Specific molecular techniques led to a closer analysis of taxonomic relationships, which is one factor that can lead to extinctions if unclear.

Implications

New technology in conservation genetics has many implications for the future of conservation biology. At the molecular level, new technologies are advancing. Some of these techniques include the analysis of minisatellites and MHC. These molecular techniques have wider effects from clarifying taxonomic relationships, as in the previous example, to determining the best individuals to reintroduce to a population for recovery by determining kinship. These effects then have consequences that reach even further. Conservation of species has implications for humans in the economic, social, and political realms. In the biological realm increased genotypic diversity has been shown to help ecosystem recovery, as seen in a community of grasses which was able to resist disturbance to grazing geese through greater genotypic diversity. Because species diversity increases ecosystem function, increasing biodiversity through new conservation genetic techniques has wider reaching effects than before.

A short list of studies a conservation geneticist may research include:

Phylogenetic classification of species, subspecies, geographic races, and populations, and measures of phylogenetic diversity and uniqueness.
Identifying hybrid species, hybridization in natural populations, and assessing the history and extent of introgression between species.
Population genetic structure of natural and managed populations, including identification of Evolutionary Significant Units (ESUs) and management units for conservation.
Assessing genetic variation within a species or population, including small or endangered populations, and estimates such as effective population size (Ne).
Measuring the impact of inbreeding and outbreeding depression, and the relationship between heterozygosity and measures of fitness (see Fisher's fundamental theorem of natural selection).
Evidence of disrupted mate choice and reproductive strategy in disturbed populations.
Forensic applications, especially for the control of trade in endangered species.
Practical methods for monitoring and maximizing genetic diversity during captive breeding programs and re-introduction schemes, including mathematical models and case studies.
Conservation issues related to the introduction of genetically modified organisms.
The interaction between environmental contaminants and the biology and health of an organism, including changes in mutation rates and adaptation to local changes in the environment (e.g. industrial melanism).
New techniques for noninvasive genotyping.

Search This Blog

Sunday, January 5, 2020

Innocence Project

Founding

Mission

Overturned convictions

Work

Funding

Innocence Network

Causes of wrongful conviction

In popular culture

Film

Literature

Podcasts

Stage productions

Television

DNA barcoding

Background

Methodology

Sampling and preservation

DNA extraction, amplification and sequencing

Marker selection

Reference libraries and bioinformatics

Bioinformatic analysis

Species identification and taxonomic assignment

Applications

Identification of species

Detection of invasive species

Delimiting cryptic species

Diet analysis and food web application

Barcoding for food safety

Biomonitoring and ecological assessment

Potentials and shortcomings

Potentials

Time and cost

Taxonomic resolution

Shortcomings

Physical parameters

Incomplete barcode reference libraries

Technological bias

Lack of standardization

Mismatches between conventional (morphological) and barcode based identification

Estimates of richness/diversity

DNA metabarcoding

Methodology

Applications

Advantages and challenges

Conservation genetics

Genetic diversity

Importance of genetic diversity

Contributors to extinction

Techniques

Applications

Implications

Pure mathematics