Nanoinformatics is the application of informatics to nanotechnology.
It is an interdisciplinary field that develops methods and software
tools for understanding nanomaterials, their properties, and their
interactions with biological entities, and using that information more
efficiently. It differs from cheminformatics in that nanomaterials usually involve nonuniform
collections of particles that have distributions of physical properties
that must be specified. The nanoinformatics infrastructure includes ontologies for nanomaterials, file formats, and data repositories.
Nanoinformatics has applications for improving workflows in fundamental research, manufacturing, and environmental health, allowing the use of high-throughput data-driven methods to analyze broad sets of experimental results. Nanomedicine applications include analysis of nanoparticle-based pharmaceuticals for structure–activity relationships in a similar manner to bioinformatics.
Background
Context
of nanoinformatics as a convergence of science and practice at the
nexus of safety, health, well-being, and productivity; risk management;
and emerging nanotechnology.
While conventional chemicals are specified by their chemical composition, and concentration, nanoparticles have other physical properties that must be measured for a complete description, such as size, shape, surface properties, crystallinity, and dispersion state. In addition, preparations of nanoparticles are often non-uniform,
having distributions of these properties that must also be specified.
These molecular-scale properties influence their macroscopic chemical
and physical properties, as well as their biological effects. They are
important in both the experimental characterization of nanoparticles and their representation in an informatics system. The context of nanoinformatics is that effective development and
implementation of potential applications of nanotechnology requires the
harnessing of information at the intersection of safety, health,
well-being, and productivity; risk management; and emerging nanotechnology.
A graphical representation of a working definition of nanoinformatics as a life-cycle process
One working definition of nanoinformatics developed through the community-based Nanoinformatics 2020 Roadmap and subsequently expanded is:
Determining which information is relevant to meeting the safety,
health, well-being, and productivity objectives of the nanoscale
science, engineering, and technology community;
Developing and implementing effective mechanisms for collecting,
validating, storing, sharing, analyzing, modeling, and applying the
information;
Confirming that appropriate decisions were made and that desired
mission outcomes were achieved as a result of that information; and
finally
Conveying experience to the broader community, contributing to generalized knowledge, and updating standards and training.
Data representations
Although nanotechnology is the subject of significant
experimentation, much of the data are not stored in standardized formats
or broadly accessible. Nanoinformatics initiatives seek to coordinate
developments of data standards and informatics methods.
Ontologies
An overview of the eNanoMapper nanomaterial ontology
In the context of information science, an ontology is a formal representation of knowledge within a domain,
using hierarchies of terms including their definitions, attributes, and
relations. Ontologies provide a common terminology in a
machine-readable framework that facilitates sharing and discovery of data. Having an established ontology for nanoparticles is important for cancer nanomedicine due to the need of researchers to search, access, and analyze large amounts of data.
The NanoParticle Ontology is an ontology for the preparation,
chemical composition, and characterization of nanomaterials involved in
cancer research. It uses the Basic Formal Ontology framework and is implemented in the Web Ontology Language. It is hosted by the National Center for Biomedical Ontology and maintained on GitHub. The eNanoMapper Ontology is more recent and reuses wherever possible
already existing domain ontologies. As such, it reuses and extends the
NanoParticle Ontology, but also the BioAssay Ontology, Experimental Factor Ontology, Unit Ontology, and ChEBI.
File formats
Flowchart
depicting the ways to identify different components of a material
sample to guide the creation of an ISA-TAB-Nano Material file
ISA-TAB-Nano is a set of four spreadsheet-based file formats for representing and sharing nanomaterial data based on the ISA-TAB metadata standard. In Europe, other templates have been adopted that were developed by the Institute of Occupational Medicine, and by the Joint Research Centre for the NANoREG project.
Tools
Nanoinformatics is not limited to aggregating and sharing information
about nanotechnologies, but has many complementary tools, some
originating from chemoinformatics and bioinformatics.
Databases and repositories
Over the past decase, various publicly available nanomaterials
databases and repositories have been constructed to support
nanoinformatics and toxicology modelling. These databases often store standardised physicochemical, biological,
and toxicological data on engineered nanomaterials and offer model-ready
datasets to the scientific community to enable data reuse. Given the
limitation of data availability in nanoinformatics tasks, the curation
of large datasets and storage into accessible repositories is
prioritised to support computational modelling, regulatory assessment,
and data-driven research.
caNanoLab, developed by the U.S. National Cancer Institute, focuses on nanotechnologies related to biomedicine. The NanoMaterials Registry, maintained by RTI International, is a curated database of nanomaterials, and includes data from caNanoLab.
The eNanoMapper database, a project of the EU NanoSafety Cluster,
is a deployment of the database software developed in the eNanoMapper
project. It has since been used in other settings, such as the EU Observatory for NanoMaterials (EUON).
Other databases include the Center for the Environmental Implications of NanoTechnology's NanoInformatics Knowledge Commons (NIKC) and NanoDatabank, PEROSH's Nano Exposure & Contextual Information Database (NECID), Data and Knowledge on Nanomaterials (DaNa), NanoPharos and Springer Nature's Nano database.
Applications
Nanoinformatics has applications for improving workflows in fundamental research, manufacturing, and environmental health, allowing the use of high-throughput data-driven methods to analyze broad sets of experimental results.
Nanoinformatics is especially useful in nanoparticle-based cancer
diagnostics and therapeutics. They are very diverse in nature due to
the combinatorially large numbers of chemical and physical modifications
that can be made to them, which can cause drastic changes in their
functional properties. This leads to a combinatorial complexity that
far exceeds, for example, genomic data. Nanoinformatics can enable structure–activity relationship modelling for nanoparticle-based drugs. Nanoinformatics and biomolecular nanomodeling provide a route for effective cancer treatment. Nanoinformatics also enables a data-driven approach to the design of materials to meet health and environmental needs.
Modeling and NanoQSAR
Viewed as a workflow process, nanoinformatics deconstructs experimental studies using data, metadata, controlled vocabularies and ontologies
to populate databases so that trends, regularities and theories will be
uncovered for use as predictive computational tools. Models are
involved at each stage, some material (experiments, reference materials, model organisms)
and some abstract (ontology, mathematical formulae), and all intended
as a representation of the target system. Models can be used in
experimental design, may substitute for experiment or may simulate how a
complex system changes over time.
At present, nanoinformatics is an extension of bioinformatics
due to the great opportunities for nanotechnology in medical
applications, as well as to the importance of regulatory approvals to
product commercialization. In these cases, the models target, their
purposes, may be physico-chemical, estimating a property based on
structure (quantitative structure–property relationship, QSPR); or
biological, predicting biological activity based on molecular structure (quantitative structure–activity relationship, QSAR) or the time-course development of a simulation (physiologically based toxicokinetics, PBTK). Each of these has been explored for small moleculedrug development with a supporting body of literature.
Particles differ from molecular entities, especially in having
surfaces that challenge nomenclature system and QSAR/PBTK model
development. For example, particles do not exhibit an octanol–water partition coefficient, which acts as a motive force in QSAR/PBTK models; and they may dissolve in vivo or have band gaps. Illustrative of current QSAR and PBTK models are those of Puzyn et al. and Bachler et al. The OECD has codified regulatory acceptance criteria, and there are guidance roadmaps with supporting workshops to coordinate international efforts.
Communities
Communities active in nanoinformatics include the European UnionNanoSafety Cluster, The U.S. National Cancer Institute National Cancer Informatics Program's Nanotechnology Working Group, and the US–EU Nanotechnology Communities of Research.
Nanoinformatics roles, responsibilities, and communication interfaces
Individuals who engage in nanoinformatics can be viewed as fitting
across four categories of roles and responsibilities for nanoinformatics
methods and data:
Customers, who need either the methods to create the data, the
data itself, or both, and who specify the scientific applications and
characterization methods and data needs for their intended purposes;
Creators, who develop relevant and reliable methods and data to meet the needs of customers in the nanotechnology community;
Curators, who maintain and ensure the quality of the methods and associated data; and
Analysts, who develop and apply methods and models for data analysis
and interpretation that are consistent with the quality and quantity of
the data and that meet customers' needs.
In some instances, the same individuals perform all four roles. More
often, many individuals must interact, with their roles and
responsibilities extending over significant distances, organizations,
and time. Effective communication is important across each of the twelve
links (in both directions across each of the six pairwise interactions)
that exist among the various customers, creators, curators, and
analysts.
History
One of the first mentions of nanoinformatics was in the context of handling information about nanotechnology.
An early international workshop with substantial discussion of
the need for sharing all types of information on nanotechnology and
nanomaterials was the First International Symposium on Occupational
Health Implications of Nanomaterials held 12–14 October 2004 at the
Palace Hotel, Buxton, Derbyshire, UK. The workshop report included a presentation on Information Management for Nanotechnology Safety and Health that described the development of a Nanoparticle Information Library
(NIL) and noted that efforts to ensure the health and safety of
nanotechnology workers and members of the public could be substantially
enhanced by a coordinated approach to information management. The NIL
subsequently served as an example for web-based sharing of
characterization data for nanomaterials.
The National Cancer Institute prepared in 2009 a rough vision of, what was then still called, nanotechnology informatics, outlining various aspects of what nanoinformatics should comprise. This
was later followed by two roadmaps, detailing existing solutions,
needs, and ideas on how the field should further develop: the Nanoinformatics 2020 Roadmap and the EU US Roadmap Nanoinformatics 2030.
A 2013 workshop on nanoinformatics described current resources,
community needs and the proposal of a collaborative framework for data
sharing and information integration.
Stages in the origin of life process range from the well understood, such as the habitable Earth and the abiotic synthesis of simple molecules, to the largely unknown, like the derivation of the last universal common ancestor (LUCA) with its complex molecular functionalities.
Abiogenesis or the origin of life (sometimes called biopoesis) is the natural process by which life arises from non-living matter, such as simple organic compounds. The prevailing scientific hypothesis is that the transition from non-living to living entities on Earth was not a single event, but a process of increasing complexity involving the formation of a habitable planet, the prebiotic synthesis of organic molecules, molecular self-replication, self-assembly, autocatalysis, and the emergence of cell membranes.
The transition from non-life to life has not been observed
experimentally, but many proposals have been made for different stages
of the process.
The study of abiogenesis aims to determine how pre-life chemical reactions gave rise to life under conditions strikingly different from those on Earth today. It uses tools from biology and chemistry, attempting a synthesis of many sciences. Life functions through the chemistry of carbon and water, and builds on four chemical families: lipids for cell membranes, carbohydrates such as sugars, amino acids for protein metabolism, and the nucleic acidsDNA and RNA for heredity. A theory of abiogenesis must explain the origins and interactions of these classes of molecules.
Many approaches investigate how self-replicating molecules came into existence. Researchers think that life descends from an RNA world, although other self-replicating and self-catalyzing molecules may have preceded RNA. Other approaches ("metabolism-first" hypotheses) focus on how catalysis on the early Earth might have provided the precursor molecules for self-replication. The 1952 Miller–Urey experiment demonstrated that amino acids can be synthesized from inorganic compounds under conditions like early Earth's. Subsequently, amino acids have been found in meteorites, comets, asteroids, and star-forming regions of space.
While the last universal common ancestor
of all modern organisms (LUCA) existed millions of years after the
origin of life, its study can guide research into early universal
characteristics. A genomics approach has sought to characterize LUCA by identifying the genes shared by Archaea and Bacteria, major branches of life. It appears there are 60 proteins common to all life and 355 prokaryotic genes that trace to LUCA; their functions imply that LUCA was anaerobic with the Wood–Ljungdahl pathway, deriving energy by chemiosmosis, and used DNA, the genetic code, and ribosomes. Earlier cells might have had a leaky membrane and been powered by a naturally occurring proton gradient near a deep-sea white smoker hydrothermal vent; or, life may have originated inside the continental crust or in water at Earth's surface.
Although Earth is the only place known to harbor life, astrobiologists assume that life exists and came into being by similar processes on other planets. Geochemical and fossil evidence informs most studies. The Earth was formed at 4.54 Gya, and the earliest evidence of life on Earth dates from 3.8 Gya from Western Australia. Fossil micro-organisms may have lived in hydrothermal vent precipitates from Quebec, soon after ocean formation during the Hadean, so the process appears to have been relatively rapid in terms of geological time.
NASA's 2015 strategy for astrobiology
aimed to solve the puzzle of the origin of life – how a fully
functioning living system could emerge from non-living components –
through research on the prebiotic origin of life's chemicals, both in space and on planets, as well as the functioning of early biomolecules to catalyse reactions and support inheritance.
The challenge for origin of life researchers is to explain how such a complex and tightly interlinked
system could develop by evolutionary steps, as at first sight all its parts are necessary
to enable it to function. For example, a cell, whether the LUCA or in a
modern organism, copies its DNA with the DNA polymerase enzyme, which
is itself produced by translating the DNA polymerase gene in the DNA.
Neither the enzyme nor the DNA can be produced without the other. The evolutionary process could have started with molecular self-replication, self-assembly such as of cell membranes, and autocatalysis via RNA ribozymes in an RNA world environment. The transition of non-life to life has not been observed experimentally. Some scientists see both life and the origin of life as aspects of the same process.
The preconditions to the development of a living cell like the
LUCA are known, though disputed in detail: a habitable world is formed
with a supply of minerals and liquid water. Prebiotic synthesis creates a
range of simple organic compounds, which are assembled into polymers
such as proteins and RNA. On the other side, the process after the LUCA
is readily understood: biological evolution caused the development of a
wide range of species with varied forms and biochemical capabilities.
However, the derivation of the LUCA from simple components is far from
understood.
Although Earth remains the only place where life is known, the science of astrobiology
seeks evidence of life on other planets. The 2015 NASA strategy on the
origin of life aimed to solve the puzzle by identifying interactions,
intermediary structures and functions, energy sources, and environmental
factors that contributed to evolvable macromolecular systems, and mapping the chemical landscape of potential primordial informational polymers. The advent of such polymers was most likely a critical step in prebiotic chemical evolution. Those polymers derived, in turn, from simple organic compounds such as nucleobases, amino acids, and sugars, likely formed by reactions in the environment. A successful theory of the origin of life must explain how all these chemicals came into being.
The Miller–Urey experiment
was a synthesis of small organic molecules in a mixture of simple gases
in a thermal gradient created by heating (right) and cooling (left) the
mixture at the same time, with electrical discharges.
One ancient view of the origin of life, from Aristotle until the 19th century, was of spontaneous generation. This held that "lower" animals such as insects were generated by decaying organic substances, and that life arose by chance. This was questioned from the 17th century, in works like Thomas Browne's Pseudodoxia Epidemica. In 1665, Robert Hooke published the first drawings of a microorganism. In 1676, Antonie van Leeuwenhoek drew and described microorganisms, probably protozoa and bacteria. Van Leeuwenhoek disagreed with spontaneous generation, and by the 1680s
convinced himself, using experiments ranging from sealed and open meat
incubation and the close study of insect reproduction, that the theory
was incorrect. In 1668 Francesco Redi showed that no maggots appeared in meat when flies were prevented from laying eggs. By the middle of the 19th century, spontaneous generation was considered disproven.
Dating back to Anaxagoras in the 5th century BC, panspermia is the idea that life originated elsewhere in the universe and came to Earth. The modern version of panspermia holds that life may have been distributed to Earth by meteoroids, asteroids, comets or planetoids. This shifts the origin of life to another heavenly body. The advantage
is that life is not required to have formed on each planet it occurs on,
but in a more limited set of locations, and then spread about the galaxy to other star systems. There is some interest in the possibility that life originated on Mars and later transferred to Earth.
The idea that life originated from non-living matter in slow stages appeared in Herbert Spencer's 1864–1867 book Principles of Biology, and in William Turner Thiselton-Dyer's 1879 paper "On spontaneous generation and evolution". On 1 February 1871 Charles Darwin wrote about these publications to Joseph Hooker, and set out his own speculation that the original spark of life may have been in a "warm little pond, with all sorts of ammonia and phosphoric salts,—light,
heat, electricity &c present, that a protein compound was
chemically formed". Darwin explained that "at the present day such
matter would be instantly devoured or absorbed, which would not have
been the case before living creatures were formed."
Alexander Oparin in 1924 and J. B. S. Haldane in 1929 proposed that the earliest cells slowly self-organized from a primordial soup, the Oparin–Haldane hypothesis. Haldane suggested that the Earth's prebiotic oceans consisted of a "hot
dilute soup" in which organic compounds could have formed. J. D. Bernal showed that such mechanisms could form most of the necessary molecules for life from inorganic precursors. In 1967, he suggested three "stages": the origin of biological monomers; the origin of biological polymers; and the evolution from molecules to cells.
In 1952, Stanley Miller and Harold Urey
carried out a chemical experiment to demonstrate how organic molecules
could have formed spontaneously from inorganic precursors under prebiotic conditions like those posited by the Oparin–Haldane hypothesis. It used a highly reducing (lacking oxygen) mixture of gases—methane, ammonia, and hydrogen, with water vapor—to form organic monomers such as amino acids. Bernal said of the Miller–Urey experiment that "it is not enough to
explain the formation of such molecules, what is necessary, is a
physical-chemical explanation of the origins of these molecules that
suggests the presence of suitable sources and sinks for free energy." However, current scientific consensus describes the primitive atmosphere as weakly reducing or neutral, diminishing the amount and variety of amino acids that could be produced. The addition of iron and carbonate minerals, present in early oceans, produces a diverse array of amino acids. Later work has focused on two other potential reducing environments: outer space and deep-sea hydrothermal vents.
Soon after the Big Bang, roughly 14 Gya, the only chemical elements present in the universe were hydrogen, helium, and lithium,
the three lightest atoms in the periodic table. These elements
gradually accreted and began orbiting in disks of gas and dust.
Gravitational accretion of material at the hot and dense centers of
these protoplanetary disks formed stars by the fusion of hydrogen. Early stars were massive and short-lived, producing all the heavier elements by stellar nucleosynthesis. Such element formation proceeds to its most stable element Iron-56. Heavier elements were formed during supernovae at the end of a star's lifecycle. Carbon, currently the fourth most abundant element in the universe, was formed mainly in white dwarf stars. As these stars reached the end of their lifecycles,
they ejected heavier elements, including carbon and oxygen, throughout
the universe. These allowed for the formation of rocky planets. According to the nebular hypothesis, the Solar System began to form 4.6 Gya with the gravitational collapse of part of a giant molecular cloud. Most of the collapsing mass collected in the center, forming the Sun, while the rest flattened into a protoplanetary disk out of which the planets formed.
The age of the Earth is 4.54 Gya as found by radiometric dating of calcium-aluminium-rich inclusions in carbonaceous chrondrite meteorites, the oldest material in the Solar System. Earth, during the Hadean
eon (from its formation until 4.031 Gya,) was at first inhospitable to
life. During its formation, the Earth lost much of its initial mass, and
so lacked the gravity to hold molecular hydrogen and the bulk of the original inert gases. Soon after initial accretion of Earth at 4.48 Gya, its collision with Theia, a hypothesised impactor, is thought to have created the ejected debris that eventually formed the Moon. This impact removed the Earth's primary atmosphere, leaving behind
clouds of viscous silicates and carbon dioxide. This unstable atmosphere
was short-lived, soon condensing to form the bulk silicate Earth,
leaving behind an atmosphere largely consisting of water vapor, nitrogen, and carbon dioxide, with smaller amounts of carbon monoxide, hydrogen, and sulfur compounds. The solution of carbon dioxide in water is thought to have made the seas slightly acidic, with a pH of about 5.5.
Condensation to form liquid oceans is theorised to have occurred as early as the Moon-forming impact. This scenario is supported by the dating of 4.404 Gya zircon crystals with high δ18O values from metamorphosed quartzite of Mount Narryer in Western Australia. The Hadean atmosphere has been characterized as a "gigantic, productive
outdoor chemical laboratory," similar to volcanic gases today which
still support some abiotic chemistry. Despite the likely increased
volcanism from early plate tectonics, the Earth may have been a
predominantly water world between 4.4 and 4.3 Gya. It is debated whether
crust was exposed above this ocean. Immediately after the Moon-forming impact, Earth likely had little if any continental crust, a turbulent atmosphere, and a hydrosphere subject to intense ultraviolet light from a T Tauri stage Sun. It was also affected by cosmic radiation, and continued asteroid and comet impacts.
The Late Heavy Bombardment hypothesis posits that a period of intense impact occurred at 4.1 to 3.8 Gya during the Hadean and early Archean eons. Originally it was thought that the Late Heavy Bombardment was a single
cataclysmic impact event occurring at 3.9 Gya; this would have had the
potential to sterilize Earth by volatilizing liquid oceans and blocking
sunlight needed for photosynthesis, delaying the earliest possible
emergence of life. More recent research questioned the intensity of the Late Heavy
Bombardment and its potential for sterilisation. If it was not one giant
impact but a period of raised impact rate, it would have had much less
destructive power. The 3.9 Gya date arose from dating of Apollo mission sample returns collected mostly near the Imbrium Basin, biasing the age of recorded impacts. Impact modelling of the lunar surface reveals that rather than a
cataclysmic event at 3.9 Gya, multiple small-scale, short-lived periods
of bombardment likely occurred. Terrestrial data backs this idea by showing multiple periods of ejecta
in the rock record both before and after the 3.9 Gya marker, suggesting
that the early Earth was subject to continuous impacts with less impact
on extinction.
If life evolved in the ocean at depths of more than ten meters,
it would have been shielded both from late impacts and the then high
levels of ultraviolet radiation from the sun. The available energy is
maximized at 100–150 °C, the temperatures at which hyperthermophilic bacteria and thermoacidophilicarchaea live.
Based on evidence from the geologic record,
life most likely emerged on Earth between 4.32 and 3.48 Gya. In 2017,
the earliest physical evidence of life was reported to consist of microbialites in the Nuvvuagittuq Greenstone Belt of Northern Quebec, in banded iron formation
rocks at least 3.77 and possibly as old as 4.32 Gya. The
micro-organisms could have lived within hydrothermal vent precipitates,
soon after the 4.4 Gya formation of oceans
during the Hadean. The microbes resemble modern hydrothermal vent
bacteria, supporting the view that abiogenesis began in such an
environment. Later research disputed this interpretation of the data, stating that
the observations may be better explained by abiotic processes in
silica-rich waters, "chemical gardens," circulating hydrothermal fluids, or volcanic ejecta.
Biogenic graphite has been found in 3.7 Gya metasedimentary rocks from southwestern Greenland and in microbial mat fossils from 3.49 Gya cherts in the Pilbara region of Western Australia. Evidence of early life in rocks from Akilia Island, near the Isua supracrustal belt in southwestern Greenland, dating to 3.7 Gya, have shown biogenic carbon isotopes. In other parts of the Isua supracrustal belt, graphite inclusions trapped within garnet crystals are connected to the other elements of life: oxygen, nitrogen, and possibly phosphorus in the form of phosphate, providing further evidence for life 3.7 ;Gya. In the Pilbara region of Western Australia, compelling evidence of early life was found in pyrite-bearing
sandstone in a fossilized beach, with rounded tubular cells that
oxidized sulfur by photosynthesis in the absence of oxygen. Carbon isotope ratios on graphite inclusions from the Jack Hills
zircons suggest that life could have existed on Earth from 4.1 Gya. A 2024 study inferred LUCA's age as around 4.2 Gya (4.09–4.33 Gya) by
analysing pre-LUCA gene duplicates, with calibration from fossil
micro-organisms, much sooner after the origin of life than previously
thought.
The Pilbara region of Western Australia contains the Dresser Formation with rocks 3.48 Gya, including layered structures called stromatolites. Their modern counterparts are created by photosynthetic micro-organisms including cyanobacteria. These lie within undeformed hydrothermal-sedimentary strata; their
texture indicates a biogenic origin. Parts of the Dresser formation
preserve hot springs on land, but other regions seem to have been shallow seas. A molecular clock analysis suggests the LUCA emerged prior to 3.9 Gya.
All chemical elements derive from stellar nucleosynthesis except for hydrogen and some helium and lithium. Basic chemical ingredients of life – the carbon-hydrogen molecule (CH), the carbon-hydrogen positive ion (CH+) and the carbon ion (C+) – can be produced by ultraviolet light from stars. Complex molecules, including organic molecules, form naturally both in space and on planets. Organic molecules on the early Earth could have had either terrestrial
origins, with organic molecule synthesis driven by impact shocks or by
other energy sources, such as ultraviolet light, redox coupling, or electrical discharges; or extraterrestrial origins (pseudo-panspermia), with organic molecules formed in interstellar dust clouds raining down on to the planet.
An organic compound is a chemical whose molecules contain carbon. Carbon is abundant in the Sun, stars, comets, and in the atmospheres of most planets of the Solar System. Organic compounds are relatively common in space, formed by "factories
of complex molecular synthesis" which occur in molecular clouds and circumstellar envelopes, and chemically evolve after reactions are initiated mostly by ionizing radiation. Purine and pyrimidine nucleobases including guanine, adenine, cytosine, uracil, and thymine, as well as sugars, have been found in meteorites. These could have provided the materials for DNA and RNA to form on the early Earth. The amino acid glycine was found in material ejected from comet Wild 2; it had earlier been detected in meteorites. Comets are encrusted with dark material, thought to be a tar-like
organic substance formed from simple carbon compounds under ionizing
radiation. A rain of material from comets could have brought such
complex organic molecules to Earth. During the Late Heavy Bombardment, meteorites may have delivered up to five million tons of organic prebiotic elements to Earth per year. Currently 40,000 tons of cosmic dust falls to Earth each year.
Polycyclic aromatic hydrocarbons (PAH) are the most common and abundant polyatomic molecules in the observable universe, and are a major store of carbon. They seem to have formed shortly after the Big Bang, and are associated with new stars and exoplanets. They are a likely constituent of Earth's primordial sea. PAHs have been detected in nebulae, and in the interstellar medium, in comets, and in meteorites.
A star, HH 46-IR, resembling the sun early in its life, is
surrounded by a disk of material which contains molecules including
cyanide compounds, hydrocarbons, and carbon monoxide. PAHs in the interstellar medium can be transformed through hydrogenation, oxygenation, and hydroxylation to more complex organic compounds used in living cells.
Organic compounds introduced on Earth by interstellar dust particles can help to form complex molecules, thanks to their peculiar surface-catalytic activities. The RNA component uracil and related molecules, including xanthine, in the Murchison meteorite were likely formed extraterrestrially, as suggested by studies of 12C/13C isotopic ratios. NASA studies of meteorites suggest that all four DNA nucleobases
(adenine, guanine and related organic molecules) have been formed in
outer space. The cosmic dust permeating the universe contains complex organics ("amorphous organic solids with a mixed aromatic–aliphatic structure") that could be created rapidly by stars. Glycolaldehyde, a sugar molecule and RNA precursor, has been detected in regions of space including around protostars and on meteorites.
Laboratory synthesis
As early as the 1860s, experiments demonstrated that biologically
relevant molecules can be produced from interaction of simple carbon
sources with abundant inorganic catalysts. The spontaneous formation of
complex polymers from abiotically generated monomers under the
conditions posited by the "soup" theory is not straightforward. Besides
the necessary basic organic monomers, compounds that would have
prohibited the formation of polymers were also formed in high
concentration during the Miller–Urey experiment and Joan Oró experiments. Biology uses essentially 20 amino acids for its coded protein enzymes,
representing a very small subset of the structurally possible products.
Since life tends to use whatever is available, an explanation is needed
for why the set used is so small. Formamide is attractive as a medium that potentially provided a source
of amino acid derivatives from simple aldehyde and nitrile feedstocks.
The Breslow catalytic cycle for formaldehyde dimerization and C2-C6 sugar formation
Alexander Butlerov showed in 1861 that the formose reaction created sugars including tetroses, pentoses, and hexoses when formaldehyde
is heated under basic conditions with divalent metal ions like calcium.
R. Breslow proposed that the reaction was autocatalytic in 1959.
Nucleobases
Nucleobases, such as guanine and adenine, can be synthesized from simple carbon and nitrogen sources, such as hydrogen cyanide (HCN) and ammonia. On early Earth, HCN has been shown in modelling experiments to have
likely been supplied via photochemical production in transient, highly
reducing atmospheres (see Prebiotic atmosphere) following major impacts. Formamide,
produced by the reaction of water and HCN, is ubiquitous and produces
all four ribonucleotides when warmed with terrestrial minerals. It can
be concentrated by the evaporation of water. HCN is poisonous only to aerobic organisms,
which did not exist during the earliest phases of life's origin. It can
contribute to chemical processes such as the synthesis of the amino
acid glycine.
DNA and RNA components including uracil, cytosine and thymine can
be synthesized under outer space conditions, using starting chemicals
such as pyrimidine found in meteorites. Pyrimidine may have been formed
in red giant stars, in interstellar dust and gas clouds, or may have been synthesized on Earth via precursors such as cyanoacetylene and other intermediates made available following early asteroid impacts. All four RNA-bases may be synthesized from formamide in high-energy density events like extraterrestrial impacts. Several ribonucleotides for RNA formation have been synthesized in a laboratory environment which replicates prebiotic conditions via autocatalytic formose reaction.
Other pathways for synthesizing bases from inorganic materials have been reported. Freezing temperatures assist the synthesis of purines, by concentrating key precursors such as HCN. However, while adenine and guanine require freezing conditions, cytosine and uracil may require boiling temperatures. Seven amino acids and eleven types of nucleobases formed in ice when ammonia and cyanide were left in a freezer for 25 years. S-triazines
(alternative nucleobases), pyrimidines including cytosine and uracil,
and adenine can be synthesized by subjecting a urea solution to
freeze-thaw cycles under a reductive atmosphere with spark discharges. The unusual speed of these low-temperature reactions is due to eutectic freezing, which crowds impurities in microscopic pockets of liquid within the ice.
Peptides
Prebiotic peptide synthesis could have occurred by several routes.
Some center on high temperature/concentration conditions in which
condensation becomes energetically favorable, while others use plausible
prebiotic condensing agents.
Experimental evidence for the formation of peptides in uniquely
concentrated environments is bolstered by work suggesting that wet-dry
cycles and the presence of specific salts can greatly increase
spontaneous condensation of glycine into poly-glycine chains. Other work suggests that while mineral surfaces, such as those of
pyrite, calcite, and rutile catalyze peptide condensation, they also
catalyze their hydrolysis. The authors suggest that additional chemical
activation or coupling would be necessary to produce peptides at
sufficient concentrations. Thus, mineral surface catalysis, while
important, is not sufficient alone for peptide synthesis.
Many prebiotically plausible condensing/activating agents have
been identified, including the following: cyanamide, dicyanamide,
dicyandiamide, diaminomaleonitrile, urea, trimetaphosphate, NaCl, CuCl2, (Ni,Fe)S, CO, carbonyl sulfide (COS), carbon disulfide (CS2), SO2, and diammonium phosphate (DAP).
A 2024 experiment used a sapphire substrate with a web of thin cracks under a heat flow, mimicking deep-ocean vents,
to concentrate prebiotically-relevant building blocks from a dilute
mixture by up to three orders of magnitude. This could help to create
biopolymers such as peptides. A similar role has been suggested for clays, though this speculation has not been supported through experimental evidence.
The prebiotic synthesis of peptides from simpler molecules such as CO, NH3 and C, skipping the step of amino acid formation, is also very efficient.
The largest unanswered question in evolution is how simple protocells first arose and differed in reproductive contribution to the following generation, thus initiating evolution. The lipid world theory postulates that the first self-replicating object was lipid-like. Phospholipids form lipid bilayers
(as in cell membranes) in water while under agitation. These molecules
were not present on early Earth, but other membrane-forming amphiphilic long-chain molecules were. These bodies may expand by insertion of additional lipids, and may spontaneously split into two offspring
of similar size and composition. Lipid bodies may have provided
sheltering envelopes for information storage, allowing the evolution of
information-storing polymers like RNA. Only one or two types of
vesicle-forming amphiphiles have been studied. There is an enormous number of possible arrangements of lipid bilayer
membranes, and those with the best reproductive characteristics would
have converged toward a hypercycle reaction, a positive feedback
composed of two mutual catalysts represented by a membrane site and a
specific compound trapped in the vesicle. Such site/compound pairs are
transmissible to the daughter vesicles, leading to the emergence of
distinct lineages of vesicles, subject to natural selection.
A protocell is a self-organized, self-ordered, spherical collection of lipids proposed as a stepping-stone to life. A functional protocell has (as of 2014) not yet been achieved in a laboratory setting. Self-assembled vesicles are essential components of primitive cells. The theory of classical irreversible thermodynamics treats
self-assembly under a generalized chemical potential within the
framework of dissipative systems. The second law of thermodynamics requires that overall entropy increases, yet life is distinguished by its great degree of organization. Therefore, a boundary is needed to separate ordered life processes from chaotic non-living matter.
Irene Chen and Jack W. Szostak
suggest that elementary protocells can give rise to cellular behaviors
including primitive forms of differential reproduction, competition, and
energy storage. Competition for membrane molecules would favor stabilized membranes,
suggesting a selective advantage for cross-linked fatty acids and even
modern phospholipids. Such micro-encapsulation
would allow for metabolism within the membrane and the exchange of
small molecules, while retaining large biomolecules inside. Such a
membrane is needed for a cell to create its own electrochemical gradient. Fatty acid vesicles in alkaline hydrothermal vent conditions can be
stabilized by isoprenoids, synthesized by the formose reaction; the
advantages and disadvantages of isoprenoids within the lipid bilayer in
different microenvironments might have led to the divergence of the
membranes of archaea and bacteria.
Vesicles can undergo an evolutionary process under pressure cycling conditions. Simulating the systemic environment in tectonic fault zones within the Earth's crust, pressure cycling forms vesicles periodically, as well as random peptide
chains which are selected for ability to integrate into the vesicle
membrane. Further selection of vesicles for stability could lead to
functional peptide structures, increasing vesicle survival rate.
Life requires a loss of entropy, or disorder, as molecules organize
themselves into living matter. At the same time, the emergence of life
is associated with the formation of structures beyond a certain
threshold of complexity. The emergence of life with increasing order and complexity does not
contradict the second law of thermodynamics, which states that overall
entropy never decreases, since a living organism creates order in some
places (e.g. its living body) at the expense of an increase of entropy
elsewhere (e.g. heat and waste production).
Multiple sources of energy were available for chemical reactions on the early Earth. Heat from geothermal processes is a standard energy source for chemistry. Other examples include sunlight, lightning, atmospheric entries of micro-meteorites, and implosion of bubbles in sea and ocean waves. This has been confirmed by experiments and simulations. Unfavorable reactions can be driven by highly favorable ones, as in the
case of iron-sulfur chemistry. For example, this was probably important
for carbon fixation. Carbon fixation by reaction of CO2 with H2S
via iron-sulfur chemistry is favorable, and occurs at neutral pH and
100 °C. Iron-sulfur surfaces, which are abundant near hydrothermal
vents, can drive the production of small amounts of amino acids and
other biomolecules.
In 1961, Peter Mitchell proposed chemiosmosis
as a cell's primary system of energy conversion. The mechanism, now
ubiquitous in living cells, powers energy conversion in micro-organisms
and in the mitochondria of eukaryotes, making it a likely candidate for early life. Mitochondria produce adenosine triphosphate
(ATP), the energy currency of the cell used to drive cellular processes
such as chemical syntheses. The mechanism of ATP synthesis involves a
closed membrane in which the ATP synthase enzyme is embedded. The energy required to release strongly bound ATP has its origin in protons that move across the membrane. In modern cells, those proton movements are caused by the pumping of
ions across the membrane, maintaining an electrochemical gradient. In
the first organisms, the gradient could have been provided by the
difference in chemical composition between the flow from a hydrothermal
vent and the surrounding seawater, or perhaps meteoric quinones that were conducive to the development of
chemiosmotic energy across lipid membranes if at a terrestrial origin.
The RNA world hypothesis proposes that undirected polymerisation led to the emergence of ribozymes, and in turn to an RNA replicase.
The RNA world hypothesis describes an early Earth with self-replicating and catalytic RNA but no DNA or proteins. It was proposed in 1962 by Alexander Rich; the term was coined by Walter Gilbert in 1986. Many researchers concur that an RNA world must have preceded modern DNA-based life. However, it may not have been the first to exist. There may have been over 30 chemical events between pre-RNA world to near-LUCA, just involving RNA.
RNA is central to the translation process. Small RNAs can
catalyze all the chemical groups and information transfers required for
life. RNA both expresses and maintains genetic information in modern
organisms; its components are easily synthesized under early Earth
conditions. The structure of the ribosome has been called the "smoking gun", with a central core of RNA and no amino acid side chains within 18 Ã… of the active site that catalyzes peptide bond formation.
RNA replicase
can both code and catalyse further RNA replication, i.e. it is
autocatalytic. Some catalytic RNAs can link smaller RNA sequences
together, enabling self-replication.Natural selection would then favor the proliferation of such autocatalytic sets. Self-assembly of RNA may occur spontaneously in hydrothermal vents. A preliminary form of tRNA could have assembled into a replicator molecule. When this began to replicate, it may have had all three mechanisms of Darwinian selection: heritability, variation, and differential reproduction. Its fitness would have depended on its ability to adapt, determined by its nucleotide sequence, and resource availability.
From RNA to directed protein synthesis
In line with the RNA world hypothesis, much of modern biology's
templated protein biosynthesis is done by RNA molecules—namely tRNAs and
the ribosome (consisting of both protein and rRNA). The most central
reaction of peptide bond synthesis is carried out by base catalysis by
the 23S rRNA domain V. Di- and tripeptides can be synthesized with a system consisting of only aminoacyl phosphate adaptors and RNA guides. Aminoacylation ribozymes that can charge tRNAs with their cognate amino acids have been selected in in vitro experimentation.
Early functional peptides
The first proteins had to arise without a fully-fledged system of
protein biosynthesis. Random sequence peptides would not have had
biological function. Thus, significant study has gone into exploring how
early functional proteins could have arisen from random sequences.
Evidence on hydrolysis rates shows that abiotically plausible peptides
likely contained significant "nearest-neighbor" biases. This could have had some effect on early protein sequence diversity. A search found that approximately 1 in 1011 random sequences had ATP binding function.
Starting with the work of Carl Woese from 1977, genomics
studies have placed the last universal common ancestor (LUCA) of all
modern life-forms between Bacteria and a clade formed by Archaea and Eukaryota in the phylogenetic tree of life. It lived over 4 Gya. A minority of studies have placed the LUCA in Bacteria, proposing that
Archaea and Eukaryota are evolutionarily derived from within Eubacteria; Thomas Cavalier-Smith suggested in 2006 that the phenotypically diverse bacterial phylum Chloroflexota contained the LUCA.
In 2016, a set of 355 genes likely present in the LUCA was
identified. A total of 6.1 million prokaryotic genes from Bacteria and
Archaea were sequenced, identifying 355 protein clusters from among
286,514 protein clusters that were probably common to the LUCA. The
results suggest that the LUCA was anaerobic with a Wood–Ljungdahl (reductive Acetyl-CoA) pathway, nitrogen- and carbon-fixing, thermophilic. Its cofactors suggest dependence upon an environment rich in hydrogen, carbon dioxide, iron, and transition metals.
Its genetic material was probably DNA, requiring the 4-nucleotide
genetic code, messenger RNA, transfer RNA, and ribosomes to translate
the code into proteins such as enzymes. LUCA likely inhabited an
anaerobic hydrothermal vent setting in a geochemically active
environment. It was evidently already a complex organism, and must have
had precursors; it was not the first living thing. The physiology of LUCA has been in dispute.Previous research identified 60 proteins common to all life. Metabolic reactions inferred in LUCA are the incomplete reverse Krebs cycle, gluconeogenesis, the pentose phosphate pathway, glycolysis, reductive amination, and transamination.
A variety of geologic and environmental settings have been proposed
for an origin of life. These theories are often in competition with one
another as there are many views of prebiotic compound availability,
geophysical setting, and early life characteristics. The first organism
on Earth likely differed from LUCA.
Between the first appearance of life and where all modern phylogenies
began branching, an unknown amount of time passed, with unknown gene
transfers, extinctions, and adaptation to environmental niches. Modern phylogenies provide more genetic evidence about LUCA than about its precursors.
Early micro-fossils may have come from a hot world of gases such as methane, ammonia, carbon dioxide, and hydrogen sulfide, toxic to much current life. Analysis of the tree of life
places thermophilic and hyperthermophilic bacteria and archaea closest
to the root, suggesting that life may have evolved in a hot environment. The deep sea or alkaline hydrothermal vent theory posits that life began at submarine hydrothermal vents. William Martin and Michael Russell have suggested that this could have been in metal-sulphide-walled compartments acting as precursors for cell walls.
These form where hydrogen-rich fluids emerge from below the sea floor, as a result of serpentinization of ultra-maficolivine
with seawater and a pH interface with carbon dioxide-rich ocean water.
The vents form a sustained chemical energy source derived from redox
reactions, in which electron donors (molecular hydrogen) react with
electron acceptors (carbon dioxide); see iron–sulfur world theory. These are exothermic reactions.
Proposed
model of an early cell powered by external proton gradient near a
deep-sea hydrothermal vent. As long as the membrane (or passive ion
channels within it) is permeable to protons, the mechanism can function
without ion pumps.
Russell demonstrated that alkaline vents create an abiogenic proton motive force chemiosmotic gradient, ideal for abiogenesis. Their microscopic compartments "provide a
natural means of concentrating organic molecules," composed of
iron-sulfur minerals such as mackinawite, endowed these mineral cells with the catalytic properties envisaged by Günter Wächtershäuser. This movement of ions across the membrane depends on two factors:
Diffusion force caused by concentration gradient—all particles including ions diffuse from higher concentration to lower.
Electrostatic force caused by electrical potential gradient—cations like protons H+ diffuse down the electrical potential, anions in the opposite direction.
These two gradients together can be expressed as an electrochemical
gradient, providing energy for abiogenic synthesis. The proton motive
force measures the potential energy stored as proton and voltage
gradients across a membrane (differences in proton concentration and
electrical potential).
The surfaces of mineral particles inside deep-ocean hydrothermal
vents have catalytic properties similar to those of enzymes, and can
create simple organic molecules, such as methanol (CH3OH) and formic, acetic, and pyruvic acids out of the dissolved CO2 in the water, if driven by an applied voltage or by reaction with H2 or H2S.
Starting in 1981, researchers proposed that life might have started at hydrothermal vents, that spontaneous chemistry in the Earth's crust driven by rock–water
interactions at disequilibrium thermodynamically underpinned life's
origin,and that the founding lineages of the archaea and bacteria were H2-dependent autotrophs that used CO2 as their terminal acceptor in energy metabolism. In 2016, Martin suggested that the LUCA "may have depended heavily on the geothermal energy of the vent to survive". That same year, RNA was produced in synthetic alkaline hydrothermal chimneys simulating deep-sea vents. Researchers were able to generate RNA oligomers of up to 4 units in
length. This RNA was synthesized using activated ribonucleotides.
Additionally, these RNA oligomers could only be synthesized under
certain conditions.
Pores at deep sea hydrothermal vents are suggested to have been
occupied by membrane-bound compartments which promoted biochemical
reactions. Metabolic intermediates in the Krebs cycle, gluconeogenesis, amino acid
bio-synthetic pathways, glycolysis, the pentose phosphate pathway, and
including sugars like ribose, and lipid precursors can occur
non-enzymatically at conditions relevant to deep-sea alkaline
hydrothermal vents.
If the deep marine hydrothermal setting was the site, then life could have arisen as early as 4.0–4.2 Gya.
If life evolved in the ocean at depths of more than ten meters, it
would have been shielded both from impacts and the then high levels of
solar ultraviolet radiation. The available energy in hydrothermal vents
is maximized at 100–150 °C, the temperatures at which hyperthermophilic bacteria and thermoacidophilicarchaea live.
Arguments against a vent setting
Arguments against a hydrothermal origin of life state that hyperthermophily was a result of convergent evolution in bacteria and archaea, and that a mesophilic environment is more likely.
Production of prebiotic organic compounds at hydrothermal vents is estimated to be 108 kg/yr. While a large amount of key prebiotic compounds, such as methane, are
found at vents, they are in far lower concentrations than in a
Miller-Urey Experiment environment. Additionally, some organic compounds
originally thought to have been formed at vents are now understood to
have been formed by other geological processes and later inherited by
vents. Methane at alkaline vents, for example, was once thought to have
been synthesized from catalytic synthesis after serpentinization, but is
now understood to more likely come from leached fluid inclusions formed
deeper in oceanic crust from magmatic carbon. The concentrations of methane the rate is 2–4 orders of magnitude lower than those in Miller-Urey experiments.
Other counter-arguments include the inability to concentrate
prebiotic materials, due to strong dilution by seawater. This open
system cycles compounds through vent minerals, leaving little residence
time to accumulate. All modern cells rely on phosphates and potassium for nucleotide
backbone and protein formation respectively, making it likely that the
first life forms shared these functions. These elements were not
available in high quantities in the Archaean oceans, as both primarily
come from the weathering of continental rocks on land, far from vents,
and phosphate is lost into relatively insoluble apatite (calcium
phosphate). However, phosphate can be concentrated in lakes, and modern
analogs exist, such as the most phosphate-rich natural body of water in
the world, Last Chance Lake, Canada. Submarine hydrothermal vents are not conducive to condensation reactions needed for polymerisation of macromolecules.
An older argument was that key polymers were encapsulated in
vesicles after condensation, which supposedly would not happen in
saltwater. However, while salinity inhibits vesicle formation from
low-diversity mixtures of fatty acids, vesicle formation from a broader, more realistic mix of fatty-acid and 1-alkanol species is more resilient.
Importantly, no studies to date have been able to experimentally
demonstrate synthesis of de novo sugars, amino acids, nucleases,
nucleosides, nucleotides, or membrane-forming fatty acids under
plausible vent conditions.
Surface bodies of water provide environments that dry out and rewet. Wet-dry cycles concentrate prebiotic compounds and enable condensation reactions to polymerise macromolecules. Moreover, lakes and ponds receive detrital input from weathering of continental apatite-containing
rocks, the most common source of phosphates. The amount of exposed
continental crust in the Hadean is unknown, but models of early ocean
depths and rates of ocean island and continental crust growth make it
plausible that there was exposed land. Another line of evidence for a surface start to life is the requirement for Ultraviolet radiation (UV) for organism function. UV is necessary for the formation of the U+C nucleotide base pair by partial hydrolysis and nucleobase loss.[260]
Simultaneously, UV can be harmful and sterilising to life, especially
for simple early lifeforms with little ability to repair radiation
damage. Radiation levels from a young Sun were likely greater, and, with
no ozone layer,
harmful shortwave UV rays would reach the surface of Earth. For life to
begin, a shielded environment with influx from UV-exposed sources is
necessary to both benefit and protect from UV. Shielding under ice,
liquid water, mineral surfaces (e.g. clay) or regolith is possible in a
range of surface water settings.
Most branching phylogenies are thermophilic or hyperthermophilic,
making it possible that LUCA and preceding lifeforms were similarly
thermophilic. Hot springs are formed from the heating of groundwater by
geothermal activity. This intersection allows for influxes of material
from deep penetrating waters and from surface runoff that transports
eroded continental sediments. Interconnected groundwater systems create a
mechanism for distribution of life to wider area.
Mulkidjanian and co-authors argue that marine environments did
not provide the ionic balance and composition universally found in
cells, or the ions required by essential proteins and ribozymes,
especially with respect to high K+/Na+ ratio, Mn2+, Zn2+ and phosphate concentrations. They argue that the only environments that do this are hot springs similar to ones at Kamchatka. Mineral deposits in these environments under an anoxic atmosphere would
have suitable pH, contain precipitates of photocatalytic sulfide
minerals that absorb harmful ultraviolet radiation, and have wet-dry
cycles that concentrate substrate solutions enough for spontaneous
formation of biopolymers created both by chemical reactions in the hydrothermal environment, and by exposure to UV light during transport from vents to adjacent pools. The hypothesized pre-biotic environments are similar to hydrothermal
vents, with additional components that help explain peculiarities of the
LUCA.
A phylogenomic and geochemical analysis of proteins plausibly
traced to the LUCA shows that the ionic composition of its intracellular
fluid is identical to that of hot springs. The LUCA likely was
dependent upon synthesized organic matter for its growth. Experiments show that RNA-like polymers can be synthesized in wet-dry
cycling and UV light exposure. These polymers were encapsulated in
vesicles after condensation. Potential sources of organics at hot springs might have been transport
by interplanetary dust particles, extraterrestrial projectiles, or
atmospheric or geochemical synthesis. Hot springs could have been
abundant in volcanic landmasses during the Hadean.
Temperate surface bodies of water
A mesophilic start in surface bodies of waters hypothesis has evolved from Darwin's concept of a 'warm little pond' and the Oparin-Haldane hypothesis.
Freshwater bodies under temperate climates can accumulate prebiotic
materials while providing suitable environmental conditions conducive to
simple life forms. The Archaean climate is uncertain. Atmospheric
reconstructions from geochemical proxies and models suggest that
sufficient greenhouse gases were present to maintain surface
temperatures between 0–40 °C. If so, the temperature was suitable for
life could begin.
Evidence for mesophily from biomolecular studies includes Galtier's G+C
nucleotide thermometer. G+C are more abundant in thermophiles due to
the added stability of an additional hydrogen bond not present between
A+T nucleotides. rRNA sequencing of modern lifeforms shows that LUCA's reconstructed G+C content was likely representative of moderate temperatures.
The diversity of thermophiles today could be a product of
convergent evolution and horizontal gene transfer rather than an
inherited trait from LUCA.[267] The reverse gyrasetopoisomerase is found exclusively in thermophiles and hyperthermophiles, as it allows for coiling of DNA. This enzyme requires the complex molecule ATP
to function. If an origin of life is hypothesised to involve a simple
organism that had not yet evolved a membrane, let alone ATP, this would
make the existence of reverse gyrase improbable. Moreover, phylogenetic
studies show that reverse gyrase originated in archaea, and transferred
to bacteria by horizontal gene transfer, implying it was not present in
the LUCA.
Icy surface bodies of water
Cold-start theories presuppose large ice-covered regions. Stellar
evolution models predict that the Sun's luminosity was ≈25% weaker than
it is today. Fuelner states that although this significant decrease in
solar energy would have formed an icy planet, there is strong evidence
for the presence of liquid water, possibly driven by a greenhouse
effect. This would mean an early Earth with both liquid oceans and icy
poles.
Ice melts that form from ice sheets or glacier melts create
freshwater pools, another niche capable of wet-dry cycles. While surface
pools would be exposed to intense UV radiation, bodies of water within
and under ice would be shielded, while remaining connected to exposed
areas through ice cracks. Impact melting would allow freshwater and
meteoritic input, creating prebiotic components. Near-seawater levels of sodium chloride destabilize fatty acid membrane
self-assembly, making freshwater settings appealing for early
membranous life.
Icy environments would trade the faster reaction rates that occur
in warm environments for increased stability and accumulation of larger
polymers. Experiments simulating Europa-like conditions of ≈20 °C have
synthesised amino acids and adenine, showing that Miller-Urey type
syntheses can occur at low temperatures. In an RNA world,
the ribozyme would have had even more functions than in a later
DNA-RNA-protein-world. For RNA to function, it must be able to fold, a
process hindered by temperatures above 30 °C. While RNA folding in psychrophilic
organisms is slower, so is hydrolysis, so folding is more successful.
Shorter nucleotides would not suffer from higher temperatures.
Inside the continental crust
An alternative geological environment has been proposed by the
geologist Ulrich Schreiber and the physical chemist Christian Mayer: the
continental crust. Tectonic fault
zones could present a stable and well-protected environment for
long-term prebiotic evolution. Inside these systems of cracks and
cavities, water and carbon dioxide present the bulk solvents. Their
phase state could vary between liquid, gaseous and supercritical,
depending on pressure and temperature. When forming two separate phases
(e.g. liquid water and supercritical carbon dioxide in depths of little
more than 1 km), the system provides optimal conditions for phase transfer reactions.
Concurrently, the contents of the tectonic fault zones are being
supplied by a multitude of inorganic educts (e.g. carbon monoxide,
hydrogen, ammonia, hydrogen cyanide, nitrogen, and even phosphate from
dissolved apatite) and simple organic molecules formed by hydrothermal
chemistry (e.g. amino acids, long-chain amines, fatty acids, long-chain
aldehydes).
Part of the tectonic fault zones is at a depth of around 1000 m.
For the carbon dioxide part of the bulk solvent, it provides temperature
and pressure conditions near the phase transition point between the supercritical and the gaseous state. This allows lipophilic organic molecules that dissolve well in supercritical CO2 to accumulate, but not in its gaseous state, leading to their local precipitation. Periodic pressure variations such as caused by geysers or tidal influences result in periodic phase transitions, keeping the local reaction environment in a constant non-equilibrium state. In presence of amphiphilic compounds (such as the long chain amines and fatty acids), subsequent generations of vesicles are formed that are constantly selected for their stability.
Many biomolecules, such as L-glutamic acid, are asymmetric, and occur in living systems in only one of the two possible forms, in the case of amino acids the left-handed form. Prebiotic chemistry would produce both forms, creating a puzzle for abiogenesis researchers.
Homochirality is the uniformity of materials composed of chiral (non-mirror-symmetric) units. Living organisms use molecules with the same chirality: with almost no exceptions, amino acids are left-handed while nucleotides and sugars
are right-handed. Chiral molecules can be synthesized, but in the
absence of a chiral source or a chiral catalyst, they are formed in a
50/50 (racemic) mixture of both forms. Non-racemic mixtures can arise from racemic materials by asymmetric physical laws such as the electroweak interaction or asymmetric environments such as circularly polarized light.
Once established, chirality would be selected for. A small bias in the population can be amplified by asymmetric autocatalysis, as in the Soai reaction, where a chiral molecule catalyzes its own production.