A Medley of Potpourri

Saturday, January 28, 2023

Allen Institute for Brain Science

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/Allen_Institute_for_Brain_Science

Allen Institute for Brain Science
Headquarters of the Allen Institute for Brain Science
Formation	2003
Founders	Paul Allen, Jody Allen
Type	501(c)(3)
Purpose	Neuroscience, brain research, biology, technology
Headquarters	Seattle, Washington, U.S.
Area served	Worldwide
Key people	Hongkui Zeng (director) Christof Koch (chief scientist of the Mindscope Program)
Website	alleninstitute.org/what-we-do/brain-science/

The Allen Institute for Brain Science is a division of the Allen Institute, based in Seattle, Washington, that focuses on bioscience research. Founded in 2003, it is dedicated to accelerating the understanding of how the human brain works. With the intent of catalyzing brain research in different areas, the Allen Institute provides free data and tools to scientists.

Started with $100 million in seed money from Microsoft co-founder and philanthropist Paul Allen in 2003, the institute tackles projects at the leading edge of science—far-reaching projects at the intersection of biology and technology. The resulting data create free, publicly available resources that fuel discovery for countless researchers. Hongkui Zeng is the director of the institute.

History and funding

The Allen Institute for Brain Science is a scientific division of the Allen Institute, a nonprofit research organization that also includes the Allen Institute for Cell Science, launched in 2014. The Paul G. Allen Frontiers Group was launched in 2016 while the Allen Institute for Immunology was launched in 2018. All four divisions of the Allen Institute are housed in the same building in Seattle's South Lake Union neighborhood. The institute employs a business model that combines the operational agility and accountability of a for-profit enterprise with the founding vision to take on ambitious projects in neuroscience.

In 2012, the institute received an additional pledge of $300 million from Paul Allen, bringing his total commitment to $500 million.

Online public resources

The Allen Institute for Brain Science provides researchers and educators with a variety of unique online public resources for exploring the nervous system. Integrating extensive gene expression data and neuroanatomy, along with data search and viewing tools, these resources are openly accessible via the Allen Brain Atlas data portal.

Allen Mouse Brain Atlas

The inaugural project of the Allen Institute was announced on September 26, 2006. Named the Allen Brain Atlas, it was a web-based, three-dimensional map of gene expression in the mouse brain detailing more than 21,000 genes at the cellular level. Since the project's launch, it has been renamed the Allen Mouse Brain Atlas to distinguish it from subsequent atlas projects.

Allen Spinal Cord Atlas

On July 17, 2008, the Allen Institute for Brain Science launched the online Allen Spinal Cord Atlas. The spinal cord atlas is an interactive, genome-wide map showing where each gene is expressed, or "turned on", throughout the mouse spinal cord. It is set up like the Allen Institute's earlier atlas of the adult mouse brain. The map could help reveal new treatments for human neurological disorders. The map points researchers toward places where genes are active.

The Allen Spinal Cord Atlas led to the discovery of a new class of cells in the spinal cord that behave like stem cells, according to researchers at the University of British Columbia. Jane Roskams, the neuroscientist who led the study, said that, "By using the Allen Spinal Cord Atlas, we were able to discover a brand new cell type that has previously been overlooked and that could be an important player in all manner of spinal cord injury and disease, including multiple sclerosis and ALS."

Allen Developing Mouse Brain Atlas

On November 14, 2008, the Allen Institute for Brain Science announced the launch of the Allen Developing Mouse Brain Atlas, providing a highly detailed map of gene activity in the mouse brain at several time points across development, including four embryonic ages, three postnatal, and aging time points. The in situ hybridization data is accompanied by a set of reference atlases drawn by neuroanatomist Luis Puelles.

Allen Human Brain Atlas

On May 24, 2010, the Allen Institute announced it was expanding its tools from the mouse into the human brain with the launch of the Allen Human Brain Atlas. This highly comprehensive atlas integrates several kinds of data, including data collected by magnetic resonance imaging (MRI), diffusion tensor technology (DTI), as well as histology and gene expression data derived from both microarray and in situ hybridization (ISH) approaches. The Allen Human Brain Atlas allows researchers to see where a gene is turned on. "The location of where these genes are active is at the very center of understanding how brain diseases work", neurologist Jeffrey L. Noebels told The Wall Street Journal in April 2011. The Allen Human Brain Atlas was profiled in the journal Nature on September 19, 2012.

Allen Mouse Brain Connectivity Atlas

The Allen Mouse Brain Connectivity Atlas was launched online on November 3, 2011, and moved the Allen Institute's mapping efforts beyond its historical focus on gene expression toward neural circuitry. The atlas is a three-dimensional, high-resolution map of neural connections throughout the mouse brain, designed to help scientists understand how the brain is wired, offering new insights into how the brain works and what goes awry in brain diseases and disorders.

Allen Cell Types Database

Launched in 2015, the Allen Cell Types Database is a new tool to help scientists understand the building blocks of the brain and a major step toward creating a comprehensive map of the brain. The database will help create a common language for researchers around the world to use in observing, measuring and ultimately sorting cells into types much like the periodic table sorts elements. The first release of data includes information on more than 240 cells in the mouse brain. In 2017, the Allen Institute added data from human brain cells to the database.

Allen Brain Observatory

The Allen Brain Observatory was launched in 2016 to capture cellular-level activity of neurons in the mouse visual cortex. Experiments through the observatory use visual or electrical readouts of neural activity as animals see visual stimuli, ranging from natural images to black and white grid lines to a clip from the Orson Welles film noir, Touch of Evil. In 2018, the institute opened the observatory for research projects proposed by scientists from the broader community through a program called OpenScope, which was modeled after large-scale shared physics observatories such as the Hubble Space Telescope.

Other online resources

In addition to the atlas resources, the Allen Institute has generated several other online research tools, including:

The Ivy Glioblastoma Atlas Project (Ivy GAP), a platform for exploring the anatomic and genetic basis of glioblastoma at the cellular and molecular levels.
The BrainSpan Atlas of the Developing Human Brain, a resource for studying human brain development developed by a consortium of scientific partners and funded by awards from the National Institutes of Health.

Awards

Forbes – 30 Under 30 Rising Stars Transforming Science and Health to Allen Institute scientist Adrian Cheng (2012)
Cajal Club – Krieg Lifetime Achievement Award to Paul Allen for extraordinary contributions in neuroscience through his work with the Allen Institute (2010)
American Academy of Neurology – Public Leadership in Neurology Award to Paul Allen for his strong commitment to brain research and work with the Allen Institute (2009)
Time – "Top 100 Most Influential People in the World" to Paul Allen for his successful achievements at the Allen Institute (2007, 2008)
Time – "Top Ten Medical Breakthroughs" (2006)
Wired – Rave Award to Paul Allen and the Allen Institute for the completion of the Allen Mouse Brain Atlas (2007)
Society for Neuroscience – Special Recognition Award to Paul Allen for his generous contributions to neuroscience through his work with the Allen Institute (2007)
USA Weekend – "Top 10 Medical Breakthroughs of the Year" for the Allen Mouse Brain Atlas (2006)

OpenAI

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/OpenAI

OpenAI

Headquarters at the Pioneer Building in San Francisco
Industry	Artificial intelligence
Founded	December 11, 2015
Founders	Sam Altman Trevor Blackwell Greg Brockman Vicki Cheung Reid Hoffman Andrej Karpathy Durk Kingma Jessica Livingston Elon Musk John Schulman Ilya Sutskever Peter Thiel Pamela Vagata Wojciech Zaremba
Headquarters	Pioneer Building, San Francisco, California, US
Key people	Greg Brockman (chairman & president) Sam Altman (CEO) Ilya Sutskever (chief scientist)
Products	DALL-E GPT-3 OpenAI Five ChatGPT OpenAI Codex
Number of employees	375 (as of 2023)
Website	openai.com

OpenAI is an American artificial intelligence (AI) research laboratory consisting of the non-profit OpenAI Incorporated (OpenAI Inc.) and its for-profit subsidiary corporation OpenAI Limited Partnership (OpenAI LP). OpenAI conducts AI research to promote and develop friendly AI in a way that benefits all humanity. The organization was founded in San Francisco in 2015 by Elon Musk, Sam Altman, Peter Thiel, Reid Hoffman, Jessica Livingston and others, who collectively pledged US$1 billion. Musk resigned from the board in 2018 but remained a donor. Microsoft provided OpenAI LP a $1 billion investment in 2019 and a second multi-year investment in January 2023 reported to be $10 billion.

History

In December 2015, Sam Altman, Elon Musk, Greg Brockman, Reid Hoffman, Jessica Livingston, Peter Thiel, Amazon Web Services (AWS), Infosys, and YC Research announced the formation of OpenAI and pledged over US$1 billion to the venture. The organization stated it would "freely collaborate" with other institutions and researchers by making its patents and research open to the public. OpenAI is headquartered at the Pioneer Building in Mission District, San Francisco.

In April 2016, OpenAI released a public beta of "OpenAI Gym", its platform for reinforcement learning research. In December 2016, OpenAI released "Universe", a software platform for measuring and training an AI's general intelligence across the world's supply of games, websites, and other applications.

In 2018, Musk resigned his board seat, citing "a potential future conflict (of interest)" with Tesla AI development for self driving cars, but remained a donor.

In 2019, OpenAI transitioned from non-profit to "capped" for-profit, with profit cap set to 100X on any investment. The company distributed equity to its employees and partnered with Microsoft and Matthew Brown Companies, who announced an investment package of $1 billion into the company. OpenAI then announced its intention to commercially license its technologies.

In 2020, OpenAI announced GPT-3, a language model trained on large datasets from the Internet. It also announced that an associated API, named simply "the API", would form the heart of its first commercial product. GPT-3 is aimed at natural language answering of questions, but it can also translate between languages and coherently generate improvised text.

In 2021, OpenAI introduced DALL-E, a deep learning model that can generate digital images from natural language descriptions.

Around December 2022, OpenAI received widespread media coverage after launching a free preview of ChatGPT, its new AI chatbot based on GPT-3.5. According to OpenAI, the preview received over a million signups within the first five days. According to anonymous sources cited by Reuters in December 2022, OpenAI was projecting a $200 million revenue for 2023 and $1 billion revenue for 2024.

As of January 2023, OpenAI was in talks for funding that would value the company at $29 billion, double the value of the company in 2021. On January 23, 2023, Microsoft announced a new multi-year, multi-billion dollar (reported to be $10 billion) investment in OpenAI.

Participants

Key employees:

CEO and co-founder: Sam Altman, former president of the startup accelerator Y Combinator
President and co-founder: Greg Brockman, former CTO, 3rd employee of Stripe
Chief Scientist and co-founder: Ilya Sutskever, a former Google expert on machine learning
Chief Technology Officer: Mira Murati, previously at Leap Motion and Tesla, Inc.
Chief Operating Officer: Brad Lightcap, previously at Y Combinator and JPMorgan Chase

Board of the OpenAI nonprofit:

Greg Brockman
Ilya Sutskever
Sam Altman
Adam D'Angelo
Reid Hoffman
Will Hurd
Tasha McCauley
Helen Toner
Shivon Zilis

Other backers of the project include:

Reid Hoffman, LinkedIn co-founder
Peter Thiel, PayPal co-founder
Jessica Livingston, a founding partner of Y Combinator

Companies:

Microsoft
Khosla Ventures
Infosys

The group started in early January 2016 with nine researchers. According to Wired, Brockman met with Yoshua Bengio, one of the "founding fathers" of the deep learning movement, and drew up a list of the "best researchers in the field". Microsoft's Peter Lee stated that the cost of a top AI researcher exceeds the cost of a top NFL quarterback prospect. While OpenAI pays corporate-level (rather than nonprofit-level) salaries, it doesn't currently pay AI researchers salaries comparable to those of Facebook or Google. Nevertheless, Sutskever stated that he was willing to leave Google for OpenAI "partly because of the very strong group of people and, to a very large extent, because of its mission." Brockman stated that "the best thing that I could imagine doing was moving humanity closer to building real AI in a safe way." OpenAI researcher Wojciech Zaremba stated that he turned down "borderline crazy" offers of two to three times his market value to join OpenAI instead.

Motives

Some scientists, such as Stephen Hawking and Stuart Russell, have articulated concerns that if advanced AI someday gains the ability to re-design itself at an ever-increasing rate, an unstoppable "intelligence explosion" could lead to human extinction. Musk characterizes AI as humanity's "biggest existential threat." OpenAI's founders structured it as a non-profit so that they could focus its research on making positive long-term contributions to humanity.

Musk and Altman have stated they are partly motivated by concerns about AI safety and the existential risk from artificial general intelligence. OpenAI states that "it's hard to fathom how much human-level AI could benefit society," and that it is equally difficult to comprehend "how much it could damage society if built or used incorrectly". Research on safety cannot safely be postponed: "because of AI's surprising history, it's hard to predict when human-level AI might come within reach." OpenAI states that AI "should be an extension of individual human wills and, in the spirit of liberty, as broadly and evenly distributed as possible...". Co-chair Sam Altman expects the decades-long project to surpass human intelligence.

Vishal Sikka, former CEO of Infosys, stated that an "openness" where the endeavor would "produce results generally in the greater interest of humanity" was a fundamental requirement for his support, and that OpenAI "aligns very nicely with our long-held values" and their "endeavor to do purposeful work". Cade Metz of Wired suggests that corporations such as Amazon may be motivated by a desire to use open-source software and data to level the playing field against corporations such as Google and Facebook that own enormous supplies of proprietary data. Altman states that Y Combinator companies will share their data with OpenAI.

In 2019, OpenAI became a for-profit company called OpenAI LP to secure additional funding while staying controlled by a non-profit called OpenAI Inc in a structure that OpenAI calls "capped-profit", having previously been a 501(c)(3) nonprofit organization.

Strategy

Musk posed the question: "What is the best thing we can do to ensure the future is good? We could sit on the sidelines or we can encourage regulatory oversight, or we could participate with the right structure with people who care deeply about developing AI in a way that is safe and is beneficial to humanity." Musk acknowledged that "there is always some risk that in actually trying to advance (friendly) AI we may create the thing we are concerned about"; nonetheless, the best defense is "to empower as many people as possible to have AI. If everyone has AI powers, then there's not any one person or a small set of individuals who can have AI superpower."

Musk and Altman's counter-intuitive strategy of trying to reduce the risk that AI will cause overall harm, by giving AI to everyone, is controversial among those who are concerned with existential risk from artificial intelligence. Philosopher Nick Bostrom is skeptical of Musk's approach: "If you have a button that could do bad things to the world, you don't want to give it to everyone." During a 2016 conversation about the technological singularity, Altman said that "we don't plan to release all of our source code" and mentioned a plan to "allow wide swaths of the world to elect representatives to a new governance board". Greg Brockman stated that "Our goal right now... is to do the best thing there is to do. It's a little vague."

Conversely, OpenAI's initial decision to withhold GPT-2 due to a wish to "err on the side of caution" in the presence of potential misuse, has been criticized by advocates of openness. Delip Rao, an expert in text generation, stated "I don't think [OpenAI] spent enough time proving [GPT-2] was actually dangerous." Other critics argued that open publication is necessary to replicate the research and to be able to come up with countermeasures.

In the 2017 tax year, OpenAI spent $7.9 million, or a quarter of its functional expenses, on cloud computing alone. In comparison, DeepMind's total expenses in 2017 were much larger, measuring $442 million. In Summer 2018, simply training OpenAI's Dota 2 bots required renting 128,000 CPUs and 256 GPUs from Google for multiple weeks. According to OpenAI, the capped-profit model adopted in March 2019 allows OpenAI LP to legally attract investment from venture funds, and in addition, to grant employees stakes in the company, the goal being that they can say "I'm going to Open AI, but in the long term it's not going to be disadvantageous to us as a family." Many top researchers work for Google Brain, DeepMind, or Facebook, which offer stock options that a nonprofit would be unable to. In June 2019, OpenAI LP raised a billion dollars from Microsoft, a sum which OpenAI plans to have spent "within five years, and possibly much faster". Altman has stated that even a billion dollars may turn out to be insufficient, and that the lab may ultimately need "more capital than any non-profit has ever raised" to achieve artificial general intelligence.

The transition from a nonprofit to a capped-profit company was viewed with skepticism by Oren Etzioni of the nonprofit Allen Institute for AI, who agreed that wooing top researchers to a nonprofit is difficult, but stated "I disagree with the notion that a nonprofit can't compete" and pointed to successful low-budget projects by OpenAI and others. "If bigger and better funded was always better, then IBM would still be number one." Following the transition, public disclosure of the compensation of top employees at OpenAI LP is no longer legally required. The nonprofit, OpenAI Inc., is the sole controlling shareholder of OpenAI LP. OpenAI LP, despite being a for-profit company, retains a formal fiduciary responsibility to OpenAI's Inc.'s nonprofit charter. A majority of OpenAI Inc.'s board is barred from having financial stakes in OpenAI LP. In addition, minority members with a stake in OpenAI LP are barred from certain votes due to conflict of interest. Some researchers have argued that OpenAI LP's switch to for-profit status is inconsistent with OpenAI's claims to be "democratizing" AI. A journalist at Vice News wrote that "generally, we've never been able to rely on venture capitalists to better humanity".

Products and applications

OpenAI's research tend to focus on reinforcement learning (RL). OpenAI is viewed as an important competitor to DeepMind.

Gym

Gym aims to provide an easy to set up, general-intelligence benchmark with a wide variety of different environments—somewhat akin to, but broader than, the ImageNet Large Scale Visual Recognition Challenge used in supervised learning research—and that hopes to standardize the way in which environments are defined in AI research publications, so that published research becomes more easily reproducible. The project claims to provide the user with a simple interface. As of June 2017, Gym can only be used with Python. As of September 2017, the Gym documentation site was not maintained, and active work focused instead on its GitHub page.

In "RoboSumo", virtual humanoid "metalearning" robots initially lack knowledge of how to even walk, and are given the goals of learning to move around, and pushing the opposing agent out of the ring. Through this adversarial learning process, the agents learn how to adapt to changing conditions; when an agent is then removed from this virtual environment and placed in a new virtual environment with high winds, the agent braces to remain upright, suggesting it had learned how to balance in a generalized way. OpenAI's Igor Mordatch argues that competition between agents can create an intelligence "arms race" that can increase an agent's ability to function, even outside the context of the competition.

Debate Game

In 2018, OpenAI launched the Debate Game, which teaches machines to debate toy problems in front of a human judge. The purpose is to research whether such an approach may assist in auditing AI decisions and in developing explainable AI.

Dactyl

Dactyl uses machine learning to train a Shadow Hand, a human-like robot hand, to manipulate physical objects. It learns entirely in simulation using the same RL algorithms and training code as OpenAI Five. OpenAI tackled the object orientation problem by using domain randomization, a simulation approach which exposes the learner to a variety of experiences rather than trying to fit to reality. The set-up for Dactyl, aside from having motion tracking cameras, also has RGB cameras to allow the robot to manipulate an arbitrary object by seeing it. In 2018, OpenAI showed that the system was able to manipulate a cube and an octagonal prism.

In 2019, OpenAI demonstrated that Dactyl could solve a Rubik's Cube. The robot was able to solve the puzzle 60% of the time. Objects like the Rubik's Cube introduce complex physics that is harder to model. OpenAI solved this by improving the robustness of Dactyl to perturbations; they employed a technique called Automatic Domain Randomization (ADR), a simulation approach where progressively more difficult environments are endlessly generated. ADR differs from manual domain randomization by not needing there to be a human to specify randomization ranges.

Generative models

GPT

The GPT model

The original paper on generative pre-training (GPT) of a language model was written by Alec Radford and his colleagues, and published in preprint on OpenAI's website on June 11, 2018. It showed how a generative model of language is able to acquire world knowledge and process long-range dependencies by pre-training on a diverse corpus with long stretches of contiguous text.

GPT-2

An instance of GPT-2 writing a paragraph based on a prompt from its own Wikipedia article in February 2021

Generative Pre-trained Transformer 2, commonly known by its abbreviated form GPT-2, is an unsupervised transformer language model and the successor to GPT. GPT-2 was first announced in February 2019, with only limited demonstrative versions initially released to the public. The full version of GPT-2 was not immediately released out of concern over potential misuse, including applications for writing fake news. Some experts expressed skepticism that GPT-2 posed a significant threat. The Allen Institute for Artificial Intelligence responded to GPT-2 with a tool to detect "neural fake news". Other researchers, such as Jeremy Howard, warned of "the technology to totally fill Twitter, email, and the web up with reasonable-sounding, context-appropriate prose, which would drown out all other speech and be impossible to filter". In November 2019, OpenAI released the complete version of the GPT-2 language model. Several websites host interactive demonstrations of different instances of GPT-2 and other transformer models.

GPT-2's authors argue unsupervised language models to be general-purpose learners, illustrated by GPT-2 achieving state-of-the-art accuracy and perplexity on 7 of 8 zero-shot tasks (i.e. the model was not further trained on any task-specific input-output examples). The corpus it was trained on, called WebText, contains slightly over 8 million documents for a total of 40 GB of text from URLs shared in Reddit submissions with at least 3 upvotes. It avoids certain issues encoding vocabulary with word tokens by using byte pair encoding. This allows to represent any string of characters by encoding both individual realse

characters and multiple-character tokens.

GPT-3

Generative Pre-trained Transformer 3, commonly known by its abbreviated form GPT-3, is an unsupervised transformer language model and the successor to GPT-2. It was first described in May 2020. OpenAI stated that full version of GPT-3 contains 175 billion parameters, two orders of magnitude larger than the 1.5 billion parameters in the full version of GPT-2 (although GPT-3 models with as few as 125 million parameters were also trained).

OpenAI stated that GPT-3 succeeds at certain "meta-learning" tasks. It can generalize the purpose of a single input-output pair. The paper gives an example of translation and cross-linguistic transfer learning between English and Romanian, and between English and German.

GPT-3 dramatically improved benchmark results over GPT-2. OpenAI cautioned that such scaling up of language models could be approaching or encountering the fundamental capability limitations of predictive language models. Pre-training GPT-3 required several thousand petaflop/s-days of compute, compared to tens of petaflop/s-days for the full GPT-2 model. Like that of its predecessor, GPT-3's fully trained model was not immediately released to the public on the grounds of possible abuse, though OpenAI planned to allow access through a paid cloud API after a two-month free private beta that began in June 2020.

On September 23, 2020, GPT-3 was licensed exclusively to Microsoft.

ChatGPT

ChatGPT is an artificial intelligence tool that provides a conversational interface that allows you to ask questions in natural language. The system then responds with an answer within seconds. ChatGPT was launched in November 2022 and reached 1 million users only 5 days after its initial launch.

Music

OpenAI's MuseNet (2019) is a deep neural net trained to predict subsequent musical notes in MIDI music files. It can generate songs with ten different instruments in fifteen different styles. According to The Verge, a song generated by MuseNet tends to start reasonably but then fall into chaos the longer it plays.

OpenAI's Jukebox (2020) is an open-sourced algorithm to generate music with vocals. After training on 1.2 million samples, the system accepts a genre, artist, and a snippet of lyrics and outputs song samples. OpenAI stated the songs "show local musical coherence, follow traditional chord patterns" but acknowledged that the songs lack "familiar larger musical structures such as choruses that repeat" and that "there is a significant gap" between Jukebox and human-generated music. The Verge stated "It's technologically impressive, even if the results sound like mushy versions of songs that might feel familiar", while Business Insider stated "surprisingly, some of the resulting songs are catchy and sound legitimate".

Whisper

Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.

API

In June 2020, OpenAI announced a multi-purpose API which it said was "for accessing new AI models developed by OpenAI" to let developers call on it for "any English language AI task."

DALL-E and CLIP

Images produced by DALL-E when given the text prompt "a professional high-quality illustration of a giraffe dragon chimera. a giraffe imitating a dragon. a giraffe made of dragon."

DALL-E is a Transformer model that creates images from textual descriptions, revealed by OpenAI in January 2021.

CLIP does the opposite: it creates a description for a given image. DALL-E uses a 12-billion-parameter version of GPT-3 to interpret natural language inputs (such as "a green leather purse shaped like a pentagon" or "an isometric view of a sad capybara") and generate corresponding images. It can create images of realistic objects ("a stained-glass window with an image of a blue strawberry") as well as objects that do not exist in reality ("a cube with the texture of a porcupine"). As of March 2021, no API or code is available.

In March 2021, OpenAI released a paper titled Multimodal Neurons in Artificial Neural Networks, where they showed a detailed analysis of CLIP (and GPT) models and their vulnerabilities. The new type of attacks on such models was described in this work.

We refer to these attacks as typographic attacks. We believe attacks such as those described above are far from simply an academic concern. By exploiting the model's ability to read text robustly, we find that even photographs of hand-written text can often fool the model.
— Multimodal Neurons in Artificial Neural Networks, OpenAI

In April 2022, OpenAI announced DALL-E 2, an updated version of the model with more realistic results. In December 2022, OpenAI published on GitHub software for Point-E, a new rudimentary system for converting a text description into a 3-dimensional model.

Microscope

OpenAI Microscope is a collection of visualizations of every significant layer and neuron of eight different neural network models which are often studied in interpretability. Microscope was created to analyze the features that form inside these neural networks easily. The models included are AlexNet, VGG 19, different versions of Inception, and different versions of CLIP Resnet.

Codex

OpenAI Codex is a descendant of GPT-3 that has additionally been trained on code from 54 million GitHub repositories. It was announced in mid-2021 as the AI powering the code autocompletion tool GitHub Copilot. In August 2021, an API was released in private beta. According to OpenAI, the model is able to create working code in over a dozen programming languages, most effectively in Python.

Several issues with glitches, design flaws, and security vulnerabilities have been brought up.

Video game bots and benchmarks

OpenAI Five

OpenAI Five is the name of a team of five OpenAI-curated bots that are used in the competitive five-on-five video game Dota 2, who learn to play against human players at a high skill level entirely through trial-and-error algorithms. Before becoming a team of five, the first public demonstration occurred at The International 2017, the annual premiere championship tournament for the game, where Dendi, a professional Ukrainian player, lost against a bot in a live 1v1 matchup. After the match, CTO Greg Brockman explained that the bot had learned by playing against itself for two weeks of real time, and that the learning software was a step in the direction of creating software that can handle complex tasks like a surgeon. The system uses a form of reinforcement learning, as the bots learn over time by playing against themselves hundreds of times a day for months, and are rewarded for actions such as killing an enemy and taking map objectives.

By June 2018, the ability of the bots expanded to play together as a full team of five, and they were able to defeat teams of amateur and semi-professional players. At The International 2018, OpenAI Five played in two exhibition matches against professional players, but ended up losing both games. In April 2019, OpenAI Five defeated OG, the reigning world champions of the game at the time, 2:0 in a live exhibition match in San Francisco. The bots' final public appearance came later that month, where they played in 42,729 total games in a four-day open online competition, winning 99.4% of those games.

GYM Retro

Gym Retro is a platform for RL research on video games. Gym Retro is used to research RL algorithms and study generalization. Prior research in RL has focused chiefly on optimizing agents to solve single tasks. Gym Retro gives the ability to generalize between games with similar concepts but different appearances.

Friday, January 27, 2023

DeepMind

From Wikipedia, the free encyclopedia

https://en.wikipedia.org/wiki/DeepMind

DeepMind Technologies Limited

Headquarters in Kings Cross, London
Type	Subsidiary
Industry	Artificial intelligence
Founded	23 September 2010
Founders	Demis Hassabis Shane Legg Mustafa Suleyman
Headquarters	London, UK
Key people	Demis Hassabis (CEO) Lila Ibrahim (COO)
Products	AlphaGo, AlphaStar, AlphaFold, AlphaZero
Number of employees	1,000+ (2022)
Parent	Google Inc. (2014–2015) Alphabet Inc. (2015–present)
Website	deepmind.com

DeepMind Technologies is a British artificial intelligence subsidiary of Alphabet Inc. and research laboratory founded in 2010. DeepMind was acquired by Google in 2014 and became a wholly owned subsidiary of Alphabet Inc., after Google's restructuring in 2015. The company is based in London, with research centres in Canada, France, and the United States.

DeepMind has created a neural network that learns how to play video games in a fashion similar to that of humans, as well as a Neural Turing machine, or a neural network that may be able to access an external memory like a conventional Turing machine, resulting in a computer that mimics the short-term memory of the human brain.

DeepMind made headlines in 2016 after its AlphaGo program beat a human professional Go player Lee Sedol, a world champion, in a five-game match, which was the subject of a documentary film. A more general program, AlphaZero, beat the most powerful programs playing go, chess and shogi (Japanese chess) after a few days of play against itself using reinforcement learning. In 2020, DeepMind made significant advances in the problem of protein folding with AlphaFold. In July 2022, it was announced that over 200 million predicted protein structures, representing virtually all known proteins, would be released on the AlphaFold database.

DeepMind posted a blog post on 28 April 2022 on a single visual language model (VLM) named Flamingo that can accurately describe a picture of something with just a few training images. In July 2022, DeepMind announced the development of DeepNash, a model-free multi-agent reinforcement learning system capable of playing the board game Stratego at the level of a human expert.

History

The start-up was founded by Demis Hassabis, Shane Legg and Mustafa Suleyman in September 2010. Hassabis and Legg first met at the Gatsby Computational Neuroscience Unit at University College London (UCL).

During one of the interviews, Demis Hassabis said that the start-up began working on artificial intelligence technology by teaching it how to play old games from the seventies and eighties, which are relatively primitive compared to the ones that are available today. Some of those games included Breakout, Pong and Space Invaders. AI was introduced to one game at a time, without any prior knowledge of its rules. After spending some time on learning the game, AI would eventually become an expert in it. “The cognitive processes which the AI goes through are said to be very like those of a human who had never seen the game would use to understand and attempt to master it.” The goal of the founders is to create a general-purpose AI that can be useful and effective for almost anything.

Major venture capital firms Horizons Ventures and Founders Fund invested in the company, as well as entrepreneurs Scott Banister, Peter Thiel, and Elon Musk. Jaan Tallinn was an early investor and an adviser to the company. On 26 January 2014, Google announced the company had acquired DeepMind for $500 million, and that it had agreed to take over DeepMind Technologies. The sale to Google took place after Facebook reportedly ended negotiations with DeepMind Technologies in 2013. The company was afterwards renamed Google DeepMind and kept that name for about two years.

In 2014, DeepMind received the "Company of the Year" award from Cambridge Computer Laboratory.

In September 2015, DeepMind and the Royal Free NHS Trust signed their initial Information Sharing Agreement (ISA) to co-develop a clinical task management app, Streams.

After Google's acquisition the company established an artificial intelligence ethics board. The ethics board for AI research remains a mystery, with both Google and DeepMind declining to reveal who sits on the board. DeepMind, together with Amazon, Google, Facebook, IBM and Microsoft, is a founding member of Partnership on AI, an organization devoted to the society-AI interface. DeepMind has opened a new unit called DeepMind Ethics and Society and focused on the ethical and societal questions raised by artificial intelligence featuring prominent philosopher Nick Bostrom as advisor. In October 2017, DeepMind launched a new research team to investigate AI ethics.

In December 2019, co-founder Suleyman announced he would be leaving DeepMind to join Google, working in a policy role.

Products and technologies

According to the company's website, DeepMind Technologies' goal is to combine "the best techniques from machine learning and systems neuroscience to build powerful general-purpose learning algorithms".

Google Research released a paper in 2016 regarding AI safety and avoiding undesirable behaviour during the AI learning process. Deepmind has also released several publications via its website. In 2017 DeepMind released GridWorld, an open-source testbed for evaluating whether an algorithm learns to disable its kill switch or otherwise exhibits certain undesirable behaviours.

In July 2018, researchers from DeepMind trained one of its systems to play the computer game Quake III Arena.

As of 2020, DeepMind has published over a thousand papers, including thirteen papers that were accepted by Nature or Science. DeepMind received media attention during the AlphaGo period; according to a LexisNexis search, 1842 published news stories mentioned DeepMind in 2016, declining to 1363 in 2019.

Deep reinforcement learning

As opposed to other AIs, such as IBM's Deep Blue or Watson, which were developed for a pre-defined purpose and only function within its scope, DeepMind claims that its system is not pre-programmed: it learns from experience, using only raw pixels as data input. Technically it uses deep learning on a convolutional neural network, with a novel form of Q-learning, a form of model-free reinforcement learning. They test the system on video games, notably early arcade games, such as Space Invaders or Breakout. Without altering the code, the AI begins to understand how to play the game, and after some time plays, for a few games (most notably Breakout), a more efficient game than any human ever could.

In 2013, DeepMind published research on an AI system that could surpass human abilities in games such as Pong, Breakout and Enduro, while surpassing state of the art performance on Seaquest, Beamrider, and Q*bert. This work reportedly led to the company's acquisition by Google. DeepMind's AI had been applied to video games made in the 1970s and 1980s; work was ongoing for more complex 3D games such as Quake, which first appeared in the 1990s.

In 2020, DeepMind published Agent57, an AI Agent which surpasses human level performance on all 57 games of the Atari2600 suite.

AlphaGo and successors

In 2014, the company published research on computer systems that are able to play Go.

In October 2015, a computer Go program called AlphaGo, developed by DeepMind, beat the European Go champion Fan Hui, a 2 dan (out of 9 dan possible) professional, five to zero. This was the first time an artificial intelligence (AI) defeated a professional Go player. Previously, computers were only known to have played Go at "amateur" level. Go is considered much more difficult for computers to win compared to other games like chess, due to the much larger number of possibilities, making it prohibitively difficult for traditional AI methods such as brute-force.

In March 2016 it beat Lee Sedol—a 9th dan Go player and one of the highest ranked players in the world—with a score of 4–1 in a five-game match.

In the 2017 Future of Go Summit, AlphaGo won a three-game match with Ke Jie, who at the time continuously held the world No. 1 ranking for two years. It used a supervised learning protocol, studying large numbers of games played by humans against each other.

In 2017, an improved version, AlphaGo Zero, defeated AlphaGo 100 games to 0. AlphaGo Zero's strategies were self-taught. AlphaGo Zero was able to beat its predecessor after just three days with less processing power than AlphaGo; in comparison, the original AlphaGo needed months to learn how to play.

Later that year, AlphaZero, a modified version of AlphaGo Zero but for handling any two-player game of perfect information, gained superhuman abilities at chess and shogi. Like AlphaGo Zero, AlphaZero learned solely through self-play.

DeepMind researchers published a new model named MuZero that mastered the domains of Go, chess, shogi, and Atari 2600 games without human data, domain knowledge, or known rules.

Researchers applied MuZero to solve the real world challenge of video compression with a set number of bits with respect to Internet traffic on sites such as YouTube, Twitch, and Google Meet. The goal of MuZero is to optimally compress the video so the quality of the video is maintained with a reduction in data. The final result using MuZero was a 6.28% average reduction in bitrate.

In October 2022, DeepMind unveiled a new version of AlphaZero, called AlphaTensor, in a paper published in Nature. The version discovered a faster way to perform matrix multiplication – one of the most fundamental tasks in computing – using reinforcement learning. For example, AlphaTensor figured out how to multiply two mod-2 4x4 matrices in only 47 multiplications, unexpectedly exceeding the 1969 Strassen algorithm record of 49 multiplications.

Technology

AlphaGo technology was developed based on the deep reinforcement learning approach. This makes AlphaGo different from the rest of AI technologies on the market. With that said, AlphaGo's ‘brain’ was introduced to various moves based on historical tournament data. The number of moves was increased gradually until it eventually processed over 30 million of them. The aim was to have the system mimic the human player and eventually become better. It played against itself and learned not only from its own defeats but wins as well; thus, it learned to improve itself over the time and increased its winning rate as a result.

AlphaGo used two deep neural networks: a policy network to evaluate move probabilities and a value network to assess positions. The policy network trained via supervised learning, and was subsequently refined by policy-gradient reinforcement learning. The value network learned to predict winners of games played by the policy network against itself. After training, these networks employed a lookahead Monte Carlo tree search (MCTS), using the policy network to identify candidate high-probability moves, while the value network (in conjunction with Monte Carlo rollouts using a fast rollout policy) evaluated tree positions.

AlphaGo Zero was trained using reinforcement learning in which the system played millions of games against itself. Its only guide was to increase its win rate. It did so without learning from games played by humans. Its only input features are the black and white stones from the board. It uses a single neural network, rather than separate policy and value networks. Its simplified tree search relies upon this neural network to evaluate positions and sample moves. A new reinforcement learning algorithm incorporates lookahead search inside the training loop. AlphaGo Zero employed around 15 people and millions in computing resources. Ultimately, it needed much less computing power than AlphaGo, running on four specialized AI processors (Google TPUs), instead of AlphaGo's 48.

AlphaFold

In 2016, DeepMind turned its artificial intelligence to protein folding, a long-standing problem in molecular biology. In December 2018, DeepMind's AlphaFold won the 13th Critical Assessment of Techniques for Protein Structure Prediction (CASP) by successfully predicting the most accurate structure for 25 out of 43 proteins. “This is a lighthouse project, our first major investment in terms of people and resources into a fundamental, very important, real-world scientific problem,” Hassabis said to The Guardian. In 2020, in the 14th CASP, AlphaFold's predictions achieved an accuracy score regarded as comparable with lab techniques. Dr Andriy Kryshtafovych, one of the panel of scientific adjudicators, described the achievement as "truly remarkable", and said the problem of predicting how proteins fold had been "largely solved".

In July 2021, the open-source RoseTTAFold and AlphaFold2 were released to allow scientists to run their own versions of the tools. A week later OpenMind announced that AlphaFold had completed its prediction of nearly all human proteins as well as the entire proteomes of 20 other widely studied organisms. The structures were released on the AlphaFold Protein Structure Database. In July 2022, it was announced that the predictions of over 200 million proteins, representing virtually all known proteins, would be released on the AlphaFold database.

WaveNet and WaveRNN

In 2016, DeepMind introduced WaveNet, a text-to-speech system. It was originally too computationally intensive for use in consumer products, but in late 2017 it became ready for use in consumer applications such as Google Assistant. In 2018 Google launched a commercial text-to-speech product, Cloud Text-to-Speech, based on WaveNet.

In 2018, DeepMind introduced a more efficient model called WaveRNN co-developed with Google AI. In 2020 WaveNetEQ, a packet loss concealment method based on a WaveRNN architecture, was presented. In 2019, Google started to roll WaveRNN with WavenetEQ out to Google Duo users.

AlphaStar

In 2016, Hassabis discussed the game StarCraft as a future challenge, since it requires strategic thinking and handling imperfect information.

In January 2019, DeepMind introduced AlphaStar, a program playing the real-time strategy game StarCraft II. AlphaStar used reinforcement learning based on replays from human players, and then played against itself to enhance its skills. At the time of the presentation, AlphaStar had knowledge equivalent to 200 years of playing time. It won 10 consecutive matches against two professional players, although it had the unfair advantage of being able to see the entire field, unlike a human player who has to move the camera manually. A preliminary version in which that advantage was fixed lost a subsequent match.

In July 2019, AlphaStar began playing against random humans on the public 1v1 European multiplayer ladder. Unlike the first iteration of AlphaStar, which played only Protoss v. Protoss, this one played as all of the game's races, and had earlier unfair advantages fixed. By October 2019, AlphaStar reached Grandmaster level on the StarCraft II ladder on all three StarCraft races, becoming the first AI to reach the top league of a widely popular esport without any game restrictions.

AlphaCode

In 2022, DeepMind unveiled AlphaCode, an AI-powered coding engine that creates computer programs at a rate comparable to that of an average programmer, with the company testing the system against coding challenges created by Codeforces utilized in human competitive programming competitions. AlphaCode earned a rank equivalent to 54% of the median score on Codeforces after being trained on GitHub data and Codeforce problems and solutions. The program was required to come up with a unique solution and stopped from duplicating answers.

Gato

Gato is a "generalist agent" that learns multiple tasks simultaneously.

Miscellaneous contributions to Google

Google has stated that DeepMind algorithms have greatly increased the efficiency of cooling its data centers. In addition, DeepMind (alongside other Alphabet AI researchers) assists Google Play's personalized app recommendations. DeepMind has also collaborated with the Android team at Google for the creation of two new features which were made available to people with devices running Android Pie, the ninth installment of Google's mobile operating system. These features, Adaptive Battery and Adaptive Brightness, use machine learning to conserve energy and make devices running the operating system easier to use. It is the first time DeepMind has used these techniques on such a small scale, with typical machine learning applications requiring orders of magnitude more computing power.

Sports

DeepMind researchers have applied machine learning models to the sport of football, often referred to as soccer in North America, to modelling the behaviour of football players, including the goalkeeper, defenders, and strikers during different scenarios such as penalty kicks. The researchers used heat maps and cluster analysis to organize players based on their tendency to behave a certain way during the game when confronted with a decision on how to score or prevent the other team from scoring. The researchers mention that machine learning models could be used to democratize the football industry by automatically selecting interesting video clips of the game that serve as highlights. This can be done by searching videos for certain events, which is possible because video analysis is an established field of machine learning. This is also possible because of extensive sports analytics based on data including annotated passes or shots, sensors that capture data about the players movements many times over the course of a game, and game theory models.

Archaeology

Google has unveiled a new archaeology document program named Ithaca after the home island of mythical hero Odysseys. The deep neural network helps researchers restore the empty text of damaged documents, identify the place they originated from, and give them a definite accurate date. The work builds on another text analysis network named Pythia. The model Ithaca achieves 62% accuracy in restoring damaged texts, 71% location accuracy, and has a dating precision of 30 years. The tool has already been used by historians and ancient Greek archaeologists to make new discoveries in ancient Greek history. The team is working on extending the model to other ancient languages, including Demotic, Akkadian, Hebrew, and Mayan.

Sparrow

Sparrow is an artificial intelligence-powered chatbot developed by DeepMind to build safer machine learning systems by using a mix of human feedback and Google search suggestions.

Chinchilla AI

Chinchilla AI is a language model developed by DeepMind.

DeepMind Health

In July 2016, a collaboration between DeepMind and Moorfields Eye Hospital was announced to develop AI applications for healthcare. DeepMind would be applied to the analysis of anonymised eye scans, searching for early signs of diseases leading to blindness.

In August 2016, a research programme with University College London Hospital was announced with the aim of developing an algorithm that can automatically differentiate between healthy and cancerous tissues in head and neck areas.

There are also projects with the Royal Free London NHS Foundation Trust and Imperial College Healthcare NHS Trust to develop new clinical mobile apps linked to electronic patient records. Staff at the Royal Free Hospital were reported as saying in December 2017 that access to patient data through the app had saved a ‘huge amount of time’ and made a ‘phenomenal’ difference to the management of patients with acute kidney injury. Test result data is sent to staff's mobile phones and alerts them to changes in the patient's condition. It also enables staff to see if someone else has responded, and to show patients their results in visual form.

In November 2017, DeepMind announced a research partnership with the Cancer Research UK Centre at Imperial College London with the goal of improving breast cancer detection by applying machine learning to mammography. Additionally, in February 2018, DeepMind announced it was working with the U.S. Department of Veterans Affairs in an attempt to use machine learning to predict the onset of acute kidney injury in patients, and also more broadly the general deterioration of patients during a hospital stay so that doctors and nurses can more quickly treat patients in need.

DeepMind developed an app called Streams, which sends alerts to doctors about patients at risk of acute kidney injury. On 13 November 2018, DeepMind announced that its health division and the Streams app would be absorbed into Google Health. Privacy advocates said the announcement betrayed patient trust and appeared to contradict previous statements by DeepMind that patient data would not be connected to Google accounts or services. A spokesman for DeepMind said that patient data would still be kept separate from Google services or projects.

NHS data-sharing controversy

In April 2016, New Scientist obtained a copy of a data sharing agreement between DeepMind and the Royal Free London NHS Foundation Trust. The latter operates three London hospitals where an estimated 1.6 million patients are treated annually. The agreement shows DeepMind Health had access to admissions, discharge and transfer data, accident and emergency, pathology and radiology, and critical care at these hospitals. This included personal details such as whether patients had been diagnosed with HIV, suffered from depression or had ever undergone an abortion in order to conduct research to seek better outcomes in various health conditions.

A complaint was filed to the Information Commissioner's Office (ICO), arguing that the data should be pseudonymised and encrypted. In May 2016, New Scientist published a further article claiming that the project had failed to secure approval from the Confidentiality Advisory Group of the Medicines and Healthcare products Regulatory Agency.

In 2017, the ICO concluded a year-long investigation that focused on how the Royal Free NHS Foundation Trust tested the app, Streams, in late 2015 and 2016. The ICO found that the Royal Free failed to comply with the Data Protection Act when it provided patient details to DeepMind, and found several shortcomings in how the data was handled, including that patients were not adequately informed that their data would be used as part of the test. DeepMind published its thoughts on the investigation in July 2017, saying “we need to do better” and highlighting several activities and initiatives they had initiated for transparency, oversight and engagement. This included developing a patient and public involvement strategy and being transparent in its partnerships.

In May 2017, Sky News published a leaked letter from the National Data Guardian, Dame Fiona Caldicott, revealing that in her "considered opinion" the data-sharing agreement between DeepMind and the Royal Free took place on an "inappropriate legal basis". The Information Commissioner's Office ruled in July 2017 that the Royal Free hospital failed to comply with the Data Protection Act when it handed over personal data of 1.6 million patients to DeepMind.

DeepMind Ethics and Society

In October 2017, DeepMind announced a new research unit, DeepMind Ethics & Society. Their goal is to fund external research of the following themes: privacy, transparency, and fairness; economic impacts; governance and accountability; managing AI risk; AI morality and values; and how AI can address the world's challenges. As a result, the team hopes to further understand the ethical implications of AI and aid society to seeing AI can be beneficial.

This new subdivision of DeepMind is a completely separate unit from the partnership of leading companies using AI, academia, civil society organizations and nonprofits of the name Partnership on Artificial Intelligence to Benefit People and Society of which DeepMind is also a part. The DeepMind Ethics and Society board is also distinct from the mooted AI Ethics Board that Google originally agreed to form when acquiring DeepMind.

DeepMind Professors of machine learning

DeepMind sponsors three chairs of machine learning:

At the University of Cambridge, held by Neil Lawrence, in the Department of Computer Science and Technology,
At the University of Oxford, held by Michael Bronstein, in the Department of Computer Science, and
At the University College London, held by Marc Deisenroth, in the Department of Computer Science.

Search This Blog

Saturday, January 28, 2023

Allen Institute for Brain Science

History and funding

Online public resources

Allen Mouse Brain Atlas

Allen Spinal Cord Atlas

Allen Developing Mouse Brain Atlas

Allen Human Brain Atlas

Allen Mouse Brain Connectivity Atlas

Allen Cell Types Database

Allen Brain Observatory

Other online resources

Awards

OpenAI

History

Participants

Motives

Strategy

Products and applications

Gym

Debate Game

Dactyl

Generative models

GPT

GPT-2

GPT-3

ChatGPT

Music

Whisper

API

DALL-E and CLIP

Microscope

Codex

Video game bots and benchmarks

OpenAI Five

GYM Retro

Friday, January 27, 2023

DeepMind

History

Products and technologies

Deep reinforcement learning

AlphaGo and successors

Technology

AlphaFold

WaveNet and WaveRNN

AlphaStar

AlphaCode

Gato

Miscellaneous contributions to Google

Sports

Archaeology

Sparrow

Chinchilla AI

DeepMind Health

NHS data-sharing controversy

DeepMind Ethics and Society

DeepMind Professors of machine learning

BIOS