Open Sourcing Artificial Intelligence Research

As with many companies over the last couple of years, InfoSys is seeing a major shift in away from “big data” to more of an emphasis on machine learning an AI research. But unlike their competitors, which are heavily investing in proprietary solutions such as Microsoft’s Azure Machine Learning Studio, InfoSys decided a cooperative approach would be more efficient.

The result of this decision is OpenAI, a non-profit artificial intelligence research company. Officially launched in December, this research group has a billion dollars in funding from InfoSys, Amazon Web Services, and several private donors.

The reason we’re talking about OpenAI today is they just released the public beta of OpenAI Gym. This toolkit is used to develop and compare reinforcement learning (RL) algorithms, a cornerstone of modern machine learning research. The announcement cites two main reasons they are focusing on reinforcement learning algorithms,

RL is very general, encompassing all problems that involve making a sequence of decisions: for example, controlling a robot's motors so that it's able to run and jump, making business decisions like pricing and inventory management, or playing video games and board games. RL can even be applied to supervised learning problems with sequential or structured outputs.

RL algorithms have started to achieve good results in many difficult environments. RL has a long history, but until recent advances in deep learning, it required lots of problem-specific engineering. DeepMind's Atari results, BRETT from Pieter Abbeel's group, and AlphaGo all used deep RL algorithms which did not make too many assumptions about their environment, and thus can be applied in other settings.

Currently RL research is hampered the need for better benchmarks and the “lack of standardization of environments used in publications”. As you can imagine, it is hard to reproduce another scientist’s results when their research paper presumes that you have access to a proprietary set of tools. Or worse, an internally built toolkit that isn’t available for any price.

An important aspect of machine learning is the having an experimental environment to work in. Not only is there a significant development cost in creating an experimental environment, one can’t meaningfully compare two algorithms unless they share a common environment. So out of the box, OpenAI Gym offers these environments: Classic control, Toy text, Algorithmic, Atari (based on the Arcade Learning Environment), Board games, and 2D/3D robots. (The last one requires a MuJoCo physics engine license.)

OpenAI Gym currently supports Python 2.7 on Linux and OSX. If there is sufficient interest, Python 3 and Windows will also be considered. The code is offered under the MIT license.

Topics

Pitfalls of Unified Memory Models in GPUs

When DevOps Runs Its Course - We Need Platform as a Runtime

Generally AI - Season 2 - Episode 1: Generative AI and Creativity

Great Products Need Closer Collaboration Between Product, Engineering and Design

Improving Developer Experience with Platform Engineering

Helpful links

Choose your language

Write for InfoQ

Rate this Article

This content is in the AI, ML & Data Engineering topic

Related Topics:

Related Editorial

Related Sponsored Content

Popular across InfoQ

Planning, Automation and Monorepo: How Monzo Does Code Migrations Across 2800 Microservices

Generally AI - Season 2 - Episode 1: Generative AI and Creativity

Improving Developer Experience with Platform Engineering

How Functional Programming Can Help You Write Efficient, Elegant Web Applications

Intuit Engineering's Approach to Simplifying Kubernetes Management with AI

Android 15 Brings Desktop-Like Windowing UX on Tablets

Mitmproxy 11 Released: Full HTTP/3 Support and DNS Enhancements

Pitfalls of Unified Memory Models in GPUs

Will Quantum Computing Solve Humanity's Biggest Challenges? InfoQ DevSummit Munich Keynote

Planning, Automation and Monorepo: How Monzo Does Code Migrations Across 2800 Microservices

When DevOps Runs Its Course - We Need Platform as a Runtime

How Canva Scaled Real-Time Collaboration with WebRTC: from WebSockets to Seamless P2P Communication

Great Products Need Closer Collaboration Between Product, Engineering and Design

The Art, Science and Psychology of Decision Making

How to Improve Software Team Performance with Experimentation

Valkey 8.0 Now GA with Improved Memory Efficiency

Generally AI - Season 2 - Episode 1: Generative AI and Creativity

Google Develops Voice Transfer AI for Restoring Voices

System Initiative Launches DevOps Platform to Address Cloud Stack Drift

Improving Developer Experience with Platform Engineering

Resilience and Chaos Engineering in a Kubernetes World

QCon San Francisco

QCon London

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

Open Sourcing Artificial Intelligence Research

Write for InfoQ

Rate this Article

This content is in the AI, ML & Data Engineering topic

Related Topics:

Related Editorial

Related Sponsored Content

Popular across InfoQ

The InfoQ Newsletter