InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Enhancing AI Capabilities: Google Cloud Integrates Vector Search in Managed Databases
Google Cloud recently added support for the pgvector on Cloud SQL for PostgreSQL and AlloyDB for PostgreSQL. The extension brings vector search operations to the managed databases, allowing developers to store vector embeddings generated by large language models (LLMs) and perform similarity searches.
-
OpenAI Introduces Superalignment to Address Rogue Superintelligent AI
OpenAI announced the formation of a specialized Superalignment team with the objective of preventing the emergence of rogue Superintelligent AI. OpenAI highlighted the need to align AI systems with human values and emphasized the importance of proactive measures to prevent potential harm.
-
UC Berkeley Researchers Open-Source API-Calling Language Model Gorilla
Researchers from UC Berkeley and Microsoft have open-sourced Gorilla, a large language model (LLM) that can write code to call APIs. In experiments measuring generated code accuracy, Gorilla outperforms several baseline models, including GPT-4.
-
How LinkedIn Serves over 4.8 Million Member Profiles per Second
LinkedIn introduced Couchbase as a centralized caching tier for scaling member profile reads to handle increasing traffic that has outgrown their existing database cluster. The new solution achieved over 99% hit rate, helped reduce tail latencies by more than 60% and costs by 10% annually.
-
Microsoft Empowers Government Agencies with Secure Access to Generative AI Capabilities
Microsoft continues to prioritize the development of cloud services that align with US regulatory standards and cater to government requirements for security and compliance. The latest addition to their tools is the integration of generative AI capabilities through Microsoft Azure OpenAI Service, which aims to enhance government agencies' efficiency, productivity, and data insights.
-
QCon New York 2023: Living on the Edge with Erica Pisani
Erica Pisani, Sr. software engineer at Netlify, presented Living on the Edge at QCon New York 2023. Pisani discussed: edge computing anatomy; communication between users, edge locations and origin servers; various edge computing challenges with corresponding solutions; and introduced the AWS Snowball Edge Device. Pisani spoke to InfoQ to learn more about edge computing.
-
Microsoft Guidance Offers Language for Controlling Large Language Models
Microsoft has recently introduced a domain-specific language called Guidance, to improve developers' ability to manage contemporary language models. The new framework integrates aspects such as generation, prompting, and logical control into a unified process for developers. The inclusion of regex pattern guides ensures the enforcement of formats, allowing for the natural completion of prompts.
-
Google's PaLM-E Combines Vision and Language AI for Robot Control
Researchers from Google's Robotics team recently announced PaLM-E, a combination of their PaLM and Vision Transformer (ViT) models designed for controlling robots. PaLM-E handles multimodal input data from robotic sensor and outputs text commands to control the robot's actuators. Besides performing well on several robotics tasks, PaLM-E also outperforms other models on the OK-VQA benchmark.
-
QCon New York 2023: Day Three Recap
Day Three of the 9th annual QCon New York conference was held on June 15th, 2023, at the New York Marriott at the Brooklyn Bridge in Brooklyn, New York. This three-day event, organized by C4Media, included keynote addresses by Suhail Patel and Sarah Bird, and presentations from four conference tracks.
-
QCon New York 2023 Panel Discussion: Navigating the Future - LLM in Production
The recent QCon New York conference featured a panel discussion titled "Navigating the Future: LLM in Production." Some key takeaways are that there are two trends in LLMS: closed models behind APIs and open-source models, and that organizations using LLMs will need to think deeply about testing and evaluating the models themselves, with a strong emphasis on risk mitigation.
-
Voxel51 Open-Sources Computer Vision Dataset Assistant VoxelGPT - Q&A with Jason Corso
Voxel51 recently open-sourced VoxelGPT, an AI assistant that interfaces with GPT-3.5 to produce Python code for querying computer vision datasets. InfoQ spoke with Jason Corso, co-founder and CSO of Voxel51, who shared their lessons and insights gained while developing VoxelGPT.
-
AI, ML, Data Engineering News Round Up: Vertex, AlphaDev, Function Calling, Gorilla, and Falcon
The latest update, spanning from June 12th, 2023, highlights the recent advancements and announcements in the domains of data science, machine learning, and artificial intelligence. This week's spotlight is on notable entities such as Google, OpenAI, UC Berkeley, and AWS.
-
QCon New York 2023: Day Two Recap
Day Two of the 9th annual QCon New York conference was held on June 14th, 2023, at the New York Marriott at the Brooklyn Bridge in Brooklyn, New York. This three-day event, organized by C4Media, included a keynote address by Alicia Dwyer Cianciolo and presentations from four conference tracks and one sponsored track.
-
QCon New York 2023: Day One Recap
Day One of the 9th annual QCon New York conference was held on June 13th, 2023, at the New York Marriott at the Brooklyn Bridge in Brooklyn, New York. This three-day event, organized by C4Media, included a keynote address by Radia Perlman and presentations from four conference tracks and one sponsored track.
-
OpenAI Announces Function Calling, Allowing Developers to Describe Functions
OpenAI has introduced updates to the API, including a capability called function calling, which allows developers to describe functions to GPT-4 and GPT-3.5 and have the models create code to execute those functions.