InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Alibaba Releases Two Open-Weight Language Models for Math and Voice Chat
Alibaba released two open-weight language model families: Qwen2-Math, a series of LLMs tuned for solving mathematical problems; and Qwen2-Audio, a family of multi-modal LLMs that can accept voice or text input. Both families are based on Alibaba's Qwen2 LLM series, and all but the largest version of Qwen2-Math are available under the Apache 2.0 license.
-
Grok-2 Beta Version Released on X Platform
The Grok-2 language model has been released in beta on the X platform, introduced alongside Grok-2 mini. The model, tested under the designation "sus-column-r" on the LMSYS leaderboard, has achieved a higher Elo Score compared to Claude 3.5 Sonnet and GPT-4-Turbo. Grok-2 mini, a smaller variant, is also part of the beta release, designed to offer a balance between speed and performance.
-
Concerns Rise in Open-Source Community as CockroachDB Ends Core Free Edition
CockroachDB Labs has recently announced a change to the license model of their distributed SQL database, discontinuing the free Core version and making the Enterprise version the only option. Having previously moved away from an open-source license, this latest change has raised further questions in the community about the future of open-source solutions managed by a single vendor.
-
Microsoft Launches Open-Source Phi-3.5 Models for Advanced AI Development
Microsoft launched three new open-source AI models in its Phi-3.5 series: Phi-3.5-mini-instruct, Phi-3.5-MoE-instruct, and Phi-3.5-vision-instruct. Available under a permissive MIT license, these models offer developers powerful tools for various tasks, including reasoning, multilingual processing, and image and video analysis.
-
Apple Unveils Apple Foundation Models Powering Apple Intelligence
Apple published the details of their new Apple Foundation Models (AFM), a family of large language models (LLM) that power several features in their Apple Intelligence suite. AFM comes in two sizes: a 3B parameter on-device version and a larger cloud-based version.
-
LLMs and Agents as Team Enablers
Eric Naiburg and Birgitta Böckeler published articles on the benefits and challenges of using AI as a multiplier in dev teams. We report on their insights for scenarios such as simplifying the germane cognitive load of a domain, automating code migrations, and coaching scrum masters on team facilitation. We also cover Böckeler's experiments with using LLMs to onboard onto a complex project.
-
Spanner Graph: Google Introduces Graph Database on Spanner
At the recent Cloud Next conference in Tokyo, Google announced Spanner Graph, a managed feature that integrates graph, relational, search, and AI capabilities within Spanner. This new database supports a graph query interface compatible with ISO GQL (Graph Query Language) standards while avoiding the need for a standalone graph database.
-
How Amazon Aurora Serverless Manages Resources and Scaling for Fleets of 10K+ Instances
AWS engineers published a paper describing the evolution and latest design of resource management and scaling for the Amazon Aurora Serverless platform. Aurora Serverless uses a combination of components at different levels to create a holistic approach for dynamically scaling and adjusting resources to satisfy the needs of customer workloads.
-
InfoQ Dev Summit Munich 2024 Summer Sale: Learn about GenAI, Secure Supply Chains, and Scalable Arch
In today’s fast-paced tech developments, staying ahead means continually leveling up your skills and knowledge. At InfoQ Dev Summit Munich (September 26-27), learn directly from senior software practitioners driving innovation and change in software.
-
Meta's Research SuperCluster for Real-Time Voice Translation AI Systems
A recent article from Engineering at Meta reveals how the company is building Research SuperCluster (RSC) infrastructure that is used for advancements in real-time voice translations, language processing, computer vision, and augmented reality (AR).
-
University Researchers Create New Type of Interpretable Neural Network
Researchers from Massachusetts Institute of Technology, California Institute of Technology, and Northeastern University created a new type of neural network: Kolmogorov–Arnold Networks (KAN). KAN models outperform larger perceptron-based models on physics modeling tasks and provide a more interpretable visualization.
-
MariaDB Introduces Open-Source Vector Preview, Aiming to Become Default MySQL Option
With the release of MariaDB 11.6, the MariaDB Foundation has announced the public preview of Vector search for the open-source fork of the MySQL engine. Database experts and open-source advocates see vector support as an opportunity for MariaDB to lead the MySQL ecosystem, especially since Oracle reserves most new features for its enterprise editions only.
-
University of Pennsylvania Researchers Develop Processorless Learning Circuitry
Researchers from the University of Pennsylvania have designed an electrical circuit, similar to a neural network, that can learn tasks such as nonlinear regression. The circuit operates at low power levels and can be trained without a computer.
-
NVIDIA NIM Now Available on Hugging Face with Inference-as-a-Service
Hugging Face has announced the launch of an inference-as-a-service capability powered by NVIDIA NIM. This new service will provide developers easy access to NVIDIA-accelerated inference for popular AI models.
-
Amazon MemoryDB Provides Fastest Vector Search on AWS
AWS recently announced the general availability of vector search for Amazon MemoryDB, the managed in-memory database with Multi-AZ availability. The new capability provides ultra-low latency and the fastest vector search performance at the highest recall rates among vector databases on AWS.