InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Meta's Research SuperCluster for Real-Time Voice Translation AI Systems
A recent article from Engineering at Meta reveals how the company is building Research SuperCluster (RSC) infrastructure that is used for advancements in real-time voice translations, language processing, computer vision, and augmented reality (AR).
-
University Researchers Create New Type of Interpretable Neural Network
Researchers from Massachusetts Institute of Technology, California Institute of Technology, and Northeastern University created a new type of neural network: Kolmogorov–Arnold Networks (KAN). KAN models outperform larger perceptron-based models on physics modeling tasks and provide a more interpretable visualization.
-
MariaDB Introduces Open-Source Vector Preview, Aiming to Become Default MySQL Option
With the release of MariaDB 11.6, the MariaDB Foundation has announced the public preview of Vector search for the open-source fork of the MySQL engine. Database experts and open-source advocates see vector support as an opportunity for MariaDB to lead the MySQL ecosystem, especially since Oracle reserves most new features for its enterprise editions only.
-
University of Pennsylvania Researchers Develop Processorless Learning Circuitry
Researchers from the University of Pennsylvania have designed an electrical circuit, similar to a neural network, that can learn tasks such as nonlinear regression. The circuit operates at low power levels and can be trained without a computer.
-
NVIDIA NIM Now Available on Hugging Face with Inference-as-a-Service
Hugging Face has announced the launch of an inference-as-a-service capability powered by NVIDIA NIM. This new service will provide developers easy access to NVIDIA-accelerated inference for popular AI models.
-
Amazon MemoryDB Provides Fastest Vector Search on AWS
AWS recently announced the general availability of vector search for Amazon MemoryDB, the managed in-memory database with Multi-AZ availability. The new capability provides ultra-low latency and the fastest vector search performance at the highest recall rates among vector databases on AWS.
-
GitHub Models Brings New AI Playground and Tight Integration with Other GitHub Tools
GitHub has launched GitHub Models, a free capability aimed at letting developers explore various AI models from within the GitHub tool ecosystem and make it easier to deploy AI-based services using Azure AI. GitHub Models includes both private and public models and is currently in closed preview.
-
Canva Opts for Amazon KDS over SNS+SQS to Save 85% with 25 Billion Events per Day
Canva evaluated different data massaging solutions for its Product Analytics Platform, including the combination of AWS SNS and SQS, MKS, and Amazon KDS, and eventually chose the latter, primarily based on its much lower costs. The company compared many aspects of these solutions, like performance, maintenance effort, and cost.
-
Mistral AI Releases Three Open-Weight Language Models
Mistral AI released three open-weight language models: Mistral NeMo, a 12B parameter general-purpose LLM; Codestral Mamba, a 7B parameter code-generation model; and Mathstral, a 7B parameter model fine-tuned for math and reasoning. All three models are available under the Apache 2.0 license.
-
Meta Releases Llama 3.1 405B, Largest Open-Source Model to Date
Meta recently unveiled its latest language model, Llama 3.1 405B. This AI model is the largest of the new Llama models, which also include 8B and 70B versions. With 405 billion parameters, 15 trillion tokens, and 16,000 GPUs, Llama 3.1 405B offers a range of impressive features.
-
Google's JEST Algorithm Automates AI Training Dataset Curation and Reduces Training Compute
Google DeepMind recently published a new algorithm for curating AI training datasets: multimodal contrastive learning with joint example selection (JEST), which uses a pre-trained model to score the learnability of batches of data. Google's experiments show that image-text models trained with JEST-curated data require 10x less computation than baseline methods.
-
Google Introduces Free Trial for AlloyDB PostgreSQL Database
Google has announced a free trial program for AlloyDB, its fully-managed PostgreSQL-compatible database service. The trial allows users to test AlloyDB's capabilities with their own workloads for up to 30 days. AlloyDB is designed to provide high performance, scalability, and reliability, while maintaining full compatibility with open-source PostgreSQL.
-
Gen AI Increases Workloads and Decreases Productivity, Upwork Study Finds
A controversial survey by Upwork Research Institute found that while 96% of C-suite leaders expect the use of generative AI tools to increase overall productivity levels, 77% of surveyed employees say they have actually decreased their productivity. In fact, the survey contradicts previous research showing a positive correlation.
-
Java News Roundup: WildFly 33, Spring Cloud Data Flow, Apache TomEE, LangChain4j, Micronaut
This week's Java roundup for July 22nd, 2024, features news highlighting: the release of WildFly 33; Spring Cloud Data Flow 2.11.4; the second milestone release of Apache TomEE 10.0; LangChain4j 0.33; Micronaut 4.5.1; Eclipse Store 1.4; and an update on Jakarta EE 11.
-
AWS Discontinues Amazon Quantum Ledger Database (QLDB)
AWS recently announced that new customers can no longer sign up for Amazon Quantum Ledger Database (QLDB), a managed service providing an immutable transaction log maintained by a central trusted authority. All existing databases will be shut down in one year, and current users are encouraged to migrate to Aurora PostgreSQL.