InfoQ Homepage News
-
Databases in 2024: Growth, Change and Controversy
Andrew Pavlo’s annual retrospective on the database world has recently been released, covering trends and innovations from the past year. The opinionated report, "Databases in 2024: A Year in Review," highlights that while we may indeed be in the "golden era of databases," last year brought significant license changes, the rapid growth of DuckDB, and some surprising new releases.
-
Microsoft Phi-4 is a Small Language Model Specialized for Complex Math Reasoning
Phi-4 is 14B parameter model from Microsoft Research that aims to improve the state of the art for math reasoning. Previously available on Azure AI Foundry, Phi-4 has recently become available on Hugging Face under the MIT license.
-
Amazon Bedrock Introduces Multi-Agent Systems (MAS) with Open Source Framework Integration
Amazon Web Services has released a multi-agent collaboration capability for Amazon Bedrock, introducing a framework for deploying and managing multiple AI agents that collaborate on complex tasks. The system enables specialized agents to work together under a supervisor agent's coordination, addressing challenges developers face with agent orchestration in distributed AI systems.
-
Using Machine Learning on Microcontrollers: Decreasing Memory and CPU Usage to Save Power and Cost
According to Eirik Midttun, artificial intelligence (AI) and machine learning (ML) are useful tools for interpreting sensor data, especially when the input is complex, such as vibration, voice, and vision. The main challenges of using machine learning on microcontrollers are the constraints in computing power available and cost-related requirements that come with microcontroller-based designs,
-
Enhancing Compliance with AWS Declarative Policies
AWS recently introduced declarative policies, a feature designed to help organizations define and enforce desired configurations for AWS services at scale. This capability addresses common challenges faced by customers who need to establish standards for cloud resource configurations, such as blocking public access to Amazon EBS snapshots.
-
Microsoft Research Unveils rStar-Math: Advancing Mathematical Reasoning in Small Language Models
Microsoft Research unveiled rStar-Math, a framework that demonstrates the ability of small language models (SLMs) to achieve mathematical reasoning capabilities comparable to, and in some cases exceeding, larger models like OpenAI's o1-mini. This is accomplished without the need for more advanced models, representing a novel approach to enhancing the inference capabilities of AI.
-
Nvidia Ingest Aims to Make it Easier to Extract Structured Information from Documents
Nvidia Ingest is a new microservice aimed at processing document content and extracting metadata into a well-defined JSON schema. Ingest is able to process PDFs, Word, and PowerPoint documents and extract structured information from tables, charts, images, and text using optical character recognition.
-
Microsoft Research AI Frontiers Lab Launches AutoGen v0.4 Library
Microsoft Research’s AI Frontiers Lab has announced the release of AutoGen version 0.4, an open-source framework designed to build advanced AI agent systems. This latest version as stated marks the complete redesign of the AutoGen library, focusing on enhancing code quality, robustness, usability, and the scalability of agent workflows.
-
Agoda Integrates macOS Infrastructure with Kubernetes
Agoda recently detailed how they integrated their macOS infrastructure with Kubernetes. This was accomplished by developing the macOS Virtualization Kubelet (macOS-vz-Kubelet), a tool designed to manage macOS workloads within a Kubernetes environment.
-
DeepSeek Open-Sources DeepSeek-V3, a 671B Parameter Mixture of Experts LLM
DeepSeek open-sourced DeepSeek-V3, a Mixture-of-Experts (MoE) LLM containing 671B parameters. It was pre-trained on 14.8T tokens using 2.788M GPU hours and outperforms other open-source models on a range of LLM benchmarks, including MMLU, MMLU-Pro, and GPQA.
-
Google Releases Experimental AI Reasoning Model
Google has introduced Gemini 2.0 Flash Thinking Experimental, an AI reasoning model available in its AI Studio platform.
-
Google Vertex AI Provides RAG Engine for Large Language Model Grounding
Vertex AI RAG Engine is a managed orchestration service aimed to make it easier to connect large language models (LLMs) to external data sources to be more up-to-date, generate more relevant responses, and hallucinate less.
-
AWS Adds Container Insights with Enhanced Observability to Elastic Container Service
AWS recently announced the launch of Container Insights with Enhanced Observability for Amazon Elastic Container Service (ECS). This goes on the lines of a similar feature previously introduced for Amazon Elastic Kubernetes Service (EKS). This new capability aims to improve monitoring and troubleshooting for container workloads.
-
Terraform Fork OpenTofu Adds Provider Iteration and Exclusions
Version 1.9.0 of OpenTofu - the infrastructure-as-code tool forked a year ago from Hashicorp's Terraform - has been released. This release brings several significant features, including provider iteration capabilities through `for_each`, to enable simplified multi-zone and multi-region deployments.
-
Java News Roundup: JDK 24 in Rampdown Phase Two, Spring Framework, JobRunr, Commonhaus Foundation
This week's Java roundup for January 13th, 2025, features news highlighting: JDK 24 in Rampdown Phase Two; Spring Framework 6.2.2; JobRunr 7.4.0; Micrometer Metrics 1.15.0-M1 and Micrometer Tracing 1.5.0-M1; and Infinispan joins the Commonhaus Foundation.