InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Qwen Team Unveils QwQ-32B-Preview: Advancing AI Reasoning and Analytics

Qwen Team introduced QwQ-32B-Preview, an experimental research model designed to improve AI reasoning and analytical capabilities. Featuring a 32,768-token context and cutting-edge transformer architecture, it excels in math, programming, and scientific benchmarks like GPQA and MATH-500. Available on Hugging Face, it invites researchers to explore its features and contribute to its development.

Robert Krzaczyński
on Dec 31, 2024
AI, ML & Data Engineering

InstaDeep Open-Sources Genomics AI Model Nucleotide Transformers

Researchers from InstaDeep and NVIDIA have open-sourced Nucleotide Transformers (NT), a set of foundation models for genomics data. The largest NT model has 2.5 billion parameters and was trained on genetic sequence data from 850 species. It outperforms other state-of-the-art genomics foundation models on several genomics benchmarks.

Anthony Alford
on Dec 31, 2024
Development

AWS Adds News Amazon Q Developer Agent Capabilities: Doc Generation, Code Reviews, and Unit Tests

AWS has enhanced its generative AI-powered Amazon Q Developer, streamlining software development with new agent capabilities. Key features include automated documentation, code reviews, and unit test generation, allowing developers to focus on coding. Available in all AWS Regions, Amazon Q Developer simplifies processes in IDEs like Visual Studio Code and IntelliJ IDEA.

Steef-Jan Wiggers
on Dec 31, 2024
Cloud

Google Cloud Launches Sixth Generation Trillium TPUs: More Performance, Scalability and Efficiency

Google Cloud's Trillium, its sixth-generation TPU, is now available. It enhances AI workloads with unmatched performance and 67% better energy efficiency. Integral to the AI Hypercomputer, Trillium boasts training speeds over 4x faster and triples inference throughput. This leap positions Google as a contender against Nvidia in the AI data center market.

Steef-Jan Wiggers
on Dec 28, 2024
AI, ML & Data Engineering

EuroLLM-9B Aims to Improve State of the Art LLM Support for European Languages

EuroLLM-9B is an open-source large language model built in Europe and tailored to European languages, including all the official EU languages as well as 11 other non-official albeit commercially important languages. According to the team behind it, its performance makes it one of the best European-made LLM of this size.

Sergio De Simone
on Dec 27, 2024
AI, ML & Data Engineering

OpenAI Announces ‘o3’ Reasoning Model

OpenAI has launched the O3 and O3 Mini models, setting a new standard in AI with enhanced reasoning capabilities. Notable achievements include 71.7% accuracy on SWE-Bench and 96.7% on the AIME benchmark. While these models excel in coding and mathematics, challenges remain. O3 Mini offers scalable options for developers, prioritizing safety and adaptability.

Andrew Hoblitzell
on Dec 25, 2024
Cloud

Azure Boost DPU: Microsoft's New Silicon Solution for Enhanced Cloud Performance

At Ignite 2024, Microsoft unveiled the Azure Boost DPU, its first in-house solution for low-power, data-centric workloads. This innovative chip optimizes cloud performance and security, offering triple the efficiency of CPUs. With a robust hardware-software design, Microsoft’s advancements position it to redefine AI and cloud infrastructure.

Steef-Jan Wiggers
on Dec 25, 2024
AI, ML & Data Engineering

Anthropic Publishes Model Context Protocol Specification for LLM App Integration

Anthropic recently released their Model Context Protocol (MCP), an open standard describing a protocol for integrating external resources and tools with LLM apps. The release includes SDKs implementing the protocol, as well as an open-source repository of reference implementations of MCP.

Anthony Alford
on Dec 24, 2024
AI, ML & Data Engineering

Google Willow Sets New Quantum Supremacy Milestone

Google has announced its new 105-qubit superconducting chip, code-named Willow, which solved a quantum supremacy experiment that would take at least 300 million years to simulate on a classical computer. More importantly, the chip shows how quantum hardware may achieve fault tolerance in such a way to seemingly unleash its scalability.

Sergio De Simone
on Dec 21, 2024
AI, ML & Data Engineering

Recap of OpenAI Highlights Key Updates in 12-Day "Shipmas"

OpenAI's "12 Days of Shipmas" event featured daily announcements of new AI features and tools. Below is a summary of the key developments.

Daniel Dominguez
on Dec 21, 2024
AI, ML & Data Engineering

NVIDIA Unveils Jetson Orin Nano Generative AI Supercomputer

NVIDIA has released the Jetson Orin Nano Super Developer Kit, a compact generative AI supercomputer. The device, which measures small enough to fit in one's hand, provides increased performance for generative AI capabilities.

Daniel Dominguez
on Dec 20, 2024
AI, ML & Data Engineering

Hugging Face and Entalpic Unveil LeMaterial: Transforming Materials Science through AI

Entalpic, in collaboration with Hugging Face, has launched LeMaterial, an open-source initiative to tackle key challenges in materials science. By unifying data from major resources into LeMat-Bulk, a harmonized dataset with 6.7 million entries, LeMaterial aims to streamline materials discovery and accelerate innovation in areas such as LEDs, batteries, and photovoltaic cells.

Robert Krzaczyński
on Dec 19, 2024
Cloud

Azure AI Agent Service in Public Preview: Automation of Routine Tasks

Unveiling at Ignite, Microsoft's Azure AI Agent Service empowers developers to build and scale AI agents seamlessly. With secure integration, flexible use cases, and support for multiple frameworks, it automates workflows across platforms like Teams and Excel. Experience the future of business automation—innovate efficiently with Azure AI today!

Steef-Jan Wiggers
on Dec 19, 2024
AI, ML & Data Engineering

PydanticAI: a New Python Framework for Streamlined Generative AI Development

The team behind Pydantic, widely used for data validation in Python, has announced the release of PydanticAI, a Python-based agent framework designed to ease the development of production-ready Generative AI applications. Positioned as a potential competitor to LangChain, PydanticAI introduces a type-safe, model-agnostic approach inspired by the design principles of FastAPI.

Robert Krzaczyński
on Dec 18, 2024
Architecture & Design

Key Takeaways from QCon & InfoQ Dev Summits with a Look ahead to 2025 Conferences

As we reflect on 2024, one thing is clear: senior developers, architects, and team leaders face challenges that benefit from real-world insights shared by other senior practitioners. This year, the InfoQ Dev Summits in Boston and Munich, and the QCon conferences in London and San Francisco provided curated topics and talks from software practitioners working through demanding challenges.

Artenisa Chatziou
on Dec 18, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

News