InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

OpenEuroLLM: Europe’s New Initiative for Open-Source AI Development

A consortium of 20 European research institutions, companies, and EuroHPC centers has launched OpenEuroLLM, an initiative to develop open-source, multilingual large language models (LLMs). Coordinated by Jan Hajič and co-led by Peter Sarlin, the project aims to provide transparent and compliant AI models for commercial and public sector applications.

Robert Krzaczyński
on Feb 07, 2025
Culture & Methods

How Data Contracts Support Collaboration between Data Teams

Data contracts define the interface between data providers and consumers, specifying things like data models, quality guarantees, and ownership. They are essential for distributed data ownership in data mesh, ensuring data is discoverable, interoperable, and governed. Data contracts improve communication between teams and enhance the reliability and quality of data products.

Ben Linders
on Feb 06, 2025
AI, ML & Data Engineering

OpenAI Launches Deep Research: Advancing AI-Assisted Investigation

OpenAI has launched Deep Research, a new agent within ChatGPT designed to conduct in-depth, multi-step investigations across the web. Initially available to Pro users, with plans to expand access to Plus and Team users, Deep Research automates time-consuming research by retrieving, analyzing, and synthesizing online information.

Robert Krzaczyński
on Feb 06, 2025
Development

DeepSeek Database Leaking Sensitive Information Highlights AI Security Risks

Cloud security firm Wiz uncovered unprotected DeepSeek database giving full control over database operations and access to internal data including millions of lines of chat logs. While the vulnerability has been quickly fixed, the incident shows the need for the AI industry to enforce higher security standards, says the company.

Sergio De Simone
on Feb 05, 2025
AI, ML & Data Engineering

DeepSeek Open-Sources DeepSeek-R1 LLM with Performance Comparable to OpenAI's o1 Model

DeepSeek open-sourced DeepSeek-R1, an LLM fine-tuned with reinforcement learning )RL) to improve reasoning capability. DeepSeek-R1 achieves results on par with OpenAI's o1 model on several benchmarks, including MATH-500 and SWE-bench.

Anthony Alford
on Feb 04, 2025
AI, ML & Data Engineering

Hugging Face Expands Serverless Inference Options with New Provider Integrations

Hugging Face has launched the integration of four serverless inference providers Fal, Replicate, SambaNova, and Together AI, directly into its model pages. These providers are also integrated into Hugging Face's client SDKs for JavaScript and Python, allowing users to run inference on various models with minimal setup.

Daniel Dominguez
on Feb 04, 2025
AI, ML & Data Engineering

Block Launches Open-Source AI Framework Codename Goose

Block’s Open Source Program Office has launched Codename Goose, an open-source, non-commercial AI agent framework designed to automate tasks and integrate seamlessly with existing tools. Goose provides users with a flexible, on-machine AI assistant that can be customized through extensions, enabling developers and other professionals to enhance their productivity.

Robert Krzaczyński
on Feb 04, 2025
AI, ML & Data Engineering

OpenAI Introduces ChatGPT Gov for U.S. Government Agencies

OpenAI has launched ChatGPT Gov, a version of its AI-powered chatbot designed specifically for U.S. government agencies. This tailored deployment provides federal, state, and local agencies with access to OpenAI’s latest AI models while allowing them to maintain control over security, privacy, and compliance.

Robert Krzaczyński
on Feb 03, 2025
Mobile

Google's Vertex AI in Firebase SDK Now Ready for Production Use

Three months after its launch in beta, the Vertex AI in Firebase SDK is now ready for production, says Google engineer Thomas Ezan, who further explores three dimensions that are essential for its successful deployment to production: abuse prevention, remote configuration, and responsible AI use.

Sergio De Simone
on Feb 02, 2025
Cloud

AWS Glue 5.0 Introduces Spark 3.5.2 and Enhanced ETL Performance

At the latest re:Invent conference in Las Vegas, Amazon announced the general availability of AWS Glue 5.0, designed to accelerate ETL jobs powered by Apache Spark. The latest release of the serverless data integration service introduces upgraded runtimes, including Spark 3.5.2, Python 3.11, and Java 17, along with enhancements in performance and security.

Renato Losio
on Jan 31, 2025
Development

JetBrains AI Coding Agent Junie Provides Tight Integration with JetBrains IDEs

JetBrains has announced Junie, its new AI coding agent, in closed preview. Junie, says the company, is able to carry through the coding tasks you assign it and leverage the knowledge about your project context as available in the IDE.

Sergio De Simone
on Jan 31, 2025
AI, ML & Data Engineering

AMD and Johns Hopkins Researchers Develop AI Agent Framework to Automate Scientific Research Process

Researchers from AMD and Johns Hopkins University have developed Agent Laboratory, an artificial intelligence framework that automates core aspects of the scientific research process. The system uses large language models to handle literature reviews, experimentation, and report writing, producing both code repositories and research documentation.

Vinod Goje
on Jan 31, 2025
AI, ML & Data Engineering

DeepSeek Release Another Open-Source AI Model, Janus Pro

DeepSeek has released Janus-Pro, an updated version of its multimodal model, Janus. The new model improves training strategies, data scaling, and model size, enhancing multimodal understanding and text-to-image generation.

Daniel Dominguez
on Jan 31, 2025
AI, ML & Data Engineering

Build Resilient Systems with Insights on AI, Multi-Cloud, Leadership & Security at QCon London 2025

From AI and ML to cloud, leadership, and modern data strategies, QCon London 2025, April 7-10, features 15 tracks of insights from 125+ senior practitioners. Discover practical solutions to scaling architectures, enhancing productivity, securing supply chains, and integrating cutting-edge technologies - all through real-world examples and actionable takeaways.

Artenisa Chatziou
on Jan 29, 2025
Architecture & Design

Inside Atlassian Lithium: How a Dynamic ETL Platform is Transforming Data Movement and Cutting Costs

Atlassian recently introduced Lithium, an in-house ETL platform designed to meet the requirements of dynamic data movement. Lithium streamlines tasks such as cloud migrations, scheduled backups, and in-flight data validations by supporting ephemeral pipelines and tenant-level isolation while ensuring efficiency and scalability, resulting in significant cost savings.

Eran Stiller
on Jan 29, 2025

Newer News

Older News

InfoQ Software Architects' Newsletter

News