InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Microsoft Announces Small Language Model Phi-2
Microsoft Research announced Phi-2, a 2.7 billion-parameter Transformer-based language model. Phi-2 is trained on 1.4T tokens of synthetic data generated by GPT-3.5 and outperforms larger models on a variety of benchmarks.
-
Cloudflare's Journey in ML and AI: MLOps Platform and Best Practices
Cloudflare's blog described its MLOps platform and best practices for running Artificial Intelligence (AI) deployment at scale. Cloudflare's products, including WAF attack scoring, bot management, and global threat identification, rely on constantly evolving Machine Learning (ML) models. These models are pivotal in enhancing customer protection and augmenting support services.
-
Expedia Uses WebSockets and Kafka to Query Near Real-Time Streaming Data
Expedia created a solution to support querying the clickstream data from their platform in near-real time to enable their product and engineering teams to explore live data while working on new and enhancing existing data-driven functional use cases. The team used a combination of WebSockets, Apache Kafka, and PostgreSQL to allow streaming query results continuously to users’ browsers.
-
Privacy Engineering at Scale: DoorDash’s Journey in Geomasking and Data Protection
DoorDash recently published how it proactively embeds privacy into its products. It explains the importance of Privacy Engineering, an often overlooked software architecture practice, and provides an example of geomasking users' address data to protect their privacy better.
-
Apple Open-sources Apple Silicon-Optimized Machine Learning Framework MLX
Apple's MLX combines familiar APIs, composable function transformations, and lazy computation to create a machine learning framework inspired by NumPy and PyTorch that is optimized for Apple Silicon. Implemented in Python and C++, the framework aims to provide a user-friendly and efficient solution to train and deploy machine learning models on Apple Silicon.
-
Microsoft Released ML.NET 3.0: Deep Learning and Data Processing Improvements
ML.NET version 3.0 has been officially released, introducing new features and enhancements. Notably, deep learning capabilities have been significantly expanded with advancements in Object Detection, Named Entity Recognition, and Question Answering and more.
-
Amazon RDS Now Supports IBM Db2
During the recent re:Invent conference, AWS announced that Amazon RDS now offers support for IBM Db2, including both the Standard Edition and Advanced Edition. Db2 becomes the sixth engine supported by this managed database service, joining PostgreSQL, MySQL, MariaDB, SQL Server, and Oracle.
-
Spring Boot 3.2 Delivers Support for Virtual Threads and Modern RestClient and JdbcClient
The Spring Boot team released Spring Boot 3.2 in November 2023. This latest release is packed with features, the most noteworthy of which is the support for Virtual Threads and Coordinated Restore at Checkpoint (CRaC). Other features include SSL Bundle reloading, Rest Client and JDBC Client interface support, observability improvements, and many others.
-
Meta and IBM Lead Formation of AI Alliance to Drive Open-Source Innovation
A new consortium, led by Meta and IBM, has been formed to support open-source AI. The AI Alliance comprises numerous organizations from various sectors, including software, hardware, nonprofit, public, and academic, with the goal of collaboratively developing tools and programs to facilitate open development.
-
Microsoft's Orca 2 LLM Outperforms Models That Are 10x Larger
Microsoft Research released its Orca 2 LLM, a fine-tuned version of Llama 2 that performs as well as or better than models that contain 10x the number of parameters. Orca 2 uses a synthetic training dataset and a new technique called Prompt Erasure to achieve this performance.
-
Java News Roundup: JDK 22 in Rampdown, JDK 23 Expert Group, Spring Cloud 2023.0, TornadoVM 1.0
This week's Java roundup for December 4th, 2023, features news highlighting: JDK 22 having moved to Rampdown Phase One; formation of the JDK 23 expert group; JEP 464, Scope Values (Second Preview) targeted for JDK 22; Spring Cloud 2023.0.0; TornadoVM 1.0.0; and JHipster Lite 1.0.0.
-
Apache Pinot 1.0 Provides a Realtime Distributed OLAP Datastore
Apache Pinot is an open source column-oriented distributed data store written in Java. Pinot is designed to use Online Analytical processing (OLAP) in order to answer multi-dimensional analytical (MDA) queries with low latency.
-
Google Launches New Multi-Modal Gemini AI Model
On December 6, Alphabet released the first phase of its next-generation AI model, Gemini. Gemini was overseen and driven by its CEO, Sundar Pichai and Google DeepMind. Gemini is the first model to outperform human experts on MMLU (Massive Multitask Language Understanding), one of the most popular methods to test the performance of language models.
-
Microsoft Integrates Copilot with Azure Quantum Elements for Advancing Scientific Research
At the recent Ignite conference, Microsoft demonstrated the advantages of combining AI and Azure Quantum Elements, a suite of services and tools for quantum chemistry and materials science. With the integration of Copilot, the new solution allows researchers to explore more materials, speed up chemistry simulations, and experiment with existing quantum hardware.
-
Amazon Unveils Titan AI Image Generator
Amazon unveiled Titan Image Generator, currently in preview for AWS customers on Bedrock, Amazon's AI development platform. As a member of Amazon's Titan family of generative AI models, Titan Image Generator has the capability to generate new images based on a text description or customize existing images.