InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
GPT-3.5 Turbo Can Now Be Fine-Tuned for Improved Quality and Performance
OpenAI has made GPT-3.5 Turbo available to developers, with the added bonus of allowing them to customize the model to improve performance for their specific use cases. According to OpenAI, fine-tuning GPT-3.5 Turbo can even outperform base GPT-4 for certain tasks.
-
MetaGPT Leverages Human Collaboration Techniques for Multi-Agent-Based Software Engineering
Created by a team of researchers from Chinese and US universities, MetaGPT is a new LLM-based meta programming framework aiming to enable collaboration in multi-agent systems by leveraging human procedural knowledge to enhance robustness, reduce errors, and engineer software solutions for complex tasks.
-
LMSYS Org Releases Chatbot Arena and LLM Evaluation Datasets
Large Model Systems Organization (LMSYS Org) recently released Chatbot Arena, a comparison platform for large language models (LLMs), where users can pick the better response from a pair of chatbots. LMSYS also released a dataset containing conversations from the Arena as well as a dataset of human annotations of results from evaluating LLMs on the MT-Bench benchmark.
-
Vector Engine for Amazon Opensearch Serverless Now in Preview
AWS announced the preview release of vector storage and search capability within Amazon OpenSearch Serverless. The capability is intended to support machine learning augmented search experiences and generative AI applications.
-
Mountpoint for Amazon S3 Now GA to Access Bucket Like Local File System
During the latest AWS Storage Day event, Amazon announced the general availability of Mountpoint for Amazon S3. The new open-source file client provides through a file interface the elastic storage and throughput of Amazon S3, supporting data transfer at up to 100 Gb/second between each EC2 instance and the object storage.
-
Running Apache Flink Applications on AWS KDA: Lessons Learnt at Deliveroo
Deliveroo introduced Apache Flink into its technology stack for enriching and merging events consumed from Apache Kafka or Kinesis Streams. The company opted to use AWS Kinesis Data Analytics (KDA) service to manage Apache Flink clusters on AWS and shared its experiences from running Flink applications on KDA.
-
Ai4 2023: Generative AI Testing Lessons from Hussein Mehanna of Cruise
The recent Ai4 conference featured a talk by Hussein Mehanna of Cruise titled "How Autonomous Vehicles Will Inform and Improve AI Model Testing." Some key takeaways are that systems should handle the "long tail," developers should measure model output quality, and developers should push their systems to fail.
-
Inside InfluxDB 3.0: Exploring InfluxDB’s Scalable and Decoupled Architecture
InfluxData recently unveiled the system architecture for InfluxDB 3.0, its newest time-series DB. Its architecture encompasses four major components responsible for data ingestion, querying, compaction, and garbage collection and includes two main storage types. The architecture caters to operating the DB on-premise and natively on major cloud providers.
-
New Google Cloud H3 Virtual Machine Series for High-Performance Computing Workloads in Preview
Recently Google launched a new H3 Virtual Machine (VM) Series designed for High-Performance Computing (HPC) workloads. The series of VMs are available in public preview for Compute Engine and Google Kubernetes Engine (GKE) users and offers 88 cores (Simultaneous multi-threading disabled) and 352 GB of memory.
-
Jupyter AI Brings Generative AI to Notebooks
The open-source Project Jupyter, used by millions for data science and machine learning, has released Jupyter AI, a free tool bringing powerful generative AI capabilities to Jupyter notebooks.
-
AI, ML, Data Engineering News Roundup: Jupyter AI, AudioCraft, OverflowAI, StableCode and Tabnine
The latest update, which covers developments until August 7, 2023, highlights significant accomplishments and statements made in the fields of artificial intelligence, machine learning, and data science. This week's major news involved Jupyter, Meta AI, Overflow, Stability AI and Tabnine.
-
Ai4 2023 Panel Discussion: Generative AI in Business and Society
The recent Ai4 conference featured a panel discussion titled "Generative AI in Business and Society." Some key takeaways are that generative AI offers many opportunities for operational efficiency and product personalization, that companies need to balance privacy concerns with personalization, and they need to understand how generative AI is used across their organization.
-
MySQL Changes Versioning Model, Adds Innovation and LTS Releases
Oracle recently announced a change in the versioning model for MySQL, introducing the MySQL Innovation and Long-Term Supported releases. The first innovation release is MySQL 8.1.0 , which includes InnoDB cluster read replicas.
-
The Challenges of AI Product Development
Developing artificial intelligence (AI) products involves creating models and feeding data to train them, testing the models, and deploying them. Software engineers can support the adoption of AI and machine learning (ML) in companies by building an understanding of the technologies, encouraging experimentation, and ensuring compliance with regulations and ethical standards.
-
Ai4 2023 Summary Day Two: AI Legal Issues, AI in Education & Deploying AI
Day Two of Ai4 2023 conference was held on August 9th, 2023, at the MGM Grand hotel in Las Vegas, Nevada. This two-day event is organized by Fora Group and includes tracks focused on various industries, including automotive, financial, healthcare, and government. The day began with six mainstage presentations from leaders in AI technology.