InfoQ Homepage Generative AI Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

AWS Introduces Amazon Q Developer in SageMaker Studio to Streamline ML Workflows

AWS announced that Amazon SageMaker Studio now includes Amazon Q Developer as a new capability. This generative AI-powered assistant is built natively into SageMaker’s JupyterLab experience and provides recommendations for the best tools for each task, step-by-step guidance, code generation, and troubleshooting assistance.

Daniel Dominguez
on Jul 24, 2024
AI, ML & Data Engineering

Redis Improves Performance of Vector Semantic Search with Multi-Threaded Query Engine

Redis, the in-memory data structure store, has recently released its enhanced Redis Query Engine. This comes at a time when vector databases are gaining prominence due to their importance in retrieval-augmented generation (RAG) for GenAI applications. Redis announced significant improvements to its Query Engine, using multi-threading to enhance query throughput while maintaining low latency.

Vinod Goje
on Jul 19, 2024
AI, ML & Data Engineering

Google Open Sources 27B Parameter Gemma 2 Language Model

Google DeepMind recently open-sourced Gemma 2, the next generation of their family of small language models. Google made several improvements to the Gemma architecture and used knowledge distillation to give the models state-of-the-art performance: Gemma 2 outperforms other models of comparable size and is competitive with models 2x larger.

Anthony Alford
on Jul 16, 2024
AI, ML & Data Engineering

Amazon Brings AI Assistant to Software Development as Part of Amazon Q Suite

Amazon has recently released Amazon Q Developer Agent, an AI-powered assistant that uses natural language input from developers to generate features, bug fixes, and unit tests within an integrated development environment (IDE). It employs large language models and generative AI to understand a developer's natural language request, and then generate the necessary code changes.

Vinod Goje
on Jun 28, 2024
AI, ML & Data Engineering

Meta's Chameleon AI Model Outperforms GPT-4 on Mixed Image-Text Tasks

The Fundamental AI Research (FAIR) team at Meta recently released Chameleon, a mixed-modal AI model that can understand and generate mixed text and image content. In experiments rated by human judges, Chameleon's generated output was preferred over GPT-4 in 51.6% of trials, and over Gemini Pro in 60.4%.

Anthony Alford
on Jun 25, 2024
Cloud

Generative AI Capabilities for Logic Apps Standard with Azure OpenAI and AI Search Connectors

Microsoft has announced that the Azure OpenAI and Azure AI Search connectors for Logic Apps Standard are now generally available, following an earlier public preview. These connectors are fully integrated into Azure Integration Services, providing developers with powerful tools to enhance application functionality with advanced AI capabilities.

Steef-Jan Wiggers
on Jun 19, 2024
AI, ML & Data Engineering

Meta Open-Sources MEGALODON LLM for Efficient Long Sequence Modeling

Researchers from Meta, University of Southern California, Carnegie Mellon University, and University of California San Diego recently open-sourced MEGALODON, a large language model (LLM) with an unlimited context length. MEGALODON has linear computational complexity and outperforms a similarly-sized Llama 2 model on a range of benchmarks.

Anthony Alford
on Jun 11, 2024
AI, ML & Data Engineering

AI and Software Development: Preview of Sessions at InfoQ Events

Explore the transformative impact of AI on software development at InfoQ's upcoming events. Senior software developers will share practical applications and ethical considerations of AI technology through technical talks.

Ian Robins
on Jun 07, 2024
AI, ML & Data Engineering

OpenAI Publishes GPT Model Specification for Fine-Tuning Behavior

OpenAI recently published their Model Spec, a document that describes rules and objectives for the behavior of their GPT models. The spec is intended for use by data labelers and AI researchers when creating data for fine-tuning the models.

Anthony Alford
on Jun 04, 2024
Development

InfoQ Dev Summit Munich: Learn from German Automotive, Banking, and TelCo Software Practitioners

InfoQ Dev Summit Munich is a two-day in-person software development conference for senior software engineers, architects, and team leaders in the Bavarian capital on September 26th and 27th. The sessions will cover critical topics such as generative AI and platform engineering, with use cases from the German automotive, banking, and telecommunication industries.

Renato Losio
on Jun 04, 2024
AI, ML & Data Engineering

Cloudflare AI Gateway Now Generally Available

Cloudflare has recently announced that AI Gateway is now generally available. Described as a unified interface for managing and scaling generative AI workloads, AI Gateway allows developers to gain visibility and control over AI applications.

Renato Losio
on Jun 02, 2024
AI, ML & Data Engineering

Stanford AI Index 2024 Report: Growth of AI Regulations and Generative AI Investment

Stanford University’s Institute for Human-Centered Artificial Intelligence (HAI) has published its 2024 AI Index annual report. The report identifies top trends in AI, such as 8x growth in Generative AI investment since 2022.

Anthony Alford
on May 28, 2024
AI, ML & Data Engineering

NIST Launches Program to Discriminate How Far from "Human-Quality" are Gen AI Generated Summaries

NIST launched a public Gen AI evaluation program for systems developed by the international research community. The pilot program focuses on systems that can generate human-like summaries from multiple documents, or discriminators to identify whether a summary was AI-generated. For now, information about text-to-text modality is available. The registration closes in May.

Olimpiu Pop
on May 28, 2024
AI, ML & Data Engineering

AWS Introduces Amazon Bedrock Studio for Building Generative AI Applications

AWS has recently announced Amazon Bedrock Studio, a web interface for developers to collaborate and build generative AI applications. Currently in public preview, the rapid prototyping environment provides access to multiple foundation models, knowledge bases, agents, and guardrails.

Renato Losio
on May 24, 2024
AI, ML & Data Engineering

OpenAI Announces New Flagship Model GPT-4o

OpenAI recently announced the latest version of their GPT AI foundation model, GPT-4o. GPT-4o is faster than the previous version of GPT-4 and has improved capabilities in handling speech, vision, and multilingual tasks, outperforming all models except Google's Gemini on several benchmarks.

Anthony Alford
on May 21, 2024

Newer News

Older News

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

News