InfoQ Homepage Generative AI Content on InfoQ
-
Amazon Q Business and Amazon Q Developer Now Generally Available
AWS has recently announced the general availability of Amazon Q a generative AI-powered assistant tailored for businesses and developers. Amazon Q Developer provides code suggestions and recommendations in real time, while Amazon Q Business enables companies to get insights from structured and unstructured data.
-
Meta Releases Llama 3 Open-Source LLM
Meta AI released Llama 3, the latest generation of their open-source large language model (LLM) family. The model is available in 8B and 70B parameter sizes, each with a base and instruction-tuned variant. Llama3 outperforms other LLMs of the same parameter size on standard LLM benchmarks.
-
Ines Montani at QCon London: Economies of Scale Can’t Monopolise the AI Revolution
During her presentation at QCon London, Ines Montani, co-founder and CEO of explosion.ai (the maker of spaCy), stated that economies of scale are not enough to create monopolies in the AI space and that open-source techniques and models will allow everybody to keep up with the “Gen AI revolution”.
-
Stability AI Releases 3D Model Generation AI Stable Video 3D
Stability AI recently released Stable Video 3D (SV3D), an AI model that can generate 3D mesh object models from a single 2D image. SV3D is based on the Stable Video Diffusion model and produces state-of-the-art results on 3D object generation benchmarks.
-
Fix Your SDLC before Adopting Gen AI in Your Organisation: Bannon’s Call to Action at QCon London
During her keynote at QCon London, Tracy Bannon, architect and researcher at MITRE, argued that AI will be able to enhance the software development lifecycle, though currently it’s at the “code completion” rather than “code generation” phase. Throughout her presentation, she continuously stresses the importance of keeping humans in the loop and fixing your company’s SDLC before embracing AI.
-
KubeCon EU Keynotes: a Call to Action to Innovate Responsibly with Generative AI
The KubeCon EU morning keynotes were a veritable call to action encouraging the cloud-native community's involvement in building the scalable infrastructure needed by generative AI. This call was balanced with encouragement to make a cloud-native platform’s “golden path” green and sustainable, ensuring that any innovation is also responsible.
-
Researchers Open-Source LLM Jailbreak Defense Algorithm SafeDecoding
Researchers from the University of Washington, the Pennsylvania State University, and Allen Institute for AI have open-sourced SafeDecoding, a technique for protecting large language models (LLMs) against jailbreak attacks. SafeDecoding outperforms baseline jailbreak defenses without incurring significant computational overhead.
-
eBay’s Lessons Learned about Generative AI in Software Development Productivity
Recently eBay disclosed the lessons learned about the application of generative AI in the development process. eBay's AI endeavors have uncovered three pivotal avenues toward enhancing developer productivity: through the integration of commercial offerings, fine-tuning existing Large Language Models (LLMs), and harnessing an internal knowledge network.
-
Google BigQuery Introduces Vector Search
Google recently announced that BigQuery now supports vector search. The new functionality enables vector similarity search required by data and AI use cases such as semantic search, similarity detection, and retrieval-augmented generation (RAG) with a large language model (LLM).
-
Amazon Q Data Integration in AWS Glue Simplifies Data Transformation on AWS
Recently, AWS announced the preview of a new feature for AWS Glue, enabling customers to use natural language for authoring and troubleshooting data integration jobs. With Amazon Q data integration in AWS Glue, developers can provide a description of their data integration workload, and the service will generate an ETL script.
-
NVIDIA Unveils Chat with RTX, a Locally Run AI Chatbot
NVIDIA has introduced Chat with RTX, allowing users to build their own personalized chatbot experience. Unlike many cloud-based solutions, Chat with RTX operates entirely on a local Windows PC or workstation, offering enhanced data privacy and control.
-
InfoQ & QCon Events: Level up on Generative AI, Security, Platform Engineering, and More Upcoming
As we navigate through these transformative times, the upcoming InfoQ events stand as a platform to help you stay ahead, learn valuable insights, and find practical solutions to your development challenges in 2024 and beyond. The events are carefully curated for senior software engineers, architects, and team leaders, offering practitioner insights into emerging trends, patterns, and practices.
-
Harnessing AI-Generated Cloudformation with Application Composer
The AWS Toolkit for VS Code has recently extended its support to include AWS Application Composer, introduced a year ago in the AWS Management Console. This enhancement empowers users to seamlessly craft Infrastructure as Code (IaC) for a comprehensive range of over 1100 AWS CloudFormation resources.
-
Mistral AI's Open-Source Mixtral 8x7B Outperforms GPT-3.5
Mistral AI recently released Mixtral 8x7B, a sparse mixture of experts (SMoE) large language model (LLM). The model contains 46.7B total parameters, but performs inference at the same speed and cost as models one-third that size. On several LLM benchmarks, it outperformed both Llama 2 70B and GPT-3.5, the model powering ChatGPT.
-
Google Announces Video Generation LLM VideoPoet
Google Research recently published their work on VideoPoet, a large language model (LLM) that can generate video. VideoPoet was trained on 2 trillion tokens of text, audio, image, and video data, and in evaluations by human judges its output was preferred over that of other models.