InfoQ Homepage Machine Learning Content on InfoQ
-
Meta Releases Llama 3.2 with Vision, Voice, and Open Customizable Models
Meta recently announced Llama 3.2, the latest version of Meta's open-source language model, which includes vision, voice, and open customizable models. This is the first multimodal version of the model, which will allow users to interact with visual data in ways like identifying objects in photos or editing images with natural language commands among other use cases.
-
OpenAI Releases Stable Version of .NET Library with GPT-4o Support and API Enhancements
OpenAI has released the stable version of its official .NET library, following June's beta launch. Available as a NuGet package, it supports the latest models like GPT-4o and GPT-4o mini, and the full OpenAI REST API. The release includes both sync and async APIs, streaming chat completions, and key-breaking changes for improved API consistency.
-
PyTorch Conference 2024: PyTorch 2.4/Upcoming 2.5, and Llama 3.1
The PyTorch Conference 2024, held by The Linux Foundation, showcased groundbreaking advancements in AI, featuring insights on PyTorch 2.4, Llama 3.1, and open-source projects like OLMo. Key discussions on LLM deployment, ethical AI, and innovative libraries like Torchtune and TorchChat emphasized collaboration and responsible practices in the evolving landscape of generative AI.
-
Microsoft Launches Azure AI Inference SDK for .NET
Microsoft launched Azure AI Inference SDK for .NET, streamlining access to generative AI models in the Azure AI Studio model catalog. This catalog includes models from providers like Azure OpenAI Service, Mistral, Meta, Cohere, NVIDIA, and Hugging Face, organized into three collections: Curated by Azure AI, Azure OpenAI Models, and Open Models from Hugging Face Hub.
-
AWS Announces General Availability of EC2 P5e Instances, Powered by NVIDIA H100 Tensor Core GPUs
Amazon Web Services (AWS) has launched EC2 P5e instances featuring NVIDIA H100 Tensor Core GPUs, substantially boosting AI and HPC performance. With enhanced memory bandwidth, these instances reduce latency for real-time applications. Ideal for tasks like LLM training and simulations, they offer improved scalability and cost-efficiency, making them pivotal for modern cloud computing.
-
Leveraging the Transformer Architecture for Music Recommendation on YouTube
Google has described an approach to use transformer models, which ignited the current generative AI boom, for music recommendation. This approach, which is currently being applied experimentally on YouTube, aims to build a recommender that can understand sequences of user actions when listening to music to better predict user preferences based on their context.
-
Pinterest Modernises Machine Learning Infrastructure with Ray
Pinterest, the visual discovery platform, has revealed details about its journey to modernise its machine learning infrastructure using Ray, an open-source distributed computing framework. In a recent blog post, the company shared insights into the challenges faced and solutions implemented as they integrated Ray into their large-scale production environment.
-
Meta Releases Llama 3.1 405B, Largest Open-Source Model to Date
Meta recently unveiled its latest language model, Llama 3.1 405B. This AI model is the largest of the new Llama models, which also include 8B and 70B versions. With 405 billion parameters, 15 trillion tokens, and 16,000 GPUs, Llama 3.1 405B offers a range of impressive features.
-
AWS Introduces Amazon Q Developer in SageMaker Studio to Streamline ML Workflows
AWS announced that Amazon SageMaker Studio now includes Amazon Q Developer as a new capability. This generative AI-powered assistant is built natively into SageMaker’s JupyterLab experience and provides recommendations for the best tools for each task, step-by-step guidance, code generation, and troubleshooting assistance.
-
Amazon SageMaker Now Offers Managed MLflow Capability for Enhanced Experiment Tracking
AWS has announced the general availability of MLflow capability in Amazon SageMaker. MLflow is an open-source tool commonly used for managing ML experiments. Users can now compare model performance, parameters, and metrics across experiments in the MLflow UI, keep track of their best models in the MLflow Model Registry, and automatically register them as a SageMaker model.
-
Apple WWDC: iOS18 and Apple Intelligence Announcements
At WWDC 2024 Apple unveiled "Apple Intelligence," a suite of AI features coming to iOS 18, iPadOS 18, and macOS Sequoia. Apple’s aim with Apple Intelligence is to seamlessly integrate AI into the core of the iPhone, iPad, and Mac experience.
-
AI and Software Development: Preview of Sessions at InfoQ Events
Explore the transformative impact of AI on software development at InfoQ's upcoming events. Senior software developers will share practical applications and ethical considerations of AI technology through technical talks.
-
AWS Introduces Amazon Bedrock Studio for Building Generative AI Applications
AWS has recently announced Amazon Bedrock Studio, a web interface for developers to collaborate and build generative AI applications. Currently in public preview, the rapid prototyping environment provides access to multiple foundation models, knowledge bases, agents, and guardrails.
-
Enhanced Security for Enterprises: Google Launches Google Threat Intelligence
At the recent RSA Conference in San Francisco, Google Cloud introduced Google Threat Intelligence, a new security offering for large organizations. The new solution provides users with actionable insights, external threat monitoring, attack surface management, digital risk protection, and in-depth analysis of Indicators of Compromise (IOC).
-
Hugging Face Unveils LeRobot, an Open-Source Machine Learning Model for Robotics
Hugging Face has unveiled LeRobot, a new machine learning model trained for real-world robotics applications. LeRobot functions as a platform, offering a versatile library for data sharing, visualization, and training of advanced models.