InfoQ Homepage News
-
QCon London: Efficient Serverless Development
At QCon London, Yan Cui, a serverless advocate at Lumigo, shared patterns for effective local development with AWS serverless technologies. The focus areas were testing approaches, deployment practices, and application environments.
-
Large Language Models for Code by Loubna Ben Allal at QCon London
At QCon London, Loubna Ben Allal discussed Large Language Models (LLMs) for code. She discussed the lifecycle of code completion models, which consists of pre-training on vast codebases and finetuning and continuous adaptation. She specifically discussed open-source models, which are powered by platforms like Hugging Face.
-
NVIDIA Announces Next-Generation AI Superchip Blackwell
NVIDIA recently announced their next generation GPU architecture, Blackwell. Blackwell is the largest GPU ever built, with over 200 billion transistors, and can train large language models (LLMs) up to 4x faster than previous generation hardware.
-
QCon London: Meta Used Monolithic Architecture to Ship Threads in Only Five Months
Zahan Malkani talked during QCon London 2024 about Meta’s journey from identifying the opportunity in the market to shipping the Threads application only five months later. The company leveraged Instagram's existing monolithic architecture and quickly iterated to create a new text-first microblogging service in record time.
-
Efficient DevSecOps Workflows with a Little Help from AI: Q&A with Michael Friedrich
At QCon London, Michael Friedrich, senior developer advocate at GitLab, discussed how AI can help in DevSecOps workflows. His session was part of the Cloud-Native Engineering track on the first day of the conference. InfoQ interviewed Friedrich after the session.
-
Navigating LLM Deployment: Tips, Tricks and Techniques by Meryem Arik at QCon London
At QCon London, Meryem Arik discussed deploying Large Language Models (LLMs). While initial proofs of concept benefit from hosted solutions, scaling demands self-hosting to cut costs, enhance performance with tailored models, and meet privacy and security requirements. She emphasized understanding deployment limits, quantization for efficiency, and optimizing inference to fully use GPU resources.
-
Java News Roundup: New JEP Candidates, Project Bisbane, Ktor Plugin Repository, JDKUpdater
This week's Java roundup for April 1st, 2024, features news highlighting: new JEP Candidates: JEP 469, Vector API (Eighth Incubator); JEP 473, Stream Gatherers (Second Preview); and JEP 474, ZGC: Generational Mode by Default, Project Bisbane, and introducing the Ktor Plugin Repository and JDKUpdater.
-
Will C++ Become a Safe Language Like Rust and Others?
In a recent article, C++ expert and ISO C++ Committee Chair Herb Sutter expressed his views about what it takes to make C++ a safe language in the guise of Rust and other memory-safe languages (MSLs). His recipes include relying on tooling, as is the case with other MSLs, promoting safe language features, pushing unsafe features behind compiler flags, and more.
-
AWS Deadline Cloud: Q&A on the Cloud-Based Render Farm with Antony Passemard
AWS has recently introduced Deadline Cloud, a new service designed to help creative teams manage rendering tasks more efficiently. The service is particularly useful for customers in the media & entertainment (M&E) and architecture, engineering, & construction (AEC) industries who need to generate final frames for film, TV, games, industrial design visualizations, and other digital media.
-
Visual Studio 17.10 Preview 2: GitHub Copilot-Powered Pull Requests, SSDT Support for VS in ARM64
Microsoft has released Visual Studio 17.10 Preview 2. This release contains features regarding GitHub Copilot-powered pull requests, support for SQL Server Developer Tools in VS on ARM64 and support for Garbage Collection Insights in Managed Memory Window.
-
Enabling Software Platform Adoption with Self-Service and User Engagement
In order to scale a platform, it has to become a self-service product with software engineers and managers engaged, taking advantage of new technologies. A stakeholder engagement program was established with senior engineers and managers across the company, explaining how the new tools can increase developers' productivity and team velocity.
-
Cloudflare D1, Workers Analytics Engine and Hyperdrive: Q&A with Rita Kozlov and Matt Silverlock
Cloudflare recently announced the “production ready” switch on three products: their serverless SQL database D1, time series Workers Analytics Engine, and Hyperdrive.
-
Microsoft Announces Garnet: a New Open-Source Cache-Store and Redis Alternative
Microsoft Research has recently announced Garnet, an open-source cache-store designed to accelerate applications and services. Using the RESP wire protocol, Garnet is a faster alternative to cache-stores and is compatible with existing Redis clients.
-
Nvidia Announces Robotics-Oriented AI Foundational Model
At its recent GTC 2024 event, Nvidia announced a new foundational model to build intelligent humanoid robots. Dubbed GR00T, short for Generalist Robot 00 Technology, the model will understand natural language and be able to observe human actions and emulate human movements.
-
Microsoft AI-Driven Security Tool Copilot for Security is Now GA
Microsoft recently announced the general availability of Copilot for Security, a generative Artificial Intelligence (AI) security product designed to help security and IT teams with the capabilities to protect their digital assets.