InfoQ Homepage Optimization Content on InfoQ
-
Hugging Face's Guide to Optimizing LLMs in Production
When it comes to deploying Large Language Models (LLMs) in production, the two major challenges originate from the huge amount of parameters they require and the necessity of handling very long input sequences to represent contextual information. Hugging Face has documented a list of techniques to tackle those hurdles based on their experience serving such models.
-
Cloudflare Introduces Smart Hints to Automate Web Performance Optimisation
Cloudflare has introduced Smart Hints to eliminate the need for manual configuration of Early Hints and Fetch Priorities, By leveraging Cloudflare's network, Smart Hints streamline the optimization process and aim to provide performance improvements without requiring manual configuration.
-
ETTrace is an Open-Source Profiler for iOS Aiming to Simplify Performance Optimization
Recently open-sourced by Emerge Tools, maker of several analysis tools for iOS apps, ETTrace aims to simplify iOS performance profiling by providing intuitive visualizations and straightforward operation.
-
AWS Introduces Athena Provisioned Capacity
AWS recently announced a new feature Provisioned Capacity for Athena, that allows users to run SQL queries on fully-managed compute capacity for a fixed price and no long-term commitments.
-
Green Software Development - What Can You Do Now, and Where is the Industry Heading?
Making code more efficient often ends up saving carbon. Storing less information and compressing it can also lower your carbon footprint. There are open-source projects and standards and guides available that can be used to increase sustainability in software development. Measurement standardization is needed to compare the environmental impact of cloud suppliers.
-
Developing Software to Manage Distributed Energy Systems at Scale
Functional programming techniques can make software more composable, reliable, and testable. For systems at scale, trade-offs in edge vs. cloud computing can impact speed and security.
-
AWS Lambda SnapStart Accelerates Java Functions
At the recent re:Invent, AWS announced an update to its FaaS offering Lambda with SnapStart feature that reduces the cold start for Java Functions.
-
Facebook MemLab Helps Finding JavaScript Memory Leaks
While JavaScript runtimes use garbage collection to manage memory, this does not rule out the possibility of leaking memory. To detect all possible cases where memory is leaked, Facebook created and open-sourced MemLab, a tool able to run predefined, automatable test scenarios and analyze heapshot diffs.
-
Spotify's Ruler Helps Optimize Android App Size
Ruler is an open-source tool developed at Spotify to analyze Android apps and guide developers in identifying which of their components contribute the most to the app size.
-
Improve Android App Startup Times Like Facebook and Google Do
In a recent article, engineers from Google and Facebook provided their hints at what matters the most when reducing the time it takes for an Android app to become responsive on launch and ensuring an optimal user experience.
-
OpenAI Releases Triton, Python-Based Programming Language for AI Workload Optimization
OpenAI released their newest language, Triton, an open-source programming language that enables researchers to write highly efficient GPU code for AI workloads. Triton is Python-compatible and allows new users to achieve expert-quality results in only 25 lines of code. The code is written in Python using Triton’s libraries, which are then JIT-compiled to run on the GPU.
-
V8 Gets a Non-Optimizing Compiler Stage to Improve Performance
The latest version of the JavaScript V8 engine, V8 9.1, introduces a new intermediate compiler stage, called Sparkplug, that improves performance on real-world benchmarks by 5-15%, says V8 engineer Leszek Swirski. It will be available in the upcoming Chrome 91.
-
The Preact Team Releases WMR to Simplify Modern Web App Development
Jason Miller, creator of 3KB Preact alternative to React, recently released WMR, a *tiny all-in-one development tool for modern web apps*. WMR, in a single 2MB file with no dependencies, supports importing npm packages without prior installation, and many of the productivity and production features customary to developers (e.g, hot module reloading, JSX, CSS modules, TypeScript, and more).
-
Webpack vs. Rollup vs. Parcel vs. Browserify: a Detailed Benchmark
The Google's web.dev team recently released a detailed benchmark comparing popular web application bundlers. The first release tests the browserify, parcel, rollup, and webpack bundlers across six dimensions and 61 feature tests. The benchmark aims at giving developers a relevant and structured comparison basis from which to pick a bundler that fits the specific needs of a given project.
-
Optimization Strategies for the New Facebook.com - Ashley Watkins at React Conf
Ashley Watkins discussed at React Conf some of the technologies and strategies powering FB5, the new facebook.com, addressing topics such as data-driven dependencies, phased code and data downloading, and more.