InfoQ Homepage Machine Learning Content on InfoQ
-
The AI Revolution Is Just Getting Started: Leslie Miley Bids Us to Act Now against Its Bias and CO2
At his inaugural keynote of the QCON London conference, Leslie Miley, technical advisor for the CTO at Microsoft, spoke about AI Bias and Sustainability, and how the march towards transformative technologies, like large-scale AI and even crypto, has an inherent cost in the increased CO2 that comes with deployment at scale. More than just context and impact, he suggests mitigation techniques.
-
The Sentence Similarity Scenario in ML.NET Model Builder
Microsoft recently published information about adding the Sentence Similarity scenario in Model Builder. This scenario allows the training of custom sentence similarity models. Together with the addition of this scenario to the Model Builder, it is no longer necessary to install the Model Builder GPU extension.
-
Stack Overflow Sentiment Survey Aims to Identify Experimental and Proven Technologies
Stack Overflow has published its 2023 Emergent Tech Sentiment Survey, aiming to gauge how developers perceive utility and impact of a range of "buzz-worthy" technologies like quantum computing, nanotechnology, blockchain, and others, as well as other more established ones including cloud computing, machine learning, robotics, and more.
-
Meta AI’s Large Language Model with 10x Fewer Parameters
Meta AI recently released a new large language model called Language Large Models Meta AI (LLaMA) that outperforms foundational models such as GPT-3 and is competitive with PaLM, despite having 10 times fewer parameters. LLaMA has better performance in language tasks such as natural questions, common-sense reasoning and mathematical reasoning.
-
Google AI Updates Universal Speech Model to Scale Automatic Speech Recognition beyond 100 Languages
Google AI has recently unveiled a new update for their Universal Speech Model (USM), to support the 1,000 Languages Initiative. The new model performs better than OpenAI Whisper for all segments of automation speech recognition.
-
Zero-Copy In-Memory Sharing of Large Distributed Data: V6d
Zero-copy and in-memory data manager Vineyard (v6d) is maintained as a CNCF sandbox project and provides distributed operators that can be utilized to share immutable data within or across cluster nodes. V6d is of interest particularly for deep network training on big (sharded) datasets such as large language and graph models.
-
Hugging Face and AWS Join Forces to Democratize AI
Hugging Face, a top supplier of open-source machine learning tools, and AWS have joined together to increase the access to artificial intelligence (AI). Hugging Face's cutting-edge transformers and natural language processing (NLP) models will be made available to AWS customers as a result of the cooperation, making it simpler for them to develop and deploy AI applications.
-
MariaDB 11.0 Brings a New Optimizer Cost Model and More
After 10 years since the last release, MariaDB Server 11.0 has been released, bringing a new optimizer cost model which aims to predict more accurately the actual cost of each query execution plan, removed InnoDB change buffer, and so on.
-
Unsupervised Object Detection and Semantic Segmentation Using Deep Learning
Meta AI released CutLER, a state-of-the-art zero-shot unsupervised object detector which improves detection performance by over 2.7 times on 11 benchmark datasets for different domains like video frames, painting, sketches, etc. This model’s simplicity allows compatibility with different object-detection architectures across different domains.
-
Google Unveils MusicLM, an AI That Can Generate Music from Text Prompts
Google researchers have introduced MusicLM, an AI model that can generate high-fidelity music from text. MusicLM creates music at a constant 24 kHz throughout a number of minutes by modeling the conditional music generating process as a hierarchical sequence-to-sequence modeling problem.
-
Doordash Introduces ML to Understand the Marketplaces Status
DoorDash introduces an ML model to predict the operational status of a store in order to increase the user experience and save thousands of orders cancellation. Understanding the merchant’s operational status and the ability to receive and fulfill orders is crucial for the DoorDash platform.
-
GitHub Releases Copilot for Business amid Ongoing Legal Controversy
GitHub has announced Copilot for Business, a business plan for their OpenAI-powered coding assistant Copilot. The release follows a recent class action lawsuit against Microsoft, GitHub, and OpenAI for violating open-source licenses.
-
Grafana Adds Outlier Detection to Its Machine Learning Toolkit
Grafana has released outlier detection as part of their Grafana Machine Learning toolkit. Outlier detection can be used to monitor a group of similar things and be alerted when some of them start to behave differently than the norm.
-
eBay New Recommendations Model with Three Billion Item Titles
eBay developed a new recommendations model based on Natural Language Processing (NLP) techniques and in particular on BERT model. This new model, called “ranker,” uses the distance score between the embeddings as a feature; in this way the information in the titles of the products is analyzed from the semantic points of view.
-
Waymo Developed Collision Avoid Test to Evaluate Its Autonomous Driver
Waymo developed a testing framework called Collision Avoidance Test (CAT) to evaluate the ability to avoid crush or potential hazard situations of its Waymo Driver, compared to a human driver.