InfoQ Homepage Data Science Content on InfoQ
-
Apple Open Sources GCGC: a Tool to Analyze Java GC Logs
Apple has open-sourced GCGC, a tool for Java Garbage Collector (GC) logs visualization based on Python 3 and pandas. GCGC uses a Jupyter notebook to analyze and visualize GC log files.
-
Julia 1.7 Extends its Threading Capabilities, Improves Type Inference, and More
Julia 1.7 brings a number of significant enhancements, including new threading capabilities, new Package Manager features, improved type inference, and new syntactic features. It is also the first release to run natively on Apple Silicon.
-
BasisAI Open Source Boxkite Machine Learning Monitoring Tool
Boxkite is an open source instrumentation library designed to track concept drift in highly available model servers. It integrates with DevOps tools such as Grafana, Prometheus, fluentd and kubeflow, scaling horizontally to multiple replicas without needing changes to code or infrastructure. The project claims to be fast, correct and simple.
-
Pyodide Brings Python and Its Scientific Stack to the Browser with WebAssembly
Mozilla announced that Pyodide, which aims at providing a full Python data science stack running entirely in the browser, has become an independent community-driven project. Pyodide uses the CPython 3.8 interpreter compiled to WebAssembly, and thus allows using Python, NumPy, Pandas, Matplotlib, SciPy, and more in Iodide, an experimental interactive scientific computing environment for the web.
-
AWS Announces a Data Management and Analytics Solution Called Amazon FinSpace
Recently, AWS announced a data management and analytics solution purpose-built for the Financial Services Industry (FSI) called Amazon FinSpace. The service aims to reduce the time it takes for financial analysts to find and access all types of financial data for analysis.
-
Kaggle Publishes 2020 State of Machine Learning and Data Science Report
Kaggle has published a report on the State of Machine Learning and Data Science for 2020. The report is based on survey responses from over two thousand users currently employed as data scientists. The report notes that the "vast majority" of data scientists are under 35 years of age, two-thirds have a graduate degree, and most have less than 10 years coding experience.
-
Using Agile with a Data Science Team
Agile helped a data science team to better collaborate with their stakeholders and increase their productivity. As priorities became clear, the team was able to focus and deliver. Buy-in of the data science team by taking them through a journey of agile was crucial to making it work.
-
NVIDIA Releases a $59 Jetson Nano 2GB Kit to Make AI More Accessible to Developers
With the Jetson series of devices and software SDKs, NVIDIA creates a coherent development environment to learn and develop GPU-based AI applications.
-
Is Julia Production Ready? Q&A with Bogumił Kamiński
On the heels of JuliaCon 2020, SGH Warsaw School of Economics professor and DataFrames.jl maintainer Bogumił Kamiński summarized the status of the language and its ecosystem and stated that Julia is finally production-ready. InfoQ has taken the chance to speak with professor Kamiński.
-
COVID-19 and AI: Virtual Conference at Stanford Discusses the Future
The Stanford Institute For Human-Centered Artificial Intelligence approaches COVID-19 from a wide variety of perspectives.
-
AlphaFold Algorithm Predicts COVID-19 Protein Structures
DeepMind uses AlphaFold to predict 3D protein structures straight from amino acid sequences for novel coronavirus 2019 (NCOVID-2019).
-
Data Science Community Reacts to COVID-19 Pandemic
The data science community has reacted with fervor to the COVID-19 pandemic, with numerous articles from a data-oriented perspective and both official and grassroot efforts to provide access to data and utilize ML techniques to help deal with the crises across industry, academia and governmental organizations worldwide.
-
.NET Interactive: Jupyter Notebooks with .NET Core - Preview 2
.NET Interactive is the new name for Try .Net. This suite of APIs and command-line tools allows you to create interactive notebooks and documents combining rich text, code and data. One of the main use cases for .NET Interactive is embedding code into a Jupyter Notebook.
-
Oracle Cloud Now Offers Data Science and Machine Learning Services
Oracle recently announced the availability of its Cloud Data Science Platform, a native service on Oracle Cloud Infrastructure (OCI), which the software designed to let teams of data scientists collaborate on the development, deployment and maintenance of machine learning models.
-
Michael Berthold on End-to-End Data Science Using KNIME Software
Open source data analytics platform KNIME CEO and co-founder Michael Berthold gave the keynote presentation at this year's KNIME Fall Summit 2019 Conference. He spoke about the end-to-end data science cycle. The data science process lifecycle mainly involves create and productionize categories.