On conscious AI and other AI stuff. If you’re in London it’s gonna be a scorching hot day. Find some shade, grab a beer or a cider. If you have time you should watch: Lex in conversation with Demis CEO @Deepmind. They talk about chess, Turing Test, video games, solving intelligence, quantum simulation, aliens, conscious AI and more…
The cost of LLMs: Since the Attention is All you Need paper, and BERT Transformers, LLMs are trending. I’ve lost count of so many LLMs: GPT-3, Gopher, PaLM, GLaM, Megatron, Chinchilla, Pari, DALL-E, Parti, Flamingo, Gato… But here’s the thing: e.g. Google’s LaMDA 137Billion parameters, takes 1024 TPUs cards and 57.7 days to train, which costs ca. $6 Million! Tom has more details on the costs of LLM’s
Python for Data Analysis: In 2012, I organised a workshop with Wes on pandas in London. Back then, we gave away a paperback of Wes’ book. A decade later! Wes just released this: An open access (html) version + code of Python for Data Analysis 3rd ed.
Let’s Unify All ML Frameworks… Any model, Any Code
A Collection of [Mostly] Pen & Paper ML Exercises
[Tutorial] Conversational RecSys ( 78 slides & vid)
[Great] AutoRegEx: English to RegEx Using GPT-3 NLP
Bandits for Recommender Systems
Our Machine Learning Stack at Monzo Bank
[Tutorial] Topological Data Analysis for ML
[Free Book] Probabilistic Numerics for ML
Etsy #purgatory: Markov Chat Bot Disaster Story
Irreproducible ML: A List of Failures & Bogus Claims in ML
Share Data Machina
DoWhy: Open Source Lib for Causal Inference
pyprobml: All Colab Notebooks from Probabilistic ML
DeepChecks: Test & Validate ML Models Effortlessly
KelpNet : A Pure C# ML Framework
A C++ Lib for Implementing AI & ML Algos
Facebook AI’s Flashlight: Fast, Flexible ML in C++
RemixAutoML: Automate Your ML Workflow
distillML: Interpretable ML Methods
rtemis: A Platform for Advanced ML & Dataviz in R
State of Machine Learning in Julia
Combinatorial Optimization Algos within ML Pipelines
MLJ: Alan Turing Institute’s ML Framework for Julia
An Idiomatic Clojure ML Library
DataLinguist: Clojure Wrapper for Standford CoreNLP
k-means ++ & k-means parallel Implemented in Clojure
Metarank: Real Time Personalization as a Service
Microsoft’s SynapseML: Simple & Distributed ML
BigDL: Large-Scale AI Apps for Distributed Big Data
A Collection of Dataviz Projects in R
Visualising Data Structures & Algos Via Animation
Vizzu: Opensource JS/ C++ Lib for Animated Data Stories
Presto Analytics on Apache Kafka at Uber Scale
Powering Real-time Data Analytics with Druid at Twitter
Spotify’s Infrastructure for Running User Forecasts
Generative Anomaly Detection for Time Series Datasets
Causality-Based Multivariate Time Series Anomaly Detection
Intrinsic Anomaly Detection for Multi-Variate Time Series
Visualise Algos from Code
Algos for Competitive Programming
Sorting Algos Visualised with Blender Python API
[Free Course] – Evolutionary Robotics (26 Videos)
DayDreamer: World Models for Physical Robot Learning
VAPAR: Visual Attention Prediction for Drone Racing
Algorithmic Trading with Deep Reinforcement Learning
Intel AI’s AnomaLib: A DL Lib for SotA Anomaly Detection
[Vids & Course Notes] MIT Intro to Deep Learning (2022)
AI Dungeon: Create & Play Infinite AI Adventures
Hazy: Synthetic Financial Data without Restrictions
Weights & Biases: Faster, Better MLOps Platform
Facebok AI’s Casual Conversation Dataset
Amazon’s Massive Dataset for Natural Language Understanding
LAION: 400 Million English Text-Image Pairs, 100% Open, Free
Share Data Machina
Tips? Suggestions? Feedback? Send email to Carlos
Curated by Carlos @ds_ldn in the middle of the night.
Great to see the newsletter back!
Carlos, you made my day! So good to get this blast from the past on a sunny Sunday morning. And with your reference to the 2012 session with Wes, that feels like yesterday yet so much has happened since! Thank you for all that you do for us!