Stars
Productive, portable, and performant GPU programming in Python.
10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)
Samples for CUDA Developers which demonstrates features in CUDA Toolkit
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
openpilot is an operating system for robotics. Currently, it upgrades the driver assistance system on 275+ supported cars.
A list of awesome compiler projects and papers for tensor computation and deep learning.
A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.
higher is a pytorch library allowing users to obtain higher order gradients over losses spanning training loops rather than individual training steps.
automatic differentiation made easier for C++
A pure, low-level tensor program representation enabling tensor program optimization via program rewriting. See the web demo at https://gussmith23.github.io/glenside-web-demo/
Re-implementation of the TASO compiler using equality saturation
Deep universal probabilistic programming with Python and PyTorch
egg is a flexible, high-performance e-graph library
A self-contained version of the tutorial which can be easily cloned and viewed by others.
Dynamic Tensor Rematerialization prototype (modified PyTorch) and simulator. Paper: https://arxiv.org/abs/2006.09616
Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
Open deep learning compiler stack for cpu, gpu and specialized accelerators
ODK Collect is an Android app for filling out forms. It's been used to collect billions of data points in challenging environments around the world. Contribute and make the world a better place! ✨📋✨
Build cross-platform desktop apps with JavaScript, HTML, and CSS
A toolkit for developing and comparing reinforcement learning algorithms.