amrzv / awesome-colab-notebooks

Collection of google colaboratory notebooks for fast and easy experiments
MIT License
1.35k stars 254 forks source link
cnn colab-notebooks deep-learning deep-neural-networks generative-adversarial-network google-colab google-colab-notebook google-colab-notebooks google-colab-tutorial google-colaboratory google-colabs jupyter-notebooks machine-learning pytorch tensorflow tensorflow-tutorials

Hits awesome-colab-notebooks

The page might not be rendered properly. Please open README.md file directly

Awesome colab notebooks collection for ML experiments

Trending

repositories papers
  • facebookresearch/co-tracker
  • iterative/datachain
  • callummcdougall/ARENA_3.0
  • ToTheBeginning/PuLID
  • ZhengPeng7/BiRefNet
  • ultralytics/ultralytics
  • unslothai/unsloth
  • facebookresearch/segment-anything-2
  • lllyasviel/IC-Light
  • gemelo-ai/vocos
  • comfyanonymous/ComfyUI
  • TransformerLensOrg/TransformerLens
  • HongwenZhang/PyMAF-X
  • roboflow/supervision
  • KwaiVGI/LivePortrait
  • piddnad/DDColor
  • TencentARC/InstantMesh
  • LAION-AI/aesthetic-predictor
  • Doubiiu/DynamiCrafter
  • facebookresearch/home-robot
  • KillianLucas/open-interpreter
  • jxnl/instructor
  • LIDA
  • Gaussian Splatting
  • Tune-A-Video
  • FollowYourPose
  • Text2Video-Zero
  • GLIP
  • UniFormerV2
  • SadTalker
  • OWL-ViT
  • VideoReTalking
  • LDM
  • Dream Fields
  • Detic
  • GraphCast
  • DragGAN
  • VRT
  • Thin-Plate Spline Motion Model
  • PyMAF-X
  • FateZero
  • py-irt
  • VQ-Diffusion
  • ECON

Research

name description authors links colaboratory update
CoTracker Architecture that jointly tracks multiple points throughout an entire video Open In Colab 16.10.2024
PIFu Pixel-Aligned Implicit Function for High-Resolution Clothed Human Digitization
  • arxiv
  • yt
Open In Colab 08.10.2024
DifFace Method that is capable of coping with unseen and complex degradations more gracefully without complicated loss designs
  • arxiv
  • git, git, git, git
  • hf
Open In Colab 05.10.2024
Segment Anything 2 Foundation model towards solving promptable visual segmentation in images and videos Open In Colab 01.10.2024
Open-Unmix A deep neural network reference implementation for music source separation, applicable for researchers, audio engineers and artists Open In Colab 25.09.2024
Deep Painterly Harmonization Algorithm produces significantly better results than photo compositing or global stylization techniques and that it enables creative painterly edits that would be otherwise difficult to achieve
  • arxiv, arxiv
  • git, git, git
Open In Colab 23.09.2024
audio2photoreal Framework for generating full-bodied photorealistic avatars that gesture according to the conversational dynamics of a dyadic interaction Open In Colab 13.09.2024
Fast Segment Anything CNN Segment Anything Model trained using only 2% of the SA-1B dataset published by SAM authors
  • arxiv, arxiv
  • git
  • medium
  • yt, yt, yt
Open In Colab 10.09.2024
Neuralangelo Framework for high-fidelity 3D surface reconstruction from RGB video captures Open In Colab 02.09.2024
BiRefNet Bilateral reference framework for high-resolution dichotomous image segmentation Open In Colab 23.08.2024
SPIN Learning to Reconstruct 3D Human Pose and Shape via Model-fitting in the Loop Open In Colab 21.08.2024
YOLOv10 Aim to further advance the performance-efficiency boundary of YOLOs from both the post-processing and model architecture Open In Colab 20.08.2024
SpecVQGAN Taming the visually guided sound generation by shrinking a training dataset to a set of representative vectors Open In Colab 12.07.2024
LivePortrait Video-driven portrait animation framework with a focus on better generalization, controllability, and efficiency for practical usage Open In Colab 10.07.2024
TAPIR Tracking Any Point with per-frame Initialization and temporal Refinement Open In Colab 05.07.2024
Wav2Lip A Lip Sync Expert Is All You Need for Speech to Lip Generation In the Wild Open In Colab 27.06.2024
DeepLabCut Efficient method for markerless pose estimation based on transfer learning with deep neural networks that achieves excellent results with minimal training data Open In Colab 05.06.2024
PoolFormer MetaFormer Is Actually What You Need for Vision
  • arxiv
  • git, git, git
  • hf
Open In Colab 01.06.2024
StoryDiffusion Way of self-attention calculation, termed Consistent Self-Attention, that significantly boosts the consistency between the generated images and augments prevalent pretrained diffusion-based text-to-image models in a zero-shot manner Open In Colab 04.05.2024
PuLID Pure and Lightning ID customization, a tuning-free ID customization method for text-to-image generation
  • arxiv
  • git, git, git
  • reddit
Open In Colab 03.05.2024
FILM A frame interpolation algorithm that synthesizes multiple intermediate frames from two input images with large in-between motion Open In Colab 03.05.2024
VoiceCraft token infilling neural codec language model, that achieves state-of-the-art performance on both speech editing and zero-shot text-to-speech on audiobooks, internet videos, and podcasts Open In Colab 21.04.2024
ZeST Method for zero-shot material transfer to an object in the input image given a material exemplar image Open In Colab 16.04.2024
InstantMesh Feed-forward framework for instant 3D mesh generation from a single image, featuring state-of-the-art generation quality and significant training scalability
  • arxiv
  • git, git, git
  • hf
  • reddit
  • yt
Open In Colab 16.04.2024
AlphaFold Highly accurate protein structure prediction Open In Colab 15.04.2024
Würstchen Architecture for text-to-image synthesis that combines competitive performance with unprecedented cost-effectiveness for large-scale text-to-image diffusion models
  • arxiv
  • hf
  • reddit
  • yt
Open In Colab 06.04.2024
AQLM Extreme Compression of Large Language Models via Additive Quantization
  • arxiv
  • hf, hf, hf
  • reddit
  • yt, yt
Open In Colab 08.03.2024
YOLOv9 Learning What You Want to Learn Using Programmable Gradient Information Open In Colab 05.03.2024
Multi-LoRA Composition LoRA Switch and LoRA Composite, approaches that aim to surpass traditional techniques in terms of accuracy and image quality, especially in complex compositions Open In Colab 03.03.2024
AMARETTO Multiscale and multimodal inference of regulatory networks to identify cell circuits and their drivers shared and distinct within and across biological systems of human disease Open In Colab 28.02.2024
LIDA Tool for generating grammar-agnostic visualizations and infographics Victor Dibia Open In Colab 06.02.2024
ViT Vision Transformer and MLP-Mixer Architectures Open In Colab 06.02.2024
3D Ken Burns A reference implementation of 3D Ken Burns Effect from a Single Image using PyTorch - given a single input image, it animates this still image with a virtual camera scan and zoom subject to motion parallax Manuel Romero
  • arxiv
  • yt
Open In Colab 24.01.2024
VALL-E X Cross-lingual neural codec language model for cross-lingual speech synthesis Open In Colab 19.01.2024
PhotoMaker Efficient personalized text-to-image generation method, which mainly encodes an arbitrary number of input ID images into a stack ID embedding for preserving ID information Open In Colab 18.01.2024
DDColor End-to-end method with dual decoders for image colorization
  • arxiv
  • git, git
Open In Colab 15.01.2024
PASD Pixel-aware stable diffusion network to achieve robust Real-ISR as well as personalized stylization
  • arxiv
  • git
  • hf, hf
  • reddit
Open In Colab 12.01.2024
HandRefiner Refining Malformed Hands in Generated Images by Diffusion-based Conditional Inpainting
  • arxiv
  • git, git, git
  • reddit
  • yt
Open In Colab 08.01.2024
GraphCast Learning skillful medium-range global weather forecasting
  • arxiv
  • data
  • deepmind
  • git, git, git, git, git
  • medium
  • yt, yt, yt, yt, yt
Open In Colab 04.01.2024
ESM Evolutionary Scale Modeling: Pretrained language models for proteins Open In Colab 28.12.2023
LLaVA Large Language and Vision Assistant, an end-to-end trained large multimodal model that connects a vision encoder and LLM for general-purpose visual and language understanding Open In Colab 22.12.2023
Background Matting V2 Real-time, high-resolution background replacement technique which operates at 30fps in 4K resolution, and 60fps for HD on a modern GPU Open In Colab 22.12.2023
Gaussian Splatting State-of-the-art visual quality while maintaining competitive training times and importantly allow high-quality real-time (≥ 100 fps) novel-view synthesis at 1080p resolution Open In Colab 19.12.2023
SMPLer-X Scaling up EHPS towards the first generalist foundation model, with up to ViT-Huge as the backbone and training with up to 4.5M instances from diverse data sources Open In Colab 18.12.2023
DeepCache Training-free paradigm that accelerates diffusion models from the perspective of model architecture Open In Colab 18.12.2023
MagicAnimate Diffusion-based framework that aims at enhancing temporal consistency, preserving reference image faithfully, and improving animation fidelity Open In Colab 18.12.2023
DiffBIR Towards Blind Image Restoration with Generative Diffusion Prior Open In Colab 18.12.2023
AudioLDM Text-to-audio system that is built on a latent space to learn the continuous audio representations from contrastive language-audio pretraining latents Open In Colab 02.12.2023
TabPFN Neural network that learned to do tabular data prediction Open In Colab 29.11.2023
Concept Sliders Plug-and-play low rank adaptors applied on top of pretrained models Open In Colab 26.11.2023
Qwen-VL Set of large-scale vision-language models designed to perceive and understand both text and images Open In Colab 24.11.2023
AnimeGANv3 Double-tail generative adversarial network for fast photo animation Open In Colab 23.11.2023
Ithaca First Deep Neural Network for the textual restoration, geographical and chronological attribution of ancient Greek inscriptions Open In Colab 21.11.2023
PixArt-Σ Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation Open In Colab 07.11.2023
Zero123++ Image-conditioned diffusion model for generating 3D-consistent multi-view images from a single input view
  • arxiv
  • git, git
  • hf, hf
  • medium
  • reddit
  • yt
Open In Colab 26.10.2023
UniFormerV2 Unified Transformer for Efficient Spatiotemporal Representation Learning
  • arxiv
  • git, git, git, git
  • hf
  • pwc, pwc, pwc, pwc, pwc
Open In Colab 20.10.2023
Show-1 Hybrid model, dubbed as Show-1, which marries pixel-based and latent-based VDMs for text-to-video generation Open In Colab 15.10.2023
AudioSep Foundation model for open-domain audio source separation with natural language queries Open In Colab 12.10.2023
DA-CLIP Degradation-aware vision-language model to better transfer pretrained vision-language models to low-level vision tasks as a universal framework for image restoration Open In Colab 11.10.2023
SadTalker Generates 3D motion coefficients of the 3DMM from audio and implicitly modulates a novel 3D-aware face render for talking head generation Open In Colab 10.10.2023
Musika Music generation system that can be trained on hundreds of hours of music using a single consumer GPU, and that allows for much faster than real-time generation of music of arbitrary length on a consumer CPU Open In Colab 09.10.2023
YOLOv6 Single-stage object detection framework dedicated to industrial applications Open In Colab 08.10.2023
DreamGaussian Algorithm to convert 3D Gaussians into textured meshes and apply a fine-tuning stage to refine the details Open In Colab 04.10.2023
ICON Given a set of images, method estimates a detailed 3D surface from each image and then combines these into an animatable avatar Open In Colab 31.08.2023
DINOv2 Produce high-performance visual features that can be directly employed with classifiers as simple as linear layers on a variety of computer vision tasks; these visual features are robust and perform well across domains without any requirement for fine-tuning Open In Colab 31.08.2023
OWL-ViT Simple Open-Vocabulary Object Detection with Vision Transformers
  • arxiv
  • hf
Open In Colab 21.08.2023
StyleGAN3 Alias-Free Generative Adversarial Networks Open In Colab 13.08.2023
FateZero Zero-shot text-based editing method on real-world videos without per-prompt training or use-specific mask Open In Colab 13.08.2023
Big GAN Large Scale GAN Training for High Fidelity Natural Image Synthesis
  • arxiv
Open In Colab 03.08.2023
LaMa Resolution-robust Large Mask Inpainting with Fourier Convolutions Open In Colab 02.08.2023
MakeItTalk A method that generates expressive talking-head videos from a single facial image with audio as the only input Open In Colab 27.07.2023
HiDT A generative image-to-image model and a new upsampling scheme that allows to apply image translation at high resolution Open In Colab 24.07.2023
CutLER Simple approach for training unsupervised object detection and segmentation models Open In Colab 24.07.2023
Recognize Anything & Tag2Text Vision language pre-training framework, which introduces image tagging into vision-language models to guide the learning of visual-linguistic features Open In Colab 09.07.2023
Thin-Plate Spline Motion Model End-to-end unsupervised motion transfer framework Open In Colab 07.07.2023
DragGAN Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold Open In Colab 03.07.2023
MobileSAM Towards Lightweight SAM for Mobile Applications
  • arxiv
  • git, git, git, git, git, git, git, git
  • twitter
  • yt
Open In Colab 30.06.2023
Grounding DINO Marrying DINO with Grounded Pre-Training for Open-Set Object Detection
  • arxiv
  • git, git, git, git, git, git, git
  • pwc, pwc, pwc, pwc
  • yt, yt, yt, yt
Open In Colab 28.06.2023
T5X Modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models at many scales
  • arxiv, arxiv
  • docs
  • git, git
  • tf, tf, tf
Open In Colab 27.06.2023
CodeTalker Cast speech-driven facial animation as a code query task in a finite proxy space of the learned codebook, which effectively promotes the vividness of the generated motions by reducing the cross-modal mapping uncertainty
  • [](), [](), [](), [](), [](), [](), [](), [](), [](), []()
  • arxiv, arxiv
  • git, git, git, git, git, git
  • project
  • yt
Open In Colab 16.06.2023
First Order Motion Model for Image Animation Transferring facial movements from video to image Aliaksandr Siarohin Open In Colab 04.06.2023
Parallel WaveGAN State-of-the-art non-autoregressive models to build your own great vocoder Tomoki Hayashi Open In Colab 01.06.2023
ECON designed for "Human digitization from a color image", which combines the best properties of implicit and explicit representations, to infer high-fidelity 3D clothed humans from in-the-wild images, even with loose clothing or in challenging poses
  • arxiv
  • discord
  • docker
  • git, git, git, git, git, git, git
  • reddit
  • twitter
  • yt, yt, yt, yt
Open In Colab 31.05.2023
MMS The Massively Multilingual Speech project expands speech technology from about 100 languages to over 1000 by building a single multilingual speech recognition model supporting over 1100 languages, language identification models able to identify over 4000 languages, pretrained models supporting over 1400 languages, and text-to-speech models for over 1100 languages
  • arxiv
  • hf, hf, hf
  • meta
  • yt, yt
Open In Colab 26.05.2023
FAB Flow AIS Bootstrap uses AIS to generate samples in regions where the flow is a poor approximation of the target, facilitating the discovery of new modes
  • arxiv
  • git, git
  • yt
Open In Colab 29.04.2023
CodeFormer Transformer-based prediction network to model global composition and context of the low-quality faces for code prediction, enabling the discovery of natural faces that closely approximate the target faces even when the inputs are severely degraded Open In Colab 21.04.2023
Text2Video-Zero Text-to-Image Diffusion Models are Zero-Shot Video Generators Open In Colab 11.04.2023
Segment Anything The Segment Anything Model produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image Open In Colab 10.04.2023
FollowYourPose Two-stage training scheme that can utilize image pose pair and pose-free video datasets and the pre-trained text-to-image model to obtain the pose-controllable character videos Open In Colab 07.04.2023
EVA3D High-quality unconditional 3D human generative model that only requires 2D image collections for training Open In Colab 06.04.2023
Stable Dreamfusion Using a pretrained 2D text-to-image diffusion model to perform text-to-3D synthesis Open In Colab 04.04.2023
PIFuHD Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization
  • arxiv
  • yt, yt
Open In Colab 26.03.2023
VideoReTalking System to edit the faces of a real-world talking head video according to input audio, producing a high-quality and lip-syncing output video even with a different emotion Open In Colab 19.03.2023
Visual ChatGPT Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting
  • arxiv
  • git, git, git, git
  • yt, yt
Open In Colab 15.03.2023
Tune-A-Video One-Shot Tuning of Image Diffusion Models for Text-to-Video Generation Open In Colab 23.02.2023
GPEN GAN Prior Embedded Network for Blind Face Restoration in the Wild Open In Colab 15.02.2023
PyMAF-X Кegression-based approach to recovering parametric full-body models from monocular images Open In Colab 14.02.2023
Disco Diffusion A frankensteinian amalgamation of notebooks, models and techniques for the generation of AI Art and Animations
  • git
  • yt, yt, yt
Open In Colab 11.02.2023
GrooVAE Some applications of machine learning for generating and manipulating beats and drum performances Open In Colab 02.02.2023
Multitrack MusicVAE The models in this notebook are capable of encoding and decoding single measures of up to 8 tracks, optionally conditioned on an underlying chord Open In Colab 02.02.2023
MusicVAE A Hierarchical Latent Vector Model for Learning Long-Term Structure in Music Open In Colab 02.02.2023
Learning to Paint Learning to Paint With Model-based Deep Reinforcement Learning Manuel Romero
  • arxiv
  • reddit
  • yt
Open In Colab 01.02.2023
Instant-NGP Instant Neural Graphics Primitives with a Multiresolution Hash Encoding Open In Colab 18.01.2023
Fourier Feature Networks Fourier Features Let Networks Learn High Frequency Functions in Low Dimensional Domains Open In Colab 17.01.2023
AlphaPose Whole-Body Regional Multi-Person Pose Estimation and Tracking in Real-Time Open In Colab 07.01.2023
HybrIK Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape Estimation Open In Colab 01.01.2023
Score Jacobian Chaining Apply chain rule on the learned gradients, and back-propagate the score of a diffusion model through the Jacobian of a differentiable renderer, which we instantiate to be a voxel radiance field Open In Colab 05.12.2022
Demucs Hybrid Spectrogram and Waveform Source Separation Alexandre Défossez
  • arxiv, arxiv, arxiv, arxiv
  • git, git, git, git
Open In Colab 21.11.2022
StyleCLIP Text-Driven Manipulation of StyleGAN Imager
  • arxiv, arxiv
  • git
  • yt, yt, yt, yt
Open In Colab 30.10.2022
MotionDiffuse The first diffusion model-based text-driven motion generation framework, which demonstrates several desired properties over existing methods Open In Colab 13.10.2022
VToonify Leverages the mid- and high-resolution layers of StyleGAN to render high-quality artistic portraits based on the multi-scale content features extracted by an encoder to better preserve the frame details Open In Colab 07.10.2022
PyMAF Pyramidal Mesh Alignment Feedback loop in regression network for well-aligned body mesh recovery and extend it for the recovery of expressive full-body models Open In Colab 06.10.2022
AlphaTensor Discovering faster matrix multiplication algorithms with reinforcement learning
  • deepmind
  • yt, yt, yt, yt
Open In Colab 04.10.2022
Swin2SR Novel Swin Transformer V2, to improve SwinIR for image super-resolution, and in particular, the compressed input scenario
  • arxiv, arxiv, arxiv, arxiv
  • git, git, git
  • hf
  • kaggle, kaggle, kaggle
Open In Colab 03.10.2022
Functa From data to functa: Your data point is a function and you can treat it like one
  • arxiv
  • git, git
  • tf
Open In Colab 24.09.2022
Whisper Automatic speech recognition system trained on 680,000 hours of multilingual and multitask supervised data collected from the web Open In Colab 21.09.2022
DeOldify (video) Colorize your own videos! Jason Antic Open In Colab 19.09.2022
DeOldify (photo) Colorize your own photos! Open In Colab 19.09.2022
Real-ESRGAN Extend the powerful ESRGAN to a practical restoration application, which is trained with pure synthetic data
  • arxiv
  • git, git, git, git, git
Open In Colab 18.09.2022
IDE-3D Interactive Disentangled Editing for High-Resolution 3D-aware Portrait Synthesis
  • git, git, git, git
  • yt
Open In Colab 08.09.2022
Decision Transformers An architecture that casts the problem of RL as conditional sequence modeling Open In Colab 06.09.2022
textual-inversion An Image is Worth One Word: Personalizing Text-to-Image Generation using Textual Inversion Open In Colab 21.08.2022
StyleGAN-Human A Data-Centric Odyssey of Human Generation Open In Colab 19.08.2022
Make-A-Scene Scene-Based Text-to-Image Generation with Human Priors
  • arxiv
  • yt
Open In Colab 12.08.2022
StyleGAN-NADA Zero-Shot non-adversarial domain adaptation of pre-trained generators Open In Colab 09.08.2022
YOLOv7 Trainable bag-of-freebies sets new state-of-the-art for real-time object detectors Open In Colab 09.08.2022
GLIP Grounded language-image pre-training model for learning object-level, language-aware, and semantic-rich visual representations Open In Colab 30.07.2022
Anycost GAN Interactive natural image editing Open In Colab 20.07.2022
GFPGAN Towards Real-World Blind Face Restoration with Generative Facial Prior Open In Colab 13.07.2022
EPro-PnP Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation Open In Colab 12.07.2022
Text2Human Text-driven controllable framework for a high-quality and diverse human generation Open In Colab 04.07.2022
VQ-Diffusion Based on a VQ-VAE whose latent space is modeled by a conditional variant of the recently developed Denoising Diffusion Probabilistic Model
  • arxiv, arxiv
  • git, git
Open In Colab 30.06.2022
OPT Open Pre-trained Transformers is a family of NLP models trained on billions of tokens of text obtained from the internet Open In Colab 29.06.2022
Customizing a Transformer Encoder We will learn how to customize the encoder to employ new network architectures Chen Chen
  • arxiv
  • git
Open In Colab 22.06.2022
MTTR End-to-End Referring Video Object Segmentation with Multimodal Transformers
  • arxiv, arxiv, arxiv
  • git
  • hf
  • yt
Open In Colab 20.06.2022
SwinIR Image Restoration Using Swin Transformer
  • arxiv, arxiv
  • git, git, git
Open In Colab 17.06.2022
VRT A Video Restoration Transformer
  • arxiv
  • git, git, git
Open In Colab 15.06.2022
Omnivore A single model which excels at classifying images, videos, and single-view 3D data using exactly the same model parameters Open In Colab 14.06.2022
Dream Fields Zero-Shot Text-Guided Object Generation Open In Colab 10.06.2022
Detic Detecting Twenty-thousand Classes using Image-level Supervision
  • arxiv
  • git
Open In Colab 07.06.2022
T0 Multitask Prompted Training Enables Zero-Shot Task Generalization
  • arxiv
  • yt, yt
Open In Colab 29.05.2022
AvatarCLIP A zero-shot text-driven framework for 3D avatar generation and animation Open In Colab 15.05.2022
Text2Mesh Text-Driven Neural Stylization for Meshes Open In Colab 14.05.2022
T5 Text-To-Text Transfer Transformer
  • arxiv
  • git
  • tf
Open In Colab 11.05.2022
XLS-R Self-supervised Cross-lingual Speech Representation Learning at Scale Open In Colab 10.05.2022
DiffCSE Unsupervised contrastive learning framework for learning sentence embeddings
  • arxiv, arxiv, arxiv
  • git
  • hf
  • twitter
Open In Colab 24.04.2022
ViDT+ An Extendable, Efficient and Effective Transformer-based Object Detector
  • arxiv, arxiv
  • git, git
Open In Colab 20.04.2022
BasicVSR++ Redesign BasicVSR by proposing second-order grid propagation and flow-guided deformable alignment Open In Colab 18.04.2022
NAFNet Nonlinear Activation Free Network for Image Restoration
  • arxiv, arxiv
  • pwc, pwc
Open In Colab 15.04.2022
Panini-Net GAN Prior based Degradation-Aware Feature Interpolation for Face Restoration
  • arxiv
  • git, git
Open In Colab 13.04.2022
E2FGVI An End-to-End framework for Flow-Guided Video Inpainting through elaborately designed three trainable modules, namely, flow completion, feature propagation, and content hallucination modules Open In Colab 06.04.2022
LDM High-Resolution Image Synthesis with Latent Diffusion Models
  • arxiv, arxiv, arxiv
  • git, git, git, git
  • hf
Open In Colab 04.04.2022
GP-UNIT Novel framework, Generative Prior-guided UNsupervised Image-to-image Translation, to improve the overall quality and applicability of the translation algorithm Open In Colab 02.04.2022
DualStyleGAN More challenging exemplar-based high-resolution portrait style transfer by introducing a novel DualStyleGAN with flexible control of dual styles of the original face domain and the extended artistic portrait domain Open In Colab 24.03.2022
CLIPasso Semantically-Aware Object Sketching Open In Colab 21.03.2022
StyleSDF A high resolution, 3D-consistent image and shape generation technique Open In Colab 05.03.2022
Disentangled Lifespan Face Synthesis LFS model is proposed to disentangle the key face characteristics including shape, texture and identity so that the unique shape and texture age transformations can be modeled effectively Open In Colab 22.02.2022
ClipCap CLIP Prefix for Image Captioning Open In Colab 15.02.2022
ROMP Monocular, One-stage, Regression of Multiple 3D People
  • arxiv, arxiv, arxiv
  • git, git, git
  • yt, yt, yt
Open In Colab 11.02.2022
Mask2Former Masked-attention Mask Transformer for Universal Image Segmentation Open In Colab 09.02.2022
JoJoGAN One Shot Face Stylization
  • arxiv
  • git, git
Open In Colab 02.02.2022
Pose with Style Detail-Preserving Pose-Guided Image Synthesis with Conditional StyleGAN Open In Colab 19.01.2022
ConvNeXt A pure ConvNet model constructed entirely from standard ConvNet modules
  • arxiv
  • git, git, git
  • hf
  • yt, yt, yt
Open In Colab 19.01.2022
diffsort Differentiable Sorting Networks
  • arxiv, arxiv
  • yt
Open In Colab 17.01.2022
Taming Transformers for High-Resolution Image Synthesis We combine the efficiancy of convolutional approaches with the expressivity of transformers by introducing a convolutional VQGAN, which learns a codebook of context-rich visual parts, whose composition is modeled with an autoregressive transformer Open In Colab 13.01.2022
RealBasicVSR Investigating Tradeoffs in Real-World Video Super-Resolution
  • arxiv
  • hf
  • reddit
Open In Colab 25.12.2021
GLIDE Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
  • arxiv
  • yt
Open In Colab 22.12.2021
Nerfies First method capable of photorealistically reconstructing deformable scenes using photos/videos captured casually from mobile phones Open In Colab 06.12.2021
HyperStyle A hypernetwork that learns to modulate StyleGAN's weights to faithfully express a given image in editable regions of the latent space Open In Colab 03.12.2021
encoder4editing Designing an Encoder for StyleGAN Image Manipulation
  • arxiv
  • git
Open In Colab 02.12.2021
StyleCariGAN Caricature Generation via StyleGAN Feature Map Modulation Open In Colab 30.11.2021
CartoonGAN The implementation of the cartoon GAN model with PyTorch Tobias Sunderdiek Open In Colab 24.11.2021
SimSwap An efficient framework, called Simple Swap, aiming for generalized and high fidelity face swapping
  • arxiv
  • git
Open In Colab 24.11.2021
RVM Robust High-Resolution Video Matting with Temporal Guidance Open In Colab 24.11.2021
RVM Robust, real-time, high-resolution human video matting method that achieves new state-of-the-art performance Open In Colab 24.11.2021
AnimeGANv2 An improved version of AnimeGAN - it prevents the generation of high-frequency artifacts by simply changing the normalization of features in the network Open In Colab 17.11.2021
SOAT StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN
  • arxiv
  • git, git
  • hf
Open In Colab 13.11.2021
Arnheim Generative Art Using Neural Visual Grammars and Dual Encoders
  • arxiv, arxiv, arxiv, arxiv, arxiv
  • git
  • wiki
  • yt, yt, yt, yt
Open In Colab 11.11.2021
StyleGAN 2 Generation of faces, cars, etc. Mikael Christensen
  • arxiv
  • git
  • yt
Open In Colab 05.11.2021
ByteTrack Multi-Object Tracking by Associating Every Detection Box Open In Colab 30.10.2021
GPT-2 Retrain an advanced text generating neural network on any text dataset using gpt-2-simple! Max Woolf Open In Colab 18.10.2021
ConvMixer An extremely simple model that is similar in spirit to the ViT and the even-more-basic MLP-Mixer in that it operates directly on patches as input, separates the mixing of spatial and channel dimensions, and maintains equal size and resolution throughout the network
  • arxiv
  • git, git
  • medium
  • yt
Open In Colab 06.10.2021
IC-GAN Instance-Conditioned GAN Open In Colab 01.10.2021
Skillful Precipitation Nowcasting Using Deep Generative Models of Radar Open-sourced dataset and model snapshot for precipitation nowcasting Open In Colab 29.09.2021
Live Speech Portraits Real-Time Photorealistic Talking-Head Animation Open In Colab 26.09.2021
StylEx Training a GAN to explain a classifier in StyleSpace Open In Colab 25.08.2021
VITS Parallel end-to-end TTS method that generates more natural sounding audio than current two-stage models Open In Colab 23.08.2021
Bringing Old Photo Back to Life Restoring old photos that suffer from severe degradation through a deep learning approach Open In Colab 13.07.2021
PTI Pivotal Tuning Inversion enables employing off-the-shelf latent based semantic editing techniques on real images using StyleGAN
  • arxiv
  • git, git
Open In Colab 01.07.2021
TediGAN Framework for multi-modal image generation and manipulation with textual descriptions
  • arxiv, arxiv
  • git, git, git, git
  • yt
Open In Colab 30.06.2021
SCALE Modeling Clothed Humans with a Surface Codec of Articulated Local Elements Open In Colab 26.06.2021
CogView Mastering Text-to-Image Generation via Transformers Open In Colab 21.06.2021
GANs N' Roses Stable, Controllable, Diverse Image to Image Translation
  • arxiv, arxiv
  • git, git
  • yt
Open In Colab 19.06.2021
Rethinking Style Transfer: From Pixels to Parameterized Brushstrokes A method to stylize images by optimizing parameterized brushstrokes instead of pixels Open In Colab 02.06.2021
Pixel2Style2Pixel Encoding in Style: A StyleGAN Encoder for Image-to-Image Translation Open In Colab 01.06.2021
Fine-tuning a BERT We will work through fine-tuning a BERT model using the tensorflow-models PIP package
  • arxiv
  • tf
Open In Colab 25.05.2021
ReStyle A Residual-Based StyleGAN Encoder via Iterative Refinement Open In Colab 21.05.2021
Motion Representations for Articulated Animation Novel motion representations for animating articulated objects consisting of distinct parts Open In Colab 29.04.2021
SAM Age Transformation Using a Style-Based Regression Model Open In Colab 26.04.2021
Geometry-Free View Synthesis Is a geometric model required to synthesize novel views from a single image? Open In Colab 22.04.2021
NeRViS An algorithm for full-frame video stabilization by first estimating dense warp fields Open In Colab 11.04.2021
NeX View synthesis based on enhancements of multiplane image that can reproduce NeXt-level view-dependent effects in real time Open In Colab 25.03.2021
Score SDE Score-Based Generative Modeling through Stochastic Differential Equations
  • arxiv, arxiv, arxiv, arxiv
  • git, git
  • yt
Open In Colab 18.03.2021
Talking Head Anime from a Single Image The network takes as input an image of an anime character's face and a desired pose, and it outputs another image of the same character in the given pose Pramook Khungurn Open In Colab 23.02.2021
NFNet An adaptive gradient clipping technique, a significantly improved class of Normalizer-Free ResNets
  • arxiv, arxiv
  • git
  • yt, yt
Open In Colab 17.02.2021
RITM Simple feedforward model for click-based interactive segmentation that employs the segmentation masks from previous steps
  • arxiv
  • git
  • pwc, pwc
Open In Colab 13.02.2021
CLIP A neural network which efficiently learns visual concepts from natural language supervision Open In Colab 29.01.2021
Adversarial Patch A method to create universal, robust, targeted adversarial image patches in the real world Tom Brown
  • arxiv
Open In Colab 27.01.2021
MSG-Net Multi-style Generative Network with a novel Inspiration Layer, which retains the functionality of optimization-based approaches and has the fast speed of feed-forward networks Open In Colab 25.01.2021
f-BRS Feature backpropagating refinement scheme that solves an optimization problem with respect to auxiliary variables instead of the network inputs, and requires running forward and backward pass just for a small part of a network
  • arxiv
  • git
  • yt, yt
Open In Colab 25.01.2021
Neural Style Transfer Implementation of Neural Style Transfer in Keras 2.0+ Somshubra Majumdar
  • arxiv, arxiv, arxiv
Open In Colab 22.01.2021
SkyAR A vision-based method for video sky replacement and harmonization, which can automatically generate realistic and dramatic sky backgrounds in videos with controllable styles Zhengxia Zou Open In Colab 18.01.2021
MusicXML Documentation The goal of this notebook is to explore one of the magenta libraries for music Open In Colab 08.01.2021
SVG VAE A colab demo for the SVG VAE model Raphael Gontijo Lopes Open In Colab 08.01.2021
Neural Magic Eye Learning to See and Understand the Scene Behind an Autostereogram Open In Colab 01.01.2021
FGVC Method first extracts and completes motion edges, and then uses them to guide piecewise-smooth flow completion with sharp edges Open In Colab 30.12.2020
VIBE Video Inference for Body Pose and Shape Estimation, which makes use of an existing large-scale motion capture dataset together with unpaired, in-the-wild, 2D keypoint annotations
  • arxiv
  • git, git, git, git, git
  • pwc
  • yt, yt, yt, yt, yt, yt, yt, yt, yt
Open In Colab 23.12.2020
SeFa A closed-form approach for unsupervised latent semantic factorization in GANs Open In Colab 06.12.2020
Stylized Neural Painting An image-to-painting translation method that generates vivid and realistic painting artworks with controllable styles Open In Colab 01.12.2020
BiT Big Transfer: General Visual Representation Learning
  • arxiv, arxiv
  • hf
  • medium
  • yt, yt, yt
Open In Colab 12.11.2020
LaSAFT Latent Source Attentive Frequency Transformation for Conditioned Source Separation Woosung Choi Open In Colab 01.11.2020
Lifespan Age Transformation Synthesis Multi-domain image-to-image generative adversarial network architecture, whose learned latent space models a continuous bi-directional aging process Open In Colab 31.10.2020
HiGAN Semantic Hierarchy Emerges in Deep Generative Representations for Scene Synthesis Open In Colab 14.10.2020
InterFaceGAN Interpreting the Latent Space of GANs for Semantic Face Editing Open In Colab 13.10.2020
Instance-aware Image Colorization Novel deep learning framework to achieve instance-aware colorization Jheng-Wei Su Open In Colab 30.08.2020
MoCo Momentum Contrast for unsupervised visual representation learning
  • arxiv, arxiv, arxiv
  • git
  • yt, yt, yt
Open In Colab 20.08.2020
CAPE Learning to Dress 3D People in Generative Clothing Open In Colab 05.08.2020
Rewriting a Deep Generative Model We ask if a deep network can be reprogrammed to follow different rules, by enabling a user to directly change the weights, instead of training with a data set Open In Colab 01.08.2020
SIREN Implicit Neural Representations with Periodic Activation Functions Open In Colab 25.06.2020
3D Photo Inpainting Method for converting a single RGB-D input image into a 3D photo, i.e., a multi-layer representation for novel view synthesis that contains hallucinated color and depth structures in regions occluded in the original view Open In Colab 04.05.2020
Motion Supervised co-part Segmentation A self-supervised deep learning method for co-part segmentation
  • arxiv
  • git
  • yt
Open In Colab 07.04.2020
Onsets and Frames Onsets and Frames is an automatic music transcription framework with piano and drums models Open In Colab 02.04.2020
FBA Matting Low-cost modification to alpha matting networks to also predict the foreground and background colours
  • arxiv
  • git
  • hf
  • pwc
Open In Colab 19.03.2020
BERT score An automatic evaluation metric for text generation Tianyi Zhang
  • arxiv
Open In Colab 05.03.2020
Generating Piano Music with Transformer This Colab notebook lets you play with pretrained Transformer models for piano music generation, based on the Music Transformer Open In Colab 16.09.2019
HMR End-to-end framework for reconstructing a full 3D mesh of a human body from a single RGB image Open In Colab 15.03.2019
GANSynth This notebook is a demo GANSynth, which generates audio with Generative Adversarial Networks Jesse Engel Open In Colab 25.02.2019
Latent Constraints Conditional Generation from Unconditional Generative Models Open In Colab 27.11.2017
Performance RNN This notebook shows you how to generate new performed compositions from a trained model Open In Colab 11.07.2017
NSynth This colab notebook has everything you need to upload your own sounds and use NSynth models to reconstruct and interpolate between them Open In Colab 06.04.2017

Tutorials

name description authors links colaboratory update
Building Your Own Federated Learning Algorithm We discuss how to implement federated learning algorithms without deferring to the tff.learning API Zachary Charles Open In Colab 01.11.2024
Federated Learning for Image Classification We use the classic MNIST training example to introduce the Federated Learning API layer of TFF, tff.learning - a set of higher-level interfaces that can be used to perform common types of federated learning tasks, such as federated training, against user-supplied models implemented in TensorFlow Krzysztof Ostrowski Open In Colab 01.11.2024
Federated Learning for Text Generation We start with a RNN that generates ASCII characters, and refine it via federated learning Krzysztof Ostrowski Open In Colab 01.11.2024
Custom Federated Algorithms, Part 1: Introduction to the Federated Core This tutorial is the first part of a two-part series that demonstrates how to implement custom types of federated algorithms in TensorFlow Federated using the Federated Core - a set of lower-level interfaces that serve as a foundation upon which we have implemented the Federated Learning layer Krzysztof Ostrowski
  • arxiv
  • pwc
  • tf, tf
Open In Colab 01.11.2024
Custom Federated Algorithms, Part 2: Implementing Federated Averaging This tutorial is the second part of a two-part series that demonstrates how to implement custom types of federated algorithms in TFF using the Federated Core, which serves as a foundation for the Federated Learning layer Krzysztof Ostrowski
  • pwc
  • tf, tf
Open In Colab 01.11.2024
High-performance simulations with TFF This tutorial will describe how to setup high-performance simulations with TFF in a variety of common scenarios Krzysztof Ostrowski
  • pwc
Open In Colab 01.11.2024
Autodistill Uses big, slower foundation models to train small, faster supervised models autodistill
  • blog post
  • docs
  • git, git, git, git, git, git, git, git, git, git, git, git, git, git, git, git, git
  • yt, yt, yt
Open In Colab 01.11.2024
Kornia Library is composed by a subset of packages containing operators that can be inserted within neural networks to train models to perform image transformations, epipolar geometry, depth estimation, and low-level image processing such as filtering and edge detection that operate directly on tensors Open In Colab 31.10.2024
LightAutoML Allows you create machine learning models using just a few lines of code, or build your own custom pipeline using ready blocks Open In Colab 31.10.2024
Llama 3.1 First openly available model that rivals the top AI models when it comes to state-of-the-art capabilities in general knowledge, steerability, math, tool use, and multilingual translation unsloth Open In Colab 31.10.2024
Phi-3.5 3.8 billion parameter language model trained on 3.3 trillion tokens, whose overall performance, as measured by both academic benchmarks and internal testing, rivals that of models such as Mixtral 8x7B and GPT-3.5, despite being small enough to be deployed on a phone unsloth Open In Colab 31.10.2024
Mistral Small Enterprise-grade small model unsloth Open In Colab 31.10.2024
Gemma 2 New addition to the Gemma family of lightweight, state-of-the-art open models, ranging in scale from 2 billion to 27 billion parameters unsloth Open In Colab 31.10.2024
NotebookLlama Open Source version of NotebookLM Meta Open In Colab 29.10.2024
MuJoCo A general purpose physics engine that aims to facilitate research and development in robotics, biomechanics, graphics and animation, machine learning, and other areas which demand fast and accurate simulation of articulated structures interacting with their environment Open In Colab 28.10.2024
YOLOv8 State-of-the-art model that builds upon the success of previous YOLO versions and introduces new features and improvements to further boost performance and flexibility Glenn Jocher Open In Colab 25.10.2024
AutoGen Framework that enables development of LLM applications using multiple agents that can converse with each other to solve tasks microsoft Open In Colab 22.10.2024
XGBoost Optimized distributed gradient boosting library designed to be highly efficient, flexible and portable
  • docs
  • pypi
  • twitter
  • wiki, wiki
  • yt, yt, yt, yt, yt, yt, yt
Open In Colab 22.10.2024
ARENA Provide talented individuals with the skills, tools, and environment necessary for upskilling in ML engineering, for the purpose of contributing directly to AI alignment in technical roles Callum McDougall Open In Colab 21.10.2024
YOLOv5 You Only Look Once Glenn Jocher Open In Colab 19.10.2024
YOLOv3 You Only Look Once Glenn Jocher Open In Colab 19.10.2024
dm_control DeepMind Infrastructure for Physics-Based Simulation Open In Colab 17.10.2024
LangGraph Library for building stateful, multi-actor applications with LLMs, used to create agent and multi-agent workflows LangChain Open In Colab 10.10.2024
SAE Lens Training Sparse Autoencoders on Language Models
  • docs
  • pypi
  • slack
Open In Colab 07.10.2024
LM Evaluation Harness Framework for few-shot evaluation of language models. EleutherAI Open In Colab 04.10.2024
Multimodal Maestro Gives you more control over large multimodal models to get the outputs you want Roboflow Open In Colab 26.09.2024
TRL Set of tools to train transformer language models with Reinforcement Learning, from the Supervised Fine-tuning step, Reward Modeling step to the Proximal Policy Optimization step
  • arxiv
  • docs
  • git
  • yt, yt
Open In Colab 24.09.2024
The Autodiff Cookbook You'll go through a whole bunch of neat autodiff ideas that you can cherry pick for your own work, starting with the basics Open In Colab 20.09.2024
Supervision Reusable computer vision tools Roboflow Open In Colab 19.09.2024
PEFT Parameter-Efficient Fine-Tuning methods enable efficient adaptation of pre-trained language models to various downstream applications without fine-tuning all the model's parameters Open In Colab 13.09.2024
SAA+ Framework, Segment Any Anomaly +, for zero-shot anomaly segmentation with hybrid prompt regularization to improve the adaptability of modern foundation models
  • arxiv
  • git, git
  • hf
Open In Colab 13.09.2024
TensorRT SDK for high-performance deep learning inference, includes a deep learning inference optimizer and runtime that delivers low latency and high throughput for inference applications nvidia Open In Colab 12.09.2024
DataChain AI-dataframe to enrich, transform and analyze data from cloud storages for ML training and LLM apps Iterative
  • discord
  • docs
  • pypi
  • twitter
  • yt, yt
Open In Colab 09.09.2024
TFF for Federated Learning Research: Model and Update Compression We use the EMNIST dataset to demonstrate how to enable lossy compression algorithms to reduce communication cost in the Federated Averaging algorithm Weikang Song Open In Colab 05.09.2024
LlamaIndex Data framework for your LLM application Jerry Liu Open In Colab 05.09.2024
VC Client software for performing real-time voice conversion using various Voice Conversion AI w-okada
  • git
  • hf
  • yt, yt, yt, yt, yt, yt, yt, yt, yt
Open In Colab 02.09.2024
Deforum Stable Diffusion Open source project is designed to be free to use and easy to modify for custom needs and pipelines Open In Colab 30.08.2024
ComfyUI Powerful and modular stable diffusion GUI and backend comfyanonymous Open In Colab 30.08.2024
Machine Learning Simplified A Gentle Introduction to Supervised Learning Andrew Wolf Open In Colab 29.08.2024
Anomalib Deep learning library that aims to collect state-of-the-art anomaly detection algorithms for benchmarking on both public and private datasets Open In Colab 29.08.2024
Nerfstudio API that allows for a simplified end-to-end process of creating, training, and testing NeRFs Open In Colab 19.08.2024
mlcourse.ai Open Machine Learning Course Yury Kashnitsky Open In Colab 19.08.2024
PyTerrier A Python framework for performing information retrieval experiments
  • arxiv
  • docs
  • git, git, git, git, git, git, git
Open In Colab 16.08.2024
highway-env A collection of environments for autonomous driving and tactical decision-making tasks Edouard Leurent
  • arxiv, arxiv, arxiv
  • docs
  • git, git, git
Open In Colab 09.08.2024
GNN Production-tested library for building GNNs at large scale
  • arxiv
  • kaggle
  • medium
  • tf, tf
  • yt, yt, yt, yt, yt, yt
Open In Colab 09.08.2024
Pix2Pix This notebook demonstrates image to image translation using conditional GAN's Billy Lamberta Open In Colab 24.07.2024
Image classification This tutorial shows how to classify images of flowers Billy Lamberta
  • pwc
Open In Colab 24.07.2024
TransformerLens Library for doing mechanistic interpretability of GPT-2 Style language models
  • arxiv, arxiv
  • docs
  • git
  • medium
  • pypi
  • slack
  • yt, yt
Open In Colab 23.07.2024
Kor Half-baked prototype that "helps" you extract structured data from text using LLMs Eugene Yurtsev
  • discord
  • docs
Open In Colab 20.07.2024
PyTorch3D Library for deep learning with 3D data Open In Colab 11.07.2024
Stable Diffusion Videos Create videos with Stable Diffusion by exploring the latent space and morphing between text prompts Nathan Raw
  • git, git
Open In Colab 11.07.2024
Transfer learning and fine-tuning You will learn how to classify images of cats and dogs by using transfer learning from a pre-trained network François Chollet
  • pwc
  • wiki
Open In Colab 26.06.2024
MARS5 Speech model for insane prosody CAMB.AI Open In Colab 25.06.2024
Deep RL Course The Hugging Face Deep Reinforcement Learning Course Open In Colab 24.06.2024
ToonCrafter Can interpolate two cartoon images by leveraging the pre-trained image-to-video diffusion priors Open In Colab 20.06.2024
Brax A differentiable physics engine that simulates environments made up of rigid bodies, joints, and actuators
  • arxiv
  • neurips
Open In Colab 07.06.2024
DiffSynth Restructured architectures including Text Encoder, UNet, VAE, among others, maintaining compatibility with models from the open-source community while enhancing computational performance Artiprocher
  • arxiv
  • hf, hf
Open In Colab 06.06.2024
Transformer This tutorial trains a Transformer model to translate Portuguese to English Billy Lamberta Open In Colab 31.05.2024
NeMo A conversational AI toolkit built for researchers working on automatic speech recognition, natural language processing, and text-to-speech synthesis Open In Colab 25.05.2024
SentencePiece An unsupervised text tokenizer and detokenizer mainly for Neural Network-based text generation systems where the vocabulary size is predetermined prior to the neural model training
  • arxiv, arxiv, arxiv, arxiv, arxiv
  • git, git, git, git
  • medium
  • yt
Open In Colab 21.05.2024
Llama3 from scratch Llama3 from scratch, one tensor and matrix multiplication at a time Nishant Aklecha
  • git
  • twitter, twitter
  • yt
Open In Colab 19.05.2024
Hello, many worlds This tutorial shows how a classical neural network can learn to correct qubit calibration errors Michael Broughton
  • tf, tf, tf
  • wiki
  • yt
Open In Colab 17.05.2024
IC-Light Manipulate the illumination of images
  • arxiv, arxiv
  • yt, yt, yt
Open In Colab 09.05.2024
Neural style transfer This tutorial uses deep learning to compose one image in the style of another image Billy Lamberta
  • arxiv
Open In Colab 06.05.2024
TorchGeo PyTorch domain library that provides datasets, transforms, samplers, and pre-trained models specific to geospatial data Open In Colab 03.05.2024
Autoencoders This tutorial introduces autoencoders with three examples: the basics, image denoising, and anomaly detection Google Open In Colab 15.04.2024
MagicTime Metamorphic time-lapse video generation model, which learns real-world physics knowledge from time-lapse videos and implements metamorphic generation Open In Colab 14.04.2024
SAGE Methodology for generative spelling correction, which was tested on English and Russian languages and potentially can be extended to any language with minor changes
  • arxiv
  • git
  • hf, hf, hf, hf, hf
  • wiki
  • yt
Open In Colab 11.04.2024
Image segmentation This tutorial focuses on the task of image segmentation, using a modified U-Net Billy Lamberta Open In Colab 09.04.2024
Open-Sora Plan Simple and efficient design along with remarkable performance in text-to-video generation YUAN Lab at PKU
  • arxiv
  • discord
  • git, git, git
  • hf, hf
  • yt, yt
Open In Colab 07.04.2024
Gorilla Finetuned LLaMA-based model that surpasses the performance of GPT-4 on writing API calls Open In Colab 06.04.2024
Cleanlab Helps you clean data and labels by automatically detecting issues in a ML dataset Open In Colab 30.03.2024
AniPortrait Framework for generating high-quality animation driven by audio and a reference portrait image
  • arxiv
  • git, git, git, git, git
  • hf, hf, hf, hf, hf
  • reddit
  • yt, yt
Open In Colab 27.03.2024
OpenVINO Open-source toolkit for optimizing and deploying AI inference intel Open In Colab 25.03.2024
Gazelle Joint Speech Language Model Tincans Open In Colab 20.03.2024
Intel® Extension for Transformers Transformer-based Toolkit to Accelerate GenAI/LLM Everywhere intel
  • arxiv, arxiv, arxiv, arxiv, arxiv
  • discord
  • docs
  • git, git, git, git, git, git, git
  • hf, hf, hf
  • medium, medium, medium, medium, medium
  • yt, yt, yt, yt, yt
Open In Colab 19.03.2024
Datasets A Community Library for Natural Language Processing
  • arxiv
  • docs
  • hf
  • kaggle
  • yt
Open In Colab 18.03.2024
Evidently An open-source framework to evaluate, test and monitor ML models in production Open In Colab 15.03.2024
Instructor Library that makes it a breeze to work with structured outputs from large language models Jason Liu
  • discord
  • docs
  • twitter
  • yt, yt, yt
Open In Colab 13.03.2024
Feast An open source feature store for machine learning Open In Colab 28.02.2024
FiftyOne Open-source tool for building high-quality datasets and computer vision models Open In Colab 27.02.2024
MetaVoice 1.2B parameter base model trained on 100K hours of speech for TTS MetaVoice Open In Colab 26.02.2024
Generative AI for Beginners - A Course A 12 Lesson course teaching everything you need to know to start building Generative AI applications microsoft Open In Colab 22.02.2024
OmegaConf Hierarchical configuration system, with support for merging configurations from multiple sources providing a consistent API regardless of how the configuration was created Omry Yadan Open In Colab 15.02.2024
Optuna An automatic hyperparameter optimization software framework, particularly designed for machine learning Open In Colab 15.02.2024
Data augmentation This tutorial demonstrates data augmentation: a technique to increase the diversity of your training set by applying random transformations such as image rotation Billy Lamberta
  • pwc
  • tf
  • wiki
Open In Colab 14.02.2024
Stable Cascade Text to image model introduces an interesting three-stage approach, setting new benchmarks for quality, flexibility, fine-tuning, and efficiency with a focus on further eliminating hardware barriers Stability AI Open In Colab 14.02.2024
CleanVision Automatically detects potential issues in image datasets like images that are: blurry, under/over-exposed, (near) duplicates, etc cleanlab Open In Colab 13.02.2024
DynamiCrafter Animating Open-domain Images with Video Diffusion Priors Open In Colab 12.02.2024
XLA Accelerated Linear Algebra is an open-source machine learning compiler for GPUs, CPUs, and ML accelerators OpenXLA
  • medium, medium
  • pt
  • tf
  • wiki
  • yt, yt, yt, yt
Open In Colab 02.02.2024
Composer PyTorch library that enables you to train neural networks faster, at lower cost, and to higher accuracy The Mosaic ML Team Open In Colab 01.02.2024
CycleGAN This notebook demonstrates unpaired image to image translation using conditional GAN's Billy Lamberta
  • arxiv
  • tf
Open In Colab 17.01.2024
Integrated gradients This tutorial demonstrates how to implement Integrated Gradients, an Explainable AI technique Google Open In Colab 17.01.2024
MAGNeT Masked generative sequence modeling method that operates directly over several streams of audio tokens Open In Colab 16.01.2024
AutoFaiss Automatically create Faiss knn indices with the most optimal similarity search parameters Ctiteo
  • docs
  • git
  • medium
  • pypi
Open In Colab 12.01.2024
Retrieval based Voice Conversion WebUI An easy-to-use Voice Conversion framework based on VITS RVC-Project
  • discord
  • git, git, git, git, git, git
  • hf
  • medium
  • yt, yt, yt, yt, yt
Open In Colab 11.01.2024
Flax Neural network library and ecosystem for JAX designed for flexibility
  • docs
  • hf
  • medium
  • reddit
  • yt, yt, yt
Open In Colab 10.01.2024
Big Vision This codebase is designed for training large-scale vision models using Cloud TPU VMs or GPU machines
  • arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv
  • tf, tf
Open In Colab 03.01.2024
Open Interpreter An open-source, locally running implementation of OpenAI's Code Interpreter Killian Lucas Open In Colab 03.01.2024
Seamless Communication Family of AI models that enable more natural and authentic communication across languages Open In Colab 14.12.2023
colab2pdf Convert your Colab notebook to a PDF Drengskapur Open In Colab 11.12.2023
Sentence Transformers Multilingual Sentence, Paragraph, and Image Embeddings using BERT & Co
  • arxiv, arxiv, arxiv
  • docs
Open In Colab 07.12.2023
CleanRL Deep Reinforcement Learning library that provides high-quality single-file implementation with research-friendly features
  • arxiv, arxiv, arxiv, arxiv, arxiv, arxiv, arxiv
  • docs
  • git, git, git, git
  • hf
  • paper
  • yt, yt
Open In Colab 28.11.2023
Vocos Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis Hubert Siuzdak Open In Colab 21.11.2023
X—LLM Easy LLM Finetuning using the most advanced methods Boris Zubarev
  • arxiv
  • discord
  • git, git, git
  • hf, hf
  • pypi
Open In Colab 15.11.2023
Distil-Whisper Maintains the robustness of the Whisper model to difficult acoustic conditions, while being less prone to hallucination errors on long-form audio
  • arxiv, arxiv
  • git, git
  • hf, hf, hf, hf, hf, hf, hf, hf
  • medium
  • reddit
  • yt, yt, yt
Open In Colab 08.11.2023
AnimateDiff Practical framework to animate most of the existing personalized text-to-image models once and for all, saving efforts in model-specific tuning Open In Colab 30.10.2023
Intel® Neural Compressor Aims to provide popular model compression techniques such as quantization, pruning (sparsity), distillation, and neural architecture search on mainstream frameworks such as TensorFlow, PyTorch, ONNX Runtime, and MXNet, as well as Intel extensions such as Intel Extension for TensorFlow and Intel Extension for PyTorch intel
  • arxiv, arxiv, arxiv
  • discord
  • docs
  • [<img src="images/git.svg" alt="git" hei