issues
search
gatheluck
/
PaperReading
Notes about papers (in Japanese)
0
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[2024] Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning
#685
gatheluck
opened
6 months ago
0
[2024] The pitfalls of next-token prediction
#684
gatheluck
opened
6 months ago
0
[2024] Ask Your Distribution Shift if Pre-Training is Right for You
#683
gatheluck
opened
7 months ago
0
[2024] Deep Networks Always Grok and Here is Why
#682
gatheluck
opened
7 months ago
0
[2024] MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
#681
gatheluck
opened
7 months ago
0
[2024] Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots
#680
gatheluck
opened
7 months ago
0
[2024] Multilinear Operator Networks
#679
gatheluck
opened
7 months ago
0
[2024] Neural Networks Learn Statistics of Increasing Complexity
#678
gatheluck
opened
7 months ago
0
Sora
#677
gatheluck
opened
7 months ago
0
[2023] Data-Efficient Contrastive Self-supervised Learning: Most Beneficial Examples for Supervised Learning Contribute the Least
#676
gatheluck
opened
7 months ago
0
[2024] Universal Neural Functionals
#675
gatheluck
opened
7 months ago
0
[2024] Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video
#674
gatheluck
opened
8 months ago
0
[2024] Knowledge Fusion of Large Language Models
#673
gatheluck
opened
8 months ago
0
[2024] Deconstructing Denoising Diffusion Models for Self-Supervised Learning
#672
gatheluck
opened
8 months ago
0
[2024] DsDm: Model-Aware Dataset Selection with Datamodels
#671
gatheluck
opened
8 months ago
0
[2021] Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines
#670
gatheluck
opened
8 months ago
0
[2023] Intriguing Properties of Generative Classifiers
#669
gatheluck
opened
8 months ago
0
[2023] Why Shallow Networks Struggle with Approximating and Learning High Frequency: A Numerical Study
#668
gatheluck
opened
8 months ago
0
[2023] Revisiting Adversarial Training at Scale
#667
gatheluck
opened
8 months ago
0
[2023] Denoising Vision Transformers
#666
gatheluck
opened
8 months ago
0
[2023] Why Do We Need Weight Decay in Modern Deep Learning?
#665
gatheluck
opened
8 months ago
0
[2023] Towards Predicting Equilibrium Distributions for Molecular Systems with Deep Learning
#664
gatheluck
opened
9 months ago
0
[2023] Hyena Hierarchy: Towards Larger Convolutional Language Models
#663
gatheluck
opened
9 months ago
0
[2023] Perspectives on the State and Future of Deep Learning - 2023
#662
gatheluck
opened
9 months ago
0
[2023] Implicit Identity Driven Deepfake Face Swapping Detection
#661
gatheluck
opened
9 months ago
0
[2023] The Journey, Not the Destination: How Data Guides Diffusion Models
#660
gatheluck
opened
9 months ago
0
[2023] Adversarial Attacks on GPT-4 via Simple Random Search
#659
gatheluck
opened
9 months ago
0
[2023] On the explainable properties of 1-Lipschitz Neural Networks: An Optimal Transport Perspective
#658
gatheluck
opened
9 months ago
0
[2023] FaceStudio: Put Your Face Everywhere in Seconds
#657
gatheluck
opened
9 months ago
0
[2023] Enabling Calibration In The Zero-Shot Inference of Large Vision-Language Models
#656
gatheluck
opened
10 months ago
0
[2023] Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step Inference
#655
gatheluck
opened
10 months ago
0
[2023] TAPIR: Tracking Any Point with per-frame Initialization and temporal Refinement
#654
gatheluck
opened
10 months ago
0
[2023] GANDiffFace: Controllable Generation of Synthetic Datasets for Face Recognition with Realistic Variations
#653
gatheluck
opened
10 months ago
0
[2023] Distilled Feature Fields Enable Few-Shot Language-Guided Manipulation
#652
gatheluck
opened
11 months ago
0
[2023] ConvNets Match Vision Transformers at Scale
#651
gatheluck
opened
11 months ago
0
[2023] Is Conditional Generative Modeling all you need for Decision-Making?
#650
gatheluck
opened
11 months ago
0
[2023] LeanDojo: Theorem Proving with Retrieval-Augmented Language Models
#649
gatheluck
opened
11 months ago
0
[2023] FreeU: Free Lunch in Diffusion U-Net
#648
gatheluck
opened
11 months ago
0
[2023] Rosetta Neurons: Mining the Common Units in a Model Zoo
#647
gatheluck
opened
11 months ago
0
[2023] SMPLer-X: Scaling Up Expressive Human Pose and Shape Estimation
#646
gatheluck
opened
1 year ago
0
[2023] Additive Decoders for Latent Variables Identification and Cartesian-Product Extrapolation
#645
gatheluck
opened
1 year ago
0
[2023] Vision Transformers Need Registers
#644
gatheluck
opened
1 year ago
0
[2023] Decaf: Monocular Deformation Capture for Face and Hand Interactions
#643
gatheluck
opened
1 year ago
0
[2023] DualToken-ViT: Position-aware Efficient Vision Transformer with Dual Token Fusion
#642
gatheluck
opened
1 year ago
0
[2023] RMT: Retentive Networks Meet Vision Transformers
#641
gatheluck
opened
1 year ago
0
[2023] Chain-of-Verification Reduces Hallucination in Large Language Models
#640
gatheluck
opened
1 year ago
0
[2023] FlowCam: Training Generalizable 3D Radiance Fields without Camera Poses via Pixel-Aligned Scene Flow
#639
gatheluck
opened
1 year ago
0
[2023] Transferable Adversarial Robustness for Categorical Data via Universal Robust Embeddings
#638
gatheluck
opened
1 year ago
0
[2019] Deep Double Descent: Where Bigger Models and More Data Hurt
#637
gatheluck
opened
1 year ago
0
[2023] DreamStyler: Paint by Style Inversion with Text-to-Image Diffusion Models
#636
gatheluck
opened
1 year ago
0
Previous
Next