No. | Keyword | arXiv# Subjects |
---|---|---|
1 | Superalignment | 2312.09390 Weak-To-Strong Generalization - Eliciting Strong Capabilities With Weak Supervision |
2 | HUGS | 2311.17910 HUGS Human Gaussian Splats |
3 | LCM-LORA | 2311.05556 LCM-LoRA - A Universal Stable-Diffusion Acceleration Module |
4 | Table-GPT | 2310.09263 Table-tuned GPT for Diverse Table Tasks |
5 | DALLE 3 | 2310.07653 Mini DALL·E 3 Interactive Text to Image by Prompting Large Language Models |
6 | Mistral 7B | 2310.06825 Mistral 7B |
7 | DALLE 3 | 2310 DALL E 3 System Card |
8 | GPT-4, 4V | 2309.17421 The Dawn of LMMs Preliminary Explorations with GPT-4V(ision) |
9 | Hallucinating | 2305.18248v1 Do Language Models Know When They’re Hallucinating References |
10 | LATM | 2305.17126v1 Large Language Models as Tool Makers |
11 | GPT-4, 4V | 2305.16291v1 VOYAGER An Open-Ended Embodied Agent with Large Language Models |
12 | Chinchilla scaling laws | 2305.16264v1 Scaling Data-Constrained Language Models |
13 | Transformer | 2305.16130v1 Language Models Implement Simple Word2Vec-style Vector Arithmetic |
14 | GPT-4, 4V | 2305.15486v1 SPRING GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning |
15 | Gorilla | 2305.15334v1 Gorilla Large Language Model Connected with Massive APIs |
16 | Transformer | 2305.14699v1 Can Transformers Learn to Solve Problems Recursively |
17 | ViT | 2305.13035v1 Getting ViT in Shape Scaling Laws for Compute-Optimal Model Design |
18 | PaLM | 2305.10266v Searching for Needles in a Haystack On the Role of Incidental Bilingualism in PaLM’s Translation Capability |
19 | SuperICL | 2305.08848v1 Small Models are Valuable Plug-ins for Large Language Models |
20 | FrugalGPT | 2305.05176v1 FrugalGPT How to Use Large Language Models While Reducing Cost and Improving Performance |
21 | ZipIt | 2305.03053v1 ZipIt Merging Models from Different Tasks without Training |
22 | Dromedary | 2305.03047v1 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision |
23 | GPT-4, 4V | 2303.12712v1 Sparks of Artificial General Intelligence Early experiments with GPT-4 |
24 | GPT-4, 4V | 2303.08774v3 GPT-4 Technical Report |
25 | Consistency Models | 2303.01469 Consistency Models |
26 | GLIGEN | 2301.07093 GLIGEN Open-Set Grounded Text-to-Image Generation |
27 | Wide Transformer | 2210.00640v1 Wide Attention Is The Way Forward For Transformers |
28 | Diffusion | 2209.04747 Diffusion Models in Vision - A Survey |
29 | Diffusion | 2209.00796 Diffusion Models- A Comprehensive Survey of Methods and Applications |
30 | Classifier-Free Diffusion | 2207.12598 Classifier-Free Diffusion Guidance |
31 | GLIDE | 2112.10741 GLIDE - Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models |
32 | ALiBi | 2108.12409v2 Atrain Short Test Long Attention With Linear Biases Enables Input Length Extrapolation |
33 | DMBG | 2105.05233 Diffusion Models Beat GANs on Image Synthesis |
34 | Improved DDPM | 2102.09672 Improved Denoising Diffusion Probabilistic Models |
35 | DDIM | 2010.02502 Denoising Diffusion Implicit Models |
36 | Big Bird | 2007.14062v2 Big Bird Transformers for Longer Sequences |
37 | DDPM | 2006.11239 Denoising Diffusion Probabilistic Models |
38 | Taskonomy | 1804.08328v1 Taskonomy Disentangling Task Transfer Learning |
39 | Tramsformer | 1706.03762 Attention Is All You Need |
40 | Mask R-CNN | 1703.0687 Mask R-CNN |
41 | TGAN | 1611.06624 Temporal Generative Adversarial Nets with Singular Value Clipping |
42 | Grad-CAM | 1610.02391 Grad-CAM Visual Explanations from Deep Networks via Gradient-based Localization |
43 | DeepLab | 1606.00915 DeepLab Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs |
44 | DCGAN | 1511.06434 DCGAN Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks |
45 | Style Transfer | 1508.06576 A Neural Algorithm of Artistic Style |
46 | YOLO | 1506.0264 You Only Look Once Unified, Real-Time Object Detection |
47 | U-Net | 1505.04597 U-Net Convolutional Networks for Biomedical Image Segmentation |
48 | Fast R-CNN | 1504.08083 Fast R-CNN |
49 | FaceNet | 1503.0383 FaceNet - A Unified Embedding for Face Recognition and Clustering |
50 | Diffusion | 1503.03585 Deep Unsupervised Learning using Nonequilibrium Thermodynamics |
51 | Fast R-CNN | 1411.4038 Fully Convolutional Networks for Semantic Segmentation |
52 | Inception | 1409.4842 Going deeper with convolutions |
53 | DeepFace | 1406 DeepFace - Closing the Gap to Human-Level Performance in Face Verification |
54 | CNN | 1311.2901 Visualizing and Understanding Convolutional Networks |
Keyward | arXiv# & Subjects
ALiBi
• 2108.12409v2 Atrain Short Test Long Attention With Linear Biases Enables Input Length Extrapolation
Big Bird
• 2007.14062v2 Big Bird Transformers for Longer Sequences
Chinchilla scaling laws
• 2305.16264v1 Scaling Data-Constrained Language Models
Classifier-Free Diffusion
• 2207.12598 Classifier-Free Diffusion Guidance
CNN
• 1311.2901 Visualizing and Understanding Convolutional Networks
Consistency Models
• 2303.01469 Consistency Models
DALLE 3
• 2310 DALL E 3 System Card
• 2310.07653 Mini DALL·E 3 Interactive Text to Image by Prompting Large Language Models
DCGAN
• 1511.06434 DCGAN Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks
DDIM
• 2010.02502 Denoising Diffusion Implicit Models
DDPM
• 2006.11239 Denoising Diffusion Probabilistic Models
DeepFace
• 1406 DeepFace - Closing the Gap to Human-Level Performance in Face Verification
DeepLab
• 1606.00915 DeepLab Semantic Image Segmentation with Deep Convolutional Nets, Atrous Convolution, and Fully Connected CRFs
Diffusion
• 1503.03585 Deep Unsupervised Learning using Nonequilibrium Thermodynamics
• 2209.00796 Diffusion Models - A Comprehensive Survey of Methods and Applications
• 2209.04747 Diffusion Models in Vision - A Survey
DMBG
• 2105.05233 Diffusion Models Beat GANs on Image Synthesis
Dromedary
• 2305.03047v1 Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
FaceNet
• 1503.0383 FaceNet - A Unified Embedding for Face Recognition and Clustering
Fast R-CNN
• 1411.4038 Fully Convolutional Networks for Semantic Segmentation
• 1504.08083 Fast R-CNN
FrugalGPT
• 2305.05176v1 FrugalGPT How to Use Large Language Models While Reducing Cost and Improving Performance
GLIDE
• 2112.10741 GLIDE - Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models
GLIGEN
• 2301.07093 GLIGEN Open-Set Grounded Text-to-Image Generation
Gorilla
• 2305.15334v1 Gorilla Large Language Model Connected with Massive APIs
GPT-4, 4V
• 2303.08774v3 GPT-4 Technical Report
• 2303.12712v1 Sparks of Artificial General Intelligence Early experiments with GPT-4
• 2305.15486v1 SPRING GPT-4 Out-performs RL Algorithms by Studying Papers and Reasoning
• 2305.16291v1 VOYAGER An Open-Ended Embodied Agent with Large Language Models
• 2309.17421 The Dawn of LMMs Preliminary Explorations with GPT-4V(ision)
Grad-CAM
• 1610.02391 Grad-CAM Visual Explanations from Deep Networks via Gradient-based Localization
Hallucinating
• 2305.18248v1 Do Language Models Know When They’re Hallucinating References
HUGS
• 2311.17910 HUGS Human Gaussian Splats
Improved DDPM
• 2102.09672 Improved Denoising Diffusion Probabilistic Models
Inception
• 1409.4842 Going deeper with convolutions
LATM
• 2305.17126v1 Large Language Models as Tool Makers
LCM-LORA
• 2311.05556 LCM-LoRA - A Universal Stable-Diffusion Acceleration Module
Mask R-CNN
• 1703.0687 Mask R-CNN
Mistral 7B
• 2310.06825 Mistral 7B
PaLM
• 2305.10266v Searching for Needles in a Haystack On the Role of Incidental Bilingualism in PaLM’s Translation Capability
Superalignment
• 2312.09390 Weak-To-Strong Generalization - Eliciting Strong Capabilities With Weak Supervision
Style Transfer
• 1508.06576 A Neural Algorithm of Artistic Style
SuperICL
• 2305.08848v1 Small Models are Valuable Plug-ins for Large Language Models
Table-GPT
• 2310.09263 Table-tuned GPT for Diverse Table Tasks
Taskonomy
• 1804.08328v1 Taskonomy Disentangling Task Transfer Learning
TGAN
• 1611.06624 Temporal Generative Adversarial Nets with Singular Value Clipping
Tramsformer
• 1706.03762 Attention Is All You Need
• 2305.14699v1 Can Transformers Learn to Solve Problems Recursively
• 2305.16130v1 Language Models Implement Simple Word2Vec-style Vector Arithmetic
U-Net
• 1505.04597 U-Net Convolutional Networks for Biomedical Image Segmentation
ViT
• 2305.13035v1 Getting ViT in Shape Scaling Laws for Compute-Optimal Model Design
Wide Transformer
• 2210.00640v1 Wide Attention Is The Way Forward For Transformers
YOLO
• 1506.0264 You Only Look Once Unified, Real-Time Object Detection
ZipIt
• 2305.03053v1 ZipIt Merging Models from Different Tasks without Training