Reference

AI Glossary

A comprehensive collection of technical terms, concepts, and definitions in AI, machine learning, and software development. Powered by Papers with Code.

Try:

Attentive Normalization

Normalization

Inverse Q-Learning

Imitation-Learning-Methods

OverFeat

Convolutional-Neural-Networks

Subformer

Transformers

GeGLU

Activation-Functions

FSAF

Feature-Extractors

Pix2Pix

Generative-Models

Generative-Adversarial-Networks

Conditional-Image-to-Image-Translation-Models

Auxiliary Classifier

Miscellaneous-Components

Neural Architecture Search

Neural-Architecture-Search

Double Q-learning

Off-Policy-TD-Control

PipeDream-2BW

Distributed-Methods

Model-Parallel-Methods

Asynchronous-Pipeline-Parallel

TaxoExpan

Graph-Models

Taxonomy-Expansion-Models

YOLOX

One-Stage-Object-Detection-Models

Feature Intertwiner

Feature-Extractors

Cluster-GCN

Graph-Models

gMLP

Image-Models

Additive Angular Margin Loss

Loss-Functions

E-Branchformer

Transformers

Rung Kutta optimization

Optimization

InternVideo: General Video Foundation Models via Generative and Discriminative Learning

Vision-and-Language-Pre-Trained-Models

Policy Similarity Metric

State-Similarity-Metrics

PP-OCR

Convolutions

OCR-Models

ShuffleNet Block

Image-Model-Blocks

Skip-Connection-Blocks

node2vec

Graph-Embeddings

Dot-Product Attention

Attention-Mechanisms

First Integer Neighbor Clustering Hierarchy (FINCH))

Clustering

CDCC-NET

Convolutional-Neural-Networks

Video Panoptic Segmentation Network

Video-Panoptic-Segmentation-Models

SEED RL

Distributed-Methods

Distributed-Reinforcement-Learning

Object-Aware Loss

Loss-Functions

Bottleneck Transformer Block

Image-Model-Blocks

Attention-Modules

MPNet

Language-Model-Pre-Training

Gait Emotion Recognition

Global second-order pooling convolutional networks

Attention-Mechanisms

Receptive Field Block

Feature-Extractors

Wide Residual Block

Skip-Connection-Blocks

Image-Model-Blocks

Dropout

Regularization

Taylor Expansion Policy Optimization

Policy-Gradient-Methods

StyleGAN

Generative-Models

Generative-Adversarial-Networks

LayoutReader

Sequence-To-Sequence-Models

Reading-Order-Detection-Models

Depthwise Fire Module

Image-Model-Blocks

RevNet

Convolutional-Neural-Networks

Bilateral Guided Aggregation Layer

Semantic-Segmentation-Modules

Levenshtein Transformer

Transformers

Autoregressive-Transformers

Morphence

Adversarial-Attacks

DV3 Attention Block

Audio-Model-Blocks

Attention-Modules

Constrained Pairwise k-Means

Clustering

DetNAS

Neural-Architecture-Search

Mix-FFN

Feedforward-Networks

Bottleneck Attention Module

Attention-Mechanisms

Graph Convolutional Networks for Fake News Detection

Graph-Models

ADAHESSIAN

Optimization

PointAugment

Point-Cloud-Augmentation

Generic RoI Extractor

RoI-Feature-Extractors

Pathways Language Model

Transformers

PrivacyNet

Face-Privacy

Generative-Adversarial-Networks

CLIPort

Imitation-Learning-Methods

WGAN-GP Loss

Loss-Functions

Voxel Transformer

3D-Object-Detection-Models

Nonuniform Quantization for Stochastic Gradient Descent

Data-Parallel-Methods

SRU++

Recurrent-Neural-Networks

Gradient Clipping

Optimization

TSDAE

Sentence-Embeddings

Lecun's Tanh

Activation-Functions

Switchable Atrous Convolution

Convolutions

1-bit Adam

Stochastic-Optimization

Large-Batch-Optimization

Boost-GNN

Deep-Tabular-Learning

Root Mean Square Layer Normalization

Normalization

Noisy Linear Layer

Randomized-Value-Functions

GAN Least Squares Loss

Loss-Functions

OPT

Language-Models

mBARTHez

Language-Models

Sequence-To-Sequence-Models

Enhanced-Multimodal Fuzzy Framework

Time-Series-Analysis

Seesaw Loss

Loss-Functions

Orthogonal Regularization

Regularization

WaveNet

Generative-Audio-Models

Progressive Growing Channel Attentive Non-Local Network

Attention-Mechanisms

ReGLU

Activation-Functions

Structurally Regularized Deep Clustering

Domain-Adaptation

Twins-SVT

Vision-Transformers

Modular Interactive VOS

Video-Object-Segmentation-Models

Voxel R-CNN

3D-Object-Detection-Models

Point-Cloud-Models

Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment

Multi-Modal-Methods

PipeTransformer

Distributed-Methods

Hybrid-Parallel-Methods

2D-Parallel-Distributed-Methods

Weight excitation

Attention

MushroomRL

Reinforcement-Learning-Frameworks

XLM

Transformers

Autoencoding-Transformers

Lib

DBlock

Audio-Model-Blocks

Skip-Connection-Blocks

Data-efficient Image Transformer

Image-Models

Vision-Transformers

Temporal Adaptive Module

Attention-Mechanisms

ELMo

Word-Embeddings

Contextualized-Word-Embeddings

Language-Models

Improved Gravitational Search algorithm

Heuristic-Search-Algorithms

Hamburger

Image-Feature-Extractors

Global-Context-Modules

CS-GAN

Generative-Models

Generative-Adversarial-Networks

TuckER with Relation Prediction

Graph-Embeddings

Inception-B

Image-Model-Blocks

Surface Nomral-based Spatial Propagation

Stereo-Depth-Estimation-Models

Generative Adversarial Imitation Learning

Adversarial-Training

Visformer

Vision-Transformers

SpineNet

Convolutional-Neural-Networks

Feedback Memory

Attention-Modules

Content-based Attention

Attention-Mechanisms

ShakeDrop

Regularization

Hermite Polynomial Activation

Activation-Functions

Linear Warmup With Cosine Annealing

Learning-Rate-Schedules

Directed Acyclic Graph Neural Network

Graph-Embeddings

SERLU

Activation-Functions

CSPResNeXt Block

Skip-Connection-Blocks

Image-Model-Blocks

LayerScale

Regularization

Normalization

Twins-PCPVT

Vision-Transformers

FixMatch

Semi-Supervised-Learning-Methods

Bottom-up Path Augmentation

Feature-Extractors

Feature-Pyramid-Blocks

Kernel Inducing Points

Meta-Learning-Algorithms

Language-driven Scene Synthesis using Multi-conditional Diffusion Model

3D-Representations

Diffusion-Models

Generative Adversarial Transformer

Transformers

ReasonBERT

Language-Model-Pre-Training

Continuously Differentiable Exponential Linear Units

Activation-Functions

Class Activation Guided Attention Mechanism (CAGAM)

Attention-Mechanisms

Wizard: Unsupervised goats tracking algorithm

Multi-Object-Tracking-Models

LipGAN

Generative-Adversarial-Networks

Conditional-Image-to-Image-Translation-Models

Face-to-Face-Translation

Implicit PointRend

Instance-Segmentation-Modules

UNet Transformer

Medical-Image-Models

LARS

Large-Batch-Optimization

Problem Agnostic Speech Encoder +

Self-Supervised-Learning

fastText

Word-Embeddings

Static-Word-Embeddings

Explanation vs Attention: A Two-Player Game to Obtain Attention for VQA

Adversarial-Training

Instance Normalization

Normalization

Depthwise Convolution

Convolutions

Faster R-CNN

Object-Detection-Models

BlendMask

Instance-Segmentation-Models

Replacing Eligibility Trace

Eligibility-Traces

mT5

Language-Models

Autoencoding-Transformers

k-Sparse Autoencoder

Generative-Models

Inception-v4

Convolutional-Neural-Networks

Elastic ResNeXt Block

Skip-Connection-Blocks

Image-Model-Blocks

DALL·E 2

Image-Generation-Models

A2C

Policy-Gradient-Methods

Contextual Word Vectors

Word-Embeddings

Contextualized-Word-Embeddings

Scaled Dot-Product Attention

Attention-Mechanisms

Multi-DConv-Head Attention

Attention-Modules

GridMask

Image-Data-Augmentation

Contextual Graph Markov Model

Graph-Models

Extremely Efficient Spatial Pyramid of Depth-wise Dilated Separable Convolutions

Image-Model-Blocks

Skip-Connection-Blocks

Contrastive Predictive Coding

Self-Supervised-Learning

Semi-Supervised-Learning-Methods

Unitary RNN

Recurrent-Neural-Networks

Dice Loss

Loss-Functions

mT0

Language-Models

Blender

Attention-Modules

Instance-Segmentation-Modules

Factorized Dense Synthesized Attention

Attention-Mechanisms

Synthesized-Attention-Mechanisms

Conditional Instance Normalization

Normalization

Deactivable Skip Connection

Skip-Connections

Randomized Leaky Rectified Linear Units

Activation-Functions

Non-Local Block

Image-Model-Blocks

Skip-Connection-Blocks

CenterMask

Instance-Segmentation-Models

Self-Attention GAN

Generative-Adversarial-Networks

RoIPool

RoI-Feature-Extractors

Graph Path Feature Learning

Rule-Learners

DynaBERT

Language-Models

Autoencoding-Transformers

Precise RoI Pooling

RoI-Feature-Extractors

Shrink and Fine-Tune

Knowledge-Distillation

Distillation

Layer-Sequential Unit-Variance Initialization

Initialization

XGPT

Vision-and-Language-Pre-Trained-Models

Serf

Activation-Functions

Pyramid Pooling Module

Semantic-Segmentation-Modules

Scattering Transform

Image-Representations

SqueezeNet

Convolutional-Neural-Networks

Light-weight-neural-networks

Gradient Harmonizing Mechanism R

Loss-Functions

Approximation of Personalized Propagation of Neural Predictions

Graph-Representation-Learning

Stochastic Gradient Descent

Stochastic-Optimization

Global-Local Attention

Attention-Mechanisms

Amplifying Sine Unit: An Oscillatory Activation Function for Deep Neural Networks to Recover Nonlinear Oscillations Efficiently

Activation-Functions

Epsilon Greedy Exploration

Behaviour-Policies

Cross-Scale Non-Local Attention

Attention-Modules

Audiovisual SlowFast Network

Video-Recognition-Models

Multi-Modal-Methods

Hybrid Firefly and Particle Swarm Optimization

Optimization

Hybrid-Optimization

Heuristic-Search-Algorithms

I-BERT

Transformers

Autoencoding-Transformers

DropPathway

Regularization

Hi-LANDER

Graph-Models

Groupwise Point Convolution

Convolutions

Adaptive NMS

Proposal-Filtering

Bidirectional LSTM

Deep-Tabular-Learning

Darknet-53

Convolutional-Neural-Networks

Sliding Window Attention

Attention-Patterns

Denoised Smoothing

Robustness-Methods

Autoencoders

Dimensionality-Reduction

Population Based Augmentation

Image-Data-Augmentation

FashionCLIP

Vision-and-Language-Pre-Trained-Models

Nouveau VAE

Likelihood-Based-Generative-Models

Generative-Models

TabNet

Deep-Tabular-Learning

Vulnerability-constrained Decoding

Sequence-Decoding-Methods

Hopfield Layer

Pooling-Operations

Recurrent-Neural-Networks

Attention-Modules

HS-ResNet

Convolutional-Neural-Networks

Joint Learning Architecture

Multi-Object-Tracking-Models

Parrot

Cache-Replacement-Models

Imitation-Learning-Methods

Child-Tuning

Fine-Tuning

ClassSR

Image-Super-Resolution-Models

ConvBERT

Transformers

Autoencoding-Transformers

Generalized Focal Loss

Loss-Functions

Visual-Spatial-Graph Network

Human-Object-Interaction-Detectors

DVD-GAN

Generative-Models

Generative-Adversarial-Networks

Generative-Video-Models

Hierarchical Transferability Calibration Network

Object-Detection-Models

ENet Bottleneck

Image-Model-Blocks

Temporal ROIAlign

RoI-Feature-Extractors

Inception v2

Convolutional-Neural-Networks

Triplet Loss

Loss-Functions

NeuroTactic

Graph-Models

Theorem-Proving-Models

Adaptive Radial Projection on Fourier Magnitude Spectrum

Image-Denoising-Models

Path Planning and Motion Control

Motion-Control

Path-Planning

Control-and-Decision-Systems

GeniePath

Graph-Models

ARMA GNN

Graph-Models

RoBERTa

Transformers

Autoencoding-Transformers

Corner Pooling

Pooling-Operations

VoVNet

Convolutional-Neural-Networks

Aging Evolution

Neural-Architecture-Search

KnowPrompt

Prompt-Engineering

Gradient Harmonizing Mechanism C

Loss-Functions

MT-PET

Exaggeration-Detection-Models

Domain Adaptative Neighborhood Clustering via Entropy Optimization

Domain-Adaptation

Inception-ResNet-v2-B

Image-Model-Blocks

Skip-Connection-Blocks

Target Policy Smoothing

Regularization

Concurrent Spatial and Channel Squeeze & Excitation (scSE)

Attention-Mechanisms

Sparse Sinkhorn Attention

Attention-Mechanisms

Hunger Games Search

Optimization

Stochastic-Optimization

OFA

Vision-and-Language-Pre-Trained-Models

PipeMare

Distributed-Methods

Model-Parallel-Methods

Asynchronous-Pipeline-Parallel

DROID-SLAM

SLAM-Methods

ZoomNet

Pose-Estimation-Models

Convolution-enhanced image Transformer

Vision-Transformers

Conditional Relation Network

Video-Model-Blocks

DU-GAN

Generative-Adversarial-Networks

Image-Denoising-Models

Lifelong Infinite Mixture

Lifelong-Learning

Embedded Dot Product Affinity

Affinity-Functions

Sample Redistribution

Image-Data-Augmentation

Polynomial Convolution

Convolutional-Neural-Networks

Deep Convolutional GAN

Generative-Models

Generative-Adversarial-Networks

BiSeNet V2

Semantic-Segmentation-Models

SPP-Net

Convolutional-Neural-Networks

Wasserstein GAN

Generative-Adversarial-Networks

ManifoldPlus

Graphics-Models

Contextual Residual Aggregation

Image-Model-Blocks

Image-Inpainting-Modules

Movement Pruning

Pruning

Self-Supervised Deep Supervision

Self-Supervised-Learning

Feature Pyramid Grid

Feature-Pyramid-Blocks

ProxylessNet-GPU

Image-Models

Convolutional-Neural-Networks

Enhanced Fusion Framework

Time-Series-Analysis

Paddle Anchor Free Network

Object-Detection-Models

context2vec

Word-Embeddings

Contextualized-Word-Embeddings

UNet++

Semantic-Segmentation-Models

SGDW

Stochastic-Optimization

Hybrid-deconvolution

Convolutional-Neural-Networks

Connectionist Temporal Classification Loss

Loss-Functions

Protagonist Antagonist Induced Regret Environment Design

Adversarial-Training

Environment-Design-Methods

Window-based Discriminator

Discriminators

DeepLab

Semantic-Segmentation-Models

Gated Graph Sequence Neural Networks

Graph-Models

MNN

Inference-Engines

LightGCN

Recommendation-Systems

Graph-Models

Mixed Attention Block

Attention-Modules

Gradient Sign Dropout

Regularization

Performer

Transformers

EvoNorms

Normalization

Activation-Functions

Inverse Square Root Schedule

Learning-Rate-Schedules

Revision Network

Style-Transfer-Modules

TABBIE

Deep-Tabular-Learning

Probability Guided Maxout

Regularization

Randomized Adversarial Solarization

Adversarial-Attacks

Parametric UMAP

Dimensionality-Reduction

Fully Convolutional Network

Semantic-Segmentation-Models

Pointer Sentinel-LSTM

Recurrent-Neural-Networks

Neural Cache

Language-Model-Components

Compact Global Descriptor

Image-Model-Blocks

Attention-Modules

SentencePiece

Tokenizers

SKEP

Semi-Supervised-Learning-Methods

ALDEN

Text-Classification-Models

Active-Learning

StruBERT: Structure-aware BERT for Table Search and Matching

Deep-Tabular-Learning

Skip-gram Word2Vec

Word-Embeddings

Static-Word-Embeddings

Big-Little Net

Convolutional-Neural-Networks

Affordance Correspondence

Instance-Segmentation-Models

Multiscale Dilated Convolution Block

Image-Model-Blocks

Sequence to Sequence

Sequence-To-Sequence-Models

Machine-Translation-Models

Channel Squeeze and Spatial Excitation (sSE)

Attention-Mechanisms

RoIWarp

RoI-Feature-Extractors

ZeRO-Offload

Distributed-Methods

Data-Parallel-Methods

Sharded-Data-Parallel-Methods

Rectified Linear Units

Activation-Functions

InstaBoost

Image-Data-Augmentation

LAPGAN

Generative-Models

Generative-Adversarial-Networks

VOS

Video-Object-Segmentation-Models

T-Fixup

Initialization

Contrastive Video Representation Learning

Generative-Video-Models

Self-Supervised-Learning

Global Local Attention Module

Image-Model-Blocks

Triplet Entropy Loss

Loss-Functions

PULSE

Image-Super-Resolution-Models

Strip Pooling Network

Attention-Mechanisms

OSCAR

Vision-and-Language-Pre-Trained-Models

Model-Free Episodic Control

Non-Parametric-Regression

GRLIA

Incident-Aggregation-Models

Computation Redistribution

Neural-Architecture-Search

Tacotron

Text-to-Speech-Models

Sequence-To-Sequence-Models

VATT

Vision-Transformers

Multi-Modal-Methods

Location-based Attention

Attention-Mechanisms

Multi-scale Progressive Fusion Network

Deraining-Models

BLANC

Document-Summary-Evaluation

Self-Supervised Motion Disentanglement

Action-Recognition-Models

Distribution-induced Bidirectional Generative Adversarial Network for Graph Representation Learning

Graph-Embeddings

Weight Standardization

Normalization

Human Robot Interaction Pipeline

Clustering

Object-Detection-Models

Soft Pooling

Pooling-Operations

NAS-FCOS

Object-Detection-Models

CenterPoint

3D-Object-Detection-Models

YOLOv1

Object-Detection-Models

One-Stage-Object-Detection-Models

GPT

Transformers

Autoregressive-Transformers

Spatial Transformer

Image-Model-Blocks

tabular data Prior-data Fitted Network

Deep-Tabular-Learning

WaveGrad

Generative-Audio-Models

BiGG

Graph-Models

wav2vec Unsupervised

Speech-Recognition

MacBERT

Transformers

Autoencoding-Transformers

Natural Gradient Descent

Optimization

NPID++

Self-Supervised-Learning

FuseFormer Block

Video-Model-Blocks

Variational Autoencoder

Generative-Models

Likelihood-Based-Generative-Models

Sarsa Lambda

On-Policy-TD-Control

RelDiff

Graph-Embeddings

ConvMLP

Convolutional-Neural-Networks

Image-Models

NPID

Self-Supervised-Learning

Deep LSTM Reader

Recurrent-Neural-Networks

Reading-Comprehension-Models

Intrinsically Motivated Goal Exploration Processes

Self-Supervised-Learning

Eligibility Trace

Eligibility-Traces

CPC v2

Self-Supervised-Learning

Semi-Supervised-Learning-Methods

HaloNet

Image-Models

CornerNet-Squeeze Hourglass

Convolutional-Neural-Networks

DenseNAS

Neural-Architecture-Search

CP with N3 Regularizer

Graph-Embeddings

Contrastive Language-Image Pre-training

Image-Representations

Vision-and-Language-Pre-Trained-Models

Mixing Adam and SGD

Stochastic-Optimization

Adaptive Softmax

Output-Functions

Graph sampling based inductive learning method

Graph-Representation-Learning

ScanSSD

Object-Detection-Models

Math-Formula-Detection-Models

SENet

Convolutional-Neural-Networks

Adaptive Instance Normalization

Normalization

MLP-Mixer

Image-Models

Residual Attention Network

Attention-Mechanisms

Fast Bi-level Adversarial Training

Adversarial-Training

Logistic Regression

Generalized-Linear-Models

Rotary Position Embedding

Position-Embeddings

Fixup Initialization

Initialization

TrIVD-GAN

Generative-Models

Generative-Adversarial-Networks

Generative-Video-Models

Hierarchical Information Threading

PSANet

Semantic-Segmentation-Models

SkipInit

Initialization

Video Language Graph Matching Network

Video-Text-Retrieval-Models

Adversarial Color Enhancement

Adversarial-Image-Data-Augmentation

Group Decreasing Network

Image-Generation-Models

Alternating Direction Method of Multipliers

Optimization

Galactica

Language-Models

Two-Way Dense Layer

Skip-Connection-Blocks

Image-Model-Blocks

Ape-X DQN

Q-Learning-Networks

online deep learning

Deep-Tabular-Learning

Guided Language to Image Diffusion for Generation and Editing

Multi-Modal-Methods

Image-Generation-Models

Dilated Bottleneck Block

Skip-Connection-Blocks

Image-Model-Blocks

BiFPN

Feature-Extractors

Feature-Pyramid-Blocks

Masked Convolution

Convolutions

Automatic Search for Parsimonious Models

AutoML

Shapley Additive Explanations

Interpretability

Self-adaptive Training

Robust-Training

Position-Sensitive RoI Pooling

RoI-Feature-Extractors

Vision Transformer

Image-Models

Vision-Transformers

BAGUA

Distributed-Methods

Data-Parallel-Methods

Replicated-Data-Parallel

Deep Graph Convolutional Neural Network

Graph-Models

Phish: A Novel Hyper-Optimizable Activation Function

Activation-Functions

Local Importance-based Pooling

Pooling-Operations

MUSIQ

Vision-Transformers

Image-Quality-Models

Darknet-19

Convolutional-Neural-Networks

Fraternal Dropout

Regularization

Random Mix-up

Image-Data-Augmentation

Non-monotonically Triggered ASGD

Stochastic-Optimization

Guided Anchoring

Anchor-Generation-Modules

Xception

Convolutional-Neural-Networks

Spatial Attention Module

Image-Model-Blocks

Attention-Modules

Closed-loop Weighted Empirical Risk Minimization

Robust-Training

Fast-OCR

Convolutional-Neural-Networks

NoisyNet-DQN

Q-Learning-Networks

Feature Pyramid Network

Feature-Extractors

Feature-Pyramid-Blocks

AutoEncoder

Generative-Models

One-Shot Aggregation

Skip-Connection-Blocks

Image-Model-Blocks

Invertible Rescaling Network

Image-Models

Elastic Weight Consolidation

Active-Learning

Self-Cure Network

Regularization

Switch FFN

Feedforward-Networks

Pansharpening by convolutional neural networks in the full resolution framework

Convolutional-Neural-Networks

Inception-ResNet-v2 Reduction-B

Image-Model-Blocks

Residual Network

Convolutional-Neural-Networks

RMSProp

Stochastic-Optimization

Stochastic Weight Averaging

Stochastic-Optimization

Pythia

Language-Models

Growing Cosine Unit

Activation-Functions

Weights Reset

Regularization

YellowFin

Stochastic-Optimization

Global and Sliding Window Attention

Attention-Patterns

WaveRNN

Generative-Audio-Models

Recurrent-Neural-Networks

U-Net

Semantic-Segmentation-Models

Deep Residual Pansharpening Neural Network

Convolutional-Neural-Networks

MobileNetV1

Convolutional-Neural-Networks

Light-weight-neural-networks

Smish

Activation-Functions

SimCLR

Self-Supervised-Learning

Random Horizontal Flip

Image-Data-Augmentation

MetaFormer

Image-Models

Height-driven Attention Network

Image-Segmentation-Models

Balanced Feature Pyramid

Feature-Pyramid-Blocks

Dilated Causal Convolution

Temporal-Convolutions

double-stage parameter tuning

Stochastic-Optimization

AdaShift

Stochastic-Optimization

Scale-wise Feature Aggregation Module

Feature-Extractors

Efficient Recurrent Unit

Recurrent-Neural-Networks

Electric

Language-Models

Transformers

Autoencoding-Transformers

ReZero

Normalization

Sticker Response Selector

Conversational-Models

GPU-Efficient Network

Convolutional-Neural-Networks

Meta Face Recognition

Face-Recognition-Models

Attention Gate

Attention-Mechanisms

AdaDelta

Stochastic-Optimization

CornerNet-Squeeze Hourglass Module

Image-Model-Blocks

OASIS

Conditional-Image-to-Image-Translation-Models

Path Length Regularization

Regularization

Causal Convolution

Temporal-Convolutions

Graph Attention Network v2

Graph-Models

Differentiable Architecture Search

Neural-Architecture-Search

MEUZZ

Hybrid-Optimization

Hybrid-Fuzzing

Distributed Distributional DDPG

Policy-Gradient-Methods

Graph Self-Attention

Attention-Modules

CycleGAN

Generative-Models

Generative-Adversarial-Networks

Unpaired-Image-to-Image-Translation

MobileNetV2

Image-Models

Convolutional-Neural-Networks

Light-weight-neural-networks

Dual Path Network

Convolutional-Neural-Networks

Single Headed Attention RNN

Language-Models

Recurrent-Neural-Networks

Social-STGCNN

Trajectory-Prediction-Models

ACER

Policy-Gradient-Methods

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM

3D-Reconstruction

Meshing

L1 Regularization

Regularization

Parameter-Norm-Penalties

modReLU

Activation-Functions

Adaptive-Activation-Functions

Mode Normalization

Normalization

Recursive Feature Pyramid

Feature-Pyramid-Blocks

VERtex Similarity Embeddings

Graph-Embeddings

ENIGMA

Dialog-System-Evaluation

Voxel RoI Pooling

RoI-Feature-Extractors

ooJpiued

Language-Models

Smooth Step

Activation-Functions

Hit-Detector

Neural-Architecture-Search

PGC-DGCNN

Graph-Models

RealFormer

Transformers

Zero-padded Shortcut Connection

Skip-Connections

ShuffleNet

Convolutional-Neural-Networks

Light-weight-neural-networks

Phase Shuffle

Audio-Artifact-Removal

Cross-View Training

Word-Embeddings

Contextualized-Word-Embeddings

Language-Models

PRNet+

Position-Recovery-Models

Factor Graph Attention

Attention

3D Dynamic Scene Graph

3D-Representations

MaxUp

Adversarial-Image-Data-Augmentation

Pointer Network

Sequence-To-Sequence-Models

Recurrent-Neural-Networks

Spatial and Channel SE Blocks

Attention-Mechanisms

Random Search

Hyperparameter-Search

Self-Supervised Cross View Cross Subject Pose Contrastive Learning

Pose-Estimation-Models

Euclidean Norm Regularization

Regularization

Patch Merger Module

Image-Model-Blocks

Mixed Depthwise Convolution

Convolutions

Tree Ensemble to Rules

Interpretability

Meta Reward Learning

Meta-Learning-Algorithms

TrOCR

OCR-Models

RegionViT

Vision-Transformers

Seq2Edits

Sequence-Editing-Models

Message Passing Neural Network

Graph-Models

Pixel Recurrent Neural Network

Generative-Models

Likelihood-Based-Generative-Models

YOLOv3

Object-Detection-Models

One-Stage-Object-Detection-Models

FBNet Block

Image-Model-Blocks

Skip-Connection-Blocks

RegNetY

Convolutional-Neural-Networks

RevSilo

Reversible-Image-Conversion-Models

Affine Coupling

Bijective-Transformation

Atrous Spatial Pyramid Pooling

Semantic-Segmentation-Modules

Boom Layer

Feedforward-Networks

LSGAN

Generative-Models

Generative-Adversarial-Networks

Self-Adjusting Smooth L1 Loss

Loss-Functions

Detection Transformer

Object-Detection-Models

Vision-Transformers

Firefly algorithm

Heuristic-Search-Algorithms

Residual Normal Distribution

Variational-Optimization

R1 Regularization

Regularization

Grammatical evolution and Q-learning

Optimization

ALBEF

Vision-and-Language-Pre-Trained-Models

Metric Pairwise Constrained KMeans

Clustering

Go-Explore

Behaviour-Policies

Vision-and-Language BERT

Representation-Learning

Transformers

Vision-and-Language-Pre-Trained-Models

BoundaryNet

Layout-Annotation-Models

Gated Linear Network

Gated-Linear-Networks

WordPiece

Subword-Segmentation

Tokenizers

Gradient Sparsification

Distributed-Methods

Optimization

Stochastic-Optimization

PanGu-$α$

Language-Models

Low-Rank Factorization-based Multi-Head Attention

Attention-Modules

CBHG

Speech-Synthesis-Blocks

Sequential-Blocks

Skip-Connection-Blocks

GAN-TTS

Text-to-Speech-Models

Sequence-To-Sequence-Models

Neural adjoint method

Optimization

BiDet

Binary-Neural-Networks

Relative Position Encodings

Position-Embeddings

CodeBERT

Transformers

Code-Generation-Transformers

Peer-attention

Attention-Modules

Strided Attention

Attention-Patterns

PoolFormer

Image-Models

Off-Diagonal Orthogonal Regularization

Regularization

CornerNet-Saccade

Object-Detection-Models

One-Stage-Object-Detection-Models

TridentNet

Object-Detection-Models

Multimodal Fuzzy Fusion Framework

Non-Parametric-Classification

Convolutional GRU

Recurrent-Neural-Networks

Dense Prediction Transformer

Vision-Transformers

Image-Models

ALIGN

Vision-and-Language-Pre-Trained-Models

Large-scale Information Network Embedding

Graph-Embeddings

Unigram Segmentation

Subword-Segmentation

Efficient Channel Attention

Image-Model-Blocks

Skip-Connection-Blocks

Linear Layer

Feedforward-Networks

LocalViT

Vision-Transformers

BERT

Language-Models

Transformers

Autoencoding-Transformers

Prioritized Experience Replay

Replay-Memory

DistilBERT

Transformers

Autoencoding-Transformers

FLIP

Loss-Functions

RFB Net

Object-Detection-Models

One-Stage-Object-Detection-Models

Bi3D

Stereo-Depth-Estimation-Models

nlogistic-sigmoid function

Activation-Functions

Feedforward-Networks

Cross-Attention Module

Attention-Modules

DeepCluster

Self-Supervised-Learning

Meta-augmentation

Meta-Learning-Algorithms

AutoDropout

Regularization

Data augmentation using Polya-Gamma latent variables.

Latent-Variable-Sampling

Neural Attention Fields

Feature-Extractors

Image-Model-Blocks

Semantic-Segmentation-Modules

FMix

Image-Data-Augmentation

Spatio-temporal stability analysis

Feature-Extractors

ByteScheduler

Distributed-Methods

Data-Parallel-Methods

Replicated-Data-Parallel

Parallax

Distributed-Methods

Hybrid-Parallel-Methods

Parameter-Server-Methods

VocGAN

Generative-Audio-Models

Value Imputation and Mask Estimation

Deep-Tabular-Learning

CNN Bidirectional LSTM

Bidirectional-Recurrent-Neural-Networks

Enhanced Sequential Inference Model

Sequence-To-Sequence-Models

YOLOP

One-Stage-Object-Detection-Models

Object-Detection-Models

Semantic-Segmentation-Models

Pseudoinverse Graph Convolutional Network

Semi-Supervised-Learning-Methods

Graph-Models

Demon CM

Stochastic-Optimization

Unified VLP

Vision-and-Language-Pre-Trained-Models

MoCo v2

Self-Supervised-Learning

Semi-Supervised-Learning-Methods

Conditional / Rectified flow matching

Generative-Models

Graph Isomorphism Network

Graph-Embeddings

Graph-Models

PyramidNet

Convolutional-Neural-Networks

DenseNAS-C

Convolutional-Neural-Networks

Fragmentation

Localization-Models

Selective Search

Region-Proposal

Point-wise Spatial Attention

Semantic-Segmentation-Modules

Attention-Modules

Swish

Activation-Functions

Adaptive-Activation-Functions

Schrödinger Network

Graph-Models

Discriminative Fine-Tuning

Fine-Tuning

Relativistic GAN

Generative-Adversarial-Networks

Dynamic Memory Network

Working-Memory-Models

Style-based Recalibration Module

Image-Model-Blocks

Segmentation of patchy areas in biomedical images based on local edge density estimation

Image-Segmentation-Models

Deep Equilibrium Models

Robust-Training

XLNet

Transformers

Autoregressive-Transformers

Blink Communication

Distributed-Methods

Distributed-Communication

An Easier Data Augmentation

Text-Data-Augmentation

Random Resized Crop

Image-Data-Augmentation

Sparse Autoencoder

Generative-Models

Chinese Pre-trained Unbalanced Transformer

Transformers

Symbolic Deep Learning

Interpretability

Graph-Models

Online Normalization

Normalization

Shake-Shake Regularization

Regularization

Multiscale Attention ViT with Late fusion

Multi-Modal-Methods

Global Coupled Adaptive Number of Shots

Quantum-Methods

Stochastic-Optimization

VisualBERT

Vision-and-Language-Pre-Trained-Models

ALDA

Unpaired-Image-to-Image-Translation

Domain-Symmetric Network

Domain-Adaptation

BTmPG

Paraphrase-Generation-Models

TURL: Table Understanding through Representation Learning

Deep-Tabular-Learning

BP-Transformer

Transformers

STAC

Semi-Supervised-Learning-Methods

Convolutional Block Attention Module

Image-Model-Blocks

Attention-Modules

Hierarchical Multi-Task Learning

Deep-Tabular-Learning

Attentional Liquid Warping Block

Image-Model-Blocks

Early Stopping

Regularization

Feature Information Entropy Regularized Cross Entropy

Regularization

Local Patch Interaction

Image-Model-Blocks

Pyramidal Residual Unit

Skip-Connection-Blocks

Image-Model-Blocks

DExTra

Feedforward-Networks

Cascade Corner Pooling

Pooling-Operations

LightAutoML

AutoML

YOLOv4

Object-Detection-Models

One-Stage-Object-Detection-Models

Adaptive Locally Connected Neuron

Feedforward-Networks

Depthwise Dilated Separable Convolution

Convolutions

Demon ADAM

Stochastic-Optimization

Margin Rectified Linear Unit

Activation-Functions

Inception-C

Image-Model-Blocks

Squeeze-and-Excitation Block

Image-Model-Blocks

PolarMask

Instance-Segmentation-Modules

GAN Feature Matching

Regularization

QuantTree histograms

Distribution-Approximation

GPT-4

Language-Models

MADDPG

Policy-Gradient-Methods

High-resolution Deep Convolutional Generative Adversarial Networks

Generative-Models

R-CNN

Object-Detection-Models

Adaptive Early-Learning Correction

Semantic-Segmentation-Models

Chained-Tracker

Multi-Object-Tracking-Models

Gated Convolution Network

Language-Models

BigGAN-deep

Generative-Models

Generative-Adversarial-Networks

Strain Elevation Tension Spring embedding

Graph-Embeddings

Normalizing Flows

Distribution-Approximation

Forward gradient

Stochastic-Optimization

GAN Hinge Loss

Loss-Functions

Context Optimization

Prompt-Engineering

State-Aware Tracker

Semi-Supervised-Learning-Methods

Video-Object-Segmentation-Models

K-Net

Semantic-Segmentation-Models

Instance-Segmentation-Models

Denoising Score Matching

Generative-Training

DropBlock

Regularization

Projection Discriminator

Discriminators

Contour Proposal Network

Object-Detection-Models

Instance-Segmentation-Models

One-Stage-Object-Detection-Models

LLaMA

Language-Models

Spatial Group-wise Enhance

Image-Model-Blocks

Collaborative Distillation

Knowledge-Distillation

PixelCNN

Generative-Models

Likelihood-Based-Generative-Models

SNIP

Multi-Scale-Training

Table Pre-training via Execution

Transformers

Graph Finite-State Automaton

Graph-Representation-Learning

CARAFE

Feature-Upsampling

Routing Attention

Attention-Patterns

AlphaZero

Board-Game-Models

Edge-augmented Graph Transformer

Transformers

Charformer

Transformers

Linear Combination of Activations

Activation-Functions

BinaryBERT

Transformers

Autoencoding-Transformers

Reversible Residual Block

Skip-Connection-Blocks

Pose-Appearance Disentangling

Pose-Estimation-Models

Instruction Pointer Attention Graph Neural Network

Graph-Models

Feature-Aligned Person Search Network

Person-Search-Models

Linear Warmup With Linear Decay

Learning-Rate-Schedules

NeuralRecon: Real-Time Coherent 3D Reconstruction from Monocular Video

3D-Reconstruction

Meshing

Entropy Regularization

Regularization

Vokenization

Multi-Modal-Methods

Random Erasing

Image-Data-Augmentation

Style Transfer Module

Generative-Adversarial-Networks

Compute-Efficient Active Learning

Active-Learning

Double DQN

Q-Learning-Networks

CANINE

Language-Models

ResNet-D

Convolutional-Neural-Networks

AutoTinyBERT

Transformers

Autoencoding-Transformers

Memory Network

Working-Memory-Models

MLP-Mixer Layer

Image-Model-Blocks

Region-based Fully Convolutional Network

Object-Detection-Models

AltCLIP

Vision-and-Language-Pre-Trained-Models

Talking-Heads Attention

Attention-Modules

Stacked Denoising Autoencoder

Generative-Models

Graph Contrastive Coding

Graph-Models

Self-Supervised-Learning

Discrete Cosine Transform

Fourier-related-Transforms

Learnable adjacency matrix GCN

Graph-Representation-Learning

DiCE Unit

Image-Model-Blocks

imGHUM

3D-Representations

Concrete Dropout

Regularization

CodeSLAM

3D-Reconstruction

Asymmetrical Bi-RNN

Bidirectional-Recurrent-Neural-Networks

Single-path NAS

Convolutional-Neural-Networks

XLM-R

Language-Models

GreedyNAS-A

Convolutional-Neural-Networks

CascadePSP

Semantic-Segmentation-Models

Multi-partition Embedding Interaction

Graph-Embeddings

Graph-Representation-Learning

Residual Masking Network

Attention

Involution

Image-Feature-Extractors

Principal Neighbourhood Aggregation

Graph-Models

Sandwich Batch Normalization

Normalization

Early exiting using confidence measures

Loss-Functions

Dilated convolution with learnable spacings

Convolutions

Optimizer Activation Function

Activation-Functions

LV-ViT

Vision-Transformers

Image-Models

SCARLET

Convolutional-Neural-Networks

MotionNet

Motion-Prediction-Models

HRNet

Convolutional-Neural-Networks

Momentumized, adaptive, dual averaged gradient

Stochastic-Optimization

Longformer

Transformers

Autoencoding-Transformers

Packed Levitated Markers

Span-Representations

SimCSE

Sentence-Embeddings

CTAB-GAN

Generative-Adversarial-Networks

Tabular-Data-Generation

Selective Kernel

Image-Model-Blocks

Skip-Connection-Blocks

RPDet

Object-Detection-Models

Linear Discriminant Analysis

Dimensionality-Reduction

Simple Visual Language Model

Vision-and-Language-Pre-Trained-Models

CutBlur

Image-Data-Augmentation

DE-GAN: A Conditional Generative Adversarial Network for Document Enhancement

Generative-Adversarial-Networks

WaveVAE

Generative-Audio-Models

Fawkes

Face-Privacy

VQ-VAE

Generative-Models

Likelihood-Based-Generative-Models

UNiversal Image-TExt Representation Learning

Word-Embeddings

Knowledge Distillation

Knowledge-Distillation

UCNet

RGB-D-Saliency-Detection-Models

OPT-IML

Language-Models

Recurrent Replay Distributed DQN

Offline-Reinforcement-Learning-Methods

Categorical Modularity

Word-Embeddings

Skim and Intensive Reading Model

Textual-Meaning

Capsule Network

Neural-Architecture-Search

Leaky ReLU

Activation-Functions

Proximal Policy Optimization

Policy-Gradient-Methods

ProxylessNet-CPU

Image-Models

Convolutional-Neural-Networks

Parallel Layers

Transformers

Encoder-Attender-Aggregator

Model Soups

Model-Compression

DenseNet

Convolutional-Neural-Networks

Reformer

Transformers

MelGAN

Generative-Audio-Models

Spatial Feature Transform

Image-Model-Blocks

uNetXST

Convolutional-Neural-Networks

Highway Layer

Miscellaneous-Components

KungFu

Distributed-Methods

Auto-Parallel-Methods

Graph Network-based Simulators

Graph-Models

Single-Headed Attention

Attention-Modules

A Framework for Leader Identification in Coordinated Activity

Time-Series-Analysis

Leadership-Inference

PonderNet

Adaptive-Computation

AlexNet

Convolutional-Neural-Networks

DiffAugment

Adversarial-Training

Adversarial-Image-Data-Augmentation

Coordinate attention

Attention-Mechanisms

ELECTRA

Transformers

Autoencoding-Transformers

RPM-Net

Point-Cloud-Models

Differential attention for visual question answering

Attention-Mechanisms

Neural Network Compression Framework

Model-Compression

Step Decay

Learning-Rate-Schedules

PermuteFormer

Transformers

XCiT

Vision-Transformers

CodeGen

Language-Models

SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings

Monocular-Depth-Estimation-Models

Distributed Any-Batch Mirror Descent

Distributed-Methods

Optimization

Data-Parallel-Methods

CSPResNeXt

Convolutional-Neural-Networks

DELU

Activation-Functions

Adaptive-Activation-Functions

Domain Adaptive Ensemble Learning

Domain-Adaptation

Rational Activation Function

Activation-Functions

Adaptive-Activation-Functions

Deep Layer Aggregation

Feature-Extractors

Pyramidal Bottleneck Residual Unit

Skip-Connection-Blocks

Image-Model-Blocks

mBERT

Language-Models

Variational Dropout

Regularization

PAUSE

Sentence-Embeddings

FLAVR

Video-Interpolation-Models

BigBiGAN

Generative-Models

Generative-Adversarial-Networks

Self-Supervised-Learning

Lovasz-Softmax

Loss-Functions

Stochastic Steady-state Embedding

Graph-Models

Recurrent Event Network

Graph-Models

Deformable Convolutional Networks

Attention-Mechanisms

ScaledSoftSign

Activation-Functions

Lipschitz Constant Constraint

Regularization

Temporaral Difference Network

Action-Recognition-Models

DropConnect

Regularization

Asynchronous Interaction Aggregation

Action-Recognition-Models

Fishr

Robustness-Methods

Hyper-parameter optimization

AutoML

rnnDrop

Regularization

PANet

Instance-Segmentation-Models

Object-Detection-Models

Class-activation map

Interpretability

Language-Models

ClariNet

Text-to-Speech-Models

Sequence-To-Sequence-Models

SNIPER

Multi-Scale-Training

Invertible 1x1 Convolution

Convolutions

CRF-RNN

Recurrent-Neural-Networks

VEGA

AutoML

ParamCrop

Generative-Video-Models

Self-Supervised-Learning

RIFE

Video-Frame-Interpolation

Semi-Supervised Knowledge Distillation

Knowledge-Distillation

Disentangled Attention Mechanism

Attention-Mechanisms

Res2Net

Image-Models

HetPipe

Distributed-Methods

Hybrid-Parallel-Methods

Parameter-Server-Methods

TernaryBERT

Transformers

Autoencoding-Transformers

NormFormer

Transformers

ParaNet Convolution Block

Audio-Model-Blocks

Skip-Connection-Blocks

MODNet

Portrait-Matting-Models

Conditional Positional Encoding

Position-Embeddings

DNN2LR

Deep-Tabular-Learning

Color Jitter

Image-Data-Augmentation

Multi-Query Attention

Attention

StarReLU

Activation-Functions

Cutout

Image-Data-Augmentation

Exponential Linear Unit

Activation-Functions

Hourglass Module

Image-Model-Blocks

Accordion

Distributed-Methods

Data-Parallel-Methods

FASFA: A Novel Next-Generation Backpropagation Optimizer

Stochastic-Optimization

PixLoc

6D-Pose-Estimation-Models

Cosine Normalization

Normalization

PReLU-Net

Convolutional-Neural-Networks

Random Gaussian Blur

Image-Data-Augmentation

Concatenated Skip Connection

Skip-Connections

Genetic Algorithms

Heuristic-Search-Algorithms

TorchBeast

Distributed-Methods

Distributed-Reinforcement-Learning

EdgeBoxes

Region-Proposal

Deep Orthogonal Fusion of Local and Global Features

Image-Retrieval-Models

SSD

Object-Detection-Models

One-Stage-Object-Detection-Models

Hierarchical Softmax

Output-Functions

Mirror-BERT

Self-Supervised-Learning

Sentence-Embeddings

Word-Embeddings

Dynamic Algorithm Configuration

Hyperparameter-Search

ResNeXt

Convolutional-Neural-Networks

Discriminative Adversarial Search

Sequence-Decoding-Methods

GhostNet

Convolutional-Neural-Networks

Light-weight-neural-networks

MoViNet

Video-Recognition-Models

Light-weight-neural-networks

Position-Wise Feed-Forward Layer

Feedforward-Networks

CrossViT

Vision-Transformers

Image-Models

Cycle-CenterNet

Table-Parsing-Models

GreedyNAS

Neural-Architecture-Search

VDO-SLAM

SLAM-Methods

Temporal attention

Attention-Mechanisms

Segmentation Transformer

Semantic-Segmentation-Models

Temporal Word Embeddings with a Compass

Word-Embeddings

Conditional Convolutions for Instance Segmentation

Instance-Segmentation-Models

Disp R-CNN

3D-Object-Detection-Models

Meta Pseudo Labels

Semi-Supervised-Learning-Methods

Random Scaling

Image-Data-Augmentation

Pairwise Constrained KMeans

Clustering

AdapTive Meta Optimizer

Stochastic-Optimization

Local Response Normalization

Normalization

Inception-ResNet-v2-C

Image-Model-Blocks

Skip-Connection-Blocks

Pointwise Convolution

Convolutions

AutoML-Zero

AutoML

Long Short-Term Memory

Recurrent-Neural-Networks

Symbolic rule learning

Rule-based-systems

Macaw

Question-Answering-Models

Evolved Sign Momentum

Optimization

You Only Hypothesize Once

Point-Cloud-Models

Effective Squeeze-and-Excitation Block

Image-Model-Blocks

Efficient Spatial Pyramid

Image-Model-Blocks

WideResNet

Image-Models

Convolutional-Neural-Networks

ClusterFit

Self-Supervised-Learning

Hierarchical-Split Block

Image-Model-Blocks

Skip-Connection-Blocks

AdvProp

Adversarial-Training

ASLFeat

Convolutional-Neural-Networks

Random Grayscale

Image-Data-Augmentation

Transformer Decoder

Transformers

Dilated Convolution

Convolutions

Approximating Spatiotemporal Representations Using a 2DCNN

Action-Recognition-Models

TransE

Graph-Embeddings

Continuously Indexed Domain Adaptation

Adversarial-Training

BLOOMZ

Language-Models

Temporal Graph Network

Graph-Models

WaveGlow

Generative-Audio-Models

Sequential Information Threading

Mixup

Image-Data-Augmentation

DenseNet-Elastic

Convolutional-Neural-Networks

IoU-Net

Localization-Models

MoGA-A

Convolutional-Neural-Networks

Light-weight-neural-networks

GFP-GAN

Face-Restoration-Models

Generative-Adversarial-Networks

Adafactor

Stochastic-Optimization

Large-Batch-Optimization

Attentive Walk-Aggregating Graph Neural Network

Graph-Representation-Learning

Pipelined Backpropagation

Distributed-Methods

Model-Parallel-Methods

Asynchronous-Pipeline-Parallel

InfoGAN

Generative-Models

Generative-Adversarial-Networks

Confidence Intervals for Diffusion Models

Image-Restoration-Models

Temporal Distribution Matching

Time-Series-Modules

Transformer

Transformers

Autoregressive-Transformers

InfoNCE

Loss-Functions

Stochastic Depth

Regularization

MixText

Semi-Supervised-Learning-Methods

Text-Classification-Models

Text-Augmentation

RetinaNet-RS

One-Stage-Object-Detection-Models

Object-Detection-Models

Approximate Bayesian Computation

Approximate-Inference

Frequency channel attention networks

Attention-Mechanisms

Criss-Cross Network

Semantic-Segmentation-Models

KNN and IOU based verification

Counting-Methods

AmoebaNet

Convolutional-Neural-Networks

A3C

Policy-Gradient-Methods

PREDATOR

Point-Cloud-Models

Trust Region Policy Optimization

Policy-Gradient-Methods

RetinaMask

Object-Detection-Models

One-Stage-Object-Detection-Models

Self-training Guided Prototypical Cross-domain Self-supervised learning

Domain-Adaptation

ECA-Net

Convolutional-Neural-Networks

End-to-end Adaptive Distributed Training

2D-Parallel-Distributed-Methods

Generalized State-Dependent Exploration

Exploration-Strategies

EdgeFlow

Semantic-Segmentation-Models

Interactive-Semantic-Segmentation-Models

Side-Aware Boundary Localization

Object-Detection-Models

Attention Dropout

Regularization

Tanh Activation

Activation-Functions

MLFPN

Feature-Extractors

Feature-Pyramid-Blocks

SwiGLU

Activation-Functions

RESCAL

Graph-Embeddings

Multiscale Vision Transformer

Vision-Transformers

Root-of-Mean-Squared Pooling

Pooling-Operations

TinaFace

Face-Detection-Models

FastSpeech 2

Text-to-Speech-Models

Sparse R-CNN

Object-Detection-Models

Vision-and-Langauge Transformer

Vision-and-Language-Pre-Trained-Models

Vision-aided GAN

Generative-Models

Hybrid Air-Water Temperature Difference

Non-Parametric-Regression

Octave Convolution

Convolutions

Hyperboloid Embeddings

Graph-Embeddings

CSPPeleeNet

Convolutional-Neural-Networks

Large-scale spectral clustering

Clustering

MagFace

Face-Recognition-Models

BigBird

Transformers

Attention-Patterns

Mask Scoring R-CNN

Instance-Segmentation-Models

Swapping Assignments between Views

Self-Supervised-Learning

MaskFlownet

Feature-Matching

Deformable Position-Sensitive RoI Pooling

RoI-Feature-Extractors

Adversarially Learned Inference

Generative-Models

Dilated Sliding Window Attention

Attention-Patterns

Channel Shuffle

Miscellaneous-Components

Channel-wise Cross Fusion Transformer

Semantic-Segmentation-Modules

SegNet

Semantic-Segmentation-Models

Semantic Clustering by Adopting Nearest Neighbours

Clustering

PnP

Image-Model-Blocks

PatchAugment: Local Neighborhood Augmentation in Point Cloud Classification

Point-Cloud-Augmentation

Generative Adversarial Network

Generative-Models

Generative-Adversarial-Networks

Models Genesis

3D-Representations

Reduction-B

Image-Model-Blocks

Holographic Reduced Representation

Miscellaneous-Components

VisTR

Instance-Segmentation-Models

Video-Instance-Segmentation-Models

Spectral Normalization

Normalization

RandAugment

Image-Data-Augmentation

Locality Sensitive Hashing Attention

Attention-Mechanisms

Shifted Rectified Linear Unit

Activation-Functions

CodeT5

Code-Generation-Transformers

Autoencoding-Transformers

Transformers

Decomposition-Integration Class Activation Map

Interpretability

PAR Transformer

Transformers

Augmented SBERT

Text-Augmentation

LeNet

Convolutional-Neural-Networks

FastSpeech 2s

Text-to-Speech-Models

Adaptive Spline Activation Function

Activation-Functions

Adversarial-Training

Depthwise Separable Convolution

Convolutions

DeepMind AlphaStar

Video-Game-Models

Context Enhancement Module

Feature-Extractors

Regularized Autoencoders

Generative-Models

Neural network for graphs

Graph-Models

Detailed Expression Capture and Animation

3D-Face-Mesh-Models

Recurrent Trend Predictive Neural Network

Recurrent-Neural-Networks

SM3

Stochastic-Optimization

Large-Batch-Optimization

Dueling Network

Q-Learning-Networks

Chinchilla

Language-Models

Attention Feature Filters

Attention-Mechanisms

Ternary Weight Splitting

Ternarization

FCOS

Object-Detection-Models

One-Stage-Object-Detection-Models

LMOT: Efficient Light-Weight Detection and Tracking in Crowds

Multi-Object-Tracking-Models

Sinusoidal Representation Network

Activation-Functions

SimpleNet

Convolutional-Neural-Networks

MelGAN Residual Block

Skip-Connection-Blocks

Audio-Model-Blocks

Neural Radiance Field

3D-Representations

3D-Reconstruction

Hierarchical Feature Fusion

Degridding

Cascade R-CNN

Object-Detection-Models

BasicVSR

Video-Super-Resolution-Models

Inception Module

Image-Model-Blocks

Virtual Batch Normalization

Normalization

Prioritized Sweeping

Efficient-Planning

G3D

Action-Recognition-Blocks

Automatic Structured Variational Inference

Variational-Optimization

Deeper Atrous Spatial Pyramid Pooling

Semantic-Segmentation-Modules

CSPDenseNet-Elastic

Convolutional-Neural-Networks

Channel-wise Cross Attention

Attention-Modules

Semantic-Segmentation-Modules

FixRes

Image-Scaling-Strategies

Model-Agnostic Meta-Learning

Meta-Learning-Algorithms

VoVNetV2

Convolutional-Neural-Networks

Harm-Net

Convolutional-Neural-Networks

Bidirectional GAN

Generative-Models

Generative-Adversarial-Networks

Self-Supervised-Learning

pixel2style2pixel

Unpaired-Image-to-Image-Translation

Learning to Match

Domain-Adaptation

Introspective Adversarial Network

Generative-Models

Fisher-BRC

Policy-Gradient-Methods

Offline-Reinforcement-Learning-Methods

DeepDrug

Graph-Representation-Learning

Convolutional Vision Transformer

Vision-Transformers

Image-Models

ALBERT

Transformers

Autoencoding-Transformers

WaveGrad DBlock

Audio-Model-Blocks

Stochastic Dueling Network

Value-Function-Estimation

TResNet

Convolutional-Neural-Networks

Learnable graph convolutional layer

Graph-Models

CAMoE

Video-Text-Retrieval-Models

Bort

Language-Models

Autoencoding-Transformers

Gradient Normalization

Normalization

ControlVAE

Generative-Models

Mish

Activation-Functions

ZCA Whitening

Whitening

Shifted Softplus

Activation-Functions

Latent Diffusion Model

Dimensionality-Reduction

Synthetic Minority Over-sampling Technique.

Downsampling

Gated Linear Unit

Activation-Functions

PowerSGD

Stochastic-Optimization

Optimization

Distributed-Methods

Average Pooling

Pooling-Operations

RetinaNet

Object-Detection-Models

One-Stage-Object-Detection-Models

Lambda Layer

Long-Range-Interaction-Layers

GraphSAGE

Graph-Models

StyleSwin: Transformer-based GAN for High-resolution Image Generation

Generative-Adversarial-Networks

Spatial-Channel Token Distillation

Knowledge-Distillation

ERNIE

Transformers

Residual Shuffle-Exchange Network

Music-Transcription

Neural Additive Model

Generalized-Additive-Models

Interpretability

Soft Actor Critic

Policy-Gradient-Methods

FastPitch

Text-to-Speech-Models

Quick Attention

Attention

PIRL

Self-Supervised-Learning

Sparsemax

Output-Functions

Drafting Network

Style-Transfer-Modules

AdaMax

Stochastic-Optimization

Primal Wasserstein Imitation Learning

Imitation-Learning-Methods

Spatial and Channel-wise Attention-based Convolutional Neural Network

Attention-Mechanisms

Spectral Dropout

Regularization

DeltaConv

3D-Representations

Axial Attention

Image-Model-Blocks

Attention-Mechanisms

AdaBound

Stochastic-Optimization

Implicit Subspace Prior Learning

Face-Restoration-Models

Recurrent Dropout

Regularization

ShuffleNet V2 Block

Image-Model-Blocks

TSRUs

Recurrent-Neural-Networks

Dual Softmax Loss

Loss-Functions

CutMix

Image-Data-Augmentation

Field Embedded Factorization Machine

Factorization-Machines

MCKERNEL

Convolutional-Neural-Networks

Feature-Extractors

Fourier-related-Transforms

ASGD Weight-Dropped LSTM

Recurrent-Neural-Networks

Siamese Network

Twin-Networks

Heterogeneous Molecular Graph Neural Network

Graph-Models

Spectrally Normalised GAN

Generative-Models

Generative-Adversarial-Networks

Adaptive Robust Loss

Loss-Functions

ResNeSt

Image-Models

Convolutional-Neural-Networks

DVD-GAN DBlock

Image-Model-Blocks

Skip-Connection-Blocks

Cosine Linear Unit

Activation-Functions

Adaptive-Activation-Functions

FLAVA

Vision-and-Language-Pre-Trained-Models

efficient channel attention

Attention-Mechanisms

DeepViT

Vision-Transformers

Image-Models

BatchChannel Normalization

Normalization

SESAME Discriminator

Discriminators

MuZero

Board-Game-Models

Hierarchical Entity Graph Convolutional Network

Graph-Models

Relation-Extraction-Models

squeeze-and-excitation networks

Attention-Mechanisms

ESPNet

Semantic-Segmentation-Models

Light-weight-neural-networks

Wide&Deep

Deep-Tabular-Learning

Differentiable Neural Architecture Search

Neural-Architecture-Search

Smooth ReLU

Recommendation-Systems

TuckER

Graph-Embeddings

HyperNetwork

Feedforward-Networks

Random Synthesized Attention

Attention-Mechanisms

Synthesized-Attention-Mechanisms

Adam

Stochastic-Optimization

Optimization

Large-Batch-Optimization

DeBERTa

Transformers

Autoencoding-Transformers

Kalman Optimization for Value Approximation

Policy-Evaluation

Highway Network

Feedforward-Networks

Deformable DETR

Object-Detection-Models

Vision-Transformers

Huber loss

Loss-Functions

Short-Term Dense Concatenate

Semantic-Segmentation-Modules

Magnification Prior Contrastive Similarity

Self-Supervised-Learning

RepVGG

Convolutional-Neural-Networks

Replica exchange stochastic gradient Langevin Dynamics

Markov-Chain-Monte-Carlo

BS-Net

Medical-Image-Models

GPT-NeoX

Language-Models

Neural Probabilistic Language Model

Language-Models

SOHO

Vision-and-Language-Pre-Trained-Models

Feature Selection

AutoML

LiteSeg

Semantic-Segmentation-Models

MoBY

Self-Supervised-Learning

AdaMod

Stochastic-Optimization

Cross-Covariance Attention

Attention-Mechanisms

VoiceFilter-Lite

Speech-Separation-Models

GLOW

Generative-Models

Likelihood-Based-Generative-Models

One Representation

Vision-and-Language-Pre-Trained-Models

Channel & Spatial attention

Attention-Mechanisms

End-to-End Neural Diarization

Speaker-Diarization

Center Pooling

Pooling-Operations

NADAM

Stochastic-Optimization

Large-Batch-Optimization

Hydra

Knowledge-Distillation

Variational Trace Distance Estimation

Quantum-Methods

XLSR

Speech-Recognition

Universal Language Model Fine-tuning

Language-Models

Experience Replay

Replay-Memory

DeeBERT

Transformers

Autoencoding-Transformers

GBlock

Audio-Model-Blocks

Skip-Connection-Blocks

Fast Focal Detection Network

Object-Detection-Models

DenseNAS-A

Convolutional-Neural-Networks

LR-Net

Image-Models

Graph Neural Networks with Continual Learning

Graph-Models

Test-time Local Converter

Image-Restoration-Models

Weight Demodulation

Normalization

Deep-MAC

Instance-Segmentation-Models

RegNetX

Convolutional-Neural-Networks

Clipped Double Q-learning

Off-Policy-TD-Control

Spiking Neural Networks

CTAL

Generative-Audio-Models

Multi-Modal-Methods

Animatable Reconstruction of Clothed Humans

3D-Reconstruction

Adversarial Latent Autoencoder

Generative-Models

RotatE

Graph-Embeddings

Review-guided Answer Helpfulness Prediction

Textual-Inference-Models

Dense Block

Image-Model-Blocks

Skip-Connection-Blocks

Lookahead

Stochastic-Optimization

Grid R-CNN

Object-Detection-Models

EsViT

Vision-Transformers

Re-Attention Module

Attention-Modules

CrossTransformers

Vision-Transformers

In-Place Activated Batch Normalization

Normalization

Independent Component Analysis

Dimensionality-Reduction

BART

Transformers

Sequence-To-Sequence-Models

Confidence Calibration with an Auxiliary Class)

Confidence-Calibration

1x1 Convolution

Convolutions

RandomRotate

Image-Data-Augmentation

AMSGrad

Stochastic-Optimization

Network On Network

Deep-Tabular-Learning

WaveGAN

Generative-Audio-Models

Scaled Exponential Linear Unit

Activation-Functions

Diffusion-Convolutional Neural Networks

Graph-Models

Generalizable Node Injection Attack

Adversarial-Attacks

DiffPool

Graph-Models

Residual Connection

Skip-Connections

Feedback Transformer

Transformers

Autoregressive-Transformers

Language-Models

Shuffle Transformer

Vision-Transformers

Temporal Activation Regularization

Regularization

Soft-NMS

Proposal-Filtering

CoordConv

Convolutions

Hybrid Task Cascade

Instance-Segmentation-Models

Object-Detection-Models

Hard Sigmoid

Activation-Functions

Variational Entanglement Detection

Quantum-Methods

Linear Warmup

Learning-Rate-Schedules

Class Attention

Attention

Attention-Mechanisms

Parametric Exponential Linear Unit

Activation-Functions

Adaptive-Activation-Functions

Capsule Network

Convolutional-Neural-Networks

Adaptive Hybrid Activation Function

Activation-Functions

Adaptive-Activation-Functions

ZeRO-Infinity

Distributed-Methods

Data-Parallel-Methods

Sharded-Data-Parallel-Methods

Partition Filter Network

Relation-Extraction-Models

Entity-Recognition-Models

Assemble-ResNet

Convolutional-Neural-Networks

Recurrent Entity Network

Working-Memory-Models

Maxout

Activation-Functions

Progressively Growing GAN

Generative-Models

Generative-Adversarial-Networks

Attention with Linear Biases

Inference-Extrapolation

Position-Embeddings

Decorrelated Batch Normalization

Normalization

Feature Fusion Module v1

Feature-Extractors

Squared ReLU

Activation-Functions

Gated Recurrent Unit

Recurrent-Neural-Networks

Transformer-XL

Transformers

Autoregressive-Transformers

MultiGrain

Convolutional-Neural-Networks

ReLU6

Activation-Functions

Unsupervised Feature Loss

Loss-Functions

PIoU Loss

Loss-Functions

Grab

Cashier-Free-Shopping

Multiple Random Window Discriminator

Discriminators

Encoder-Decoder model with local and pairwise loss along with shared encoder and discriminator network (EDLPS)

Document-Embeddings

Herring

Distributed-Methods

Hybrid-Parallel-Methods

Parameter-Server-Methods

Greedy Policy Search

Image-Data-Augmentation

Prediction-aware One-To-One

Detection-Assignment-Rules

TaBERT

Deep-Tabular-Learning

Scatter Connection

Miscellaneous-Components

Sigmoid Activation

Activation-Functions

Two Time-scale Update Rule

Optimization

Snapshot Ensembles: Train 1, get M for free

Active-Learning

Attention Free Transformer

Attention-Modules

Self-Adversarial Negative Sampling

Negative-Sampling

Supporting Clustering with Contrastive Learning

Clustering

Global Sub-Sampled Attention

Attention-Mechanisms

1-Dimensional Convolutional Neural Networks

Convolutions

Inception-A

Image-Model-Blocks

Squeeze aggregated excitation network

Convolutional-Neural-Networks

Lightweight Convolution

Convolutions

Temporal-Convolutions

Graph Echo State Network

Graph-Models

Embedding Dropout

Regularization

Multiplicative LSTM

Recurrent-Neural-Networks

Concatenation Affinity

Affinity-Functions

Grid Sensitive

Object-Detection-Modules

DSelect-k

Mixture-of-Experts

Gravity

Stochastic-Optimization

Multi-source Sentiment Generative Adversarial Network

Generative-Adversarial-Networks

Domain-Adaptation

Dense Synthesized Attention

Attention-Mechanisms

Synthesized-Attention-Mechanisms

CondConv

Convolutions

Deep Deterministic Policy Gradient

Policy-Gradient-Methods

Commute Times Layer

Graph-Embeddings

Graph-Models

Syntax Heat Parse Tree

Interpretability

Blended Diffusion

Image-Generation-Models

Spatially Separable Convolution

Convolutions

Adaptive Graph Convolutional Neural Networks

Graph-Models

Fast Minimum-Norm Attack

Adversarial-Attacks

Mesh-TensorFlow

Distributed-Methods

Model-Parallel-Methods

Intra-Layer-Parallel

Soft Split and Soft Composition

Video-Model-Blocks

ResNeXt-Elastic

Convolutional-Neural-Networks

Fixed Factorized Attention

Attention-Patterns

Minibatch Discrimination

Generative-Discrimination

GloVe Embeddings

Word-Embeddings

Static-Word-Embeddings

COCO-FUNIT

Unpaired-Image-to-Image-Translation

Few-Shot-Image-to-Image-Translation

CARLA: An Open Urban Driving Simulator

Video-Game-Models

Softplus

Activation-Functions

Composite Backbone Network

Backbone-Architectures

Self-Supervised Temporal Domain Adaptation

Domain-Adaptation

Deep Boltzmann Machine

Generative-Models

Co-Scale Conv-attentional Image Transformer

Vision-Transformers

style-based recalibration module

Attention-Mechanisms

Object Dropout

Image-Data-Augmentation

Linear Regression

Generalized-Linear-Models

Stand-Alone Self Attention

Object-Detection-Models

Semantic Cross Attention

Attention-Modules

ResNeXt Block

Skip-Connection-Blocks

Image-Model-Blocks

Grouped-query attention

Attention

Sparse Layer-wise Adaptive Moments optimizer for large Batch training

Large-Batch-Optimization

Polynomial Rate Decay

Learning-Rate-Schedules

Scale Aggregation Block

Image-Model-Blocks

Targeted Dropout

Regularization

AggMo

Stochastic-Optimization

True Online TD Lambda

On-Policy-TD-Control

Dual Contrastive Learning

Text-Classification-Models

Adaptive Bezier-Curve Network

Scene-Text-Models

ZFNet

Convolutional-Neural-Networks

Global Average Pooling

Pooling-Operations

Supervised Contrastive Loss

Loss-Functions

Mogrifier LSTM

Recurrent-Neural-Networks

CuBERT

Language-Models

Autoencoding-Transformers

Code-Generation-Transformers

Multi Loss ( BCE Loss + Focal Loss ) + Dice Loss

Loss-Functions

Extended Transformer Construction

Transformers

Focal Loss

Loss-Functions

Good Feature Matching

Feature-Matching

Singular Value Clipping

Adversarial-Training

Adaptive Richard's Curve Weighted Activation

Activation-Functions

DeCLUTR

Self-Supervised-Learning

Sentence-Embeddings

WaveTTS

Text-to-Speech-Models

Sequence-To-Sequence-Models

TSRUp

Recurrent-Neural-Networks

Panoptic-PolarNet

Point-Cloud-Models

GPT-2

Transformers

Autoregressive-Transformers

VirTex

Image-Representations

OODformer

Vision-Transformers

Stacked Hourglass Network

Pose-Estimation-Models

Residual GRU

Recurrent-Neural-Networks

Circular Smooth Label

Arbitrary-Object-Detectors

SCNet

Instance-Segmentation-Models

PipeDream

Distributed-Methods

Model-Parallel-Methods

Asynchronous-Pipeline-Parallel

Branch attention

Attention-Mechanisms

Residual SRM

Skip-Connection-Blocks

Image-Model-Blocks

Gated Convolution

Temporal-Convolutions

Viewmaker Network

Generative-Models

Slime Mould Algorithm

Optimization

Stochastic-Optimization

Contextual Decomposition Explanation Penalization

Interpretability

Early Dropout

Regularization

Network Dissection

Interpretability

FBNet

Convolutional-Neural-Networks

Light-weight-neural-networks

DouZero

Card-Game-Models

Gaussian Error Linear Units

Activation-Functions

Exponential Decay

Learning-Rate-Schedules

Funnel Transformer

Transformers

Distance to Modelled Embedding

Out-of-Distribution-Example-Detection

QHM

Stochastic-Optimization

Swin Transformer

Vision-Transformers

Image-Models

CenterNet

Object-Detection-Models

One-Stage-Object-Detection-Models

CubeRE

Relation-Extraction-Models

ReLIC

Self-Supervised-Learning

Strided EESP

Skip-Connection-Blocks

Image-Model-Blocks

Instances-Pixels Balance Index

Image-Semantic-Segmentation-Metric

FCPose

Pose-Estimation-Models

MixNet

Convolutional-Neural-Networks

Light-weight-neural-networks

Conditional Batch Normalization

Normalization

GradientDICE

Density-Ratio-Learning

Temporal Pyramid Network

Action-Recognition-Blocks

RAdam

Stochastic-Optimization

InterBERT

Vision-and-Language-Pre-Trained-Models

Convolutional time-domain audio separation network

Temporal-Convolutions

Speech-Separation-Models

Music-source-separation

Composite Fields

Image-Representations

Spatial-Reduction Attention

Attention-Modules

Xavier Initialization

Initialization

Region Proposal Network

Region-Proposal

Fast Voxel Query

Attention-Mechanisms

Fast R-CNN

Object-Detection-Models

SpreadsheetCoder

Spreadsheet-Formula-Prediction-Models

Object-Aware Mix

Image-Data-Augmentation

Inverted Residual Block

Skip-Connection-Blocks

Pathology Language and Image Pre-Training

Vision-and-Language-Pre-Trained-Models

Semantic Reasoning Network

Scene-Text-Models

Siamese Multi-depth Transformer-based Hierarchical Encoder

Transformers

Autoencoding-Transformers

AccoMontage

Generative-Audio-Models

Spatially-Adaptive Normalization

Normalization

DIoU-NMS

Proposal-Filtering

Compressive Transformer

Transformers

All-Attention Layer

Attention-Modules

Learning From Multiple Experts

Knowledge-Distillation

Phase Gradient Heap Integration

Phase-Reconstruction

ENet Initial Block

Image-Model-Blocks

MDETR

Object-Detection-Models

U-Net Generative Adversarial Network

Generative-Adversarial-Networks

LeVIT

Vision-Transformers

Inpainting

Self-Supervised-Learning

Linformer

Transformers

Autoregressive-Transformers

Dorylus

Distributed-Methods

Modulated Residual Network

VQA-Models

N-step Returns

Value-Function-Estimation

Point-GNN

3D-Object-Detection-Models

Graph-Models

Point-Cloud-Models

R(2+1)D

Convolutional-Neural-Networks

NVAE Encoder Residual Cell

Image-Model-Blocks

Skip-Connection-Blocks

Contrastive Multiview Coding

Self-Supervised-Learning

Normalized Temperature-scaled Cross Entropy Loss

Loss-Functions

MatrixNet

Feature-Extractors

DropPath

Regularization

Compact Convolutional Transformers

Vision-Transformers

Flow Alignment Module

Semantic-Segmentation-Modules

CharacterBERT

Language-Models

DeepLabv2

Semantic-Segmentation-Models

Self-Calibrated Convolutions

Attention-Mechanisms

TD Lambda

On-Policy-TD-Control

Deformable Convolution

Convolutions

Group-Aware Neural Network

Graph-Models

Air-Quality-Forecasting

DeepIR

Thermal-Image-Processing-Models

Demon

Momentum-Rules

Aggregated Learning

Information-Bottleneck

PrIme Sample Attention

Prioritized-Sampling

Grouped Convolution

Convolutions

Deep Voice 3

Text-to-Speech-Models

Sequence-To-Sequence-Models

BigGAN

Generative-Models

Generative-Adversarial-Networks

ENet Dilated Bottleneck

Image-Model-Blocks

Adaptive Input Representations

Input-Embedding-Factorization

Self-Training with Task Augmentation

Semi-Supervised-Learning-Methods

Self-Training-Methods

FairMOT

Multi-Object-Tracking-Models

OSA (identity mapping + eSE)

Skip-Connection-Blocks

Image-Model-Blocks

Fast Feedforward Networks

Backbone-Architectures

Deformable Kernel

Convolutions

Gated Attention Networks

Graph-Models

Ghost Bottleneck

Skip-Connection-Blocks

Image-Model-Blocks

Residual Block

Skip-Connection-Blocks

Image-Model-Blocks

TridentNet Block

Feature-Extractors

Local Augmentation

Graph-Data-Augmentation

Adaptive Masking

Attention-Mechanisms

ESPNetv2

Convolutional-Neural-Networks

Light-weight-neural-networks

G-GLN Neuron

Gated-Linear-Networks

Big-Little Module

Skip-Connection-Blocks

Image-Model-Blocks

StoGCN

Graph-Models

Gated Channel Transformation

Attention-Mechanisms

M2Det

Object-Detection-Models

One-Stage-Object-Detection-Models

Quasi-Recurrent Neural Network

Recurrent-Neural-Networks

DiCENet

Convolutional-Neural-Networks

DeepSIM

Image-Models

Image-Manipulation-Models

Local Relation Layer

Image-Feature-Extractors

DeepWalk

Graph-Embeddings

Bidirectional GRU

Bidirectional-Recurrent-Neural-Networks

Monte-Carlo Tree Search

Heuristic-Search-Algorithms

SNet

Convolutional-Neural-Networks

VGG

Convolutional-Neural-Networks

PointNet

3D-Representations

Baidu Dependency Parser

Dependency-Parsers

GA-PID/NN-PID

Control-and-Decision-Systems

ResNet-RS

Convolutional-Neural-Networks

Area Under the ROC Curve for Clustering

Clustering

Dimension-wise Fusion

Image-Model-Blocks

Random Convolutional Kernel Transform

Time-Series-Analysis

Crossmodal Contrastive Learning

Self-Supervised-Learning

Temporal Jittering

Video-Sampling

CentripetalNet

Object-Detection-Models

MobileViT

Vision-Transformers

Light-weight-neural-networks

(2+1)D Convolution

Convolutions

Cycle Consistency Loss

Loss-Functions

Stein Variational Policy Gradient

Policy-Gradient-Methods

XGrad-CAM

Explainable-CNNs

EfficientNetV2

Convolutional-Neural-Networks

Spatial Attention-Guided Mask

Mask-Branches

Attention-Modules

Source Hypothesis Transfer

Domain-Adaptation

Spectral Gap Rewiring Layer

Graph-Embeddings

Graph-Models

Active Convolution

Convolutions

Mechanism Transfer

Domain-Adaptation

FT-Transformer

Deep-Tabular-Learning

Deterministic Policy Gradient

Policy-Gradient-Methods

Dual Multimodal Attention

Attention-Mechanisms

ConvLSTM

Recurrent-Neural-Networks

Layer Normalization

Normalization

PP-YOLO

Object-Detection-Models

One-Stage-Object-Detection-Models

DCN-V2

Learning-to-Rank-Models

Deep-Tabular-Learning

Multi-Head Attention

Attention-Modules

RepPoints

Object-Detection-Models

Generalized Mean Pooling

Pooling-Operations

Location Sensitive Attention

Attention-Mechanisms

MyGym: Modular Toolkit for Visuomotor Robotic Tasks

Robotic-Manipulation-Models

Reinforcement-Learning-Frameworks

Policy-Gradient-Methods

Filter Response Normalization

Normalization

ORB-Simultaneous localization and mapping

Localization-Models

TabTransformer

Deep-Tabular-Learning

Deep Extreme Cut

Image-Segmentation-Models

3D ResNet-RS

Video-Recognition-Models

LayerDrop

Regularization

Crossbow

Distributed-Methods

Data-Parallel-Methods

Asynchronous-Data-Parallel

Nesterov Accelerated Gradient

Stochastic-Optimization

Large-Batch-Optimization

Dutch Eligibility Trace

Eligibility-Traces

PCA Whitening

Whitening

Contrastive Cross-View Mutual Information Maximization

Representation-Learning

Spatially Separable Self-Attention

Attention-Modules

Sparse Switchable Normalization

Normalization

Support Vector Machine

Non-Parametric-Classification

Non-Parametric-Regression

Mixture model network

Graph-Models

NesT

Vision-Transformers

Associative LSTM

Recurrent-Neural-Networks

Group Normalization

Normalization

Ape-X

Distributed-Reinforcement-Learning

ZeRO

Distributed-Methods

Data-Parallel-Methods

Sharded-Data-Parallel-Methods

ARShoe

6D-Pose-Estimation-Models

Augmented-Reality-Methods

Single-Shot Multi-Object Tracker

Multi-Object-Tracking-Models

Generalizable SAM

Semantic-Segmentation-Models

Base Boosting

Generalized-Additive-Models

S-shaped ReLU

Activation-Functions

Metropolis Hastings

Markov-Chain-Monte-Carlo

Tacotron2

Text-to-Speech-Models

Probabilistically Masked Language Model

Language-Models

Spatio-Temporal Attention LSTM

Attention-Mechanisms

Robust Predictable Control

Policy-Gradient-Methods

Ape-X DPG

Policy-Gradient-Methods

BezierAlign

RoI-Feature-Extractors

PointQuad-Transformer

Point-Cloud-Models

Fastformer

Transformers

Rainbow DQN

Q-Learning-Networks

Dual Attention Network

Attention-Mechanisms

Mask R-CNN

Instance-Segmentation-Models

Object-Detection-Models

Graph Convolutional Network

Graph-Models

Denoising Autoencoder

Generative-Models

Ghost Module

Image-Model-Blocks

VQSVD

Quantum-Methods

Bottleneck Transformer

Image-Models

Vision-Transformers

Co-Correcting

Medical-Image-Models

Principal Components Analysis

Dimensionality-Reduction

Image-Denoising-Models

Enhanced Seq2Seq Autoencoder via Contrastive Learning

Transformers

Noisy Student

Semi-Supervised-Learning-Methods

Beta-VAE

Generative-Models

Likelihood-Based-Generative-Models

ThunderNet

Object-Detection-Models

Iterative Latent Variable Refinement

Generative-Training

Negative Face Recognition

Face-Recognition-Models

Feature Fusion Module v2

Feature-Extractors

Selective Kernel Convolution

Convolutions

Class-Attention in Image Transformers

Vision-Transformers

PSPNet

Semantic-Segmentation-Models

mBART

Language-Models

Autoencoding-Transformers

Sequence-To-Sequence-Models

Tanh Exponential Activation Function

Activation-Functions

Auxiliary Batch Normalization

Regularization

Dynamic Time Warping

Time-Series-Analysis

Fast Sample Re-Weighting

Sample-Re-Weighting

StyleGAN2

Generative-Models

Generative-Adversarial-Networks

Random Ensemble Mixture

Q-Learning-Networks

Off-Policy-TD-Control

Randomized-Value-Functions

CSPDarknet53

Convolutional-Neural-Networks

Local Contrast Normalization

Normalization

Bootstrap Your Own Latent

Self-Supervised-Learning

Online Hard Example Mining

Prioritized-Sampling

Spatial Broadcast Decoder

Backbone-Architectures

Topographic VAE

Generative-Models

V-trace

Value-Function-Estimation

Byte Pair Encoding

Subword-Segmentation

GPT-3

Transformers

Language-Models

Autoregressive-Transformers

Fractal Block

Image-Model-Blocks

Time-homogenuous Top-K Ranking

Time-Series-Analysis

SimCLRv2

Semi-Supervised-Learning-Methods

TransferQA

Question-Answering-Models

Blind Image Decomposition Network

Image-Decomposition-Models

Switchable Normalization

Normalization

Multi-Head Linear Attention

Attention-Modules

SqueezeBERT

Transformers

Autoencoding-Transformers

Bottleneck Residual Block

Skip-Connection-Blocks

Image-Model-Blocks

LeViT Attention Block

Attention-Modules

Adaptive Training Sample Selection

Prioritized-Sampling

Discriminative Regularization

Regularization

StyleMapGAN

Generative-Adversarial-Networks

DV3 Convolution Block

Audio-Model-Blocks

Skip-Connection-Blocks

Convolutional Hough Matching

Geometric-Matching

Glow-TTS

Text-to-Speech-Models

CT3D

3D-Object-Detection-Models

DeLighT Block

Attention-Modules

Embedded Gaussian Affinity

Affinity-Functions

DVD-GAN GBlock

Image-Model-Blocks

Skip-Connection-Blocks

Dialogue-Adaptive Pre-training Objective

Dialog-Adaptation

BRepNet

CAD-Design-Models

BLOOM

Language-Models

CP with N3 Regularizer and Relation Prediction

Graph-Embeddings

Non Maximum Suppression

Proposal-Filtering

AMSBound

Stochastic-Optimization

MoGA-B

Convolutional-Neural-Networks

Light-weight-neural-networks

LOGAN

Generative-Models

Generative-Adversarial-Networks

Bilinear Attention

Attention-Mechanisms

Differentiable Hyperparameter Search

Hyperparameter-Search

Neural-Architecture-Search

TD-Gammon

Board-Game-Models

Recurrent models of visual attention

Attention-Mechanisms

Composed Video Retrieval

Video-Text-Retrieval-Models

Multi-head of Mixed Attention

Attention-Mechanisms

ProphetNet

Transformers

Language-Models

SpecGAN

Generative-Audio-Models

Differential Diffusion

Image-Generation-Models

Contractive Autoencoder

Generative-Models

ShuffleNet v2

Convolutional-Neural-Networks

Progressive Neural Architecture Search

Neural-Architecture-Search

StreaMRAK

Kernel-Methods

Track objects as points

Multi-Object-Tracking-Models

Spectral-Normalized Identity Priors

Pruning

Informative Sample Mining Network

Generative-Models

Generative-Training

NoisyNet-A3C

Policy-Gradient-Methods

Soft Actor-Critic (Autotuned Temperature)

Policy-Gradient-Methods

Surrogate Lagrangian Relaxation

Optimization

Hierarchical Style Disentanglement

Generative-Models

Image Scale Augmentation

Image-Data-Augmentation

MuVER

Entity-Retrieval-Models

Bridge-net

Audio-Model-Blocks

Expected Sarsa

On-Policy-TD-Control

Off-Policy-TD-Control

AdamW

Stochastic-Optimization

Graph Transformer

Graph-Models

YOLOv2

Object-Detection-Models

One-Stage-Object-Detection-Models

Pyramid Vision Transformer

Vision-Transformers

CheXNet

Convolutional-Neural-Networks

Estimation Statistics

Statistical-Inference

Sharpness-Aware Minimization

Optimization

Fast-YOLOv4-SmallObj

Convolutional-Neural-Networks

SqueezeNeXt Block

Skip-Connection-Blocks

Image-Model-Blocks

Position-Sensitive RoIAlign

RoI-Feature-Extractors

Gumbel Cross Entropy

Activation-Functions

QHAdam

Stochastic-Optimization

Perceiver IO

Bilateral Grid

Image-Representations

Wasserstein Embedding for Graph Learning

Graph-Embeddings

MeshGraphNet

Graph-Models

Mesh-Based-Simulation-Models

K3M

Language-Model-Pre-Training

PatchGAN

Discriminators

ReInfoSelect

Information-Bottleneck

Information-Retrieval-Methods

Fire Module

Image-Model-Blocks

Wasserstein GAN (Gradient Penalty)

Generative-Adversarial-Networks

SCARLET-NAS

Neural-Architecture-Search

Conditional Random Field

Structured-Prediction

FeatureNMS

Proposal-Filtering

Inception-ResNet-v2

Convolutional-Neural-Networks

Kaleido-BERT

Vision-and-Language-Pre-Trained-Models

RESCAL with Relation Prediction

Graph-Embeddings

Deep Belief Network

Generative-Models

Virtual Data Augmentation

Fine-Tuning

Contour Stochastic Gradient Langevin Dynamics

Markov-Chain-Monte-Carlo

Multiplicative Attention

Attention-Mechanisms

Neural Oblivious Decision Ensembles

Deep-Tabular-Learning

Deformable RoI Pooling

RoI-Feature-Extractors

WenLan

Vision-and-Language-Pre-Trained-Models

Matrix Non-Maximum Suppression

Proposal-Filtering

CornerNet

Object-Detection-Models

One-Stage-Object-Detection-Models

Fourier Contour Embedding

Text-Instance-Representations

IFBlock

Video-Model-Blocks

AutoSync

Distributed-Methods

Auto-Parallel-Methods

DropAttack

Adversarial-Training

RTMDet: An Empirical Study of Designing Real-Time Object Detectors

Object-Detection-Models

Padé Activation Units

Activation-Functions

Adaptive-Activation-Functions

SC-GPT

Transformers

Instance-Level Meta Normalization

Normalization

Barlow Twins

Self-Supervised-Learning

Thinned U-shape Module

Feature-Extractors

Legendre Memory Unit

Recurrent-Neural-Networks

AutoSmart

AutoML

Attribute2Font

Generative-Models

Font-Generation-Models

Colorization

Self-Supervised-Learning

Multiplicative RNN

Recurrent-Neural-Networks

UL2

Language-Models

GCNII

Graph-Models

Geometric Manifold Component Estimator

Manifold-Disentangling

ARM-Net

Deep-Tabular-Learning

Pansharpening Network

Convolutional-Neural-Networks

Gradient-based optimization

Optimization

ScaleNet

Convolutional-Neural-Networks

BytePS

Distributed-Methods

Hybrid-Parallel-Methods

Parameter-Server-Methods

Adaptively Spatial Feature Fusion

Feature-Pyramid-Blocks

Conditional Position Encoding Vision Transformer

Vision-Transformers

NoisyNet-Dueling

Q-Learning-Networks

GShard

Distributed-Methods

Model-Parallel-Methods

Intra-Layer-Parallel

TAPAS

Table-Question-Answering-Models

Deep-Tabular-Learning

FiLM Module

Audio-Model-Blocks

TabNN

Deep-Tabular-Learning

MobileDet

Object-Detection-Models

Light-weight-neural-networks

AutoInt

Deep-Tabular-Learning

AdaGPR

Graph-Models

3-dimensional interaction space

3D-Representations

MATE

Transformers

Table-Question-Answering-Models

Deep-Tabular-Learning

Primer

Transformers

Autoregressive-Transformers

ComiRec

Recommendation-Systems

Weight Decay

Regularization

Parameter-Norm-Penalties

Hardtanh Activation

Activation-Functions

Spatial Gating Unit

Feedforward-Networks

Distributional Generalization

Generalization

Softsign Activation

Activation-Functions

Non-Local Operation

Image-Feature-Extractors

SRGAN

Generative-Adversarial-Networks

Super-Resolution-Models

Gradient Quantization with Adaptive Levels/Multiplier

Data-Parallel-Methods

CPM-2

Language-Models

Laplacian Pyramid

Image-Representations

RandWire

Convolutional-Neural-Networks

Contrastive BERT

RL-Transformers

Global-and-Local attention

Attention-Mechanisms

Global Context Block

Image-Model-Blocks

Attention-Modules

Skip-Connection-Blocks

Softmax

Output-Functions

Pattern-Exploiting Training

Semi-Supervised-Learning-Methods

Temporally Consistent Spatial Augmentation

Video-Data-Augmentation

DELG

Convolutional-Neural-Networks

Image-Retrieval-Models

Pixel-BERT

Vision-and-Language-Pre-Trained-Models

Mixture of Softmaxes

Output-Functions

Non-linear Independent Component Estimation

Generative-Models

Likelihood-Based-Generative-Models

MnasNet

Convolutional-Neural-Networks

Light-weight-neural-networks

Factorized Random Synthesized Attention

Attention-Mechanisms

Synthesized-Attention-Mechanisms

Sinkhorn Transformer

Transformers

Autoregressive-Transformers

PixelShuffle

Miscellaneous-Components

UNIMO

Multi-Modal-Methods

Vision-and-Language-Pre-Trained-Models

Accuracy-Robustness Area

Adversarial-Training

Tofu

Distributed-Methods

Model-Parallel-Methods

Intra-Layer-Parallel

Slanted Triangular Learning Rates

Learning-Rate-Schedules

RoIAlign

RoI-Feature-Extractors

EfficientDet

Object-Detection-Models

One-Stage-Object-Detection-Models

Semantic-Segmentation-Models

HITNet

Stereo-Depth-Estimation-Models

Beneš Block with Residual Switch Units

Audio-Model-Blocks

RAG

Transformers

Agglomerative Contextual Decomposition

Interpretability

PyTorch DDP

Distributed-Methods

Data-Parallel-Methods

Replicated-Data-Parallel

Zoneout

Regularization

SAGA

Optimization

Relation-aware Global Attention

Attention-Mechanisms

ExtremeNet

Object-Detection-Models

One-Stage-Object-Detection-Models

Wavelet Distributed Training

Distributed-Methods

Data-Parallel-Methods

Asynchronous-Data-Parallel

Dynamic Convolution

Convolutions

Temporal-Convolutions

Chimera

Model-Parallel-Methods

Synchronous-Pipeline-Parallel

Distributed-Methods

SpatialDropout

Regularization

Aligning Latent and Image Spaces

Generative-Adversarial-Networks

Universal Transformer

Transformers

Autoregressive-Transformers

SongNet

Transformers

Sandwich Transformer

Transformers

Autoregressive-Transformers

Language-Models

Class activation guide

Region-Proposal

Automated Graph Learning

Graph-Models

Ensemble Clustering

Clustering

Noise2Fast

Image-Denoising-Models

Parameterized ReLU

Activation-Functions

Deep Q-Network

Q-Learning-Networks

Strip Pooling

Pooling-Operations

Adaptive Smooth Optimizer

Stochastic-Optimization

Griffin-Lim Algorithm

Phase-Reconstruction

Compressed Memory

Miscellaneous-Components

FastSGT

Dialogue-State-Trackers

TopK Copy

Copy-Mechanisms

SlowMo

Distributed-Methods

Optimization

Data-Parallel-Methods

Characterizable Invertible 3x3 Convolution

Normalization

Deep Graph Infomax

Graph-Models

CTRL

Transformers

Span-Based Dynamic Convolution

Convolutions

Temporal-Convolutions

PEGASUS

Transformers

Causal inference

Q-Learning

Off-Policy-TD-Control

3DSSD

3D-Object-Detection-Models

TraDeS

Multi-Object-Tracking-Models

Population Based Training

Optimization

Hyperparameter-Search

Hierarchical BiLSTM Max Pooling

Sequence-To-Sequence-Models

SAFRAN - Scalable and fast non-redundant rule application

Rule-based-systems

spatial transformer networks

Attention-Mechanisms

Procrustes

Generalized-Linear-Models

Make-A-Scene

Image-Generation-Models

Entropy Minimized Ensemble of Adapters

Ensembling

Weight Tying

Parameter-Sharing

Collaborative Preference Embedding

Recommendation-Systems

k-Means Clustering

Clustering

Convolution

Convolutions

NAS-FPN

Feature-Extractors

Feature-Pyramid-Blocks

FastMoE

Distributed-Methods

Hybrid-Parallel-Methods

2D-Parallel-Distributed-Methods

Twin Delayed Deep Deterministic

Policy-Gradient-Methods

Neural Tangent Transfer

Sparsity

Prescribed Generative Adversarial Network

Generative-Models

Generative-Adversarial-Networks

Spectral Clustering

Clustering

Online Multi-granularity Distillation

Knowledge-Distillation

Auditory Cortex ResNet

Audio-Model-Blocks

Gradual Self-Training

Semi-Supervised-Learning-Methods

IICNet

Image-Models

Reversible-Image-Conversion-Models

Relational Graph Convolution Network

Graph-Models

MinCut Pooling

Graph-Models

Balanced L1 Loss

Loss-Functions

GoogLeNet

Convolutional-Neural-Networks

Activation Normalization

Normalization

T5

Transformers

Sequence-To-Sequence-Models

Autoencoding-Transformers

Absolute Learning Progress and Gaussian Mixture Models for Automatic Curriculum Learning

Self-Supervised-Learning

PeleeNet

Convolutional-Neural-Networks

Light-weight-neural-networks

DPN Block

Skip-Connection-Blocks

Image-Model-Blocks

TGAN

Generative-Models

Generative-Adversarial-Networks

Generative-Video-Models

self-mem + new data

Self-Training-Methods

Gated Transformer-XL

RL-Transformers

Neighborhood Attention

Attention-Patterns

Attention-Modules

Attention-Mechanisms

Harmonic Block

Image-Model-Blocks

Canonical Partition

Graph-Data-Augmentation

Fast AutoAugment

Image-Data-Augmentation

PolarNet

Point-Cloud-Representations

Neural Image Assessment

Discriminators

SimAdapter

Attention-Modules

AutoGAN

Neural-Architecture-Search

Simple Neural Attention Meta-Learner

Recurrent-Neural-Networks

Jigsaw

Self-Supervised-Learning

Support-set Based Cross-Supervision

Video-Model-Blocks

VideoBERT

Transformers

Representation-Learning

H3DNet

Object-Detection-Models

VisuoSpatial Foresight

Robotic-Manipulation-Models

Absolute Position Encodings

Position-Embeddings

Time-aware Large Kernel Convolution

Temporal-Convolutions

Adaptive Content Generating and Preserving Network

Generative-Adversarial-Networks

Augmented-Reality-Methods

Dynamic Keypoint Head

Output-Heads

Bayesian Reward Extrapolation

Bayesian-Reinforcement-Learning

ERNIE-GEN

Language-Models

Language-Model-Pre-Training

Fine-Tuning

Probabilistic Continuously Indexed Domain Adaptation

Adversarial-Training

Adversarial Model Perturbation

Optimization

Res2Net Block

Skip-Connection-Blocks

Image-Model-Blocks

Affine Operator

Feedforward-Networks

Shape Adaptor

AutoML

Pooling-Operations

GPipe

Distributed-Methods

Model-Parallel-Methods

Synchronous-Pipeline-Parallel

AutoAugment

Image-Data-Augmentation

Chain-of-thought prompting

Prompt-Engineering

AdaGrad

Stochastic-Optimization

Large-Batch-Optimization

Split Attention

Image-Model-Blocks

IoU-guided NMS

Proposal-Filtering

Synergistic Image and Feature Alignment

Domain-Adaptation

Label Quality Model

Label-Correction

DAFNe

Object-Detection-Models

Oriented-Object-Detection-Models

Gradient Checkpointing

Stochastic-Optimization

SRGAN Residual Block

Skip-Connection-Blocks

Image-Model-Blocks

REINFORCE

Policy-Gradient-Methods

GrowNet

Deep-Tabular-Learning

Rectified Linear Unit N

Activation-Functions

Adaptive-Activation-Functions

ComplEx with N3 Regularizer

Graph-Embeddings

HiFi-GAN

Generative-Audio-Models

Generative-Adversarial-Networks

ShuffleNet V2 Downsampling Block

Image-Model-Blocks

GPT-Neo

Transformers

Visual-Linguistic BERT

Vision-and-Language-Pre-Trained-Models

Model-based Subsampling

Negative-Sampling

Hard Swish

Activation-Functions

Feedforward Network

Feedforward-Networks

Multi-Heads of Mixed Attention

Attention-Modules

Attention

Attention-Mechanisms

3D Convolution

Convolutions

LAMB

Large-Batch-Optimization

Dual Graph Convolutional Networks

Graph-Models

Gaussian Gated Linear Network

Gated-Linear-Networks

UCTransNet

Semantic-Segmentation-Models

Powerpropagation

Stochastic-Optimization

Triplet Attention

Attention-Modules

Sigmoid Linear Unit

Activation-Functions

FreeAnchor

Anchor-Supervision

Displaced Aggregation Units

Convolutions

Dynamic Convolution

Attention-Mechanisms

ChebNet

Graph-Models

Nyströmformer

Transformers

Truncation Trick

Latent-Variable-Sampling

Factorization machines with cubic splines for numerical features

Factorization-Machines

Recommendation-Systems

TSRUc

Recurrent-Neural-Networks

Cyclical Learning Rate Policy

Learning-Rate-Schedules

XCiT Layer

Image-Model-Blocks

Adaptive Dropout

Regularization

ACTKR

Policy-Gradient-Methods

Exact Fusion Model

Feature-Pyramid-Blocks

Laplacian Pyramid Network

Generative-Models

Style-Transfer-Models

CornerNet-Squeeze

Object-Detection-Models

One-Stage-Object-Detection-Models

Neo-fuzzy-neuron

Adaptive-Activation-Functions

Fuzzy-Logic

Flan-T5

Language-Models

BLIP: Bootstrapping Language-Image Pre-training

Vision-and-Language-Pre-Trained-Models

Restricted Boltzmann Machine

Generative-Models

DistanceNet

Domain-Adaptation

HyperDenseNet

Semantic-Segmentation-Models

Simulation as Augmentation

Adversarial-Training

Trajectory-Data-Augmentation

1-bit LAMB

Stochastic-Optimization

Large-Batch-Optimization

Parts, Poses, and Occlusions in 3D Visual Question Answering

Multi-Modal-Methods

6D-Pose-Estimation-Models

SGD with Momentum

Stochastic-Optimization

TimeSformer

Generative-Video-Models

SKNet

Convolutional-Neural-Networks

Self-Organizing Map

Clustering

ProxylessNAS

Neural-Architecture-Search

Graphic Mutual Information

Graph-Representation-Learning

3D-Representations

Adaptively Sparse Transformer

Transformers

Handwritten OCR augmentation

Image-Data-Augmentation

Adaptive Feature Pooling

Pooling-Operations

Attention-augmented Convolution

Convolutions

Attention-Modules

Multi-band MelGAN

Generative-Audio-Models

Mixture Normalization

Normalization

Visual Commonsense Region-based Convolutional Neural Network

Self-Supervised-Learning

Point Gathering Network

Scene-Text-Models

Elastic Dense Block

Skip-Connection-Blocks

Image-Model-Blocks

RotNet

Self-Supervised-Learning

Class-MLP

Pooling-Operations

Context Aggregated Bi-lateral Network for Semantic Segmentation

Semantic-Segmentation-Models

Memory-Associated Differential Learning

Semi-Supervised-Learning-Methods

TD-VAE

Generative-Sequence-Models

SortCut Sinkhorn Attention

Attention-Mechanisms

Exponential Linear Squashing Activation

Activation-Functions

SAINT

Deep-Tabular-Learning

Inverted Bottleneck BERT

Transformers

Autoencoding-Transformers

Iterative Pseudo-Labeling

Semi-Supervised-Learning-Methods

Speech-Recognition

Distributed Shampoo

Stochastic-Optimization

Large-Batch-Optimization

Attention Sinks

Attention

Disentangled Attribution Curves

Interpretability

SEER

Self-Supervised-Learning

Varifocal Loss

Loss-Functions

lda2vec

Word-Embeddings

Static-Word-Embeddings

Document-Embeddings

Adaptive Parameter-wise Diagonal Quasi-Newton Method

Stochastic-Optimization

ProxylessNet-Mobile

Image-Models

Convolutional-Neural-Networks

Light-weight-neural-networks

Adaptive Span Transformer

Transformers

Autoregressive-Transformers

MACEst

Confidence-Estimators

EfficientNet

Image-Models

Convolutional-Neural-Networks

End-To-End Memory Network

Working-Memory-Models

VarifocalNet

Object-Detection-Models

Global Convolutional Network

Semantic-Segmentation-Modules

EMQAP

Question-Answering-Models

Metric mixup

Loss-Functions

Anycost GAN

Generative-Adversarial-Networks

DeepMask

Region-Proposal

Focal Transformers

Vision-Transformers

Momentum Contrast

Self-Supervised-Learning

Semi-Supervised-Learning-Methods

Teacher-Tutor-Student Knowledge Distillation

Knowledge-Distillation

Additive Attention

Attention-Mechanisms

AugMix

Image-Data-Augmentation

Activation Regularization

Regularization

SepFormer

Speech-Separation-Models

ClipBERT

Generative-Video-Models

Transformers

Cosine Power Annealing

Learning-Rate-Schedules

Crystal Graph Neural Network

Graph-Models

Laplacian Positional Encodings

Graph-Embeddings

Decentralized Distributed Proximal Policy Optimization

Distributed-Reinforcement-Learning

NetAdapt

Network-Shrinking

HyperGraph Self-Attention

Attention-Mechanisms

Channel Attention Module

Image-Model-Blocks

Attention-Modules

SRU

Recurrent-Neural-Networks

Blue River Controls

Reinforcement-Learning-Frameworks

TILDEv2

Passage-Re-Ranking-Models

Information-Retrieval-Methods

Four-dimensional A-star

Heuristic-Search-Algorithms

Multiplex Molecular Graph Neural Network

Graph-Models

Self-supervised Equivariant Attention Mechanism

Attention-Mechanisms

LayoutLMv2

Document-Understanding-Models

Colorization Transformer

Vision-Transformers

Image-Colorization-Models

Synthesizer

Language-Models

Cascade Mask R-CNN

Instance-Segmentation-Models

VL-T5

Vision-and-Language-Pre-Trained-Models

AdaRNN

Recurrent-Neural-Networks

Model Rubik's Cube: Twisting Resolution, Depth and Width for TinyNets

Network-Shrinking

Orientation Regularized Network

Pose-Estimation-Blocks

PointASNL

Point-Cloud-Models

MobileBERT

Transformers

Autoencoding-Transformers

Panoptic FPN

Feature-Extractors

HyperTree MetaModel

Neural-Architecture-Search

Gated Positional Self-Attention

Attention-Modules

Polyak Averaging

Stochastic-Optimization

Transformer in Transformer

Transformers

Backbone-Architectures

Image-Models

Meena

Conversational-Models

Open-Domain-Chatbots

Mixture of Logistic Distributions

Output-Functions

Inception-v3

Convolutional-Neural-Networks

GreedyNAS-B

Convolutional-Neural-Networks

Switch Transformer

Transformers

Autoencoding-Transformers

CR-NET

Convolutional-Neural-Networks

Random elastic image morphing

Image-Data-Augmentation

Local Interpretable Model-Agnostic Explanations

Interpretability

Cosine Annealing

Learning-Rate-Schedules

Set Transformer

Attention-Mechanisms

Visual Parsing

Vision-and-Language-Pre-Trained-Models

Batchboost

Image-Data-Augmentation

Spatial Pyramid Pooling

Pooling-Operations

Latent Optimisation

Latent-Variable-Sampling

k-Nearest Neighbors

Non-Parametric-Classification

Non-Parametric-Regression

Implicit Graph Contrastive Learning

Graph-Representation-Learning

WaveGrad UBlock

Audio-Model-Blocks

Batch Normalization

Normalization

GLM

Language-Models

Stochastically Scaling Features and Gradients Regularization

Activation-Functions

VQ-VAE-2

Generative-Models

Likelihood-Based-Generative-Models

3D-Face-Mesh-Models

ConViT

Vision-Transformers

Image-Models

DeLighT

Transformers

Language-Models

Autoregressive-Transformers

Sliced Iterative Generator

Generative-Models

Dense Contrastive Learning

Self-Supervised-Learning

Inception-v3 Module

Image-Model-Blocks

Weighted Recurrent Quality Enhancement

Video-Model-Blocks

FractalNet

Convolutional-Neural-Networks

CRISS

Self-Supervised-Learning

Patch AutoAugment

Image-Data-Augmentation

Diffusion

Image-Generation-Models

Language-Models

Tokens-To-Token Vision Transformer

Vision-Transformers

Image-Models

CReLU

Activation-Functions

Weight Normalization

Normalization

Attentional Liquid Warping GAN

Generative-Adversarial-Networks

Residual Multi-Layer Perceptrons

Image-Models

Internet Explorer

Self-Supervised-Learning

BIMAN

Bot-Detection

nnFormer

Semantic-Segmentation-Models

Vision-Transformers

Locally-Grouped Self-Attention

Attention-Mechanisms

PAFPN

Feature-Extractors

Feature-Pyramid-Blocks

Dynamic R-CNN

Object-Detection-Models

CayleyNet

Graph-Models

FFB6D

6D-Pose-Estimation-Models

HRank

Pruning

Synaptic Neural Network

Neural-Architecture-Search

Minimum Description Length

AutoML

Gaussian Process

Non-Parametric-Classification

Non-Parametric-Regression

self-DIstillation with NO labels

Self-Supervised-Learning

Vision-Transformers

SAGAN Self-Attention Module

Attention-Modules

SCARF

Deep-Tabular-Learning

Gather-Excite Networks

Attention-Mechanisms

Deformable Attention Module

Attention-Modules

MoGA-C

Convolutional-Neural-Networks

Light-weight-neural-networks

Sparse Transformer

Transformers

Autoregressive-Transformers

Gaussian Affinity

Affinity-Functions

RealNVP

Generative-Models

Likelihood-Based-Generative-Models

Canvas Method

Inference-Attack

Deflation

Miscellaneous-Components

Forward-Looking Actor

Actor-Critic-Algorithms

Kaiming Initialization

Initialization

AltDiffusion

Image-Generation-Models

Batch Nuclear-norm Maximization

Regularization

FRILL

Speech-Embeddings

Differentiable Architecture Search Max-W

Neural-Architecture-Search

Cross-encoder Reranking

Language-Models

DistDGL

Distributed-Methods

Canonical Tensor Decomposition with N3 Regularizer

Graph-Embeddings

Log-time and Log-space Extreme Classification

Structured-Prediction

Highway networks

Attention-Mechanisms

Reduction-A

Image-Model-Blocks

Stable Rank Normalization

Regularization

Normalization

COLA

Generative-Audio-Models

Self-Supervised-Learning

DeepLabv3

Semantic-Segmentation-Models

MobileNetV3

Convolutional-Neural-Networks

Light-weight-neural-networks

Content-Conditioned Style Encoder

Image-Model-Blocks

Hierarchical Network Dissection

Interpretability

Siamese U-Net

Convolutional-Neural-Networks

Trans-Encoder

Sentence-Embeddings

Self-Supervised-Learning

Local Relation Network

Image-Model-Blocks

GreedyNAS-C

Convolutional-Neural-Networks

Florence

Vision-and-Language-Pre-Trained-Models

PointRend

Semantic-Segmentation-Modules

Instance-Segmentation-Modules

Inception-ResNet-v2-A

Image-Model-Blocks

Accumulating Eligibility Trace

Eligibility-Traces

Unsupervised Deep Manifold Attributed Graph Embedding

Clustering

Dynamic SmoothL1 Loss

Loss-Functions

Collapsing Linear Unit

Activation-Functions

PSFR-GAN

Generative-Adversarial-Networks

Face-Restoration-Models

ShapeConv

Convolutions

building to building transfer learning

Imitation-Learning-Methods

Knowledge-Distillation

HardELiSH

Activation-Functions

SegFormer

Semantic-Segmentation-Models

Fast Attention Via Positive Orthogonal Random Features

Attention-Mechanisms

U2-Net

Object-Detection-Models

Conditional DBlock

Audio-Model-Blocks

Skip-Connection-Blocks

Uncertainty Class Activation Map (U-CAM) Using Gradient Certainty Method

VQA-Models

Dilated Bottleneck with Projection Block

Skip-Connection-Blocks

Image-Model-Blocks

SqueezeNeXt

Convolutional-Neural-Networks

DetNASNet

Convolutional-Neural-Networks

Manifold Mixup

Regularization

Learning Cross-Modality Encoder Representations from Transformers

Vision-and-Language-Pre-Trained-Models

Wavelet-integrated Identity Preserving Adversarial Network for face super-resolution

Face-Restoration-Models

Contextualized Topic Models

Topic-Embeddings

Contextualized-Word-Embeddings

Clustering

Neural Turing Machine

Working-Memory-Models

Recurrent-Neural-Networks

Submanifold Convolution

Convolutions

FuseFormer

Generative-Video-Models

Video-Inpainting-Models

Positional Encoding Generator

Miscellaneous-Components

Boundary-Aware Segmentation Network

Semantic-Segmentation-Models

Probabilistic Anchor Assignment

Anchor-Generation-Modules

FlexFlow

Distributed-Methods

Auto-Parallel-Methods

Kernel Activation Function

Activation-Functions

Deep Stereo Geometry Network

3D-Object-Detection-Models

Temporal Distribution Characterization

Time-Series-Modules

Label Smoothing

Regularization

SuperpixelGridCut, SuperpixelGridMean, SuperpixelGridMix

Image-Data-Augmentation

Max Pooling

Pooling-Operations

PocketNet

Convolutional-Neural-Networks

Face-Recognition-Models

Spatial Attention Module (ThunderNet)

Feature-Extractors

ParaNet

Text-to-Speech-Models

Sequence-To-Sequence-Models

FastGCN

Graph-Models

Gaussian Mixture Variational Autoencoder

Regularization

ENet

Semantic-Segmentation-Models

MPRNet

Image-Restoration-Models

Slot Attention

Attention-Modules

CSPDenseNet

Convolutional-Neural-Networks

DFDNet

Face-Restoration-Models

ScheduledDropPath

Regularization

Routing Transformer

Transformers

Autoregressive-Transformers

Playstyle Distance

State-Similarity-Metrics

Playstyle

Representation-Learning

Sarsa

On-Policy-TD-Control

Retrace

Value-Function-Estimation

Context-aware Visual Attention-based (CoVA) webpage object detection pipeline

Object-Detection-Models

Webpage-Object-Detection-Pipeline

Dimension-wise Convolution

Convolutions

Synchronized Batch Normalization

Normalization

Libra R-CNN

Object-Detection-Models

Harris Hawks optimization

Optimization

Normalized Linear Combination of Activations

Activation-Functions

Adaptive-Activation-Functions

AdaSqrt

Stochastic-Optimization

IoU-Balanced Sampling

Prioritized-Sampling

NVAE Generative Residual Cell

Image-Model-Blocks

Skip-Connection-Blocks

IFNet

Video-Frame-Interpolation

FoveaBox

Object-Detection-Models

One-Stage-Object-Detection-Models

Continuous Bag-of-Words Word2Vec

Word-Embeddings

Static-Word-Embeddings

ViP-DeepLab

Video-Panoptic-Segmentation-Models

Monocular-Depth-Estimation-Models

Anti-Alias Downsampling

Downsampling

GBST

Subword-Segmentation

Spatial CNN with UNet based Encoder-decoder and ConvLSTM

Image-Segmentation-Models

EfficientUNet++

Semantic-Segmentation-Models

AlphaFold

Jukebox

Generative-Audio-Models

IMPALA

Policy-Gradient-Methods

Distributed-Reinforcement-Learning

Distributed-Methods

Batch Transformer

Vision-Transformers

ComplEx with N3 Regularizer and Relation Prediction Objective

Graph-Embeddings

Graph Attention Network

Graph-Models

VGG Loss

Loss-Functions

Local SGD

Stochastic-Optimization

Optimization

Distributed-Methods

Attention Mesh

3D-Face-Mesh-Models

Mirror Descent Policy Optimization

Policy-Gradient-Methods

Gumbel Softmax

Distributions

Local Prior Matching

Semi-Supervised-Learning-Methods

Spatial & Temporal Attention

Attention-Mechanisms

PP-YOLOv2

Object-Detection-Models

Energy Based Process

Non-Parametric-Regression

Dense Connections

Feedforward-Networks

Self-Attention Network

Image-Models

CurricularFace

Face-Recognition-Models

Pyramid Vision Transformer v2

Vision-Transformers

Image-Models

Vision-Language pretrained Model

Vision-and-Language-Pre-Trained-Models

GCNet

Object-Detection-Models

Instance-Segmentation-Models

MoCo v3

Vision-Transformers

Poincaré Embeddings

Word-Embeddings

Static-Word-Embeddings

Channel-wise Soft Attention

Attention-Mechanisms

DetNet

Convolutional-Neural-Networks

DenseNAS-B

Convolutional-Neural-Networks

StyleALAE

Generative-Models