InternVideo: General Video Foundation Models via Generative and Discriminative LearningVision-and-Language-Pre-Trained-Models
Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal AlignmentMulti-Modal-Methods
Language-driven Scene Synthesis using Multi-conditional Diffusion Model3D-RepresentationsDiffusion-Models
LipGANGenerative-Adversarial-NetworksConditional-Image-to-Image-Translation-ModelsFace-to-Face-Translation
Extremely Efficient Spatial Pyramid of Depth-wise Dilated Separable ConvolutionsImage-Model-BlocksSkip-Connection-Blocks
Amplifying Sine Unit: An Oscillatory Activation Function for Deep Neural Networks to Recover Nonlinear Oscillations EfficientlyActivation-Functions
Hybrid Firefly and Particle Swarm OptimizationOptimizationHybrid-OptimizationHeuristic-Search-Algorithms
Protagonist Antagonist Induced Regret Environment DesignAdversarial-TrainingEnvironment-Design-Methods
Distribution-induced Bidirectional Generative Adversarial Network for Graph Representation LearningGraph-Embeddings
Guided Language to Image Diffusion for Generation and EditingMulti-Modal-MethodsImage-Generation-Models
Pansharpening by convolutional neural networks in the full resolution frameworkConvolutional-Neural-Networks
Segmentation of patchy areas in biomedical images based on local edge density estimationImage-Segmentation-Models
Contour Proposal NetworkObject-Detection-ModelsInstance-Segmentation-ModelsOne-Stage-Object-Detection-Models
DE-GAN: A Conditional Generative Adversarial Network for Document EnhancementGenerative-Adversarial-Networks
A Framework for Leader Identification in Coordinated ActivityTime-Series-AnalysisLeadership-Inference
SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource SettingsMonocular-Depth-Estimation-Models
Encoder-Decoder model with local and pairwise loss along with shared encoder and discriminator network (EDLPS)Document-Embeddings
Multi-source Sentiment Generative Adversarial NetworkGenerative-Adversarial-NetworksDomain-Adaptation
Convolutional time-domain audio separation networkTemporal-ConvolutionsSpeech-Separation-ModelsMusic-source-separation+1
MyGym: Modular Toolkit for Visuomotor Robotic TasksRobotic-Manipulation-ModelsReinforcement-Learning-FrameworksPolicy-Gradient-Methods
Absolute Learning Progress and Gaussian Mixture Models for Automatic Curriculum LearningSelf-Supervised-Learning
Adaptive Content Generating and Preserving NetworkGenerative-Adversarial-NetworksAugmented-Reality-Methods
Factorization machines with cubic splines for numerical featuresFactorization-MachinesRecommendation-Systems
Parts, Poses, and Occlusions in 3D Visual Question AnsweringMulti-Modal-Methods6D-Pose-Estimation-Models
Learning Cross-Modality Encoder Representations from TransformersVision-and-Language-Pre-Trained-Models
Wavelet-integrated Identity Preserving Adversarial Network for face super-resolutionFace-Restoration-Models
Context-aware Visual Attention-based (CoVA) webpage object detection pipelineObject-Detection-ModelsWebpage-Object-Detection-Pipeline