Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-03-22 | LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models | Yuzhang Shang et.al. | 2403.15388v1 | null |
2024-03-22 | Time-efficient, high-resolution 3T whole-brain relaxometry using Cartesian 3D MR-STAT with CSF suppression | Hongyan Liu et.al. | 2403.15379v1 | null |
2024-03-22 | Long-CLIP: Unlocking the Long-Text Capability of CLIP | Beichen Zhang et.al. | 2403.15378v1 | null |
2024-03-22 | InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding | Yi Wang et.al. | 2403.15377v1 | null |
2024-03-22 | Cascading Blackout Severity Prediction with Statistically-Augmented Graph Neural Networks | Joe Gorka et.al. | 2403.15363v1 | null |
2024-03-22 | SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series | Badri N. Patro et.al. | 2403.15360v1 | null |
2024-03-22 | Ultrasound Imaging based on the Variance of a Diffusion Restoration Model | Yuxin Zhang et.al. | 2403.15316v1 | null |
2024-03-22 | Global Control for Local SO(3)-Equivariant Scale-Invariant Vessel Segmentation | Patryk Rygiel et.al. | 2403.15314v1 | null |
2024-03-22 | Quantum-inspired classification via efficient simulation of Helstrom measurement | Wooseop Hwang et.al. | 2403.15308v1 | null |
2024-03-22 | Reconnaissance ultracool spectra in the Euclid Deep Fields | Jerry Jun-Yan Zhang et.al. | 2403.15288v1 | null |
2024-03-21 | Language Repository for Long Video Understanding | Kumara Kahatapitiya et.al. | 2403.14622v1 | link |
2024-03-22 | Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion | Xiang Fan et.al. | 2403.14617v2 | null |
2024-03-21 | Explorative Inbetweening of Time and Space | Haiwen Feng et.al. | 2403.14611v1 | null |
2024-03-21 | ReNoise: Real Image Inversion Through Iterative Noising | Daniel Garibi et.al. | 2403.14602v1 | null |
2024-03-21 | PSALM: Pixelwise SegmentAtion with Large Multi-Modal Model | Zheng Zhang et.al. | 2403.14598v1 | link |
2024-03-21 | Large Language Models for Multi-Choice Question Classification of Medical Subjects | Víctor Ponce-López et.al. | 2403.14582v1 | null |
2024-03-21 | DINO-Tracker: Taming DINO for Self-Supervised Point Tracking in a Single Video | Narek Tumanyan et.al. | 2403.14548v1 | null |
2024-03-21 | Estimating Physical Information Consistency of Channel Data Augmentation for Remote Sensing Images | Tom Burgert et.al. | 2403.14547v1 | null |
2024-03-21 | Transfer Learning for Cross-dataset Isolated Sign Language Recognition in Under-Resourced Datasets | Ahmet Alp Kindiroglu et.al. | 2403.14534v1 | link |
2024-03-21 | Invisible Needle Detection in Ultrasound: Leveraging Mechanism-Induced Vibration | Chenyang Li et.al. | 2403.14523v1 | null |
2024-03-21 | Denoising Diffusion Models for 3D Healthy Brain Tissue Inpainting | Alicia Durrer et.al. | 2403.14499v1 | link |
2024-03-20 | TimeRewind: Rewinding Time with Image-and-Events Video Diffusion | Jingxi Chen et.al. | 2403.13800v1 | null |
2024-03-20 | Hierarchical NeuroSymbolic Approach for Action Quality Assessment | Lauren Okamoto et.al. | 2403.13798v1 | null |
2024-03-20 | Bridge the Modality and Capacity Gaps in Vision-Language Model Selection | Chao Yi et.al. | 2403.13797v1 | null |
2024-03-20 | The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency and Usability in AI | Matt White et.al. | 2403.13784v1 | null |
2024-03-20 | Gradings on associative triple systems of the second kind | Alberto Daza-Garcia et.al. | 2403.13775v1 | null |
2024-03-20 | Towards Principled Representation Learning from Videos for Reinforcement Learning | Dipendra Misra et.al. | 2403.13765v1 | null |
2024-03-20 | Enhancing Gait Video Analysis in Neurodegenerative Diseases by Knowledge Augmentation in Vision Language Model | Diwei Wang et.al. | 2403.13756v1 | null |
2024-03-20 | Be-Your-Outpainter: Mastering Video Outpainting through Input-Specific Adaptation | Fu-Yun Wang et.al. | 2403.13745v1 | null |
2024-03-20 | Probabilistic Forecasting with Stochastic Interpolants and Föllmer Processes | Yifan Chen et.al. | 2403.13724v1 | null |
2024-03-20 | Improving the Adaptive Moment Estimation (ADAM) stochastic optimizer through an Implicit-Explicit (IMEX) time-stepping approach | Abhinab Bhattacharjee et.al. | 2403.13704v1 | null |
2024-03-19 | LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression | Zhuoshi Pan et.al. | 2403.12968v1 | null |
2024-03-19 | FRESCO: Spatial-Temporal Correspondence for Zero-Shot Video Translation | Shuai Yang et.al. | 2403.12962v1 | link |
2024-03-19 | WHAC: World-grounded Humans and Cameras | Wanqi Yin et.al. | 2403.12959v1 | null |
2024-03-19 | FutureDepth: Learning to Predict the Future Improves Video Depth Estimation | Rajeev Yasarla et.al. | 2403.12953v1 | null |
2024-03-19 | Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models | Elaine Sui et.al. | 2403.12952v1 | link |
2024-03-19 | Legendrian loops and cluster modular groups | James Hughes et.al. | 2403.12951v1 | null |
2024-03-19 | Vid2Robot: End-to-end Video-conditioned Policy Learning with Cross-Attention Transformers | Vidhi Jain et.al. | 2403.12943v1 | null |
2024-03-19 | Contextual AD Narration with Interleaved Multimodal Sequence | Hanlin Wang et.al. | 2403.12922v1 | null |
2024-03-19 | Semantic Layering in Room Segmentation via LLMs | Taehyeon Kim et.al. | 2403.12920v1 | null |
2024-03-19 | Yell At Your Robot: Improving On-the-Fly from Language Corrections | Lucy Xiaoyang Shi et.al. | 2403.12910v1 | null |
2024-03-18 | Time Series Compression using Quaternion Valued Neural Networks and Quaternion Backpropagation | Johannes Pöppelbaum et.al. | 2403.11722v1 | null |
2024-03-18 | Virbo: Multimodal Multilingual Avatar Video Generation in Digital Marketing | Juan Zhang et.al. | 2403.11700v1 | null |
2024-03-18 | A Spatial-Temporal Progressive Fusion Network for Breast Lesion Segmentation in Ultrasound Videos | Zhengzheng Tu et.al. | 2403.11699v1 | null |
2024-03-18 | Object Segmentation-Assisted Inter Prediction for Versatile Video Coding | Zhuoyuan Li et.al. | 2403.11694v1 | null |
2024-03-19 | MoreStyle: Relax Low-frequency Constraint of Fourier-based Image Reconstruction in Generalizable Medical Image Segmentation | Haoyu Zhao et.al. | 2403.11689v2 | null |
2024-03-18 | Better (pseudo-)labels for semi-supervised instance segmentation | François Porcher et.al. | 2403.11675v1 | null |
2024-03-19 | WIA-LD2ND: Wavelet-based Image Alignment for Self-supervised Low-Dose CT Denoising | Haoyu Zhao et.al. | 2403.11672v2 | null |
2024-03-18 | Binary Noise for Binary Tasks: Masked Bernoulli Diffusion for Unsupervised Anomaly Detection | Julia Wolleb et.al. | 2403.11667v1 | null |
2024-03-18 | Combining Local and Global Perception for Autonomous Navigation on Nano-UAVs | Lorenzo Lamberti et.al. | 2403.11661v1 | null |
2024-03-18 | LocalStyleFool: Regional Video Style Transfer Attack Using Segment Anything Model | Yuxin Cao et.al. | 2403.11656v1 | null |
2024-03-15 | Strong and Controllable Blind Image Decomposition | Zeyu Zhang et.al. | 2403.10520v1 | link |
2024-03-15 | Frozen Feature Augmentation for Few-Shot Image Classification | Andreas Bär et.al. | 2403.10519v1 | null |
2024-03-15 | VideoAgent: Long-form Video Understanding with Large Language Model as Agent | Xiaohan Wang et.al. | 2403.10517v1 | null |
2024-03-15 | Surveyor: Facilitating Discovery Within Video Games for Blind and Low Vision Players | Vishnu Nair et.al. | 2403.10512v1 | null |
2024-03-15 | Benchmarking Zero-Shot Robustness of Multimodal Foundation Models: A Pilot Study | Chenguang Wang et.al. | 2403.10499v1 | link |
2024-03-15 | Joint Multimodal Transformer for Dimensional Emotional Recognition in the Wild | Paul Waligora et.al. | 2403.10488v1 | null |
2024-03-15 | Tensor Star Decomposition | Wuyang Zhou et.al. | 2403.10481v1 | null |
2024-03-15 | Using an LLM to Turn Sign Spottings into Spoken Language Sentences | Ozge Mercanoglu Sincan et.al. | 2403.10434v1 | null |
2024-03-15 | Neural Networks Hear You Loud And Clear: Hearing Loss Compensation Using Deep Neural Networks | Peter Leer et.al. | 2403.10420v1 | null |
2024-03-15 | A comparative study on machine learning approaches for rock mass classification using drilling data | Tom F. Hansen et.al. | 2403.10404v1 | null |
2024-03-14 | Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models | Akhil Kedia et.al. | 2403.09635v1 | link |
2024-03-14 | Generalized Predictive Model for Autonomous Driving | Jiazhi Yang et.al. | 2403.09630v1 | link |
2024-03-14 | From the Conformal Anomaly to the Virasoro Algebra | Sid Maibach et.al. | 2403.09628v1 | null |
2024-03-14 | Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding | Guo Chen et.al. | 2403.09626v1 | link |
2024-03-14 | Score-Guided Diffusion for 3D Human Recovery | Anastasis Stathopoulos et.al. | 2403.09623v1 | link |
2024-03-14 | PosSAM: Panoptic Open-vocabulary Segment Anything | Vibashan VS et.al. | 2403.09620v1 | null |
2024-03-14 | Explore In-Context Segmentation via Latent Diffusion Models | Chaoyang Wang et.al. | 2403.09616v1 | null |
2024-03-14 | Compute-first optical detection for noise-resilient visual perception | Jungmin Kim et.al. | 2403.09612v1 | null |
2024-03-14 | Mixture of Mixups for Multi-label Classification of Rare Anuran Sounds | Ilyass Moummad et.al. | 2403.09598v1 | link |
2024-03-14 | DungeonMaker: Embedding Tangible Creation and Destruction in Hybrid Board Games through Personal Fabrication Technology | Evgeny Stemasov et.al. | 2403.09592v1 | null |
2024-03-13 | VLOGGER: Multimodal Diffusion for Embodied Avatar Synthesis | Enric Corona et.al. | 2403.08764v1 | null |
2024-03-13 | Segmentation of Knee Bones for Osteoarthritis Assessment: A Comparative Analysis of Supervised, Few-Shot, and Zero-Shot Learning Approaches | Yun Xin Teoh et.al. | 2403.08761v1 | null |
2024-03-13 | MIM4D: Masked Modeling with Multi-View Video for Autonomous Driving Representation Learning | Jialv Zou et.al. | 2403.08760v1 | link |
2024-03-13 | Spatiotemporal Diffusion Model with Paired Sampling for Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08758v1 | null |
2024-03-13 | DAM: Dynamic Adapter Merging for Continual Video QA Learning | Feng Cheng et.al. | 2403.08755v1 | link |
2024-03-13 | Clinically Feasible Diffusion Reconstruction for Highly-Accelerated Cardiac Cine MRI | Shihan Qiu et.al. | 2403.08749v1 | null |
2024-03-13 | Torsion pairs, t-structures, and co-t-structures for completions of discrete cluster categories | Sofia Franchini et.al. | 2403.08735v1 | null |
2024-03-13 | Euclid: Testing photometric selection of emission-line galaxy targets | M. S. Cagliari et.al. | 2403.08726v1 | null |
2024-03-13 | Diffusion-based Iterative Counterfactual Explanations for Fetal Ultrasound Image Quality Assessment | Paraskevas Pegios et.al. | 2403.08700v1 | null |
2024-03-13 | Implicit Regularization of Gradient Flow on One-Layer Softmax Attention | Heejune Sheen et.al. | 2403.08699v1 | null |
2024-03-12 | OPEN TEACH: A Versatile Teleoperation System for Robotic Manipulation | Aadhithya Iyer et.al. | 2403.07870v1 | null |
2024-03-12 | TeleMoMa: A Modular and Versatile Teleoperation System for Mobile Manipulation | Shivin Dass et.al. | 2403.07869v1 | null |
2024-03-12 | Iterative Graph Neural Network Enhancement via Frequent Subgraph Mining of Explanations | Harish G. Naik et.al. | 2403.07849v1 | null |
2024-03-12 | When Eye-Tracking Meets Machine Learning: A Systematic Review on Applications in Medical Image Analysis | Sahar Moradizeyveh et.al. | 2403.07834v1 | null |
2024-03-12 | DeliGrasp: Inferring Object Mass, Friction, and Compliance with LLMs for Adaptive and Minimally Deforming Grasp Policies | William Xie et.al. | 2403.07832v1 | null |
2024-03-12 | A geometric model for the module category of a string algebra | Karin Baur et.al. | 2403.07810v1 | null |
2024-03-12 | BraSyn 2023 challenge: Missing MRI synthesis and the effect of different learning objectives | Ivo M. Baltruschat et.al. | 2403.07800v1 | null |
2024-03-12 | A robust SVM-based approach with feature selection and outliers detection for classification problems | Marta Baldomero-Naranjo et.al. | 2403.07753v1 | null |
2024-03-12 | Vision-based Vehicle Re-identification in Bridge Scenario using Flock Similarity | Chunfeng Zhang et.al. | 2403.07752v1 | null |
2024-03-12 | Harnessing two-photon dissipation for enhanced quantum measurement and control | Antoine Marquet et.al. | 2403.07744v1 | null |
2024-03-11 | Attention Prompt Tuning: Parameter-efficient Adaptation of Pre-trained Models for Spatiotemporal Modeling | Wele Gedara Chaminda Bandara et.al. | 2403.06978v1 | link |
2024-03-12 | VideoMamba: State Space Model for Efficient Video Understanding | Kunchang Li et.al. | 2403.06977v2 | link |
2024-03-11 | Memory-based Adapters for Online 3D Scene Perception | Xiuwei Xu et.al. | 2403.06974v1 | null |
2024-03-11 | Explainable Transformer Prototypes for Medical Diagnoses | Ugur Demir et.al. | 2403.06961v1 | link |
2024-03-11 | Quadruped-Frog: Rapid Online Optimization of Continuous Quadruped Jumping | Guillaume Bellegarda et.al. | 2403.06954v1 | null |
2024-03-11 | Optimizing Latent Graph Representations of Surgical Scenes for Zero-Shot Domain Transfer | Siddhant Satyanaik et.al. | 2403.06953v1 | null |
2024-03-11 | Advancing Generalizable Remote Physiological Measurement through the Integration of Explicit and Implicit Prior Knowledge | Yuting Zhang et.al. | 2403.06947v1 | link |
2024-03-11 | Conditional Score-Based Diffusion Model for Cortical Thickness Trajectory Prediction | Qing Xiao et.al. | 2403.06940v1 | null |
2024-03-11 | FocusCLIP: Multimodal Subject-Level Guidance for Zero-Shot Transfer in Human-Centric Tasks | Muhammad Saif Ullah Khan et.al. | 2403.06904v1 | null |
2024-03-11 | Benign overfitting in leaky ReLU networks with moderate input dimension | Kedar Karhadkar et.al. | 2403.06903v1 | null |
2024-03-08 | Tell, Don't Show!: Language Guidance Eases Transfer Across Domains in Images and Videos | Tarun Kalluri et.al. | 2403.05535v1 | null |
2024-03-08 | Tune without Validation: Searching for Learning Rate and Weight Decay on Training Sets | Lorenzo Brigato et.al. | 2403.05532v1 | null |
2024-03-08 | Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context | Machel Reid et.al. | 2403.05530v1 | null |
2024-03-08 | Take Your Best Shot: Sampling-Based Next-Best-View Planning for Autonomous Photography & Inspection | Shijie Gao et.al. | 2403.05477v1 | null |
2024-03-08 | Will GPT-4 Run DOOM? | Adrian de Wynter et.al. | 2403.05468v1 | null |
2024-03-08 | Evaluating AI and Human Authorship Quality in Academic Writing through Physics Essays | Will Yeadon et.al. | 2403.05458v1 | null |
2024-03-08 | VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models | Yabo Zhang et.al. | 2403.05438v1 | link |
2024-03-08 | OmniCount: Multi-label Object Counting with Semantic-Geometric Priors | Anindya Mondal et.al. | 2403.05435v1 | null |
2024-03-08 | Infinite Translation Surfaces in the Wild | Vincent Delecroix et.al. | 2403.05424v1 | null |
2024-03-08 | Rethinking Transformers Pre-training for Multi-Spectral Satellite Imagery | Mubashir Noman et.al. | 2403.05419v1 | link |
2024-03-07 | DeepSee: Multidimensional Visualizations of Seabed Ecosystems | Adam Coscia et.al. | 2403.04761v1 | link |
2024-03-07 | iScore: Visual Analytics for Interpreting How Language Models Automatically Score Summaries | Adam Coscia et.al. | 2403.04760v1 | link |
2024-03-07 | KnowledgeVIS: Interpreting Language Models by Comparing Fill-in-the-Blank Prompts | Adam Coscia et.al. | 2403.04758v1 | link |
2024-03-07 | Preliminary Guidelines For Combining Data Integration and Visual Data Analysis | Adam Coscia et.al. | 2403.04757v1 | link |
2024-03-07 | Photonic probabilistic machine learning using quantum vacuum noise | Seou Choi et.al. | 2403.04731v1 | null |
2024-03-07 | Analysis of Systems' Performance in Natural Language Processing Competitions | Sergio Nava-Muñoz et.al. | 2403.04693v1 | null |
2024-03-07 | CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios | Qilang Ye et.al. | 2403.04640v1 | link |
2024-03-07 | Scalable, Simulation-Guided Compliant Tactile Finger Design | Yuxiang Ma et.al. | 2403.04638v1 | null |
2024-03-08 | Pix2Gif: Motion-Guided Diffusion for GIF Generation | Hitesh Kandala et.al. | 2403.04634v2 | null |
2024-03-07 | MedFLIP: Medical Vision-and-Language Self-supervised Fast Pre-Training with Masked Autoencoder | Lei Li et.al. | 2403.04626v1 | null |
2024-03-06 | 3D Diffusion Policy | Yanjie Ze et.al. | 2403.03954v1 | link |
2024-03-06 | Stop Regressing: Training Value Functions via Classification for Scalable Deep RL | Jesse Farebrother et.al. | 2403.03950v1 | null |
2024-03-06 | Reconciling Reality through Simulation: A Real-to-Sim-to-Real Approach for Robust Manipulation | Marcel Torne et.al. | 2403.03949v1 | null |
2024-03-06 | DART: Implicit Doppler Tomography for Radar Novel View Synthesis | Tianshu Huang et.al. | 2403.03896v1 | null |
2024-03-06 | Joint multi-task learning improves weakly-supervised biomarker prediction in computational pathology | Omar S. M. El Nahhas et.al. | 2403.03891v1 | link |
2024-03-06 | Hierarchical Diffusion Policy for Kinematics-Aware Multi-Task Robotic Manipulation | Xiao Ma et.al. | 2403.03890v1 | null |
2024-03-06 | Decoupled Vertical Federated Learning for Practical Training on Vertically Partitioned Data | Avi Amalanshu et.al. | 2403.03871v1 | null |
2024-03-06 | X-Shot: A Unified System to Handle Frequent, Few-shot and Zero-shot Learning Simultaneously in Classification | Hanzi Xu et.al. | 2403.03863v1 | link |
2024-03-06 | ProxNF: Neural Field Proximal Training for High-Resolution 4D Dynamic Image Reconstruction | Luke Lozenski et.al. | 2403.03860v1 | null |
2024-03-06 | MedMamba: Vision Mamba for Medical Image Classification | Yubiao Yue et.al. | 2403.03849v1 | link |
2024-03-05 | Extension Theory and Fermionic Strongly Fusion 2-Categories | Thibault D. Décoppet et.al. | 2403.03211v1 | null |
2024-03-05 | Scaling Rectified Flow Transformers for High-Resolution Image Synthesis | Patrick Esser et.al. | 2403.03206v1 | null |
2024-03-05 | Behavior Generation with Latent Actions | Seungjae Lee et.al. | 2403.03181v1 | link |
2024-03-05 | Deep-Learned Compression for Radio-Frequency Signal Classification | Armani Rodriguez et.al. | 2403.03150v1 | null |
2024-03-05 | Dual Mean-Teacher: An Unbiased Semi-Supervised Framework for Audio-Visual Source Localization | Yuxin Guo et.al. | 2403.03145v1 | link |
2024-03-05 | Motion-Corrected Moving Average: Including Post-Hoc Temporal Information for Improved Video Segmentation | Robert Mendel et.al. | 2403.03120v1 | null |
2024-03-05 | Equilibria in Two-Stage Facility Location with Atomic Clients | Simon Krogmann et.al. | 2403.03114v1 | null |
2024-03-05 | Galaxies in the Zone of Avoidance: Misclassifications using machine learning tools | P. Marchant Cortés et.al. | 2403.03098v1 | null |
2024-03-05 | Collective self-caging of active filaments in virtual confinement | Maximilian Kurjahn et.al. | 2403.03093v1 | null |
2024-03-05 | A Backpack Full of Skills: Egocentric Video Understanding with Diverse Task Perspectives | Simone Alberto Peirone et.al. | 2403.03037v1 | null |
2024-03-03 | Enhancing Retinal Vascular Structure Segmentation in Images With a Novel Design Two-Path Interactive Fusion Module Model | Rui Yang et.al. | 2403.01362v1 | null |
2024-03-02 | Improve Cost Efficiency of Active Learning over Noisy Dataset | Zan-Kai Chong et.al. | 2403.01346v1 | null |
2024-03-02 | An eternal hypersurface flow arising in centro-affine geometry | Xinjie Jiang et.al. | 2403.01340v1 | null |
2024-03-02 | Image-Based Dietary Assessment: A Healthy Eating Plate Estimation System | Assylzhan Izbassar et.al. | 2403.01310v1 | null |
2024-03-02 | VNLP: Turkish NLP Package | Meliksah Turker et.al. | 2403.01309v1 | null |
2024-03-02 | Towards a classification of |
Alyson Deines et.al. | 2403.01287v1 | null |
2024-03-02 | Irfan Habib et.al. | 2403.01285v1 | null | |
2024-03-02 | Fast Low-parameter Video Activity Localization in Collaborative Learning Environments | Venkatesh Jatla et.al. | 2403.01281v1 | null |
2024-03-02 | Rigidity results for group von Neumann algebras with diffuse center | Ionuţ Chifan et.al. | 2403.01280v1 | null |
2024-03-02 | Can a Confident Prior Replace a Cold Posterior? | Martin Marek et.al. | 2403.01272v1 | link |
2024-02-29 | Panda-70M: Captioning 70M Videos with Multiple Cross-Modality Teachers | Tsai-Shien Chen et.al. | 2402.19479v1 | null |
2024-02-29 | Towards Generalizable Tumor Synthesis | Qi Chen et.al. | 2402.19470v1 | null |
2024-02-29 | Humanoid Locomotion as Next Token Prediction | Ilija Radosavovic et.al. | 2402.19469v1 | null |
2024-03-01 | TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning | Kate Sanders et.al. | 2402.19467v2 | null |
2024-02-29 | Heavy-Tailed Class Imbalance and Why Adam Outperforms Gradient Descent on Language Models | Frederik Kunstner et.al. | 2402.19449v1 | null |
2024-02-29 | Probing the Information Encoded in Neural-based Acoustic Models of Automatic Speech Recognition Systems | Quentin Raymondaud et.al. | 2402.19443v1 | null |
2024-02-29 | Pushing the Limits of Cross-Embodiment Learning for Manipulation and Navigation | Jonathan Yang et.al. | 2402.19432v1 | null |
2024-02-29 | PaECTER: Patent-level Representation Learning using Citation-informed Transformers | Mainak Ghosh et.al. | 2402.19411v1 | null |
2024-02-29 | Navigating Hallucinations for Reasoning of Unintentional Activities | Shresth Grover et.al. | 2402.19405v1 | null |
2024-02-29 | A Newborn AGN in a Starforming Galaxy | P. Arévalo et.al. | 2402.19403v1 | null |
2024-02-28 | Time-efficient filtering of polarimetric data by checking physical realizability of experimental Mueller matrices | Tatiana Novikova et.al. | 2402.18555v1 | null |
2024-02-28 | Selection of appropriate multispectral camera exposure settings and radiometric calibration methods for applications in phenotyping and precision agriculture | Vaishali Swaminathan et.al. | 2402.18553v1 | null |
2024-02-28 | Implicit Bias of Next-Token Prediction | Christos Thrampoulidis et.al. | 2402.18551v1 | null |
2024-02-28 | Defect Detection in Tire X-Ray Images: Conventional Methods Meet Deep Structures | Andrei Cozma et.al. | 2402.18527v1 | null |
2024-02-28 | Do galaxy mergers prefer under-dense environments? | U. Sureshkumar et.al. | 2402.18520v1 | null |
2024-02-28 | Log Neural Controlled Differential Equations: The Lie Brackets Make a Difference | Benjamin Walker et.al. | 2402.18512v1 | null |
2024-02-28 | Orchid: Flexible and Data-Dependent Convolution for Sequence Modeling | Mahdi Karami et.al. | 2402.18508v1 | null |
2024-02-28 | Detection of Micromobility Vehicles in Urban Traffic Videos | Khalil Sabri et.al. | 2402.18503v1 | link |
2024-02-28 | Few-Shot Fairness: Unveiling LLM's Potential for Fairness-Aware Classification | Garima Chhikara et.al. | 2402.18502v1 | null |
2024-02-28 | ROG$_{PL}$: Robust Open-Set Graph Learning via Region-Based Prototype Learning | Qin Zhang et.al. | 2402.18495v1 | null |
2024-02-27 | Diffusion Meets DAgger: Supercharging Eye-in-hand Imitation Learning | Xiaoyu Zhang et.al. | 2402.17768v1 | null |
2024-02-27 | Towards Optimal Learning of Language Models | Yuxian Gu et.al. | 2402.17759v1 | null |
2024-02-27 | An Eye Gaze Heatmap Analysis of Uncertainty Head-Up Display Designs for Conditional Automated Driving | Michael A. Gerber et.al. | 2402.17751v1 | null |
2024-02-27 | Scaling on-chip photonic neural processors using arbitrarily programmable wave propagation | Tatsuhiro Onodera et.al. | 2402.17750v1 | link |
2024-02-27 | Linking Order to Strength in Metals | Nicolas Argibay et.al. | 2402.17728v1 | null |
2024-02-27 | MedContext: Learning Contextual Cues for Efficient Volumetric Medical Segmentation | Hanan Gani et.al. | 2402.17725v1 | link |
2024-02-27 | Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners | Yazhou Xing et.al. | 2402.17723v1 | null |
2024-02-27 | Understanding Neural Network Binarization with Forward and Backward Proximal Quantizers | Yiwei Lu et.al. | 2402.17710v1 | null |
2024-02-27 | NextLevelBERT: Investigating Masked Language Modeling with Higher-Level Representations for Long Documents | Tamara Czinczoll et.al. | 2402.17682v1 | null |
2024-02-27 | MCF-VC: Mitigate Catastrophic Forgetting in Class-Incremental Learning for Multimodal Video Captioning | Huiyu Xiong et.al. | 2402.17680v1 | null |
2024-02-26 | Open Your Ears to Take a Look: A State-of-the-Art Report on the Integration of Sonification and Visualization | Kajetan Enge et.al. | 2402.16558v1 | null |
2024-02-26 | LLM-based Privacy Data Augmentation Guided by Knowledge Distillation with a Distribution Tutor for Medical Text Classification | Yiping Song et.al. | 2402.16515v1 | null |
2024-02-26 | Photonic Neural Network Fabricated on Thin Film Lithium Niobate for High-Fidelity and Power-Efficient Matrix Computation | Yong Zheng et.al. | 2402.16513v1 | null |
2024-02-26 | Intelligent Known and Novel Aircraft Recognition -- A Shift from Classification to Similarity Learning for Combat Identification | Ahmad Saeed et.al. | 2402.16486v1 | null |
2024-02-26 | Edge Detectors Can Make Deep Convolutional Neural Networks More Robust | Jin Ding et.al. | 2402.16479v1 | null |
2024-02-26 | Autonomous Integration of TSN-unaware Applications with QoS Requirements in TSN Networks | Moritz Fluechter et.al. | 2402.16454v1 | null |
2024-02-26 | Retrouver l'inventeur-auteur : la lev{é}e d'homonymies d'autorat entre les brevets et les publications scientifiques | David Reymond et.al. | 2402.16440v1 | null |
2024-02-26 | Improving behavior based authentication against adversarial attack using XAI | Dong Qin et.al. | 2402.16430v1 | null |
2024-02-26 | Adaptive Online Learning of Separable Path Graph Transforms for Intra-prediction | Wen-Yang Lu et.al. | 2402.16371v1 | null |
2024-02-26 | DEYO: DETR with YOLO for End-to-End Object Detection | Haodong Ouyang et.al. | 2402.16370v1 | null |
2024-02-26 | SPINEPS -- Automatic Whole Spine Segmentation of T2-weighted MR images using a Two-Phase Approach to Multi-class Semantic and Instance Segmentation | Hendrik Möller et.al. | 2402.16368v1 | link |
2024-02-26 | An Integrated Data Processing Framework for Pretraining Foundation Models | Yiding Sun et.al. | 2402.16358v1 | link |
2024-02-26 | What Text Design Characterizes Book Genres? | Daichi Haraguchi et.al. | 2402.16356v1 | null |
2024-02-23 | A Comprehensive Survey of Convolutions in Deep Learning: Applications, Challenges, and Future Trends | Abolfazl Younesi et.al. | 2402.15490v1 | null |
2024-02-23 | Retinotopic Mapping Enhances the Robustness of Convolutional Neural Networks | Jean-Nicolas Jérémie et.al. | 2402.15480v1 | null |
2024-02-23 | FAIR: Filtering of Automatically Induced Rules | Divya Jyoti Bajpai et.al. | 2402.15472v1 | null |
2024-02-23 | GROS: A General Robust Aggregation Strategy | Alejandro Cholaquidis et.al. | 2402.15442v1 | null |
2024-02-23 | Hierarchical Invariance for Robust and Interpretable Vision Tasks at Larger Scales | Shuren Qi et.al. | 2402.15430v1 | link |
2024-02-23 | ProTIP: Probabilistic Robustness Verification on Text-to-Image Diffusion Models against Stochastic Perturbation | Yi Zhang et.al. | 2402.15429v1 | link |
2024-02-23 | Understanding Entrainment in Human Groups: Optimising Human-Robot Collaboration from Lessons Learned during Human-Human Collaboration | Eike Schneiders et.al. | 2402.15427v1 | null |
2024-02-23 | PREDILECT: Preferences Delineated with Zero-Shot Language-based Reasoning in Reinforcement Learning | Simon Holk et.al. | 2402.15420v1 | null |
2024-02-23 | G-RepsNet: A Fast and General Construction of Equivariant Networks for Arbitrary Matrix Groups | Sourya Basu et.al. | 2402.15413v1 | null |
2024-02-23 | A Universal Method for Solar Filament Detection from H-alpha Observations using Semi-supervised Deep Learning | Andrea Diercke et.al. | 2402.15407v1 | null |
2024-02-22 | Link Prediction under Heterophily: A Physics-Inspired Graph Neural Network Approach | Andrea Giuseppe Di Francesco et.al. | 2402.14802v1 | null |
2024-02-22 | Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis | Willi Menapace et.al. | 2402.14797v1 | null |
2024-02-22 | Customize-A-Video: One-Shot Motion Customization of Text-to-Video Diffusion Models | Yixuan Ren et.al. | 2402.14780v1 | null |
2024-02-22 | Zero-Shot Pediatric Tuberculosis Detection in Chest X-Rays using Self-Supervised Learning | Daniel Capellán-Martín et.al. | 2402.14741v1 | null |
2024-02-22 | Solitons of the mean curvature flow in |
Rafael López et.al. | 2402.14727v1 | null |
2024-02-22 | A Transformer Model for Boundary Detection in Continuous Sign Language | Razieh Rastgoo et.al. | 2402.14720v1 | null |
2024-02-22 | InfFeed: Influence Functions as a Feedback to Improve the Performance of Subjective Tasks | Somnath Banerjee et.al. | 2402.14702v1 | null |
2024-02-22 | Big data analytics to classify earthwork-related locations: A Chengdu study | Lei Yu et.al. | 2402.14698v1 | null |
2024-02-22 | Rethinking Invariance Regularization in Adversarial Training to Improve Robustness-Accuracy Trade-off | Futa Waseda et.al. | 2402.14648v1 | null |
2024-02-22 | Distributed Radiance Fields for Edge Video Compression and Metaverse Integration in Autonomous Driving | Eugen Šlapak et.al. | 2402.14642v1 | null |
2024-02-21 | A Simple and Yet Fairly Effective Defense for Graph Neural Networks | Sofiane Ennadir et.al. | 2402.13987v1 | link |
2024-02-21 | On modular representations of inner forms of |
Johannes Droschl et.al. | 2402.13969v1 | null |
2024-02-21 | New directions in algebraic statistics: Three challenges from 2023 | Yulia Alexandr et.al. | 2402.13961v1 | null |
2024-02-21 | On the topological classification of complex plane curve singularities | Alberto Fernández-Hernández et.al. | 2402.13941v1 | null |
2024-02-21 | Verifying message-passing neural networks via topology-based bounds tightening | Christopher Hojny et.al. | 2402.13937v1 | null |
2024-02-21 | Tumor segmentation on whole slide images: training or prompting? | Huaqian Wu et.al. | 2402.13932v1 | null |
2024-02-21 | BenchCloudVision: A Benchmark Analysis of Deep Learning Approaches for Cloud Detection and Segmentation in Remote Sensing Imagery | Loddo Fabio et.al. | 2402.13918v1 | link |
2024-02-21 | An Explainable Transformer-based Model for Phishing Email Detection: A Large Language Model Approach | Mohammad Amaz Uddin et.al. | 2402.13871v1 | null |
2024-02-21 | RFI-DRUnet: Restoring dynamic spectra corrupted by radio frequency interference -- Application to pulsar observations | Xiao Zhang et.al. | 2402.13867v1 | null |
2024-02-21 | What we can learn from TikTok through its Research API | Francesco Corso et.al. | 2402.13855v1 | null |
2024-02-20 | Video ReCap: Recursive Captioning of Hour-Long Videos | Md Mohaiminul Islam et.al. | 2402.13250v1 | null |
2024-02-20 | SMORE: Similarity-based Hyperdimensional Domain Adaptation for Multi-Sensor Time Series Classification | Junyao Wang et.al. | 2402.13233v1 | null |
2024-02-20 | A Touch, Vision, and Language Dataset for Multimodal Alignment | Letian Fu et.al. | 2402.13232v1 | null |
2024-02-20 | NeRF Solves Undersampled MRI Reconstruction | Tae Jun Jang et.al. | 2402.13226v1 | null |
2024-02-20 | VideoPrism: A Foundational Visual Encoder for Video Understanding | Long Zhao et.al. | 2402.13217v1 | null |
2024-02-20 | How do Hyenas deal with Human Speech? Speech Recognition and Translation with ConfHyena | Marco Gaido et.al. | 2402.13208v1 | null |
2024-02-20 | A novel image correction method for cloud-affected observations with Imaging Atmospheric Cherenkov Telescopes | Natalia Żywucka et.al. | 2402.13190v1 | null |
2024-02-20 | UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing | Jianhong Bai et.al. | 2402.13185v1 | null |
2024-02-20 | DINOBot: Robot Manipulation via Retrieval and Alignment with Vision Foundation Models | Norman Di Palo et.al. | 2402.13181v1 | null |
2024-02-20 | 3D Kinematics Estimation from Video with a Biomechanical Model and Synthetic Training Data | Zhi-Yi Lin et.al. | 2402.13172v1 | null |
2024-02-19 | Short-Period Variables in TESS Full-Frame Image Light Curves Identified via Convolutional Neural Networks | Greg Olmschenk et.al. | 2402.12369v1 | null |
2024-02-19 | The first all-sky survey of star-forming galaxies with eROSITA: Scaling relations and a population of X-ray luminous starbursts | E. Kyritsis et.al. | 2402.12367v1 | null |
2024-02-19 | An Adversarial Approach to Evaluating the Robustness of Event Identification Models | Obai Bahwal et.al. | 2402.12338v1 | null |
2024-02-19 | Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models | Christian Schlarmann et.al. | 2402.12336v1 | link |
2024-02-19 | Generating Survival Interpretable Trajectories and Data | Andrei V. Konstantinov et.al. | 2402.12331v1 | null |
2024-02-19 | Asymptotic Gaussian Fluctuations of Eigenvectors in Spectral Clustering | Hugo Lebeau et.al. | 2402.12302v1 | null |
2024-02-19 | Time-periodic behaviour in one- and two-dimensional interacting particle systems | Jonas Köppl et.al. | 2402.12300v1 | null |
2024-02-19 | Is Open-Source There Yet? A Comparative Study on Commercial and Open-Source LLMs in Their Ability to Label Chest X-Ray Reports | Felix J. Dorfner et.al. | 2402.12298v1 | null |
2024-02-19 | Revisiting registration-based synthesis: A focus on unsupervised MR image synthesis | Savannah P. Hays et.al. | 2402.12288v1 | null |
2024-02-19 | Significance of Chirp MFCC as a Feature in Speech and Audio Applications | S. Johanan Joysingh et.al. | 2402.12239v1 | null |
2024-02-16 | PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter | Junfei Xiao et.al. | 2402.10896v1 | null |
2024-02-16 | Fusion of Diffusion Weighted MRI and Clinical Data for Predicting Functional Outcome after Acute Ischemic Stroke with Deep Contrastive Learning | Chia-Ling Tsai et.al. | 2402.10894v1 | null |
2024-02-16 | Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation | Ziyang Wang et.al. | 2402.10887v1 | link |
2024-02-16 | Control Color: Multimodal Diffusion-based Interactive Image Colorization | Zhexin Liang et.al. | 2402.10855v1 | null |
2024-02-16 | HistoSegCap: Capsules for Weakly-Supervised Semantic Segmentation of Histological Tissue Type in Whole Slide Images | Mobina Mansoori et.al. | 2402.10851v1 | null |
2024-02-16 | FedD2S: Personalized Data-Free Federated Knowledge Distillation | Kawa Atapour et.al. | 2402.10846v1 | null |
2024-02-16 | Pedipulate: Enabling Manipulation Skills using a Quadruped Robot's Leg | Philip Arm et.al. | 2402.10837v1 | null |
2024-02-16 | GAN-driven Electromagnetic Imaging of 2-D Dielectric Scatterers | Ehtasham Naseer et.al. | 2402.10831v1 | null |
2024-02-16 | Structure results for torus fixed loci | Jarod Alper et.al. | 2402.10823v1 | null |
2024-02-16 | Training Class-Imbalanced Diffusion Model Via Overlap Optimization | Divin Yan et.al. | 2402.10821v1 | link |
2024-02-15 | Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling | Raunaq Bhirangi et.al. | 2402.10211v1 | null |
2024-02-15 | FedAnchor: Enhancing Federated Semi-Supervised Learning with Label Contrastive Loss for Unlabeled Clients | Xinchi Qiu et.al. | 2402.10191v1 | null |
2024-02-15 | Euclid preparation. Measuring detailed galaxy morphologies for Euclid with Machine Learning | Euclid Collaboration et.al. | 2402.10187v1 | link |
2024-02-15 | DeepSRGM -- Sequence Classification and Ranking in Indian Classical Music with Deep Learning | Sathwik Tejaswi Madhusudhan et.al. | 2402.10168v1 | null |
2024-02-15 | Holographic covering and the fortuity of black holes | Chi-Ming Chang et.al. | 2402.10129v1 | null |
2024-02-15 | Classification Diffusion Models | Shahar Yadin et.al. | 2402.10095v1 | null |
2024-02-15 | MIM-Refiner: A Contrastive Learning Boost from Intermediate Pre-Trained Representations | Benedikt Alkin et.al. | 2402.10093v1 | link |
2024-02-15 | GraphCBAL: Class-Balanced Active Learning for Graph Neural Networks via Reinforcement Learning | Chengcheng Yu et.al. | 2402.10074v1 | null |
2024-02-15 | Both Matter: Enhancing the Emotional Intelligence of Large Language Models without Compromising the General Intelligence | Weixiang Zhao et.al. | 2402.10073v1 | null |
2024-02-15 | NYCTALE: Neuro-Evidence Transformer for Adaptive and Personalized Lung Nodule Invasiveness Prediction | Sadaf Khademi et.al. | 2402.10066v1 | null |
2024-02-14 | LL-GABR: Energy Efficient Live Video Streaming Using Reinforcement Learning | Adithya Raman et.al. | 2402.09392v1 | null |
2024-02-14 | GraSSRep: Graph-Based Self-Supervised Learning for Repeat Detection in Metagenomic Assembly | Ali Azizpour et.al. | 2402.09381v1 | link |
2024-02-14 | Deep Rib Fracture Instance Segmentation and Classification from CT on the RibFrac Challenge | Jiancheng Yang et.al. | 2402.09372v1 | null |
2024-02-14 | Magic-Me: Identity-Specific Video Customized Diffusion | Ze Ma et.al. | 2402.09368v1 | null |
2024-02-14 | Small instanton-induced flavor invariants and the axion potential | Ravneet Bedi et.al. | 2402.09361v1 | null |
2024-02-14 | Pruning Sparse Tensor Neural Networks Enables Deep Learning for 3D Ultrasound Localization Microscopy | Brice Rauby et.al. | 2402.09359v1 | null |
2024-02-14 | DoRA: Weight-Decomposed Low-Rank Adaptation | Shih-Yang Liu et.al. | 2402.09353v1 | null |
2024-02-14 | Irreducible representations of the crystallisation of the |
Manabendra Giri et.al. | 2402.09347v1 | null |
2024-02-14 | Registration of Longitudinal Spine CTs for Monitoring Lesion Growth | Malika Sanhinova et.al. | 2402.09341v1 | null |
2024-02-14 | Stability and Multigroup Fairness in Ranking with Uncertain Predictions | Siddartha Devic et.al. | 2402.09326v1 | null |
2024-02-13 | IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation | Luke Melas-Kyriazi et.al. | 2402.08682v1 | null |
2024-02-13 | A Convergence Analysis of Approximate Message Passing with Non-Separable Functions and Applications to Multi-Class Classification | Burak Çakmak et.al. | 2402.08676v1 | null |
2024-02-13 | Learning Emergent Gaits with Decentralized Phase Oscillators: on the role of Observations, Rewards, and Feedback | Jenny Zhang et.al. | 2402.08662v1 | null |
2024-02-13 | BdSLW60: A Word-Level Bangla Sign Language Dataset | Husne Ara Rubaiyeat et.al. | 2402.08635v1 | link |
2024-02-13 | Convolutional Neural Networks Towards Facial Skin Lesions Detection | Reza Sarshar et.al. | 2402.08592v1 | null |
2024-02-13 | Totally geodesic submanifolds and polar actions on Stiefel manifolds | Claudio Gorodski et.al. | 2402.08585v1 | null |
2024-02-13 | Motion-Adaptive Inference for Flexible Learned B-Frame Compression | M. Akin Yilmaz et.al. | 2402.08550v1 | null |
2024-02-13 | Approximately Piecewise E(3) Equivariant Point Networks | Matan Atzmon et.al. | 2402.08529v1 | null |
2024-02-13 | Reduced-order modeling of the dynamics of an inverted flag from experimental data | Zhenwei Xu et.al. | 2402.08504v1 | null |
2024-02-13 | Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models | Shaeke Salman et.al. | 2402.08473v1 | null |
2024-02-13 | Wavefront Randomization Improves Deconvolution | Amit Kohli et.al. | 2402.07900v2 | null |
2024-02-12 | Detection of Spider Mites on Labrador Beans through Machine Learning Approaches Using Custom Datasets | Violet Liu et.al. | 2402.07895v1 | null |
2024-02-12 | Perfect stable regularity lemma and slice-wise stable hypergraphs | Artem Chernikov et.al. | 2402.07870v1 | null |
2024-02-12 | On Computationally Efficient Multi-Class Calibration | Parikshit Gopalan et.al. | 2402.07821v1 | null |
2024-02-12 | A Benchmark Grocery Dataset of Realworld Point Clouds From Single View | Shivanand Venkanna Sheshappanavar et.al. | 2402.07819v1 | null |
2024-02-12 | Fixation for |
Laure Marêché et.al. | 2402.07807v1 | null |
2024-02-12 | Estimation of non-uniform blur using a patch-based regression convolutional neural network (CNN) | Luis G. Varela et.al. | 2402.07796v1 | null |
2024-02-12 | "Layer-by-layer" Unsupervised Clustering of Statistically Relevant Fluctuations in Noisy Time-series Data of Complex Dynamical Systems | Matteo Becchi et.al. | 2402.07786v1 | null |
2024-02-12 | Solving parameter-dependent semi-algebraic systems | Louis Gaillard et.al. | 2402.07782v1 | null |
2024-02-12 | Observations of the new meteor shower from comet 46P/Wirtanen | D. Vida et.al. | 2402.07769v1 | null |
2024-02-09 | A two-stage algorithm in evolutionary product unit neural networks for classification | Antonio J. Tallón-Ballesteros et.al. | 2402.06622v1 | null |
2024-02-09 | Image-based Deep Learning for the time-dependent prediction of fresh concrete properties | Max Meyer et.al. | 2402.06611v1 | null |
2024-02-09 | SAE: Single Architecture Ensemble Neural Networks | Martin Ferianc et.al. | 2402.06580v1 | null |
2024-02-09 | Video Annotator: A framework for efficiently building video classifiers using vision-language models and active learning | Amir Ziai et.al. | 2402.06560v1 | link |
2024-02-09 | Self Supervised Learning for Improved Calibrationless Radial MRI with NLINV-Net | Moritz Blumenthal et.al. | 2402.06550v1 | null |
2024-02-09 | Bryndza at ClimateActivism 2024: Stance, Target and Hate Event Detection via Retrieval-Augmented GPT-4 and LLaMA | Marek Šuppa et.al. | 2402.06549v1 | null |
2024-02-09 | Feature Density Estimation for Out-of-Distribution Detection via Normalizing Flows | Evan D. Cook et.al. | 2402.06537v1 | null |
2024-02-09 | Refining Myocardial Infarction Detection: A Novel Multi-Modal Composite Kernel Strategy in One-Class Classification | Muhammad Uzair Zahid et.al. | 2402.06530v1 | null |
2024-02-09 | Flexible infinite-width graph convolutional networks and the importance of representation learning | Ben Anson et.al. | 2402.06525v1 | null |
2024-02-09 | Dynamic swarms regulate the morphology and distribution of soft membrane domains | Aakanksha Gubbala et.al. | 2402.06518v1 | null |
2024-02-08 | Classifying Nodes in Graphs without GNNs | Daniel Winter et.al. | 2402.05934v1 | link |
2024-02-08 | An Interactive Agent Foundation Model | Zane Durante et.al. | 2402.05929v1 | null |
2024-02-08 | Point-VOS: Pointing Up Video Object Segmentation | Idil Esen Zulfikar et.al. | 2402.05917v1 | null |
2024-02-08 | A Survey on Detection, Classification, and Tracking of Aerial Threats using Radar and Communications Systems | Wahab Khawaja et.al. | 2402.05909v1 | null |
2024-02-09 | Large Language Model Meets Graph Neural Network in Knowledge Distillation | Shengxiang Hu et.al. | 2402.05894v2 | null |
2024-02-08 | Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data | Shufan Li et.al. | 2402.05892v1 | null |
2024-02-08 | CREMA: Multimodal Compositional Video Reasoning via Efficient Modular Adaptation and Fusion | Shoubin Yu et.al. | 2402.05889v1 | null |
2024-02-08 | Sandwiched Compression: Repurposing Standard Codecs with Neural Network Wrappers | Onur G. Guleryuz et.al. | 2402.05887v1 | link |
2024-02-08 | GET-Tok: A GenAI-Enriched Multimodal TikTok Dataset Documenting the 2022 Attempted Coup in Peru | Gabriela Pinto et.al. | 2402.05882v1 | link |
2024-02-08 | You've Got to Feel It To Believe It: Multi-Modal Bayesian Inference for Semantic and Property Prediction | Parker Ewen et.al. | 2402.05872v1 | null |
2024-02-07 | Edu-ConvoKit: An Open-Source Library for Education Conversation Data | Rose E. Wang et.al. | 2402.05111v1 | link |
2024-02-07 | Moduli Parameters of Complex Singularities with Non-Degenerate Newton Boundary | Janko Boehm et.al. | 2402.05093v1 | null |
2024-02-07 | Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation | Ziyang Wang et.al. | 2402.05079v1 | link |
2024-02-07 | Arbitrary Scale Super-Resolution Assisted Lunar Crater Detection in Satellite Images | Atal Tewari et.al. | 2402.05068v1 | null |
2024-02-07 | Efficient Multi-Resolution Fusion for Remote Sensing Data with Label Uncertainty | Hersh Vakharia et.al. | 2402.05045v1 | link |
2024-02-07 | PAC Learnability under Explanation-Preserving Graph Perturbations | Xu Zheng et.al. | 2402.05039v1 | null |
2024-02-07 | Strong convexity-guided hyper-parameter optimization for flatter losses | Rahul Yedida et.al. | 2402.05025v1 | null |
2024-02-07 | Example-based Explanations for Random Forests using Machine Unlearning | Tanmay Surve et.al. | 2402.05007v1 | null |
2024-02-07 | Randomized Confidence Bounds for Stochastic Partial Monitoring | Maxime Heuillet et.al. | 2402.05002v1 | null |
2024-02-07 | Beyond explaining: XAI-based Adaptive Learning with SHAP Clustering for Energy Consumption Prediction | Tobias Clement et.al. | 2402.04982v1 | null |
2024-02-06 | EVA-CLIP-18B: Scaling CLIP to 18 Billion Parameters | Quan Sun et.al. | 2402.04252v1 | link |
2024-02-06 | The spectrum of excisive functors | Gregory Arone et.al. | 2402.04244v1 | null |
2024-02-06 | A classification of nonzero skew immaculate functions | Sarah Mason et.al. | 2402.04219v1 | null |
2024-02-06 | Resource-Aware Hierarchical Federated Learning in Wireless Video Caching Networks | Md Ferdous Pervej et.al. | 2402.04216v1 | null |
2024-02-06 | "Task Success" is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviors | Lin Guan et.al. | 2402.04210v1 | null |
2024-02-06 | 3D Volumetric Super-Resolution in Radiology Using 3D RRDB-GAN | Juhyung Ha et.al. | 2402.04171v1 | null |
2024-02-06 | Human Emotions Analysis and Recognition Using EEG Signals in Response to 360$^\circ$ Videos | Haseeb ur Rahman Abbasi et.al. | 2402.04142v1 | null |
2024-02-06 | Hierarchical Delay Attribution Classification using Unstructured Text in Train Management Systems | Anton Borg et.al. | 2402.04108v1 | null |
2024-02-06 | Analysis of Deep Image Prior and Exploiting Self-Guidance for Image Reconstruction | Shijun Liang et.al. | 2402.04097v1 | null |
2024-02-06 | A Hard-to-Beat Baseline for Training-free CLIP-based Adaptation | Zhengbo Wang et.al. | 2402.04087v1 | link |
2024-02-05 | Multiclass Classification Procedure for Detecting Attacks on MQTT-IoT Protocol | Hector Alaiz-Moreton et.al. | 2402.03270v1 | null |
2024-02-05 | Security Advice for Parents and Children About Content Filtering and Circumvention as Found on YouTube and TikTok | Ran Elgedawy et.al. | 2402.03255v1 | null |
2024-02-05 | JOBSKAPE: A Framework for Generating Synthetic Job Postings to Enhance Skill Matching | Antoine Magron et.al. | 2402.03242v1 | link |
2024-02-05 | FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition | Xiaohu Huang et.al. | 2402.03241v1 | null |
2024-02-05 | IGUANe: a 3D generalizable CycleGAN for multicenter harmonization of brain MR images | Vincent Roca et.al. | 2402.03227v1 | null |
2024-02-05 | English Prompts are Better for NLI-based Zero-Shot Emotion Classification than Target-Language Prompts | Patrick Barreiß et.al. | 2402.03223v1 | null |
2024-02-05 | "Define Your Terms" : Enhancing Efficient Offensive Speech Classification with Definition | Huy Nghiem et.al. | 2402.03221v1 | link |
2024-02-05 | Isotropy, Clusters, and Classifiers | Timothee Mickus et.al. | 2402.03191v1 | null |
2024-02-06 | Cool-chic video: Learned video coding with 800 parameters | Thomas Leguay et.al. | 2402.03179v2 | null |
2024-02-05 | Accurate and Well-Calibrated ICD Code Assignment Through Attention Over Diverse Label Embeddings | Gonçalo Gomes et.al. | 2402.03172v1 | link |
2024-02-02 | From gas to stars: MUSEings on the internal evolution of IC 1613 | S. Taibi et.al. | 2402.01631v1 | null |
2024-02-02 | Truncation technique for variational quantum eigensolver for Molecular Hamiltonians | Qidong Xu et.al. | 2402.01630v1 | null |
2024-02-02 | L2G2G: a Scalable Local-to-Global Network Embedding with Graph Autoencoders | Ruikang Ouyang et.al. | 2402.01614v1 | link |
2024-02-02 | Immersive Video Compression using Implicit Neural Representations | Ho Man Kwan et.al. | 2402.01596v1 | link |
2024-02-02 | NeuroCine: Decoding Vivid Video Sequences from Human Brain Activties | Jingyuan Sun et.al. | 2402.01590v1 | null |
2024-02-02 | Boximator: Generating Rich and Controllable Motions for Video Synthesis | Jiawei Wang et.al. | 2402.01566v1 | null |
2024-02-02 | Deep Continuous Networks | Nergis Tomen et.al. | 2402.01557v1 | link |
2024-02-02 | SLYKLatent, a Learning Framework for Facial Features Estimation | Samuel Adebayo et.al. | 2402.01555v1 | null |
2024-02-02 | Advancing Brain Tumor Inpainting with Generative Models | Ruizhi Zhu et.al. | 2402.01509v1 | null |
2024-02-02 | Di-NeRF: Distributed NeRF for Collaborative Learning with Unknown Relative Poses | Mahboubeh Asadi et.al. | 2402.01485v1 | null |
2024-02-01 | We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline | Simar Kareer et.al. | 2402.00868v1 | link |
2024-02-01 | Deep Room Impulse Response Completion | Jackie Lin et.al. | 2402.00859v1 | null |
2024-02-01 | Early Time Classification with Accumulated Accuracy Gap Control | Liran Ringel et.al. | 2402.00857v1 | link |
2024-02-01 | BootsTAP: Bootstrapped Training for Tracking-Any-Point | Carl Doersch et.al. | 2402.00847v1 | link |
2024-02-01 | Emo-Avatar: Efficient Monocular Video Style Avatar through Texture Rendering | Pinxin Liu et.al. | 2402.00827v1 | null |
2024-02-01 | Examining the Influence of Digital Phantom Models in Virtual Imaging Trials for Tomographic Breast Imaging | Amar Kavuri et.al. | 2402.00812v1 | null |
2024-02-01 | ReAGent: Towards A Model-agnostic Feature Attribution Method for Generative Language Models | Zhixue Zhao et.al. | 2402.00794v1 | link |
2024-02-01 | Distinguishing the Indistinguishable: Human Expertise in Algorithmic Prediction | Rohan Alur et.al. | 2402.00793v1 | link |
2024-02-02 | CroissantLLM: A Truly Bilingual French-English Language Model | Manuel Faysse et.al. | 2402.00786v2 | link |
2024-02-01 | Hybrid Quantum Vision Transformers for Event Classification in High Energy Physics | Eyup B. Unlu et.al. | 2402.00776v1 | null |
2024-01-31 | Classification-Oriented Semantic Wireless Communications | Emrecan Kutay et.al. | 2401.18069v1 | null |
2024-01-31 | Rank Supervised Contrastive Learning for Time Series Classification | Qianying Ren et.al. | 2401.18057v1 | null |
2024-01-31 | Variable selection for Naïve Bayes classification | Rafael Blanquero et.al. | 2401.18039v1 | null |
2024-01-31 | Optimizing contrastive learning for cortical folding pattern detection | Aymeric Gaudin et.al. | 2401.18035v1 | null |
2024-01-31 | A Neural Enhancement Post-Processor with a Dynamic AV1 Encoder Configuration Strategy for CLIC 2024 | Darren Ramsook et.al. | 2401.18021v1 | null |
2024-01-31 | EEG-GPT: Exploring Capabilities of Large Language Models for EEG Classification and Interpretation | Jonathan W. Kim et.al. | 2401.18006v1 | null |
2024-01-31 | Unsupervised Learning of Topological Non-Abelian Braiding in Non-Hermitian Bands | Yang Long et.al. | 2401.17968v1 | null |
2024-01-31 | Error-Tolerant E-Discovery Protocols | Jinshuo Dong et.al. | 2401.17952v1 | null |
2024-01-31 | HyperZ$\cdot$Z$\cdot$W Operator Connects Slow-Fast Networks for Full Context Interaction | Harvie Zhang et.al. | 2401.17948v1 | null |
2024-01-31 | Probabilistic Photonic Computing with Chaotic Light | Frank Brückerhoff-Plückelmann et.al. | 2401.17915v1 | null |
2024-01-30 | The SRG/eROSITA all-sky survey: Hard X-ray selected Active Galactic Nuclei | Sophia G. H. Waddell et.al. | 2401.17306v1 | null |
2024-01-30 | Compact white-dwarf binaries in the combined SRG/eROSITA/SDSS eFEDS survey | A. Schwope et.al. | 2401.17304v1 | null |
2024-01-30 | Searching for X-ray counterparts of unassociated Fermi-LAT sources and rotation-powered pulsars with SRG/eROSITA | Martin G. F. Mayer et.al. | 2401.17295v1 | null |
2024-01-30 | X-ray AGNs with SRG/eROSITA: Multi-wavelength observations reveal merger triggering and post-coalescence circumnuclear blowout | Robert W. Bickley et.al. | 2401.17277v1 | null |
2024-01-30 | ReacLLaMA: Merging chemical and textual information in chemical reactivity AI models | Aline Hartgers et.al. | 2401.17267v1 | null |
2024-01-30 | SLIC: A Learned Image Codec Using Structure and Color | Srivatsa Prativadibhayankaram et.al. | 2401.17246v1 | link |
2024-01-31 | Faster coloring and embedding in dense hypergraphs via stability | Jianfeng Hou et.al. | 2401.17219v2 | null |
2024-01-31 | GazeGPT: Augmenting Human Capabilities using Gaze-contingent Contextual AI for Smart Eyewear | Robert Konrad et.al. | 2401.17217v2 | null |
2024-01-30 | Single Word Change is All You Need: Designing Attacks and Defenses for Text Classifiers | Lei Xu et.al. | 2401.17196v1 | null |
2024-01-30 | GraphViz2Vec: A Structure-aware Feature Generation Model to Improve Classification in GNNs | Shraban Kumar Chatterjee et.al. | 2401.17178v1 | null |
2024-01-29 | Computer Vision for Primate Behavior Analysis in the Wild | Richard Vogg et.al. | 2401.16424v1 | null |
2024-01-29 | Synchformer: Efficient Synchronization from Sparse Cues | Vladimir Iashin et.al. | 2401.16423v1 | null |
2024-01-29 | Strategic Usage in a Multi-Learner Setting | Eliot Shekhtman et.al. | 2401.16422v1 | null |
2024-01-29 | ReTaSA: A Nonparametric Functional Estimation Approach for Addressing Continuous Target Shift | Hwanwoo Kim et.al. | 2401.16410v1 | null |
2024-01-29 | Is K-fold cross validation the best model selection method for Machine Learning? | Juan M Gorriz et.al. | 2401.16407v1 | null |
2024-01-29 | Zero-shot Imitation Policy via Search in Demonstration Dataset | Federco Malato et.al. | 2401.16398v1 | null |
2024-01-29 | Ovarian Cancer Diagnostics using Wavelet Packet Scaling Descriptors | Raymond J. Hinton Jr. et.al. | 2401.16396v1 | null |
2024-01-29 | Evaluation of pseudo-healthy image reconstruction for anomaly detection with deep generative models: Application to brain FDG PET | Ravi Hassanaly et.al. | 2401.16363v1 | link |
2024-01-29 | Curriculum-Based Reinforcement Learning for Quadrupedal Jumping: A Reference-free Design | Vassil Atanassov et.al. | 2401.16337v1 | null |
2024-01-29 | Making the unmodulated Pyramid wavefront sensor smart. Closed-loop demonstration of neural network wavefront reconstruction with MagAO-X | Rico Landman et.al. | 2401.16325v1 | null |
2024-01-26 | From GPT-4 to Gemini and Beyond: Assessing the Landscape of MLLMs on Generalizability, Trustworthiness and Causality through Four Modalities | Chaochao Lu et.al. | 2401.15071v1 | null |
2024-01-26 | Deep learning-based approach for tomato classification in complex scenes | Mikael A. Mousse et.al. | 2401.15055v1 | null |
2024-01-26 | Non-Unitary |
Pedro M. F. Pereira et.al. | 2401.15049v1 | null |
2024-01-26 | Machine learning-based analysis of glioma tissue sections: a review | Jan-Philipp Redlich et.al. | 2401.15022v1 | null |
2024-01-26 | Enhancement of a Text-Independent Speaker Verification System by using Feature Combination and Parallel-Structure Classifiers | Kerlos Atia Abdalmalak et.al. | 2401.15018v1 | null |
2024-01-26 | Graph-based Active Learning for Entity Cluster Repair | Victor Christen et.al. | 2401.14992v1 | null |
2024-01-26 | Stokes graphs of the Rabi problem with real parameters | René Langøen et.al. | 2401.14991v1 | null |
2024-01-26 | Minimum-dissipation principle for synchronised stochastic oscillators far from equilibrium | Jan Meibohm et.al. | 2401.14982v1 | null |
2024-01-26 | Microwave lymphedema assessment using deep learning with contour assisted backprojection | Yuyi Chang et.al. | 2401.14970v1 | null |
2024-01-26 | Hold Tight: Identifying Behavioral Patterns During Prolonged Work in VR through Video Analysis | Verena Biener et.al. | 2401.14920v1 | null |
2024-01-25 | Multimodal Pathway: Improve Transformers with Irrelevant Data from Other Modalities | Yiyuan Zhang et.al. | 2401.14405v1 | link |
2024-01-25 | Adaptive Mobile Manipulation for Articulated Objects In the Open World | Haoyu Xiong et.al. | 2401.14403v1 | null |
2024-01-25 | Range-Agnostic Multi-View Depth Estimation With Keyframe Selection | Andrea Conti et.al. | 2401.14401v1 | link |
2024-01-25 | Rethinking Patch Dependence for Masked Autoencoders | Letian Fu et.al. | 2401.14391v1 | null |
2024-01-25 | Smooth Ranking SVM via Cutting-Plane Method | Erhan Can Ozcan et.al. | 2401.14388v1 | link |
2024-01-25 | Inconsistency Masks: Removing the Uncertainty from Input-Pseudo-Label Pairs | Michael R. H. Vorndran et.al. | 2401.14387v1 | link |
2024-01-25 | A Comparative Analysis of Noise Reduction Methods in Sentiment Analysis on Noisy Bengali Texts | Kazi Toufique Elahi et.al. | 2401.14360v1 | link |
2024-01-25 | Computing Derivations on Nilpotent Quadratic Lie Algebras | Pilar Benito et.al. | 2401.14348v1 | null |
2024-01-25 | Class-attribute Priors: Adapting Optimization to Heterogeneity and Fairness Objective | Xuechen Zhang et.al. | 2401.14343v1 | null |
2024-01-25 | Progressive Multi-task Anti-Noise Learning and Distilling Frameworks for Fine-grained Vehicle Recognition | Dichao Liu et.al. | 2401.14336v1 | link |
2024-01-24 | Tyche: Stochastic In-Context Learning for Medical Image Segmentation | Marianne Rakic et.al. | 2401.13650v1 | null |
2024-01-24 | Quantifying the Impact of Frame Preemption on Combined TSN Shapers | Rubi Debnath et.al. | 2401.13631v1 | null |
2024-01-24 | Can overfitted deep neural networks in adversarial training generalize? -- An approximation viewpoint | Zhongjie Shi et.al. | 2401.13624v1 | null |
2024-01-24 | FLLIC: Functionally Lossless Image Compression | Xi Zhang et.al. | 2401.13616v1 | null |
2024-01-24 | Enhancing Image Retrieval : A Comprehensive Study on Photo Search using the CLIP Mode | Naresh Kumar Lahajal et.al. | 2401.13613v1 | null |
2024-01-24 | Prompt Weight Experiments for LLM Instruction Fine-Tuning | Mathew Huerta-Enochian et.al. | 2401.13586v1 | null |
2024-01-24 | WPDA: Frequency-based Backdoor Attack with Wavelet Packet Decomposition | Zhengyao Song et.al. | 2401.13578v1 | null |
2024-01-24 | CNN architecture extraction on edge GPU | Peter Horvath et.al. | 2401.13575v1 | null |
2024-01-24 | Benchmarking the Fairness of Image Upsampling Methods | Mike Laszkiewicz et.al. | 2401.13555v1 | null |
2024-01-24 | PanAf20K: A Large Video Dataset for Wild Ape Detection and Behaviour Recognition | Otto Brookes et.al. | 2401.13554v1 | null |
2024-01-23 | SegmentAnyBone: A Universal Model that Segments Any Bone at Any Location on MRI | Hanxue Gu et.al. | 2401.12974v1 | null |
2024-01-23 | On the Efficacy of Text-Based Input Modalities for Action Anticipation | Apoorva Beedu et.al. | 2401.12972v1 | null |
2024-01-23 | The role of environment and AGN feedback in quenching local galaxies: Comparing cosmological hydrodynamical simulations to the SDSS | Paul H. Goubert et.al. | 2401.12953v1 | null |
2024-01-23 | Lumiere: A Space-Time Diffusion Model for Video Generation | Omer Bar-Tal et.al. | 2401.12945v1 | null |
2024-01-23 | Long-range three-dimensional tracking of nanoparticles using interferometric scattering (iSCAT) microscopy | Kiarash Kasaian et.al. | 2401.12939v1 | null |
2024-01-23 | Neural deformation fields for template-based reconstruction of cortical surfaces from MRI | Fabian Bongratz et.al. | 2401.12938v1 | null |
2024-01-23 | Segmentation of tibiofemoral joint tissues from knee MRI using MtRA-Unet and incorporating shape information: Data from the Osteoarthritis Initiative | Akshay Daydar et.al. | 2401.12932v1 | null |
2024-01-23 | pyAKI - An Open Source Solution to Automated KDIGO classification | Christian Porschen et.al. | 2401.12930v1 | null |
2024-01-23 | Performance Analysis of Support Vector Machine (SVM) on Challenging Datasets for Forest Fire Detection | Ankan Kar et.al. | 2401.12924v1 | null |
2024-01-23 | Advancing Glitch Classification in Gravity Spy: Multi-view Fusion with Attention-based Machine Learning for Advanced LIGO's Fourth Observing Run | Yunan Wu et.al. | 2401.12913v1 | null |
2024-01-22 | Connecting the Dots: Leveraging Spatio-Temporal Graph Neural Networks for Accurate Bangla Sign Language Recognition | Haz Sameen Shahgir et.al. | 2401.12210v1 | null |
2024-01-22 | Unsupervised Machine Learning for the Classification of Astrophysical X-ray Sources | Víctor Samuel Pérez-Díaz et.al. | 2401.12203v1 | link |
2024-01-22 | OK-Robot: What Really Matters in Integrating Open-Knowledge Models for Robotics | Peiqi Liu et.al. | 2401.12202v1 | null |
2024-01-22 | In-Context Learning for Extreme Multi-Label Classification | Karel D'Oosterlinck et.al. | 2401.12178v1 | null |
2024-01-22 | Broiler-Net: A Deep Convolutional Framework for Broiler Behavior Analysis in Poultry Houses | Tahereh Zarrat Ehsan et.al. | 2401.12176v1 | link |
2024-01-22 | VRMN-bD: A Multi-modal Natural Behavior Dataset of Immersive Human Fear Responses in VR Stand-up Interactive Games | He Zhang et.al. | 2401.12133v1 | link |
2024-01-22 | Evaluation of QCNN-LSTM for Disability Forecasting in Multiple Sclerosis Using Sequential Multisequence MRI | John D. Mayfield et.al. | 2401.12132v1 | null |
2024-01-22 | Out-of-Distribution Detection & Applications With Ablated Learned Temperature Energy | Will LeVine et.al. | 2401.12129v1 | link |
2024-01-22 | Measures of the Capital Network of the U.S. Economy | Ben Klemens et.al. | 2401.12118v1 | null |
2024-01-22 | A quantitative version of the Steinhaus theorem | Alex Iosevich et.al. | 2401.12112v1 | null |
2024-01-19 | Classifying affine structures with focus-focus singularities | Xiudi Tang et.al. | 2401.10881v1 | null |
2024-01-19 | Motion Consistency Loss for Monocular Visual Odometry with Attention-Based Deep Learning | André O. Françani et.al. | 2401.10857v1 | null |
2024-01-19 | Emotion Classification In Software Engineering Texts: A Comparative Analysis of Pre-trained Transformers Language Models | Mia Mohammad Imran et.al. | 2401.10845v1 | null |
2024-01-19 | Understanding Video Transformers via Universal Concept Discovery | Matthew Kowal et.al. | 2401.10831v1 | null |
2024-01-19 | Long-Term Monitoring of the Oe Star VES 735: Ope! Not So Quiet After All | Brandon Marshall et.al. | 2401.10829v1 | null |
2024-01-19 | ActAnywhere: Subject-Aware Video Background Generation | Boxiao Pan et.al. | 2401.10822v1 | null |
2024-01-19 | RAD-DINO: Exploring Scalable Medical Image Encoders Beyond Text Supervision | Fernando Pérez-García et.al. | 2401.10815v1 | null |
2024-01-19 | Learning to Visually Connect Actions and their Effects | Eric Peh et.al. | 2401.10805v1 | null |
2024-01-19 | Endovascular Detection of Catheter-Thrombus Contact by Vacuum Excitation | Jared Lawson et.al. | 2401.10804v1 | null |
2024-01-19 | TDC-less Direct Time-of-Flight Imaging Using Spiking Neural Networks | Jack MacLean et.al. | 2401.10793v1 | null |
2024-01-18 | Simultaneous Tactile Estimation and Control for Extrinsic Dexterity | Antonia Bronars et.al. | 2401.10230v1 | null |
2024-01-18 | OMG-Seg: Is One Model Good Enough For All Segmentation? | Xiangtai Li et.al. | 2401.10229v1 | link |
2024-01-18 | RAP-SAM: Towards Real-Time All-Purpose Segment Anything | Shilin Xu et.al. | 2401.10228v1 | link |
2024-01-18 | Towards Language-Driven Video Inpainting via Multimodal Large Language Models | Jianzong Wu et.al. | 2401.10226v1 | null |
2024-01-18 | Explaining the Implicit Neural Canvas: Connecting Pixels to Neurons by Tracing their Contributions | Namitha Padmanabhan et.al. | 2401.10217v1 | null |
2024-01-18 | Transfer Learning in Human Activity Recognition: A Survey | Sourish Gunesh Dhekane et.al. | 2401.10185v1 | null |
2024-01-18 | SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild | Andreas Engelhardt et.al. | 2401.10171v1 | null |
2024-01-19 | Motion-Zero: Zero-Shot Moving Object Control Framework for Diffusion-Based Video Generation | Changgu Chen et.al. | 2401.10150v2 | null |
2024-01-18 | Few-shot learning for COVID-19 Chest X-Ray Classification with Imbalanced Data: An Inter vs. Intra Domain Study | Alejandro Galán-Cuenca et.al. | 2401.10129v1 | null |
2024-01-18 | Sub2Full: split spectrum to boost OCT despeckling without clean data | Lingyun Wang et.al. | 2401.10128v1 | link |
2024-01-17 | Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Lianghui Zhu et.al. | 2401.09417v1 | link |
2024-01-17 | Vlogger: Make Your Dream A Vlog | Shaobin Zhuang et.al. | 2401.09414v1 | link |
2024-01-17 | Deciphering Textual Authenticity: A Generalized Strategy through the Lens of Large Language Semantics for Detecting Human vs. Machine-Generated Text | Mazal Bethany et.al. | 2401.09407v1 | null |
2024-01-17 | Élivágar: Efficient Quantum Circuit Search for Classification | Sashwat Anagolum et.al. | 2401.09393v1 | null |
2024-01-17 | Tri$^{2}$-plane: Volumetric Avatar Reconstruction with Feature Pyramid | Luchuan Song et.al. | 2401.09386v1 | link |
2024-01-17 | New relations of pod partition and its connection with other partition functions | Hemjyoti Nath et.al. | 2401.09374v1 | null |
2024-01-17 | To deform or not: treatment-aware longitudinal registration for breast DCE-MRI during neoadjuvant chemotherapy via unsupervised keypoints detection | Luyi Han et.al. | 2401.09336v1 | link |
2024-01-17 | Machines Do See Color: A Guideline to Classify Different Forms of Racist Discourse in Large Corpora | Diana Davila Gordillo et.al. | 2401.09333v1 | null |
2024-01-17 | Spectral Distribution Complexity of the Surface Fibrillatory Waves Predicts Post-Catheter Ablation Relapse in Persistent Atrial Fibrillation | Pilar Escribano et.al. | 2401.09297v1 | null |
2024-01-17 | T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis | Yoonjin Chung et.al. | 2401.09294v1 | null |
2024-01-16 | From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers | Jiu Feng et.al. | 2401.08415v1 | null |
2024-01-16 | Faster ISNet for Background Bias Mitigation on Deep Neural Networks | Pedro R. A. S. Bassi et.al. | 2401.08409v1 | null |
2024-01-16 | Training and Comparison of nnU-Net and DeepMedic Methods for Autosegmentation of Pediatric Brain Tumors | Arastoo Vossough et.al. | 2401.08404v1 | null |
2024-01-16 | High-Quality Mesh Blendshape Generation from Face Videos via Neural Inverse Rendering | Xin Ming et.al. | 2401.08398v1 | null |
2024-01-16 | DoraemonGPT: Toward Understanding Dynamic Scenes with Large Language Models | Zongxin Yang et.al. | 2401.08392v1 | link |
2024-01-16 | We don't need no labels: Estimating post-deployment model performance under covariate shift without ground truth | Jakub Białek et.al. | 2401.08348v1 | null |
2024-01-16 | Learn What You Need in Personalized Federated Learning | Kexin Lv et.al. | 2401.08327v1 | link |
2024-01-16 | Application of LLM Agents in Recruitment: A Novel Framework for Resume Screening | Chengguang Gan et.al. | 2401.08315v1 | null |
2024-01-16 | Central extensions of restricted Lie superalgebras and classification of |
Sofiane Bouarroudj et.al. | 2401.08313v1 | null |
2024-01-16 | Evaluating online elasticity estimation of soft objects using standard robot grippers | Shubhan P. Patni et.al. | 2401.08298v1 | null |
2024-01-16 | Multitask Learning in Minimally Invasive Surgical Vision: A Review | Oluwatosin Alabi et.al. | 2401.08256v1 | null |
2024-01-16 | Multi-scale 2D Temporal Map Diffusion Models for Natural Language Video Localization | Chongzhi Zhang et.al. | 2401.08232v1 | null |
2024-01-16 | Towards Causal Relationship in Indefinite Data: Baseline Model and New Datasets | Hang Chen et.al. | 2401.08221v1 | link |
2024-01-16 | Ship Detection in SAR Images with Human-in-the-Loop | Hecheng Jia et.al. | 2401.08213v1 | null |
2024-01-16 | ModelNet-O: A Large-Scale Synthetic Dataset for Occlusion-Aware Point Cloud Classification | Zhongbin Fang et.al. | 2401.08210v1 | link |
2024-01-12 | Mind Your Format: Towards Consistent Evaluation of In-Context Learning Improvements | Anton Voronov et.al. | 2401.06766v1 | null |
2024-01-12 | Classification of singularities of cluster algebras of finite type II: coefficients | Angélica Benito et.al. | 2401.06758v1 | null |
2024-01-12 | Synthetic Data Generation Framework, Dataset, and Efficient Deep Model for Pedestrian Intention Prediction | Muhammad Naveed Riaz et.al. | 2401.06757v1 | null |
2024-01-12 | Stylometry Analysis of Multi-authored Documents for Authorship and Author Style Change Detection | Muhammad Tayyab Zamir et.al. | 2401.06752v1 | null |
2024-01-12 | Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part II: Spatial and Tonal Data Optimization | Niklas Kämper et.al. | 2401.06747v1 | null |
2024-01-12 | Efficient Parallel Algorithms for Inpainting-Based Representations of 4K Images -- Part I: Homogeneous Diffusion Inpainting | Niklas Kämper et.al. | 2401.06744v1 | null |
2024-01-12 | Complexity Classification of Product State Problems for Local Hamiltonians | John Kallaugher et.al. | 2401.06725v1 | null |
2024-01-12 | Obstacle-Aware Positioning of a Mobile Robotic Platform for 6G Networks | Alexandre Costa et.al. | 2401.06717v1 | null |
2024-01-12 | Reliability Analysis of Psychological Concept Extraction and Classification in User-penned Text | Muskan Garg et.al. | 2401.06709v1 | null |
2024-01-12 | On the existence of charged electrostatic black holes in arbitrary topology | Martin Reiris et.al. | 2401.06702v1 | null |
2024-01-11 | Distilling Vision-Language Models on Millions of Videos | Yue Zhao et.al. | 2401.06129v1 | null |
2024-01-11 | Dubbing for Everyone: Data-Efficient Visual Dubbing using Neural Rendering Priors | Jack Saunders et.al. | 2401.06126v1 | null |
2024-01-11 | Gaussian Shadow Casting for Neural Characters | Luis Bolanos et.al. | 2401.06116v1 | null |
2024-01-11 | A Closer Look at AUROC and AUPRC under Class Imbalance | Matthew B. A. McDermott et.al. | 2401.06091v1 | link |
2024-01-12 | LEGO:Language Enhanced Multi-modal Grounding Model | Zhaowei Li et.al. | 2401.06071v2 | link |
2024-01-11 | On the Power of Graph Neural Networks and Feature Augmentation Strategies to Classify Social Networks | Walid Guettala et.al. | 2401.06048v1 | null |
2024-01-11 | RAVEN: Rethinking Adversarial Video Generation with Efficient Tri-plane Networks | Partha Ghosh et.al. | 2401.06035v1 | null |
2024-01-11 | Attention to detail: inter-resolution knowledge distillation | Rocío del Amor et.al. | 2401.06010v1 | link |
2024-01-11 | Sea ice detection using concurrent multispectral and synthetic aperture radar imagery | Martin S J Rogers et.al. | 2401.06009v1 | null |
2024-01-11 | Boosting Mixed-Initiative Co-Creativity in Game Design: A Tutorial | Solange Margarido et.al. | 2401.05999v1 | null |
2024-01-10 | Towards Online Sign Language Recognition and Translation | Ronglai Zuo et.al. | 2401.05336v1 | link |
2024-01-10 | ANIM-400K: A Large-Scale Dataset for Automated End-To-End Dubbing of Video | Kevin Cai et.al. | 2401.05314v1 | link |
2024-01-10 | Strategic Client Selection to Address Non-IIDness in HAPS-enabled FL Networks | Amin Farajzadeh et.al. | 2401.05308v1 | null |
2024-01-10 | Frame-like Fourier expansions for finite Borel measures on |
Chad Berner et.al. | 2401.05243v1 | null |
2024-01-10 | Learning effective good variables from physical data | Giulio Barletta et.al. | 2401.05226v1 | link |
2024-01-10 | TOVAC: Tele-operated Vehicle Admission Control and Routing | Jorge Martín-Pérez et.al. | 2401.05225v1 | null |
2024-01-10 | Do Vision and Language Encoders Represent the World Similarly? | Mayug Maniparambil et.al. | 2401.05224v1 | null |
2024-01-10 | Exploring Vulnerabilities of No-Reference Image Quality Assessment Models: A Query-Based Black-Box Method | Chenxi Yang et.al. | 2401.05217v1 | null |
2024-01-10 | Pre-trained Large Language Models for Financial Sentiment Analysis | Wei Luo et.al. | 2401.05215v1 | link |
2024-01-10 | A Novel Prompt-tuning Method: Incorporating Scenario-specific Concepts into a Verbalizer | Yong Ma et.al. | 2401.05204v1 | null |
2024-01-09 | A Simple Baseline for Spoken Language to Sign Language Translation with 3D Avatars | Ronglai Zuo et.al. | 2401.04730v1 | link |
2024-01-09 | U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation | Jun Ma et.al. | 2401.04722v1 | null |
2024-01-09 | Helicoidal surfaces of prescribed mean curvature in |
Aires Eduardo Menani Barbieri et.al. | 2401.04721v1 | null |
2024-01-09 | Low-resource finetuning of foundation models beats state-of-the-art in histopathology | Benedikt Roth et.al. | 2401.04720v1 | null |
2024-01-09 | Jump Cut Smoothing for Talking Heads | Xiaojuan Wang et.al. | 2401.04718v1 | null |
2024-01-09 | NIPn CHIPS | Blaise Boissonneau et.al. | 2401.04697v1 | null |
2024-01-09 | CoordGate: Efficiently Computing Spatially-Varying Convolutions in Convolutional Neural Networks | Sunny Howard et.al. | 2401.04680v1 | null |
2024-01-09 | Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA Cats and Dogs Dataset | Galib Muhammad Shahriar Himel et.al. | 2401.04666v1 | null |
2024-01-09 | DepressionEmo: A novel dataset for multilabel classification of depression emotions | Abu Bakar Siddiqur Rahman et.al. | 2401.04655v1 | link |
2024-01-09 | Hold 'em and Fold 'em: Towards Human-scale, Feedback-Controlled Soft Origami Robots | Immanuel Ampomah Mensah et.al. | 2401.04650v1 | null |
2024-01-08 | Dr$^2$Net: Dynamic Reversible Dual-Residual Networks for Memory-Efficient Finetuning | Chen Zhao et.al. | 2401.04105v1 | null |
2024-01-08 | RudolfV: A Foundation Model by Pathologists for Pathologists | Jonas Dippel et.al. | 2401.04079v1 | null |
2024-01-08 | Variance Reduction in Ratio Metrics for Efficient Online Experiments | Shubham Baweja et.al. | 2401.04062v1 | null |
2024-01-08 | Bjøntegaard Delta (BD): A Tutorial Overview of the Metric, Evolution, Challenges, and Recommendations | Nabajeet Barman et.al. | 2401.04039v1 | null |
2024-01-08 | Blocks whose defect groups are Suzuki |
Charles W. Eaton et.al. | 2401.04028v1 | null |
2024-01-08 | IDoFew: Intermediate Training Using Dual-Clustering in Language Models for Few Labels Text Classification | Abdullah Alsuhaibani et.al. | 2401.04025v1 | null |
2024-01-08 | Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification | Wentao Zhu et.al. | 2401.04023v1 | null |
2024-01-08 | Resident space object detection method based on the connection between Fourier spectrum of the video data difference frame and the linear velocity projection | V. S. Baranova et.al. | 2401.04021v1 | null |
2024-01-09 | Recognizing Blazars Using Radio Morphology from the VLA Sky Survey | Zhang-Liang Xie et.al. | 2401.04009v2 | null |
2024-01-08 | Calabi-Yau Varieties via Cyclic Covers, and Complex Hyperbolic Structures for their Moduli Spaces | Chenglong Yu et.al. | 2401.04006v1 | null |
2024-01-05 | Open-Vocabulary SAM: Segment and Recognize Twenty-thousand Classes Interactively | Haobo Yuan et.al. | 2401.02955v1 | link |
2024-01-05 | The Dark Energy Survey Supernova Program: Cosmological Analysis and Systematic Uncertainties | M. Vincenzi et.al. | 2401.02945v1 | null |
2024-01-05 | Digital-analog quantum learning on Rydberg atom arrays | Jonathan Z. Lu et.al. | 2401.02940v1 | null |
2024-01-05 | Mixing Magnetic and Electric Ehlers-Harrison transformations: The Electromagnetic Swirling Spacetime and Novel Type I Backgrounds | José Barrientos et.al. | 2401.02924v1 | null |
2024-01-05 | Towards ASR Robust Spoken Language Understanding Through In-Context Learning With Word Confusion Networks | Kevin Everson et.al. | 2401.02921v1 | null |
2024-01-05 | Analytically-Driven Resource Management for Cloud-Native Microservices | Yanqi Zhang et.al. | 2401.02920v1 | null |
2024-01-05 | Introducing Bode: A Fine-Tuned Large Language Model for Portuguese Prompt-Based Task | Gabriel Lino Garcia et.al. | 2401.02909v1 | null |
2024-01-05 | Robust Bichromatic Classification using Two Lines | Erwin Glazenburg et.al. | 2401.02897v1 | null |
2024-01-05 | Particle-Wise Higher-Order SPH Field Approximation for DVR | Jonathan Fischer et.al. | 2401.02896v1 | null |
2024-01-05 | Nonlinear functional regression by functional deep neural network with kernel embedding | Zhongjie Shi et.al. | 2401.02890v1 | null |
2024-01-04 | asimulation: Domain formation and impact on observables in resolved cosmological simulations of the (a)symmetron | Øyvind Christiansen et.al. | 2401.02410v1 | link |
2024-01-04 | Gravitational waves from dark domain walls | Øyvind Christiansen et.al. | 2401.02409v1 | link |
2024-01-05 | Correctness Comparison of ChatGPT-4, Bard, Claude-2, and Copilot for Spatial Tasks | Hartwig H. Hochmair et.al. | 2401.02404v2 | null |
2024-01-04 | 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation | Zihao Xiao et.al. | 2401.02402v1 | null |
2024-01-04 | Analyzing Misinformation Claims During the 2022 Brazilian General Election on WhatsApp, Twitter, and Kwai | Scott A. Hale et.al. | 2401.02395v1 | null |
2024-01-04 | Image denoising and model-independent parameterization for improving IVIM MRI | Caleb Sample et.al. | 2401.02394v1 | null |
2024-01-04 | Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications | Darshan Venkatrayappa et.al. | 2401.02383v1 | null |
2024-01-04 | A novel method to enhance pneumonia detection via a model-level ensembling of CNN and vision transformer | Sandeep Angara et.al. | 2401.02358v1 | null |
2024-01-04 | ClassWise-SAM-Adapter: Parameter Efficient Fine-tuning Adapts Segment Anything to SAR Domain for Semantic Segmentation | Xinyang Pu et.al. | 2401.02326v1 | link |
2024-01-04 | Reflection physics in X-ray-emitting Symbiotic Stars | Jesús A. Toalá et.al. | 2401.02318v1 | null |
2024-01-03 | Profinite equivariant spectra and their tensor-triangular geometry | Scott Balchin et.al. | 2401.01878v1 | null |
2024-01-03 | A spatial mixture model for spaceborne lidar observations over mixed forest and non-forest land types | Paul B. May et.al. | 2401.01848v1 | null |
2024-01-03 | Teaching with a companion: the case of gravity | Iuliia Zhurakovskaia et.al. | 2401.01832v1 | null |
2024-01-03 | Iterative Mask Filling: An Effective Text Augmentation Method Using Masked Language Modeling | Himmet Toprak Kesgin et.al. | 2401.01830v1 | null |
2024-01-03 | Moonshot: Towards Controllable Video Generation and Editing with Multimodal Conditions | David Junhao Zhang et.al. | 2401.01827v1 | link |
2024-01-03 | Detours for Navigating Instructional Videos | Kumar Ashutosh et.al. | 2401.01823v1 | null |
2024-01-03 | SENS3: Multisensory Database of Finger-Surface Interactions and Corresponding Sensations | Jagan K. Balasubramanian et.al. | 2401.01818v1 | null |
2024-01-03 | Signal Processing in the Retina: Interpretable Graph Classifier to Predict Ganglion Cell Responses | Yasaman Parhizkar et.al. | 2401.01813v1 | null |
2024-01-03 | Efficient Computation of Confidence Sets Using Classification on Equidistributed Grids | Lujie Zhou et.al. | 2401.01804v1 | null |
2024-01-03 | An experimental sorting method for improving metagenomic data encoding | Diogo Pratas et.al. | 2401.01786v1 | null |
2024-01-02 | Street Gaussians for Modeling Dynamic Urban Scenes | Yunzhi Yan et.al. | 2401.01339v1 | null |
2024-01-02 | Classifying Words with 3-sort Automata | Tomasz Jastrząb et.al. | 2401.01314v1 | null |
2024-01-03 | A Comprehensive Survey of Hallucination Mitigation Techniques in Large Language Models | S. M Towhidul Islam Tonmoy et.al. | 2401.01313v2 | null |
2024-01-02 | Integrating Edges into U-Net Models with Explainable Activation Maps for Brain Tumor Segmentation using MR Images | Subin Sahayam et.al. | 2401.01303v1 | null |
2024-01-02 | Nicola Novello et.al. | 2401.01268v1 | link | |
2024-01-02 | VideoDrafter: Content-Consistent Multi-Scene Video Generation with LLM | Fuchen Long et.al. | 2401.01256v1 | null |
2024-01-02 | An operational approach to classifying measurement incompatibility | Arun Kumar Das et.al. | 2401.01236v1 | null |
2024-01-03 | Distribution Matching for Multi-Task Learning of Classification Tasks: a Large-Scale Study on Faces & Beyond | Dimitrios Kollias et.al. | 2401.01219v2 | null |
2024-01-02 | FGENet: Fine-Grained Extraction Network for Congested Crowd Counting | Hao-Yuan Ma et.al. | 2401.01208v1 | null |
2024-01-02 | Whole-examination AI estimation of fetal biometrics from 20-week ultrasound scans | Lorenzo Venturini et.al. | 2401.01201v1 | null |
2023-12-29 | Computational Tools for Trees in Gauge Theory and Gravity | Jacob L. Bourjaily et.al. | 2312.17745v1 | null |
2023-12-29 | Multiscale Vision Transformers meet Bipartite Matching for efficient single-stage Action Localization | Ioanna Ntinou et.al. | 2312.17686v1 | null |
2023-12-29 | Malware Detection in IOT Systems Using Machine Learning Techniques | Ali Mehrban et.al. | 2312.17683v1 | null |
2023-12-29 | FlowVid: Taming Imperfect Optical Flows for Consistent Video-to-Video Synthesis | Feng Liang et.al. | 2312.17681v1 | null |
2023-12-29 | Grasping, Part Identification, and Pose Refinement in One Shot with a Tactile Gripper | Joyce Xin-Yan Lim et.al. | 2312.17650v1 | null |
2023-12-29 | MoD2T:Model-Data-Driven Motion-Static Object Tracking Method | Yang Feng et.al. | 2312.17641v1 | null |
2023-12-29 | A New Explanation of the Mechanism of Hadley Circulation | Wei Huang et.al. | 2312.17637v1 | null |
2023-12-29 | Towards Faithful Explanations for Text Classification with Robustness Improvement and Explanation Guided Training | Dongfang Li et.al. | 2312.17591v1 | null |
2023-12-29 | A Tool for the Procedural Generation of Shaders using Interactive Evolutionary Algorithms | Elio Sasso et.al. | 2312.17587v1 | link |
2023-12-29 | Distribution-based Low-rank Embedding | Bardia Yousefi et.al. | 2312.17579v1 | null |
2023-12-28 | A Simple LLM Framework for Long-Range Video Question-Answering | Ce Zhang et.al. | 2312.17235v1 | null |
2023-12-28 | 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency | Yuyang Yin et.al. | 2312.17225v1 | null |
2023-12-28 | EFHQ: Multi-purpose ExtremePose-Face-HQ dataset | Trung Tuan Dao et.al. | 2312.17205v1 | null |
2023-12-28 | One Model to Rule them All: Towards Universal Segmentation for Medical Images with Text Prompts | Ziheng Zhao et.al. | 2312.17183v1 | null |
2023-12-28 | Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action | Jiasen Lu et.al. | 2312.17172v1 | null |
2023-12-28 | Classification of multiplication modules over multiplication rings with finitely many minimal primes | Volodymyr Bavula et.al. | 2312.17170v1 | null |
2023-12-28 | Securing NextG Systems against Poisoning Attacks on Federated Learning: A Game-Theoretic Solution | Yalin E. Sagduyu et.al. | 2312.17164v1 | null |
2023-12-28 | Replica Tree-based Federated Learning using Limited Data | Ramona Ghilea et.al. | 2312.17159v1 | null |
2023-12-29 | ARTrackV2: Prompting Autoregressive Tracker Where to Look and How to Describe | Yifan Bai et.al. | 2312.17133v2 | null |
2023-12-28 | Grounding-Prompter: Prompting LLM with Multimodal Information for Temporal Sentence Grounding in Long Videos | Houlun Chen et.al. | 2312.17117v1 | null |
2023-12-26 | Microwave signal processing using an analog quantum reservoir computer | Alen Senanian et.al. | 2312.16166v1 | null |
2023-12-26 | Large-scale Long-tailed Disease Diagnosis on Radiology Images | Qiaoyu Zheng et.al. | 2312.16151v1 | null |
2023-12-27 | The Media Bias Taxonomy: A Systematic Literature Review on the Forms and Automated Detection of Media Bias | Timo Spinde et.al. | 2312.16148v2 | link |
2023-12-26 | The non-Abelian Aharonov-Bohm effect | P. A. Horvathy et.al. | 2312.16133v1 | null |
2023-12-26 | LangSplat: 3D Language Gaussian Splatting | Minghan Qin et.al. | 2312.16084v1 | null |
2023-12-26 | AdaNAS: Adaptively Post-processing with Self-supervised Neural Architecture Search for Ensemble Rainfall Forecasts | Yingpeng Wen et.al. | 2312.16046v1 | null |
2023-12-26 | An extended asymmetric sigmoid with Perceptron (SIGTRON) for imbalanced linear classification | Hyenkyun Woo et.al. | 2312.16043v1 | null |
2023-12-26 | Multi-scale Progressive Feature Embedding for Accurate NIR-to-RGB Spectral Domain Translation | Xingxing Yang et.al. | 2312.16040v1 | null |
2023-12-26 | Plug-and-Play Regularization on Magnitude with Deep Priors for 3D Near-Field MIMO Imaging | Okyanus Oral et.al. | 2312.16024v1 | null |
2023-12-26 | Classification of positive solutions of Hardy-Sobolev equation without the finite volume constraints | Lu Chen et.al. | 2312.16017v1 | null |
2023-12-25 | Training Convolutional Neural Networks with the Forward-Forward algorithm | Riccardo Scodellaro et.al. | 2312.14924v2 | null |
2023-12-22 | DRStageNet: Deep Learning for Diabetic Retinopathy Staging from Fundus Images | Yevgeniy Men et.al. | 2312.14891v1 | null |
2023-12-22 | On rate-optimal classification from non-private and from private data | Balázs Csanád Csáji et.al. | 2312.14889v1 | null |
2023-12-22 | Classification of cubic tricirculant nut graphs | Ivan Damnjanović et.al. | 2312.14884v1 | null |
2023-12-22 | Neural-network-based regularization methods for inverse problems in imaging | Andreas Habring et.al. | 2312.14849v1 | null |
2023-12-22 | Classification of 3-GNDB Graphs | Amir Hosseini et.al. | 2312.14835v1 | null |
2023-12-22 | Dreaming of Electrical Waves: Generative Modeling of Cardiac Excitation Waves using Diffusion Models | Tanish Baranwal et.al. | 2312.14830v1 | null |
2023-12-22 | Classification of generalised higher-order Einstein-Maxwell Lagrangians | Aimeric Colléaux et.al. | 2312.14814v1 | null |
2023-12-22 | On support vector machines under a multiple-cost scenario | Sandra Benítez-Peña et.al. | 2312.14795v1 | null |
2023-12-22 | The Rate-Distortion-Perception-Classification Tradeoff: Joint Source Coding and Modulation via Inverse-Domain GANs | Junli Fang et.al. | 2312.14792v1 | null |
2023-12-21 | 3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera | Christen Millerdurai et.al. | 2312.14157v1 | null |
2023-12-21 | Virtual Pets: Animatable Animal Generation in 3D Scenes | Yen-Chi Cheng et.al. | 2312.14154v1 | null |
2023-12-21 | TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification | Qinying Liu et.al. | 2312.14149v1 | link |
2023-12-21 | HeadCraft: Modeling High-Detail Shape Variations for Animated 3DMMs | Artem Sevastopolsky et.al. | 2312.14140v1 | null |
2023-12-21 | Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach | Qinying Liu et.al. | 2312.14138v1 | link |
2023-12-21 | Diffusion Reward: Learning Rewards via Conditional Video Diffusion | Tao Huang et.al. | 2312.14134v1 | null |
2023-12-21 | WellFactor: Patient Profiling using Integrative Embedding of Healthcare Data | Dongjin Choi et.al. | 2312.14129v1 | null |
2023-12-21 | VideoPoet: A Large Language Model for Zero-Shot Video Generation | Dan Kondratyuk et.al. | 2312.14125v1 | null |
2023-12-21 | LingoQA: Video Question Answering for Autonomous Driving | Ana-Maria Marcu et.al. | 2312.14115v1 | link |
2023-12-21 | LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding | Senqiao Yang et.al. | 2312.14074v1 | null |
2023-12-20 | Deep Learning on 3D Neural Fields | Pierluigi Zama Ramirez et.al. | 2312.13277v1 | null |
2023-12-20 | The 1/4-BPS building blocks of brane interactions | Ben Eckardt et.al. | 2312.13269v1 | null |
2023-12-20 | ClassLIE: Structure- and Illumination-Adaptive Classification for Low-Light Image Enhancement | Zixiang Wei et.al. | 2312.13265v1 | null |
2023-12-20 | Putting the p back in Prym | Jeff Achter et.al. | 2312.13263v1 | null |
2023-12-20 | The role of data embedding in equivariant quantum convolutional neural networks | Sreetama Das et.al. | 2312.13250v1 | null |
2023-12-20 | Enhancing Neural Training via a Correlated Dynamics Model | Jonathan Brokman et.al. | 2312.13247v1 | null |
2023-12-20 | SISMIK for brain MRI: Deep-learning-based motion estimation and model-based motion correction in k-space | Oscar Dabrowski et.al. | 2312.13220v1 | null |
2023-12-20 | Boost recall in QSO selection from highly imbalanced photometric datasets | Giorgio Calderone et.al. | 2312.13194v1 | null |
2023-12-20 | Ergodic measures for periodic type |
Yuriy Tumarkin et.al. | 2312.13165v1 | null |
2023-12-20 | Underwater Acoustic Signal Recognition Based on Salient Features | Minghao Chen et.al. | 2312.13143v1 | null |
2023-12-19 | Tracking Any Object Amodally | Cheng-Yen Hsieh et.al. | 2312.12433v1 | null |
2023-12-19 | The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark | Aditya Murali et.al. | 2312.12429v1 | null |
2023-12-19 | Chasing Fairness in Graphs: A GNN Architecture Perspective | Zhimeng Jiang et.al. | 2312.12369v1 | link |
2023-12-19 | Easy quantum groups | Teo Banica et.al. | 2312.12368v1 | null |
2023-12-19 | SMC-NCA: Semantic-guided Multi-level Contrast for Semi-supervised Action Segmentation | Feixiang Zhou et.al. | 2312.12347v1 | null |
2023-12-19 | On the Effectiveness of Retrieval, Alignment, and Replay in Manipulation | Norman Di Palo et.al. | 2312.12345v1 | null |
2023-12-19 | Full-reference Video Quality Assessment for User Generated Content Transcoding | Zihao Qi et.al. | 2312.12317v1 | null |
2023-12-19 | First qualitative observations on deep learning vision model YOLO and DETR for automated driving in Austria | Stefan Schoder et.al. | 2312.12314v1 | null |
2023-12-19 | Holography of New Conformal Higher Spin Gravities in 3d | I. Lovrekovic et.al. | 2312.12301v1 | null |
2023-12-19 | Prompt-based Domain Discrimination for Multi-source Time Series Domain Adaptation | Junxiang Wang et.al. | 2312.12276v1 | null |
2023-12-18 | Development and Evaluation of Ensemble Learning-based Environmental Methane Detection and Intensity Prediction Models | Reek Majumder et.al. | 2312.10879v1 | null |
2023-12-18 | Mimic: Speaking Style Disentanglement for Speech-Driven 3D Facial Animation | Hui Fu et.al. | 2312.10877v1 | null |
2023-12-17 | Global relaxation-based LP-Newton method for multiple hyperparameter selection in support vector classification with feature selection | Qingna Li et.al. | 2312.10848v1 | null |
2023-12-17 | Online Boosting Adaptive Learning under Concept Drift for Multistream Classification | En Yu et.al. | 2312.10841v1 | null |
2023-12-17 | Learning to Act without Actions | Dominik Schmidt et.al. | 2312.10812v1 | null |
2023-12-17 | Land use/land cover classification of fused Sentinel-1 and Sentinel-2 imageries using ensembles of Random Forests | Shivam Pande et.al. | 2312.10798v1 | null |
2023-12-17 | Learning to Learn in Interactive Constraint Acquisition | Dimos Tsouros et.al. | 2312.10795v1 | null |
2023-12-17 | Identification of Knowledge Neurons in Protein Language Models | Divya Nori et.al. | 2312.10770v1 | null |
2023-12-17 | Multi-Label Classification of COVID-Tweets Using Large Language Models | Aniket Deroy et.al. | 2312.10748v1 | link |
2023-12-17 | Unmasking Deepfake Faces from Videos Using An Explainable Cost-Sensitive Deep Learning Approach | Faysal Mahmud et.al. | 2312.10740v1 | link |
2023-12-15 | Understanding Probe Behaviors through Variational Bounds of Mutual Information | Kwanghee Choi et.al. | 2312.10019v1 | link |
2023-12-15 | Wearable Coaxially-shielded Metamaterial for Magnetic Resonance Imaging | Xia Zhu et.al. | 2312.10018v1 | null |
2023-12-15 | On the Invertibility of Euler Integral Transforms with Hyperplanes and Quadric Hypersurfaces | Mattie Ji et.al. | 2312.10002v1 | null |
2023-12-15 | Towards Architecture-Insensitive Untrained Network Priors for Accelerated MRI Reconstruction | Yilin Liu et.al. | 2312.09988v1 | null |
2023-12-15 | DHFormer: A Vision Transformer-Based Attention Module for Image Dehazing | Abdul Wasi et.al. | 2312.09955v1 | null |
2023-12-15 | Multi-level graph learning for audio event classification and human-perceived annoyance rating prediction | Yuanbo Hou et.al. | 2312.09952v1 | null |
2023-12-15 | LogoStyleFool: Vitiating Video Recognition Systems via Logo Style Transfer | Yuxin Cao et.al. | 2312.09935v1 | link |
2023-12-15 | RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding | Yuxin Zi et.al. | 2312.09932v1 | null |
2023-12-15 | Reliable Probabilistic Classification with Neural Networks | Harris Papadopoulos et.al. | 2312.09912v1 | null |
2023-12-15 | TMP: Temporal Motion Propagation for Online Video Super-Resolution | Zhengqiang Zhang et.al. | 2312.09909v1 | null |
2023-12-14 | 3DGS-Avatar: Animatable Avatars via Deformable 3D Gaussian Splatting | Zhiyin Qian et.al. | 2312.09228v1 | null |
2023-12-14 | Efficient Online Learning of Contact Force Models for Connector Insertion | Kevin Tracy et.al. | 2312.09190v1 | null |
2023-12-14 | General Object Foundation Model for Images and Videos at Scale | Junfeng Wu et.al. | 2312.09158v1 | null |
2023-12-14 | Evaluating Augmented Reality Communication: How Can We Teach Procedural Skill in AR? | Manuel Rebol et.al. | 2312.09152v1 | null |
2023-12-14 | Split-Ensemble: Efficient OOD-aware Ensemble via Task and Model Splitting | Anthony Chen et.al. | 2312.09148v1 | null |
2023-12-14 | Class-Wise Buffer Management for Incremental Object Detection: An Effective Buffer Training Strategy | Junsu Kim et.al. | 2312.09139v1 | null |
2023-12-14 | Less is more -- the Dispatcher/ Executor principle for multi-task Reinforcement Learning | Martin Riedmiller et.al. | 2312.09120v1 | null |
2023-12-14 | VideoLCM: Video Latent Consistency Model | Xiang Wang et.al. | 2312.09109v1 | null |
2023-12-14 | FastInject: Injecting Unpaired Text Data into CTC-based ASR training | Keqi Deng et.al. | 2312.09100v1 | null |
2023-12-14 | Agent Attention: On the Integration of Softmax and Linear Attention | Dongchen Han et.al. | 2312.08874v1 | link |
2023-12-13 | VLAP: Efficient Video-Language Alignment via Frame Prompting and Distilling for Video Question Answering | Xijun Wang et.al. | 2312.08367v1 | null |
2023-12-13 | Challenges and Opportunities in Implementing Negative Differential Resistance Mode Reconfigurable Field Effect Transistors | Lephe S et.al. | 2312.08351v1 | null |
2023-12-13 | Ehancing CT Image synthesis from multi-modal MRI data based on a multi-task neural network framework | Zhuoyao Xin et.al. | 2312.08343v1 | null |
2023-12-13 | Preparing VVC for Streaming: A Fast Multi-Rate Encoding Approach | Yiqun Liu et.al. | 2312.08330v1 | null |
2023-12-13 | Affine monoids of corank one | Yulia Zaitseva et.al. | 2312.08316v1 | null |
2023-12-13 | VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space | Guénolé Fiche et.al. | 2312.08291v1 | null |
2023-12-13 | PhenDiff: Revealing Invisible Phenotypes with Conditional Diffusion Models | Anis Bourou et.al. | 2312.08290v1 | link |
2023-12-13 | On the verification of Embeddings using Hybrid Markov Logic | Anup Shakya et.al. | 2312.08287v1 | null |
2023-12-14 | High-throughput Biomedical Relation Extraction for Semi-Structured Web Articles Empowered by Large Language Models | Songchi Zhou et.al. | 2312.08274v2 | null |
2023-12-13 | Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable Attention and Query Aggregation | Arul Selvam Periyasamy et.al. | 2312.08268v1 | null |
2023-12-12 | diff History for Long-Context Language Agents | Ulyana Piterbarg et.al. | 2312.07540v1 | null |
2023-12-12 | FreeInit: Bridging Initialization Gap in Video Diffusion Models | Tianxing Wu et.al. | 2312.07537v1 | link |
2023-12-12 | WHAM: Reconstructing World-grounded Humans with Accurate 3D Motion | Soyong Shin et.al. | 2312.07531v1 | null |
2023-12-12 | RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation | Peng Lu et.al. | 2312.07526v1 | link |
2023-12-12 | PEEKABOO: Interactive Video Generation via Masked-Diffusion | Yash Jain et.al. | 2312.07509v1 | null |
2023-12-12 | NAC-TCN: Temporal Convolutional Networks with Causal Dilated Neighborhood Attention for Emotion Understanding | Alexander Mehta et.al. | 2312.07507v1 | link |
2023-12-12 | COLMAP-Free 3D Gaussian Splatting | Yang Fu et.al. | 2312.07504v1 | null |
2023-12-12 | NearbyPatchCL: Leveraging Nearby Patches for Self-Supervised Patch-Level Multi-Class Classification in Whole-Slide Images | Gia-Bao Le et.al. | 2312.07489v1 | null |
2023-12-12 | MinD-3D: Reconstruct High-quality 3D objects in Human Brain | Jianxiong Gao et.al. | 2312.07485v1 | null |
2023-12-12 | Classification of retail products: From probabilistic ranking to neural networks | Manar Mohamed Hafez et.al. | 2312.07482v1 | null |
2023-12-11 | Photorealistic Video Generation with Diffusion Models | Agrim Gupta et.al. | 2312.06662v1 | null |
2023-12-11 | LightSim: Neural Lighting Simulation for Urban Scenes | Ava Pun et.al. | 2312.06654v1 | null |
2023-12-11 | Beyond Classification: Definition and Density-based Estimation of Calibration in Object Detection | Teodora Popordanoska et.al. | 2312.06645v1 | null |
2023-12-11 | Upscale-A-Video: Temporal-Consistent Diffusion Model for Real-World Video Super-Resolution | Shangchen Zhou et.al. | 2312.06640v1 | null |
2023-12-12 | TMT-VIS: Taxonomy-aware Multi-dataset Joint Training for Video Instance Segmentation | Rongkun Zheng et.al. | 2312.06630v2 | link |
2023-12-11 | Neural Text to Articulate Talk: Deep Text to Audiovisual Speech Synthesis achieving both Auditory and Photo-realism | Georgios Milis et.al. | 2312.06613v1 | link |
2023-12-11 | Early Action Recognition with Action Prototypes | Guglielmo Camporese et.al. | 2312.06598v1 | null |
2023-12-11 | Flexible visual prompts for in-context learning in computer vision | Thomas Foster et.al. | 2312.06592v1 | link |
2023-12-11 | QuickQuakeBuildings: Post-earthquake SAR-Optical Dataset for Quick Damaged-building Detection | Yao Sun et.al. | 2312.06587v1 | null |
2023-12-12 | ESO/HARPS Radial Velocities Catalog | Mauro Barbieri et.al. | 2312.06586v2 | null |
2023-12-08 | The Long Secondary Period (LSP) Variables: Overview and Some Analysis | John R. Percy et.al. | 2312.05255v1 | null |
2023-12-08 | Few-Shot Class-Incremental Learning via Training-Free Prototype Calibration | Qi-Wei Wang et.al. | 2312.05229v1 | null |
2023-12-08 | Shape Matters: Detecting Vertebral Fractures Using Differentiable Point-Based Shape Decoding | Hellena Hempe et.al. | 2312.05220v1 | link |
2023-12-08 | Enhancing Facial Classification and Recognition using 3D Facial Models and Deep Learning | Houting Li et.al. | 2312.05219v1 | null |
2023-12-08 | IntrinsicAvatar: Physically Based Inverse Rendering of Dynamic Humans from Monocular Videos via Explicit Ray Tracing | Shaofei Wang et.al. | 2312.05210v1 | null |
2023-12-08 | Embedding theory in ML toward real-time tracking of structural dynamics through hyperspectral datasets | Jonathan D Hollenbach et.al. | 2312.05201v1 | null |
2023-12-08 | Video-Based Rendering Techniques: A Survey | Rafael Kuffner dos Anjos et.al. | 2312.05179v1 | null |
2023-12-08 | Enhancing Single-Frame Supervision for Better Temporal Action Localization | Changjian Chen et.al. | 2312.05178v1 | null |
2023-12-08 | MRI Scan Synthesis Methods based on Clustering and Pix2Pix | Giulia Baldini et.al. | 2312.05176v1 | null |
2023-12-08 | TriHuman : A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis | Heming Zhu et.al. | 2312.05161v1 | null |
2023-12-07 | GenDeF: Learning Generative Deformation Field for Video Generation | Wen Wang et.al. | 2312.04561v1 | null |
2023-12-07 | MonoGaussianAvatar: Monocular Gaussian Point-based Head Avatar | Yufan Chen et.al. | 2312.04558v1 | null |
2023-12-07 | GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation | Shoufa Chen et.al. | 2312.04557v1 | null |
2023-12-07 | SPIDeRS: Structured Polarization for Invisible Depth and Reflectance Sensing | Tomoki Ichikawa et.al. | 2312.04553v1 | null |
2023-12-07 | PlayFusion: Skill Acquisition via Diffusion from Language-Annotated Play | Lili Chen et.al. | 2312.04549v1 | null |
2023-12-07 | Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception? | Aritra Dutta et.al. | 2312.04548v1 | null |
2023-12-07 | Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models | Ivan Kapelyukh et.al. | 2312.04533v1 | null |
2023-12-07 | Camera Height Doesn't Change: Unsupervised Monocular Scale-Aware Road-Scene Depth Estimation | Genki Kinoshita et.al. | 2312.04530v1 | null |
2023-12-07 | RAVE: Randomized Noise Shuffling for Fast and Consistent Video Editing with Diffusion Models | Ozgur Kara et.al. | 2312.04524v1 | link |
2023-12-07 | Hierarchical Spatio-temporal Decoupling for Text-to-Video Generation | Zhiwu Qing et.al. | 2312.04483v1 | null |
2023-12-06 | OneLLM: One Framework to Align All Modalities with Language | Jiaming Han et.al. | 2312.03700v1 | link |
2023-12-07 | Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers | Umberto Cappellazzo et.al. | 2312.03694v2 | null |
2023-12-06 | Direct Exoplanet Detection Using Deep Convolutional Image Reconstruction (ConStruct): A New Algorithm for Post-Processing High-Contrast Images | Trevor N. Wolf et.al. | 2312.03671v1 | null |
2023-12-06 | Annihilating branching Brownian motion | Daniel Ahlberg et.al. | 2312.03669v1 | null |
2023-12-06 | Towards small and accurate convolutional neural networks for acoustic biodiversity monitoring | Serge Zaugg et.al. | 2312.03666v1 | null |
2023-12-06 | Reason2Drive: Towards Interpretable and Chain-based Reasoning for Autonomous Driving | Ming Nie et.al. | 2312.03661v1 | link |
2023-12-06 | Editable Stain Transformation Of Histological Images Using Unpaired GANs | Tibor Sloboda et.al. | 2312.03647v1 | link |
2023-12-06 | MotionCtrl: A Unified and Flexible Motion Controller for Video Generation | Zhouxia Wang et.al. | 2312.03641v1 | null |
2023-12-06 | Training Neural Networks on RAW and HDR Images for Restoration Tasks | Lei Luo et.al. | 2312.03640v1 | link |
2023-12-07 | Evaluation of Active Feature Acquisition Methods for Static Feature Settings | Henrik von Kleist et.al. | 2312.03619v2 | null |
2023-12-05 | Dexterous Functional Grasping | Ananye Agarwal et.al. | 2312.02975v1 | null |
2023-12-05 | Describing Differences in Image Sets with Natural Language | Lisa Dunlap et.al. | 2312.02974v1 | link |
2023-12-05 | GauHuman: Articulated Gaussian Splatting from Monocular Human Videos | Shoukang Hu et.al. | 2312.02973v1 | link |
2023-12-05 | Detecting algorithmic bias in medical AI-models | Jeffrey Smith et.al. | 2312.02959v1 | null |
2023-12-05 | Classification for everyone : Building geography agnostic models for fairer recognition | Akshat Jindal et.al. | 2312.02957v1 | null |
2023-12-05 | Choroidalyzer: An open-source, end-to-end pipeline for choroidal analysis in optical coherence tomography | Justin Engelmann et.al. | 2312.02956v1 | null |
2023-12-05 | An alternating peak-optimization method for optimal trajectory generation of quadrotor drones | Wytze A. B. de Vries et.al. | 2312.02944v1 | null |
2023-12-05 | Fast CT anatomic localization algorithm | Amit Oved et.al. | 2312.02941v1 | null |
2023-12-05 | Drag-A-Video: Non-rigid Video Editing with Point-based Interaction | Yao Teng et.al. | 2312.02936v1 | null |
2023-12-06 | WoVoGen: World Volume-aware Diffusion for Controllable Multi-camera Driving Scene Generation | Jiachen Lu et.al. | 2312.02934v2 | link |
2023-12-04 | iMatching: Imperative Correspondence Learning | Zitong Zhan et.al. | 2312.02141v1 | null |
2023-12-04 | Fast View Synthesis of Casual Videos | Yao-Chih Lee et.al. | 2312.02135v1 | null |
2023-12-04 | GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians | Liangxiao Hu et.al. | 2312.02134v1 | null |
2023-12-04 | Hot PATE: Private Aggregation of Distributions for Diverse Task | Edith Cohen et.al. | 2312.02132v1 | null |
2023-12-04 | Can we truly transfer an actor's genuine happiness to avatars? An investigation into virtual, real, posed and spontaneous faces | Vitor Miguel Xavier Peres et.al. | 2312.02128v1 | null |
2023-12-04 | Cosmic star-formation history and black hole accretion history inferred from the JWST mid-infrared source counts | Seong Jin Kim et.al. | 2312.02090v1 | null |
2023-12-05 | VideoSwap: Customized Video Subject Swapping with Interactive Semantic Point Correspondence | Yuchao Gu et.al. | 2312.02087v2 | null |
2023-12-04 | Integrating AI into CCTV Systems: A Comprehensive Evaluation of Smart Video Surveillance in Community Space | Shanle Yao et.al. | 2312.02078v1 | null |
2023-12-04 | GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians | Shenhan Qian et.al. | 2312.02069v1 | null |
2023-12-04 | TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding | Shuhuai Ren et.al. | 2312.02051v1 | null |
2023-12-01 | Dense Optical Tracking: Connecting the Dots | Guillaume Le Moing et.al. | 2312.00786v1 | null |
2023-12-01 | Sequential Modeling Enables Scalable Learning for Large Vision Models | Yutong Bai et.al. | 2312.00785v1 | null |
2023-12-01 | MorpheuS: Neural Dynamic 360° Surface Reconstruction from Monocular RGB-D Video | Hengyi Wang et.al. | 2312.00778v1 | null |
2023-12-01 | VideoBooth: Diffusion-based Video Generation with Image Prompts | Yuming Jiang et.al. | 2312.00777v1 | null |
2023-12-01 | Towards Generalizable Zero-Shot Manipulation via Translating Human Interaction Plans | Homanga Bharadhwaj et.al. | 2312.00775v1 | null |
2023-12-01 | Explaining Knock-on Effects of Bias Mitigation | Svetoslav Nizhnichenkov et.al. | 2312.00765v1 | null |
2023-12-04 | Deep Unlearning: Fast and Efficient Training-free Approach to Controlled Forgetting | Sangamesh Kodge et.al. | 2312.00761v2 | null |
2023-12-01 | Mitigating Over-smoothing in Transformers via Regularized Nonlocal Functionals | Tam Nguyen et.al. | 2312.00751v1 | null |
2023-12-01 | Tight-minimal dichotomies in Banach spaces | Alejandra C. Cáceres-Rigo et.al. | 2312.00721v1 | null |
2023-12-01 | GIFT: Generative Interpretable Fine-Tuning Transformers | Chinmay Savadikar et.al. | 2312.00700v1 | link |
2023-11-30 | Just Add |
Dominick Reilly et.al. | 2311.18840v1 | null |
2023-11-30 | TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios | Lihao Liu et.al. | 2311.18839v1 | null |
2023-11-30 | VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models | Zhen Xing et.al. | 2311.18837v1 | null |
2023-11-30 | ART$\boldsymbol{\cdot}$V: Auto-Regressive Text-to-Video Generation with Diffusion Models | Wenming Weng et.al. | 2311.18834v1 | null |
2023-11-30 | MotionEditor: Editing Video Motion via Content-Aware Diffusion | Shuyuan Tu et.al. | 2311.18830v1 | link |
2023-11-30 | MicroCinema: A Divide-and-Conquer Approach for Text-to-Video Generation | Yanhui Wang et.al. | 2311.18829v1 | null |
2023-11-30 | Motion-Conditioned Image Animation for Video Editing | Wilson Yan et.al. | 2311.18827v1 | null |
2023-11-30 | CAST: Cross-Attention in Space and Time for Video Action Recognition | Dongho Lee et.al. | 2311.18825v1 | link |
2023-11-30 | Dichotomy of Early and Late Phase Implicit Biases Can Provably Induce Grokking | Kaifeng Lyu et.al. | 2311.18817v1 | link |
2023-11-30 | BIOCLIP: A Vision Foundation Model for the Tree of Life | Samuel Stevens et.al. | 2311.18803v1 | null |
2023-11-30 | Do text-free diffusion models learn discriminative visual representations? | Soumik Mukhopadhyay et.al. | 2311.17921v2 | null |
2023-11-29 | Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving | Yuqi Wang et.al. | 2311.17918v1 | link |
2023-11-29 | HUGS: Human Gaussian Splats | Muhammed Kocabas et.al. | 2311.17910v1 | null |
2023-11-29 | SODA: Bottleneck Diffusion Models for Representation Learning | Drew A. Hudson et.al. | 2311.17901v1 | null |
2023-11-30 | Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis | Jinqi Luo et.al. | 2311.17898v2 | null |
2023-11-29 | On the geometry of tensor products over finite fields | Stefano Lia et.al. | 2311.17896v1 | null |
2023-11-29 | Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation | Shuangrui Ding et.al. | 2311.17893v1 | null |
2023-11-29 | TSDF-Sampling: Efficient Sampling for Neural Surface Field using Truncated Signed Distance Field | Chaerin Min et.al. | 2311.17878v1 | null |
2023-11-29 | Enhancing Post-Hoc Explanation Benchmark Reliability for Image Classification | Tristan Gomez et.al. | 2311.17876v1 | null |
2023-11-29 | On the Adversarial Robustness of Graph Contrastive Learning Methods | Filippo Guerranti et.al. | 2311.17853v1 | null |
2023-11-28 | Panoptic Video Scene Graph Generation | Jingkang Yang et.al. | 2311.17058v1 | link |
2023-11-28 | Self-Supervised Motion Magnification by Backpropagating Through Optical Flow | Zhaoying Pan et.al. | 2311.17056v1 | null |
2023-11-28 | MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training | Pavan Kumar Anasosalu Vasu et.al. | 2311.17049v1 | null |
2023-11-28 | Jets of foliations and |
Francis Bischoff et.al. | 2311.17045v1 | null |
2023-11-28 | LLaMA-VID: An Image is Worth 2 Tokens in Large Language Models | Yanwei Li et.al. | 2311.17043v1 | link |
2023-11-29 | Efficient In-Context Learning in Vision-Language Models for Egocentric Videos | Keunwoo Peter Yu et.al. | 2311.17041v2 | null |
2023-11-28 | Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer | Danah Yatim et.al. | 2311.17009v1 | null |
2023-11-28 | MVBench: A Comprehensive Multi-modal Video Understanding Benchmark | Kunchang Li et.al. | 2311.17005v1 | link |
2023-11-28 | Mirković-Vilonen Polytopes from Combinatorics | Mario Sanchez et.al. | 2311.16979v1 | null |
2023-11-28 | Natural Language Processing Through Transfer Learning: A Case Study on Sentiment Analysis | Aman Yadav et.al. | 2311.16965v1 | null |
2023-11-28 | Video-Bench: A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models | Munan Ning et.al. | 2311.16103v2 | link |
2023-11-27 | GART: Gaussian Articulated Template Models | Jiahui Lei et.al. | 2311.16099v1 | null |
2023-11-27 | On Bringing Robots Home | Nur Muhammad Mahi Shafiullah et.al. | 2311.16098v1 | link |
2023-11-27 | CG-HOI: Contact-Guided 3D Human-Object Interaction Generation | Christian Diller et.al. | 2311.16097v1 | null |
2023-11-27 | Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling | Zhe Li et.al. | 2311.16096v1 | link |
2023-11-27 | Three-dimensional |
Alexander C. Tyner et.al. | 2311.16092v1 | null |
2023-11-27 | BERT Goes Off-Topic: Investigating the Domain Transfer Challenge using Genre Classification | Dmitri Roussinov et.al. | 2311.16083v1 | link |
2023-11-27 | ViT-Lens-2: Gateway to Omni-modal Intelligence | Weixian Lei et.al. | 2311.16081v1 | link |
2023-11-27 | Correlated Spectral and Recurrence Variations of Cygnus X-1 | E. M. Broadbent et.al. | 2311.16070v1 | null |
2023-11-27 | DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization | Zhaoyang Xia et.al. | 2311.16060v1 | link |
2023-11-24 | SEGIC: Unleashing the Emergent Correspondence for In-Context Segmentation | Lingchen Meng et.al. | 2311.14671v1 | link |
2023-11-24 | JetLOV: Enhancing Jet Tree Tagging through Neural Network Learning of Optimal LundNet Variables | Mauricio A. Diaz et.al. | 2311.14654v1 | link |
2023-11-24 | Learning in Deep Factor Graphs with Gaussian Belief Propagation | Seth Nabarro et.al. | 2311.14649v1 | null |
2023-11-24 | Continuous football player tracking from discrete broadcast data | Matthew J. Penn et.al. | 2311.14642v1 | null |
2023-11-24 | Emergent Topology in Many-Body Dissipative Quantum Chaos | Antonio M. García-García et.al. | 2311.14640v1 | null |
2023-11-24 | Unsupervised high-throughput segmentation of cells and cell nuclei in quantitative phase images | Julia Sistermanns et.al. | 2311.14639v1 | null |
2023-11-24 | ARIA: On the interaction between Architectures, Aggregation methods and Initializations in federated visual classification | Vasilis Siomos et.al. | 2311.14625v1 | null |
2023-11-24 | Neural Style Transfer for Computer Games | Eleftherios Ioannou et.al. | 2311.14617v1 | null |
2023-11-24 | Animate124: Animating One Image to 4D Dynamic Scene | Yuyang Zhao et.al. | 2311.14603v1 | null |
2023-11-24 | A Metalearned Neural Circuit for Nonparametric Bayesian Inference | Jake C. Snell et.al. | 2311.14601v1 | link |
2023-11-22 | WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space | Katja Schwarz et.al. | 2311.13570v1 | null |
2023-11-22 | Belted sum decompositions of fully augmented links | Porter Morgan et.al. | 2311.13540v1 | null |
2023-11-22 | Learned Nonlinear Predictor for Critically Sampled 3D Point Cloud Attribute Compression | Tam Thuc Do et.al. | 2311.13539v1 | null |
2023-11-22 | Leveraging CNNs and Ensemble Learning for Automated Disaster Image Classification | Archit Rathod et.al. | 2311.13531v1 | null |
2023-11-22 | Applying Dimensionality Reduction as Precursor to LSTM-CNN Models for Classifying Imagery and Motor Signals in ECoG-Based BCIs | Soham Bafana et.al. | 2311.13507v1 | link |
2023-11-22 | Current Topological and Machine Learning Applications for Bias Detection in Text | Colleen Farrelly et.al. | 2311.13495v1 | null |
2023-11-22 | Benchmarking Toxic Molecule Classification using Graph Neural Networks and Few Shot Learning | Bhavya Mehta et.al. | 2311.13490v1 | null |
2023-11-22 | Deep-learning-based acceleration of MRI for radiotherapy planning of pediatric patients with brain tumors | Shahinur Alam et.al. | 2311.13485v1 | link |
2023-11-22 | Solution discovery via reconfiguration for problems in P | Mario Grobler et.al. | 2311.13478v1 | null |
2023-11-22 | Experimentation in Early-Stage Video Game Startups: Practices and Challenges | Henry Edison et.al. | 2311.13462v1 | null |
2023-11-21 | Physics-guided Shape-from-Template: Monocular Video Perception through Neural Surrogate Models | David Stotko et.al. | 2311.12796v1 | null |
2023-11-21 | Quantifying Impairment and Disease Severity Using AI Models Trained on Healthy Subjects | Boyang Yu et.al. | 2311.12781v1 | link |
2023-11-21 | Swift Parameter-free Attention Network for Efficient Super-Resolution | Cheng Wan et.al. | 2311.12770v1 | link |
2023-11-22 | Investigating Weight-Perturbed Deep Neural Networks With Application in Iris Presentation Attack Detection | Renu Sharma et.al. | 2311.12764v2 | link |
2023-11-21 | High-resolution Image-based Malware Classification using Multiple Instance Learning | Tim Peters et.al. | 2311.12760v1 | link |
2023-11-21 | SelfOcc: Self-Supervised Vision-Based 3D Occupancy Prediction | Yuanhui Huang et.al. | 2311.12754v1 | link |
2023-11-21 | Image Transformation for IoT Time-Series Data: A Review | Duygu Altunkaya et.al. | 2311.12742v1 | null |
2023-11-21 | Exploring Graph Classification Techniques Under Low Data Constraints: A Comprehensive Study | Kush Kothari et.al. | 2311.12737v1 | null |
2023-11-21 | Not Just Training, Also Testing: High School Youths' Perspective-Taking through Peer Testing Machine Learning-Powered Applications | L. Morales-Navarro et.al. | 2311.12733v1 | null |
2023-11-21 | Cascade Learning Localises Discriminant Features in Visual Scene Classification | Junwen Wang et.al. | 2311.12704v1 | null |
2023-11-20 | Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation | Wenhao Li et.al. | 2311.12028v1 | null |
2023-11-20 | GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration | Naoki Wake et.al. | 2311.12015v1 | null |
2023-11-20 | Evaluating Supervision Levels Trade-Offs for Infrared-Based People Counting | David Latortue et.al. | 2311.11974v1 | null |
2023-11-20 | SA-Med2D-20M Dataset: Segment Anything in 2D Medical Imaging with 20 Million masks | Jin Ye et.al. | 2311.11969v1 | link |
2023-11-20 | Correlated Attention in Transformers for Multivariate Time Series | Quang Minh Nguyen et.al. | 2311.11959v1 | null |
2023-11-20 | Tubular Curvature Filter: Implicit Pointwise Curvature Calculation Method for Tubular Objects | Elifnur Sunger et.al. | 2311.11931v1 | null |
2023-11-20 | LLMs as Visual Explainers: Advancing Image Classification with Evolving Visual Descriptions | Songhao Han et.al. | 2311.11904v1 | null |
2023-11-20 | Multimodal Characterization of Emotion within Multimedia Space | Dayo Samuel Banjo et.al. | 2311.11892v1 | null |
2023-11-20 | SniffyArt: The Dataset of Smelling Persons | Mathias Zinnen et.al. | 2311.11888v1 | null |
2023-11-20 | Multi-Task Faces (MTF) Data Set: A Legally and Ethically Compliant Collection of Face Images for Various Classification Tasks | Rami Haffar et.al. | 2311.11882v1 | link |
2023-11-17 | Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning | Rohit Girdhar et.al. | 2311.10709v1 | null |
2023-11-17 | SpACNN-LDVAE: Spatial Attention Convolutional Latent Dirichlet Variational Autoencoder for Hyperspectral Pixel Unmixing | Soham Chitnis et.al. | 2311.10701v1 | null |
2023-11-17 | A note on the convergence of the Bayesian entropy estimator for exchangeable partitions | Servet Martinez et.al. | 2311.10698v1 | null |
2023-11-17 | Distilling and Retrieving Generalizable Knowledge for Robot Manipulation via Language Corrections | Lihan Zha et.al. | 2311.10678v1 | link |
2023-11-17 | 3D-TexSeg: Unsupervised Segmentation of 3D Texture using Mutual Transformer Learning | Iyyakutti Iyappan Ganapathi et.al. | 2311.10651v1 | null |
2023-11-17 | User Dynamics-Aware Edge Caching and Computing for Mobile Virtual Reality | Mushu Li et.al. | 2311.10645v1 | null |
2023-11-17 | Image-Domain Material Decomposition for Dual-energy CT using Unsupervised Learning with Data-fidelity Loss | Junbo Peng et.al. | 2311.10641v1 | null |
2023-11-17 | Scaling TabPFN: Sketching and Feature Selection for Tabular Prior-Data Fitted Networks | Benjamin Feuer et.al. | 2311.10609v1 | null |
2023-11-17 | Designing Reconfigurable Intelligent Systems with Markov Blankets | Boris Sedlak et.al. | 2311.10597v1 | null |
2023-11-17 | FOCAL: A Cost-Aware Video Dataset for Active Learning | Kiran Kokilepersaud et.al. | 2311.10591v1 | link |
2023-11-16 | Traffic Video Object Detection using Motion Prior | Lihao Liu et.al. | 2311.10092v1 | null |
2023-11-16 | Moduli space of rank three logarithmic connections on the projective line with three poles | Takafumi Matsumoto et.al. | 2311.10071v1 | null |
2023-11-16 | Inherently Interpretable Time Series Classification via Multiple Instance Learning | Joseph Early et.al. | 2311.10049v1 | link |
2023-11-16 | On the potential of Carbon-Enhanced Metal-Poor stars for Galactic Archaeology | Aruna Goswami et.al. | 2311.10043v1 | null |
2023-11-16 | Match and Locate: low-frequency monocular odometry based on deep feature matching | Stepan Konev et.al. | 2311.10034v1 | null |
2023-11-16 | Revolutionizing Customer Interactions: Insights and Challenges in Deploying ChatGPT and Generative Chatbots for FAQs | Feriel Khennouche et.al. | 2311.09976v1 | null |
2023-11-16 | From Pretext to Purpose: Batch-Adaptive Self-Supervised Learning | Jiansong Zhang et.al. | 2311.09974v1 | null |
2023-11-16 | VertDetect: Fully End-to-End 3D Vertebral Instance Segmentation Model | Geoff Klein et.al. | 2311.09958v1 | null |
2023-11-16 | Harnessing Transformers: A Leap Forward in Lung Cancer Image Detection | Amine Bechar et.al. | 2311.09942v1 | null |
2023-11-17 | A Framework for Monitoring and Retraining Language Models in Real-World Applications | Jaykumar Kasundra et.al. | 2311.09930v2 | null |
2023-11-15 | Single-Image 3D Human Digitization with Shape-Guided Diffusion | Badour AlBahar et.al. | 2311.09221v1 | null |
2023-11-15 | ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy | Kirill Vishniakov et.al. | 2311.09215v1 | link |
2023-11-15 | Topology of Pulsar Profiles (ToPP). I. Graph theory method and classification of the EPN | D. Vohl et.al. | 2311.09201v1 | null |
2023-11-15 | ExpM+NF: Differentially Private Machine Learning that Surpasses DPSGD | Robert A. Bridges et.al. | 2311.09200v1 | null |
2023-11-15 | Domain Aligned CLIP for Few-shot Classification | Muhammad Waleed Gondal et.al. | 2311.09191v1 | null |
2023-11-15 | ContraDoc: Understanding Self-Contradictions in Documents with Large Language Models | Jierui Li et.al. | 2311.09182v1 | null |
2023-11-15 | RBPGAN: Recurrent Back-Projection GAN for Video Super Resolution | Dareen Hussein et.al. | 2311.09178v1 | null |
2023-11-15 | Model Agnostic Explainable Selective Regression via Uncertainty Estimation | Andrea Pugnana et.al. | 2311.09145v1 | null |
2023-11-15 | Explainable Text Classification Techniques in Legal Document Review: Locating Rationales without Using Human Annotated Training Text Snippets | Christian Mahoney et.al. | 2311.09133v1 | null |
2023-11-15 | Cross-view and Cross-pose Completion for 3D Human Understanding | Matthieu Armando et.al. | 2311.09104v1 | null |
2023-11-14 | MVSA-Net: Multi-View State-Action Recognition for Robust and Deployable Trajectory Generation | Ehsan Asali et.al. | 2311.08393v1 | null |
2023-11-14 | USLR: an open-source tool for unbiased and smooth longitudinal registration of brain MR | Adrià Casamitjana et.al. | 2311.08371v1 | link |
2023-11-14 | Inverse Learning with Extremely Sparse Feedback for Recommendation | Guanyu Lin et.al. | 2311.08302v1 | null |
2023-11-14 | Level Set KSVD | Omer Sapir et.al. | 2311.08284v1 | null |
2023-11-14 | TENT: Connect Language Models with IoT Sensors for Zero-Shot Activity Recognition | Yunjiao Zhou et.al. | 2311.08245v1 | null |
2023-11-14 | MCMC to address model misspecification in Deep Learning classification of Radio Galaxies | Devina Mohan et.al. | 2311.08243v1 | null |
2023-11-14 | Learning Physics-Inspired Regularization for Medical Image Registration with Hypernetworks | Anna Reithmeir et.al. | 2311.08239v1 | link |
2023-11-14 | Counterfactual Explanation for Regression via Disentanglement in Latent Space | Xuan Zhao et.al. | 2311.08228v1 | null |
2023-11-14 | Uni-COAL: A Unified Framework for Cross-Modality Synthesis and Super-Resolution of MR Images | Zhiyun Song et.al. | 2311.08225v1 | null |
2023-11-14 | Eval-GCSC: A New Metric for Evaluating ChatGPT's Performance in Chinese Spelling Correction | Kunting Li et.al. | 2311.08219v1 | link |
2023-11-13 | GPT-4V(ision) as A Social Media Analysis Engine | Hanjia Lyu et.al. | 2311.07547v1 | link |
2023-11-13 | mlscorecheck: Testing the consistency of reported performance scores and experiments in machine learning | György Kovács et.al. | 2311.07541v1 | null |
2023-11-13 | FEMDA: a unified framework for discriminant analysis | Pierre Houdouin et.al. | 2311.07518v1 | null |
2023-11-13 | Reducing the Need for Backpropagation and Discovering Better Optima With Explicit Optimizations of Neural Networks | Jake Ryland Williams et.al. | 2311.07498v1 | null |
2023-11-13 | Towards Robotic Tree Manipulation: Leveraging Graph Representations | Chung Hee Kim et.al. | 2311.07479v1 | null |
2023-11-13 | Temporal Performance Prediction for Deep Convolutional Long Short-Term Memory Networks | Laura Fieback et.al. | 2311.07477v1 | null |
2023-11-13 | Masked Face Dataset Generation and Masked Face Recognition | Rui Cai et.al. | 2311.07475v1 | link |
2023-11-13 | A Bayesian Approach to Strong Lens Finding in the Era of Wide-area Surveys | Philip Holloway et.al. | 2311.07455v1 | null |
2023-11-13 | On the Robustness of Neural Collapse and the Neural Collapse of Robustness | Jingtong Su et.al. | 2311.07444v1 | null |
2023-11-13 | Optimising Human-AI Collaboration by Learning Convincing Explanations | Alex J. Chan et.al. | 2311.07426v1 | null |
2023-11-10 | Learning Human Action Recognition Representations Without Real Humans | Howard Zhong et.al. | 2311.06231v1 | link |
2023-11-10 | Semantic-aware Video Representation for Few-shot Action Recognition | Yutao Tang et.al. | 2311.06218v1 | null |
2023-11-10 | MultiIoT: Towards Large-scale Multisensory Learning for the Internet of Things | Shentong Mo et.al. | 2311.06217v1 | null |
2023-11-10 | Deep learning segmentation of fibrous cap in intravascular optical coherence tomography images | Juhwan Lee et.al. | 2311.06202v1 | null |
2023-11-10 | An Automated Pipeline for Tumour-Infiltrating Lymphocyte Scoring in Breast Cancer | Adam J Shephard et.al. | 2311.06185v1 | link |
2023-11-10 | Automatic Report Generation for Histopathology images using pre-trained Vision Transformers | Saurav Sengupta et.al. | 2311.06176v1 | null |
2023-11-10 | Two vertex geometrically irreducible algebras | Grzegorz Bobinski et.al. | 2311.06173v1 | null |
2023-11-10 | Time Scale Network: A Shallow Neural Network For Time Series Data | Trevor Meyer et.al. | 2311.06170v1 | null |
2023-11-10 | Deep Fast Vision: A Python Library for Accelerated Deep Transfer Learning Vision Prototyping | Fabi Prezja et.al. | 2311.06169v1 | link |
2023-11-10 | Going beyond persistent homology using persistent homology | Johanna Immonen et.al. | 2311.06152v1 | null |
2023-11-09 | FogROS2-Sky: Optimizing Latency and Cost for Multi-Cloud Robot Applications | Kaiyuan Chen et.al. | 2311.05600v1 | null |
2023-11-09 | A Coefficient Makes SVRG Effective | Yida Yin et.al. | 2311.05589v1 | link |
2023-11-09 | Outlier-Robust Wasserstein DRO | Sloan Nietert et.al. | 2311.05573v1 | link |
2023-11-09 | Exploring Emotion Expression Recognition in Older Adults Interacting with a Virtual Coach | Cristina Palmero et.al. | 2311.05567v1 | null |
2023-11-09 | Disentangling Quantum and Classical Contributions in Hybrid Quantum Machine Learning Architectures | Michael Kölle et.al. | 2311.05559v1 | null |
2023-11-09 | L-WaveBlock: A Novel Feature Extractor Leveraging Wavelets for Generative Adversarial Networks | Mirat Shah et.al. | 2311.05548v1 | null |
2023-11-09 | BakedAvatar: Baking Neural Fields for Real-Time Head Avatar Synthesis | Hao-Bin Duan et.al. | 2311.05521v1 | null |
2023-11-09 | Dirichlet Active Learning | Kevin Miller et.al. | 2311.05501v1 | null |
2023-11-09 | Retinal OCT Synthesis with Denoising Diffusion Probabilistic Models for Layer Segmentation | Yuli Wu et.al. | 2311.05479v1 | null |
2023-11-09 | Robust Retraining-free GAN Fingerprinting via Personalized Normalization | Jianwei Fei et.al. | 2311.05478v1 | null |
2023-11-08 | Towards Few-Annotation Learning in Computer Vision: Application to Image Classification and Object Detection tasks | Quentin Bouniot et.al. | 2311.04888v1 | null |
2023-11-08 | Are foundation models efficient for medical image segmentation? | Danielle Ferreira et.al. | 2311.04847v1 | null |
2023-11-08 | Bayesian multi-band fitting of alerts for kilonovae detection | Biswajit Biswas et.al. | 2311.04845v1 | null |
2023-11-08 | Hierarchically Gated Recurrent Neural Network for Sequence Modeling | Zhen Qin et.al. | 2311.04823v1 | link |
2023-11-08 | A Lightweight Architecture for Real-Time Neuronal-Spike Classification | Muhammad Ali Siddiqi et.al. | 2311.04808v1 | null |
2023-11-08 | Determination of toxic comments and unintended model bias minimization using Deep learning approach | Md Azim Khan et.al. | 2311.04789v1 | null |
2023-11-08 | VioLA: Aligning Videos to 2D LiDAR Scans | Jun-Jee Chao et.al. | 2311.04783v1 | null |
2023-11-08 | FetMRQC: an open-source machine learning framework for multi-centric fetal brain MRI quality control | Thomas Sanchez et.al. | 2311.04780v1 | link |
2023-11-08 | GCS-ICHNet: Assessment of Intracerebral Hemorrhage Prognosis using Self-Attention with Domain Knowledge Integration | Xuhao Shan et.al. | 2311.04772v1 | link |
2023-11-08 | An attention-based deep learning network for predicting Platinum resistance in ovarian cancer | Haoming Zhuang et.al. | 2311.04769v1 | null |
2023-11-08 | Video Instance Matting | Jiachen Li et.al. | 2311.04212v2 | link |
2023-11-07 | JPAVE: A Generation and Classification-based Model for Joint Product Attribute Prediction and Value Extraction | Zhongfen Deng et.al. | 2311.04196v1 | link |
2023-11-07 | Linear to circular conversion in the polarized radio emission of a magnetar | Marcus E. Lower et.al. | 2311.04195v1 | null |
2023-11-07 | SpaDeLeF: A Dataset for Hierarchical Classification of Lexical Functions for Collocations in Spanish | Yevhen Kostiuk et.al. | 2311.04189v1 | null |
2023-11-07 | A Simple Interpretable Transformer for Fine-Grained Image Classification and Analysis | Dipanjyoti Paul et.al. | 2311.04157v1 | link |
2023-11-07 | Galaxy Spectra neural Network (GaSNet). II. Using Deep Learning for Spectral Classification and Redshift Predictions | Fucheng Zhong et.al. | 2311.04146v1 | null |
2023-11-07 | I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models | Shiwei Zhang et.al. | 2311.04145v1 | null |
2023-11-07 | Modelling Sentiment Analysis: LLMs and data augmentation techniques | Guillem Senabre Prades et.al. | 2311.04139v1 | null |
2023-11-07 | Improved Topological Preservation in 3D Axon Segmentation and Centerline Detection using Geometric Assessment-driven Topological Smoothing (GATS) | Nina I. Shamsi et.al. | 2311.04116v1 | null |
2023-11-07 | Joint modelling of recurrent and terminal events with discretely-distributed non-parametric frailty: application on re-hospitalizations and death in heart failure patients | Chiara Masci et.al. | 2311.04103v1 | null |
2023-11-06 | A Classification of Graphs through Quadratic Embedding Constants and Clique Graph Insights | Edy Tri Baskoro et.al. | 2311.03342v1 | null |
2023-11-06 | Tackling Concept Shift in Text Classification using Entailment-style Modeling | Sumegh Roychowdhury et.al. | 2311.03320v1 | null |
2023-11-06 | A Foundation Model for Music Informatics | Minz Won et.al. | 2311.03318v1 | link |
2023-11-06 | FATE: Feature-Agnostic Transformer-based Encoder for learning generalized embedding spaces in flow cytometry data | Lisa Weijler et.al. | 2311.03314v1 | link |
2023-11-06 | A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation | Qitao Zhao et.al. | 2311.03312v1 | null |
2023-11-06 | Advancing Post Hoc Case Based Explanation with Feature Highlighting | Eoin Kenny et.al. | 2311.03246v1 | null |
2023-11-06 | Machine Learning-Based Tea Leaf Disease Detection: A Comprehensive Review | Faruk Ahmed et.al. | 2311.03240v1 | null |
2023-11-06 | Out-of-distribution Detection Learning with Unreliable Out-of-distribution Sources | Haotian Zheng et.al. | 2311.03236v1 | null |
2023-11-06 | Segmentation of Drone Collision Hazards in Airborne RADAR Point Clouds Using PointNet | Hector Arroyo et.al. | 2311.03221v1 | null |
2023-11-06 | Leveraging Transformers to Improve Breast Cancer Classification and Risk Assessment with Multi-modal and Longitudinal Data | Yiqiu Shen et.al. | 2311.03217v1 | null |
2023-11-03 | LOTUS: Continual Imitation Learning for Robot Manipulation Through Unsupervised Skill Discovery | Weikang Wan et.al. | 2311.02058v1 | null |
2023-11-03 | MetaFast: Enabling Fast Metagenomic Classification via Seed Counting and Edit Distance Approximation | Arvid E. Gollwitzer et.al. | 2311.02029v1 | null |
2023-11-03 | A Structured Pruning Algorithm for Model-based Deep Learning | Chicago Park et.al. | 2311.02003v1 | null |
2023-11-03 | Detection of keratoconus Diseases using deep Learning | AKM Enzam-Ul Haque et.al. | 2311.01996v1 | null |
2023-11-03 | Obtaining Explainable Classification Models using Distributionally Robust Optimization | Sanjeeb Dash et.al. | 2311.01994v1 | null |
2023-11-03 | Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation | Shichao Dong et.al. | 2311.01989v1 | null |
2023-11-06 | RT-Trajectory: Robotic Task Generalization via Hindsight Trajectory Sketches | Jiayuan Gu et.al. | 2311.01977v2 | null |
2023-11-03 | Welded graphs, Wirtinger groups and knotted punctured spheres | Benjamin Audoux et.al. | 2311.01922v1 | null |
2023-11-03 | Contrast-Agnostic Groupwise Registration by Robust PCA for Quantitative Cardiac MRI | Xinqi Li et.al. | 2311.01916v1 | null |
2023-11-03 | VQPy: An Object-Oriented Approach to Modern Video Analytics | Shan Yu et.al. | 2311.01623v1 | null |
2023-11-02 | Tailoring Mixup to Data using Kernel Warping functions | Quentin Bouniot et.al. | 2311.01434v1 | link |
2023-11-02 | Identifying Alzheimer Disease Dementia Levels Using Machine Learning Methods | Md Gulzar Hussain et.al. | 2311.01428v1 | null |
2023-11-02 | Exploring Deep Learning Techniques for Glaucoma Detection: A Comprehensive Review | Aized Amin Soofi et.al. | 2311.01425v1 | null |
2023-11-02 | Holistic Transfer: Towards Non-Disruptive Fine-Tuning with Partial Target Data | Cheng-Hao Tu et.al. | 2311.01420v1 | null |
2023-11-02 | Learning to See Physical Properties with Active Sensing Motor Policies | Gabriel B. Margolis et.al. | 2311.01405v1 | null |
2023-11-02 | Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors | Gabriele M. Caddeo et.al. | 2311.01380v1 | link |
2023-11-02 | Deep learning based Image Compression for Microscopy Images: An Empirical Study | Yu Zhou et.al. | 2311.01352v1 | null |
2023-11-02 | Unreading Race: Purging Protected Features from Chest X-ray Embeddings | Tobias Weber et.al. | 2311.01349v1 | null |
2023-11-02 | Scattering Vision Transformer: Spectral Mixing Matters | Badri N. Patro et.al. | 2311.01310v1 | null |
2023-11-02 | Hybrid-Fusion Transformer for Multisequence MRI | Jihoon Cho et.al. | 2311.01308v1 | null |
2023-11-01 | Software Repositories and Machine Learning Research in Cyber Security | Mounika Vanamala et.al. | 2311.00691v1 | null |
2023-11-01 | What User Behaviors Make the Differences During the Process of Visual Analytics? | Shahin Doroudian et.al. | 2311.00690v1 | null |
2023-11-01 | Deep Learning-Based Classification of Gamma Photon Interactions in Room-Temperature Semiconductor Radiation Detectors | Sandeep K. Chaudhuri et.al. | 2311.00682v1 | null |
2023-11-01 | Latent Space Translation via Semantic Alignment | Valentino Maiorca et.al. | 2311.00664v1 | link |
2023-11-01 | Rediscussion of eclipsing binaries. Paper XV. The B-type supergiant system V1765 Cygni | John Southworth et.al. | 2311.00655v1 | null |
2023-11-02 | Emergence of Collective Open-Ended Exploration from Decentralized Meta-Reinforcement Learning | Richard Bornemann et.al. | 2311.00651v2 | null |
2023-11-01 | Understanding the Issues and Causes in WebAssembly Application Development: A Mining-based Study | Muhammad Waseem et.al. | 2311.00646v1 | null |
2023-11-01 | A Bi-level Framework for Traffic Accident Duration Prediction: Leveraging Weather and Road Condition Data within a Practical Optimum Pipeline | Rafat Tabassum Sukonna et.al. | 2311.00634v1 | null |
2023-11-01 | Controllable Music Production with Diffusion Models and Guidance Gradients | Mark Levy et.al. | 2311.00613v1 | null |
2023-11-01 | A Robust Deep Learning Method with Uncertainty Estimation for the Pathological Classification of Renal Cell Carcinoma based on CT Images | Ni Yao et.al. | 2311.00567v1 | null |
2023-10-31 | Limited Data, Unlimited Potential: A Study on ViTs Augmented by Masked Autoencoders | Srijan Das et.al. | 2310.20704v1 | null |
2023-10-31 | SEINE: Short-to-Long Video Diffusion Model for Generative Transition and Prediction | Xinyuan Chen et.al. | 2310.20700v1 | null |
2023-10-31 | StairNet: Visual Recognition of Stairs for Human-Robot Locomotion | Andrew Garrett Kurbis et.al. | 2310.20666v1 | null |
2023-10-31 | Performance Improvement in Multi-class Classification via Automated Hierarchy Generation and Exploitation through Extended LCPN Schemes | Celal Alagoz et.al. | 2310.20641v1 | null |
2023-10-31 | Deepfake detection by exploiting surface anomalies: the SurFake approach | Andrea Ciamarra et.al. | 2310.20621v1 | null |
2023-10-31 | Enhanced Synthetic MRI Generation from CT Scans Using CycleGAN with Feature Extraction | Saba Nikbakhsh et.al. | 2310.20604v1 | null |
2023-10-31 | Finiteness properties for Shimura curves and modified diagonal cycles | Congling Qiu et.al. | 2310.20600v1 | null |
2023-10-31 | Brain-like Flexible Visual Inference by Harnessing Feedback-Feedforward Alignment | Tahereh Toosi et.al. | 2310.20599v1 | link |
2023-10-31 | Tracially Complete C-Algebras* | José R. Carrión et.al. | 2310.20594v1 | null |
2023-10-31 | Strongly Magnetized Tidal Disruption Event Disks via Stream Injection in GRMHD | Brandon Curd et.al. | 2310.20592v1 | null |
2023-10-29 | Improved Motor Imagery Classification Using Adaptive Spatial Filters Based on Particle Swarm Optimization Algorithm | Xiong Xiong et.al. | 2310.19202v1 | null |
2023-10-29 | Enhancing Motor Imagery Decoding in Brain Computer Interfaces using Riemann Tangent Space Mapping and Cross Frequency Coupling | Xiong Xiong et.al. | 2310.19198v1 | null |
2023-10-29 | A Survey on Watching Social Issue Videos among YouTube and TikTok Users | Shuo Niu et.al. | 2310.19193v1 | null |
2023-10-29 | Subjective Quality Evaluation of Point Clouds Using a Head Mounted Display | Joao Prazeres et.al. | 2310.19179v1 | null |
2023-10-29 | Robustifying Language Models with Test-Time Adaptation | Noah Thomas McDermott et.al. | 2310.19177v1 | null |
2023-10-29 | Predicting recovery following stroke: deep learning, multimodal data and feature selection using explainable AI | Adam White et.al. | 2310.19174v1 | null |
2023-10-29 | BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping | Srikumar Sastry et.al. | 2310.19168v1 | link |
2023-10-29 | Unified Representation for Non-compositional and Compositional Expressions | Ziheng Zeng et.al. | 2310.19127v1 | null |
2023-10-29 | Efficient IoT Inference via Context-Awareness | Mohammad Mehdi Rastikerdar et.al. | 2310.19112v1 | null |
2023-10-29 | Pushdown Layers: Encoding Recursive Structure in Transformer Language Models | Shikhar Murty et.al. | 2310.19089v1 | null |
2023-10-27 | Addressing GAN Training Instabilities via Tunable Classification Losses | Monica Welfert et.al. | 2310.18291v1 | null |
2023-10-27 | PlantPlotGAN: A Physics-Informed Generative Adversarial Network for Plant Disease Prediction | Felipe A. Lopes et.al. | 2310.18268v1 | null |
2023-10-27 | MalFake: A Multimodal Fake News Identification for Malayalam using Recurrent Neural Networks and VGG-16 | Adhish S. Sujan et.al. | 2310.18263v1 | null |
2023-10-27 | Edge AI-Based Vein Detector for Efficient Venipuncture in the Antecubital Fossa | Edwin Salcedo et.al. | 2310.18234v1 | null |
2023-10-27 | TBDLNet: a network for classifying multidrug-resistant and drug-sensitive tuberculosis | Ziquan Zhu et.al. | 2310.18222v1 | null |
2023-10-27 | ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models | Benjamin Feuer et.al. | 2310.18208v1 | link |
2023-10-27 | Artifact-Robust Graph-Based Learning in Digital Pathology | Saba Heidari Gheshlaghi et.al. | 2310.18192v1 | null |
2023-10-27 | Globular clusters and bar: captured or not captured? | Anton A. Smirnov et.al. | 2310.18172v1 | null |
2023-10-27 | Style Description based Text-to-Speech with Conditional Prosodic Layer Normalization based Diffusion GAN | Neeraj Kumar et.al. | 2310.18169v1 | null |
2023-10-27 | DESiRED -- Dynamic, Enhanced, and Smart iRED: A P4-AQM with Deep Reinforcement Learning and In-band Network Telemetry | Leandro C. de Almeida et.al. | 2310.18159v1 | null |
2023-10-26 | A Coarse-to-Fine Pseudo-Labeling (C2FPL) Framework for Unsupervised Video Anomaly Detection | Anas Al-lahham et.al. | 2310.17650v1 | null |
2023-10-26 | torchdistill Meets Hugging Face Libraries for Reproducible, Coding-Free Deep Learning Studies: A Case Study on NLP | Yoshitomo Matsubara et.al. | 2310.17644v1 | link |
2023-10-26 | Drive Anywhere: Generalizable End-to-end Autonomous Driving with Multi-modal Foundation Models | Tsun-Hsuan Wang et.al. | 2310.17642v1 | null |
2023-10-26 | Skew Products on the Berkovich Projective Line | Richard A. P. Birkett et.al. | 2310.17628v1 | null |
2023-10-26 | A Survey on Transferability of Adversarial Examples across Deep Neural Networks | Jindong Gu et.al. | 2310.17626v1 | link |
2023-10-26 | MimicGen: A Data Generation System for Scalable Robot Learning using Human Demonstrations | Ajay Mandlekar et.al. | 2310.17596v1 | null |
2023-10-26 | Linear |
Jerson Caro et.al. | 2310.17592v1 | null |
2023-10-26 | A minimax optimal control approach for robust neural ODEs | Cristina Cipriani et.al. | 2310.17584v1 | null |
2023-10-26 | BLIS-Net: Classifying and Analyzing Signals on Graphs | Charles Xu et.al. | 2310.17579v1 | null |
2023-10-26 | Knots bounding non-isotopic ribbon disks | Jeffrey Meier et.al. | 2310.17564v1 | null |
2023-10-25 | RDBench: ML Benchmark for Relational Databases | Zizhao Zhang et.al. | 2310.16837v1 | link |
2023-10-25 | TD-MPC2: Scalable, Robust World Models for Continuous Control | Nicklas Hansen et.al. | 2310.16828v1 | null |
2023-10-26 | Deep machine learning for meteor monitoring: advances with transfer learning and gradient-weighted class activation mapping | Eloy Peña-Asensio et.al. | 2310.16826v2 | null |
2023-10-25 | Uncovering a new group of T Tauri stars in the Taurus-Auriga molecular complex from Gaia and GALEX data | Ana Inés Gómez de Castro et.al. | 2310.16820v1 | null |
2023-10-25 | Using Diffusion Models to Generate Synthetic Labelled Data for Medical Image Segmentation | Daniel Saragih et.al. | 2310.16794v1 | null |
2023-10-25 | Navigating Socio-Emotional Risk through Comfort-Building in a Physics Teaching Community of Practice: A Case Study | Maggie Mahmood et.al. | 2310.16778v1 | null |
2023-10-25 | IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery | Bhavuk Singhal et.al. | 2310.16761v1 | null |
2023-10-25 | Interferometric Neural Networks | Arun Sehrawat et.al. | 2310.16742v1 | link |
2023-10-25 | A No-Reference Quality Assessment Method for Digital Human Head | Yingjie Zhou et.al. | 2310.16732v1 | null |
2023-10-25 | Spherical Wavefront Near-Field DoA Estimation in THz Automotive Radar | Ahmet M. Elbir et.al. | 2310.16724v1 | null |
2023-10-24 | From Posterior Sampling to Meaningful Diversity in Image Restoration | Noa Cohen et.al. | 2310.16047v1 | null |
2023-10-24 | Finetuning Offline World Models in the Real World | Yunhai Feng et.al. | 2310.16029v1 | null |
2023-10-24 | Human-in-the-Loop Task and Motion Planning for Imitation Learning | Ajay Mandlekar et.al. | 2310.16014v1 | null |
2023-10-24 | CVPR 2023 Text Guided Video Editing Competition | Jay Zhangjie Wu et.al. | 2310.16003v1 | null |
2023-10-24 | Vision-Language Pseudo-Labels for Single-Positive Multi-Label Learning | Xin Xing et.al. | 2310.15985v1 | link |
2023-10-24 | Geometry-Aware Video Quality Assessment for Dynamic Digital Human | Zicheng Zhang et.al. | 2310.15984v1 | null |
2023-10-24 | Minimax Forward and Backward Learning of Evolving Tasks with Performance Guarantees | Verónica Álvarez et.al. | 2310.15974v1 | link |
2023-10-24 | Decoupled DETR: Spatially Disentangling Localization and Classification for Improved End-to-End Object Detection | Manyuan Zhang et.al. | 2310.15955v1 | null |
2023-10-25 | Improving Robustness and Reliability in Medical Image Classification with Latent-Guided Diffusion and Nested-Ensembles | Xing Shen et.al. | 2310.15952v2 | null |
2023-10-24 | ShARc: Shape and Appearance Recognition for Person Identification In-the-wild | Haidong Zhu et.al. | 2310.15946v1 | null |
2023-10-23 | FreeNoise: Tuning-Free Longer Video Diffusion Via Noise Rescheduling | Haonan Qiu et.al. | 2310.15169v1 | null |
2023-10-23 | Bitrate Ladder Prediction Methods for Adaptive Video Streaming: A Review and Benchmark | Ahmed Telili et.al. | 2310.15163v1 | null |
2023-10-23 | Linear Representations of Sentiment in Large Language Models | Curt Tigges et.al. | 2310.15154v1 | null |
2023-10-23 | Unlocking the Transferability of Tokens in Deep Models for Tabular Data | Qi-Le Zhou et.al. | 2310.15149v1 | null |
2023-10-23 | When Should the FDA Inspect Pharmaceutical Manufacturing Facilities to Better Mitigate Drug Shortages? | Daniel Kosmas et.al. | 2310.15146v1 | null |
2023-10-23 | Novel-View Acoustic Synthesis from 3D Reconstructed Rooms | Byeongjoo Ahn et.al. | 2310.15130v1 | link |
2023-10-23 | Open-Ended Instructable Embodied Agents with Memory-Augmented Large Language Models | Gabriel Sarch et.al. | 2310.15127v1 | null |
2023-10-23 | SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution | Weihao Lin et.al. | 2310.15115v1 | null |
2023-10-23 | The Self 2.0: How AI-Enhanced Self-Clones Transform Self-Perception and Improve Presentation Skills | Qingxiao Zheng et.al. | 2310.15112v1 | null |
2023-10-23 | Matryoshka Diffusion Models | Jiatao Gu et.al. | 2310.15111v1 | null |
2023-10-20 | Using Human-like Mechanism to Weaken Effect of Pre-training Weight Bias in Face-Recognition Convolutional Neural Network | Haojiang Ying et.al. | 2310.13674v1 | null |
2023-10-23 | Explainable Depression Symptom Detection in Social Media | Eliseo Bao Souto et.al. | 2310.13664v2 | null |
2023-10-20 | Arabic Dialect Identification under Scrutiny: Limitations of Single-label Classification | Amr Keleg et.al. | 2310.13661v1 | link |
2023-10-20 | Optimal Transport for Measures with Noisy Tree Metric | Tam Le et.al. | 2310.13653v1 | null |
2023-10-20 | Principal |
Shigeo Koshitani et.al. | 2310.13621v1 | null |
2023-10-20 | Skin Lesion Segmentation Improved by Transformer-based Networks with Inter-scale Dependency Modeling | Sania Eskandari et.al. | 2310.13604v1 | link |
2023-10-20 | Classification of quantum states of light using random measurements through a multimode fiber | Saroch Leedumrongwatthanakun et.al. | 2310.13599v1 | null |
2023-10-20 | Longer-range Contextualized Masked Autoencoder | Taekyung Kim et.al. | 2310.13593v1 | null |
2023-10-20 | POTLoc: Pseudo-Label Oriented Transformer for Point-Supervised Temporal Action Localization | Elahe Vahdani et.al. | 2310.13585v1 | null |
2023-10-20 | Progressive Dual Priori Network for Generalized Breast Tumor Segmentation | Li Wang et.al. | 2310.13574v1 | null |
2023-10-19 | Putting the Object Back into Video Object Segmentation | Ho Kei Cheng et.al. | 2310.12982v1 | link |
2023-10-19 | Variational Inference for SDEs Driven by Fractional Noise | Rembert Daems et.al. | 2310.12975v1 | null |
2023-10-19 | Frozen Transformers in Language Models Are Effective Visual Encoder Layers | Ziqi Pang et.al. | 2310.12973v1 | link |
2023-10-19 | Bialgebra structures on flat Lie algebras | Amine Bahayou et.al. | 2310.12966v1 | null |
2023-10-19 | End-to-End Delay Minimization based on Joint Optimization of DNN Partitioning and Resource Allocation for Cooperative Edge Inference | Xinrui Ye et.al. | 2310.12937v1 | null |
2023-10-19 | Digital Twin-Enabled Intelligent DDoS Detection Mechanism for Autonomous Core Networks | Yagmur Yigit et.al. | 2310.12924v1 | null |
2023-10-19 | Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning | Juan Rocamonde et.al. | 2310.12921v1 | null |
2023-10-19 | Unsupervised Object Localization in the Era of Self-Supervised ViTs: A Survey | Oriane Siméoni et.al. | 2310.12904v1 | link |
2023-10-19 | A Markovian dynamics for |
Antonio C. Costa et.al. | 2310.12883v1 | link |
2023-10-19 | Perceptual Assessment and Optimization of High Dynamic Range Image Rendering | Peibei Cao et.al. | 2310.12877v1 | null |
2023-10-18 | SHARCS: Efficient Transformers through Routing with Dynamic Width Sub-networks | Mohammadreza Salehi et.al. | 2310.12126v1 | null |
2023-10-18 | Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture | Daniel Y. Fu et.al. | 2310.12109v1 | null |
2023-10-18 | HSTR-Net: Reference Based Video Super-resolution for Aerial Surveillance with Dual Cameras | H. Umut Suluhan et.al. | 2310.12092v1 | null |
2023-10-18 | Chemical Analysis of the Brightest Star of the Cetus II Ultra-Faint Dwarf Galaxy Candidate | K. B. Webber et.al. | 2310.12090v1 | null |
2023-10-18 | One-Shot Imitation Learning: A Pose Estimation Perspective | Pietro Vitiello et.al. | 2310.12077v1 | null |
2023-10-18 | Exploring Fairness in Pre-trained Visual Transformer based Natural and GAN Generated Image Detection Systems and Understanding the Impact of Image Compression in Fairness | Manjary P. Gangan et.al. | 2310.12076v1 | null |
2023-10-18 | Black-Box Training Data Identification in GANs via Detector Networks | Lukman Olagoke et.al. | 2310.12063v1 | null |
2023-10-19 | Robust Class-Conditional Distribution Alignment for Partial Domain Adaptation | Sandipan Choudhuri et.al. | 2310.12060v2 | null |
2023-10-18 | Exact and efficient solutions of the LMC Multitask Gaussian Process model | Olivier Truffinet et.al. | 2310.12032v1 | link |
2023-10-18 | CORE: A Few-Shot Company Relation Classification Dataset for Robust Domain Adaptation | Philipp Borchert et.al. | 2310.12024v1 | link |
2023-10-17 | DELIFFAS: Deformable Light Fields for Fast Avatar Synthesis | Youngjoong Kwon et.al. | 2310.11449v1 | null |
2023-10-18 | 4K4D: Real-Time 4D View Synthesis at 4K Resolution | Zhen Xu et.al. | 2310.11448v2 | null |
2023-10-18 | EvalCrafter: Benchmarking and Evaluating Large Video Generation Models | Yaofang Liu et.al. | 2310.11440v2 | null |
2023-10-17 | Transitive generalized toggle groups containing a cycle | Jonathan S. Bloom et.al. | 2310.11387v1 | null |
2023-10-17 | DialogueLLM: Context and Emotion Knowledge-Tuned LLaMA Models for Emotion Recognition in Conversations | Yazhou Zhang et.al. | 2310.11374v1 | null |
2023-10-17 | VECHR: A Dataset for Explainable and Robust Classification of Vulnerability Type in the European Court of Human Rights | Shanshan Xu et.al. | 2310.11368v1 | null |
2023-10-17 | Lie Group Decompositions for Equivariant Neural Networks | Mircea Mironenco et.al. | 2310.11366v1 | null |
2023-10-17 | Hybrid quantum-classical graph neural networks for tumor classification in digital pathology | Anupama Ray et.al. | 2310.11353v1 | null |
2023-10-17 | The effect of stemming and lemmatization on Portuguese fake news text classification | Lucca de Freitas Santos et.al. | 2310.11344v1 | null |
2023-10-17 | Influencing factors on false positive rates when classifying tumor cell line response to drug treatment | Priyanka Vasanthakumari et.al. | 2310.11329v1 | null |
2023-10-16 | A Survey on Video Diffusion Models | Zhen Xing et.al. | 2310.10647v1 | link |
2023-10-16 | Real-time Photorealistic Dynamic Scene Representation and Rendering with 4D Gaussian Splatting | Zeyu Yang et.al. | 2310.10642v1 | link |
2023-10-16 | Zero-Shot Robotic Manipulation with Pretrained Image-Editing Diffusion Models | Kevin Black et.al. | 2310.10639v1 | null |
2023-10-16 | Efficacy of Dual-Encoders for Extreme Multi-Label Classification | Nilesh Gupta et.al. | 2310.10636v1 | null |
2023-10-16 | Overcoming the Rayleigh limit in extremely low SNR | Hyunsoo Choi et.al. | 2310.10633v1 | null |
2023-10-16 | Video Language Planning | Yilun Du et.al. | 2310.10625v1 | null |
2023-10-16 | DynVideo-E: Harnessing Dynamic NeRF for Large-Scale Motion- and View-Change Human-Centric Video Editing | Jia-Wei Liu et.al. | 2310.10624v1 | null |
2023-10-16 | BiLL-VTG: Bridging Large Language Models and Lightweight Visual Tools for Video-based Texts Generation | Ji Qi et.al. | 2310.10586v1 | null |
2023-10-16 | RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets | Zhicheng Cai et.al. | 2310.10563v1 | link |
2023-10-16 | Deep learning applied to EEG data with different montages using spatial attention | Dung Truong et.al. | 2310.10550v1 | null |
2023-10-13 | An Unbiased Look at Datasets for Visuo-Motor Pre-Training | Sudeep Dasari et.al. | 2310.09289v1 | null |
2023-10-13 | Disentangled Latent Spaces Facilitate Data-Driven Auxiliary Learning | Geri Skenderi et.al. | 2310.09278v1 | null |
2023-10-13 | A Hybrid Approach for Depression Classification: Random Forest-ANN Ensemble on Motor Activity Signals | Anket Patil et.al. | 2310.09277v1 | null |
2023-10-13 | PromptRE: Weakly-Supervised Document-Level Relation Extraction via Prompting-Based Data Programming | Chufan Gao et.al. | 2310.09265v1 | null |
2023-10-13 | Political claim identification and categorization in a multilingual setting: First experiments | Urs Zaberer et.al. | 2310.09256v1 | null |
2023-10-13 | It's an Alignment, Not a Trade-off: Revisiting Bias and Variance in Deep Models | Lin Chen et.al. | 2310.09250v1 | null |
2023-10-13 | A Multifaceted Look at Starlink Performance | Nitinder Mohan et.al. | 2310.09242v1 | null |
2023-10-13 | Time CNN and Graph Convolution Network for Epileptic Spike Detection in MEG Data | Pauline Mouches et.al. | 2310.09236v1 | null |
2023-10-13 | Ultrasound Image Segmentation of Thyroid Nodule via Latent Semantic Feature Co-Registration | Xuewei Li et.al. | 2310.09221v1 | null |
2023-10-13 | PaLI-3 Vision Language Models: Smaller, Faster, Stronger | Xi Chen et.al. | 2310.09199v1 | null |
2023-10-12 | Octopus: Embodied Vision-Language Programmer from Environmental Feedback | Jingkang Yang et.al. | 2310.08588v1 | link |
2023-10-12 | Is Generalized Dynamic Novel View Synthesis from Monocular Videos Possible Today? | Xiaoming Zhao et.al. | 2310.08587v1 | null |
2023-10-12 | Im4D: High-Fidelity and Real-Time Novel View Synthesis for Dynamic Scenes | Haotong Lin et.al. | 2310.08585v1 | null |
2023-10-12 | Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video | Shashanka Venkataramanan et.al. | 2310.08584v1 | null |
2023-10-12 | Universal Visual Decomposer: Long-Horizon Manipulation Made Easy | Zichen Zhang et.al. | 2310.08581v1 | null |
2023-10-12 | Learning to Act from Actionless Videos through Dense Correspondences | Po-Chen Ko et.al. | 2310.08576v1 | null |
2023-10-12 | Effective isometries of periodic shells | Hussein Nassar et.al. | 2310.08531v1 | null |
2023-10-12 | LLM-augmented Preference Learning from Natural Language | Inwon Kang et.al. | 2310.08523v1 | null |
2023-10-12 | Impact of time and note duration tokenizations on deep learning symbolic music modeling | Nathan Fradet et.al. | 2310.08497v1 | link |
2023-10-12 | GraphextQA: A Benchmark for Evaluating Graph-Enhanced Large Language Models | Yuanchun Shen et.al. | 2310.08487v1 | link |
2023-10-11 | ScaleCrafter: Tuning-free Higher-Resolution Visual Generation with Diffusion Models | Yingqing He et.al. | 2310.07702v1 | link |
2023-10-11 | ConditionVideo: Training-Free Condition-Guided Text-to-Video Generation | Bo Peng et.al. | 2310.07697v1 | null |
2023-10-11 | Large-scale photonic computing with nonlinear disordered media | Hao Wang et.al. | 2310.07690v1 | null |
2023-10-11 | Deep Video Inpainting Guided by Audio-Visual Self-Supervision | Kyuyeon Kim et.al. | 2310.07663v1 | null |
2023-10-11 | Hypercomplex Multimodal Emotion Recognition from EEG and Peripheral Physiological Signals | Eleonora Lopez et.al. | 2310.07648v1 | null |
2023-10-11 | Attention-Map Augmentation for Hypercomplex Breast Cancer Classification | Eleonora Lopez et.al. | 2310.07633v1 | null |
2023-10-11 | Differentiable Euler Characteristic Transforms for Shape Classification | Ernst Roell et.al. | 2310.07630v1 | link |
2023-10-11 | Time-Resolved Reconstruction of Motion, Force, and Stiffness using Spectro-Dynamic MRI | Max H. C. van Riel et.al. | 2310.07622v1 | null |
2023-10-11 | Reinforcement Learning-based Knowledge Graph Reasoning for Explainable Fact-checking | Gustav Nikopensius et.al. | 2310.07613v1 | null |
2023-10-11 | QACHECK: A Demonstration System for Question-Guided Multi-Hop Fact-Checking | Liangming Pan et.al. | 2310.07609v1 | link |
2023-10-10 | Convivial Solipsism as a maximally perspectival interpretation | Herve Zwirn et.al. | 2310.06815v1 | null |
2023-10-10 | A Supervised Embedding and Clustering Anomaly Detection method for classification of Mobile Network Faults | R. Mosayebi et.al. | 2310.06779v1 | null |
2023-10-10 | Optical assembly of nanostructures mediated by surface roughness | Robert G. Felsted et.al. | 2310.06774v1 | null |
2023-10-10 | Uni3D: Exploring Unified 3D Representation at Scale | Junsheng Zhou et.al. | 2310.06773v1 | link |
2023-10-10 | Improved convergence rates for some kernel random forest algorithms | Isidoros Iakovidis et.al. | 2310.06760v1 | null |
2023-10-10 | Geographic Location Encoding with Spherical Harmonics and Sinusoidal Representation Networks | Marc Rußwurm et.al. | 2310.06743v1 | link |
2023-10-10 | Multi-domain improves out-of-distribution and data-limited scenarios for medical image analysis | Ece Ozkan et.al. | 2310.06737v1 | null |
2023-10-10 | S4Sleep: Elucidating the design space of deep-learning-based sleep stage classification models | Tiezhi Wang et.al. | 2310.06715v1 | link |
2023-10-10 | Tertiary Lymphoid Structures Generation through Graph-based Diffusion | Manuel Madeira et.al. | 2310.06661v1 | null |
2023-10-10 | Assessing the Impact of a Supervised Classification Filter on Flow-based Hybrid Network Anomaly Detection | Dominik Macko et.al. | 2310.06656v1 | link |
2023-10-09 | FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing | Yuren Cong et.al. | 2310.05922v1 | null |
2023-10-09 | Enumerating Calabi-Yau Manifolds: Placing bounds on the number of diffeomorphism classes in the Kreuzer-Skarke list | Aditi Chandra et.al. | 2310.05909v1 | null |
2023-10-09 | ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models | Kaiwen Zhou et.al. | 2310.05872v1 | null |
2023-10-10 | Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models | Guangzhi Sun et.al. | 2310.05863v2 | link |
2023-10-09 | Latent Wander: an Alternative Interface for Interactive and Serendipitous Discovery of Large AV Archives | Yuchen Yang et.al. | 2310.05835v1 | null |
2023-10-09 | Write What You Want: Applying Text-to-video Retrieval to Audiovisual Archives | Yuchen Yang et.al. | 2310.05825v1 | null |
2023-10-09 | Dipole-Spread Function Engineering for 6D Super-Resolution Microscopy | Tingting Wu et.al. | 2310.05810v1 | null |
2023-10-09 | A Simple Open-Loop Baseline for Reinforcement Learning Locomotion Tasks | Antonin Raffin et.al. | 2310.05808v1 | null |
2023-10-09 | Learning Language-guided Adaptive Hyper-modality Representation for Multimodal Sentiment Analysis | Haoyu Zhang et.al. | 2310.05804v1 | null |
2023-10-10 | Two-timescale Derivative Free Optimization for Performative Prediction with Markovian Data | Haitong Liu et.al. | 2310.05792v2 | null |
2023-10-06 | Exploiting Transformer Activation Sparsity with Dynamic Inference | Mikołaj Piórczyński et.al. | 2310.04361v1 | null |
2023-10-06 | SwimXYZ: A large-scale dataset of synthetic swimming motions and videos | Fiche Guénolé et.al. | 2310.04360v1 | null |
2023-10-06 | Large-Scale Korean Text Dataset for Classifying Biased Speech in Real-World Online Services | Dasol Choi et.al. | 2310.04313v1 | null |
2023-10-06 | Convergent ADMM Plug and Play PET Image Reconstruction | Florent Sureau et.al. | 2310.04299v1 | null |
2023-10-06 | A Plug-and-Play Image Registration Network | Junhao Hu et.al. | 2310.04297v1 | null |
2023-10-06 | Towards Non-contact 3D Ultrasound for Wrist Imaging | Antony Jerald et.al. | 2310.04296v1 | null |
2023-10-06 | Spectroscopic variability of massive pre-main-sequence stars in M17 | A. R. Derkink et.al. | 2310.04287v1 | null |
2023-10-06 | Multi-Industry Simplex : A Probabilistic Extension of GICS | Maksim Papenkov et.al. | 2310.04280v1 | null |
2023-10-06 | Bringing Quantum Algorithms to Automated Machine Learning: A Systematic Review of AutoML Frameworks Regarding Extensibility for QML Algorithms | Dennis Klau et.al. | 2310.04238v1 | null |
2023-10-06 | Written and spoken corpus of real and fake social media postings about COVID-19 | Ng Bee Chin et.al. | 2310.04237v1 | null |
2023-10-05 | The Un-Kidnappable Robot: Acoustic Localization of Sneaking People | Mengyu Yang et.al. | 2310.03743v1 | null |
2023-10-05 | Agent Instructs Large Language Models to be General Zero-Shot Reasoners | Nicholas Crispino et.al. | 2310.03710v1 | link |
2023-10-05 | OMG-ATTACK: Self-Supervised On-Manifold Generation of Transferable Evasion Attacks | Ofir Bar Tal et.al. | 2310.03707v1 | null |
2023-10-05 | Role of Spatial Coherence in Diffractive Optical Neural Networks | Matthew J. Filipovich et.al. | 2310.03679v1 | null |
2023-10-05 | Certification of Deep Learning Models for Medical Image Segmentation | Othmane Laousy et.al. | 2310.03664v1 | null |
2023-10-05 | Autoregressive Coefficients based Intelligent Protection of Transmission Lines Connected to Type-3 Wind Farms | Pallav Kumar Bera et.al. | 2310.03663v1 | null |
2023-10-05 | Robustness-Guided Image Synthesis for Data-Free Quantization | Jianhong Bai et.al. | 2310.03661v1 | null |
2023-10-05 | Balancing Autonomy and Alignment: A Multi-Dimensional Taxonomy for Autonomous LLM-powered Multi-Agent Architectures | Thorsten Händler et.al. | 2310.03659v1 | null |
2023-10-05 | Strategic Evaluation: Subjects, Evaluators, and Society | Benjamin Laufer et.al. | 2310.03655v1 | null |
2023-10-05 | CLEVRER-Humans: Describing Physical and Causal Events the Human Way | Jiayuan Mao et.al. | 2310.03635v1 | null |
2023-10-04 | SemiReward: A General Reward Model for Semi-supervised Learning | Siyuan Li et.al. | 2310.03013v1 | link |
2023-10-04 | High-dimensional SGD aligns with emerging outlier eigenspaces | Gerard Ben Arous et.al. | 2310.03010v1 | null |
2023-10-05 | IBCL: Zero-shot Model Generation for Task Trade-offs in Continual Learning | Pengyuan Lu et.al. | 2310.02995v2 | link |
2023-10-04 | Multiple Physics Pretraining for Physical Surrogate Models | Michael McCabe et.al. | 2310.02994v1 | null |
2023-10-04 | UniverSLU: Universal Spoken Language Understanding for Diverse Classification and Sequence Generation Tasks with a Single Network | Siddhant Arora et.al. | 2310.02973v1 | null |
2023-10-04 | Fully Automatic Segmentation of Gross Target Volume and Organs-at-Risk for Radiotherapy Planning of Nasopharyngeal Carcinoma | Mehdi Astaraki et.al. | 2310.02972v1 | null |
2023-10-04 | Prompting and Adapter Tuning for Self-supervised Encoder-Decoder Speech Model | Kai-Wei Chang et.al. | 2310.02971v1 | null |
2023-10-05 | Co-modeling the Sequential and Graphical Routes for Peptide Representation Learning | Zihan Liu et.al. | 2310.02964v2 | link |
2023-10-04 | CoDA: Collaborative Novel Box Discovery and Cross-modal Alignment for Open-vocabulary 3D Object Detection | Yang Cao et.al. | 2310.02960v1 | link |
2023-10-04 | HappyFeat -- An interactive and efficient BCI framework for clinical applications | Arthur Desbois et.al. | 2310.02948v1 | null |
2023-10-03 | DREAM: Visual Decoding from Reversing Human Visual System | Weihao Xia et.al. | 2310.02265v1 | null |
2023-10-03 | RSRD: A Road Surface Reconstruction Dataset and Benchmark for Safe and Comfortable Autonomous Driving | Tong Zhao et.al. | 2310.02262v1 | null |
2023-10-03 | Harnessing Pre-Trained Sentence Transformers for Offensive Language Detection in Indian Languages | Ananya Joshi et.al. | 2310.02249v1 | null |
2023-10-04 | Tensor Programs VI: Feature Learning in Infinite-Depth Neural Networks | Greg Yang et.al. | 2310.02244v2 | null |
2023-10-03 | MIS-AVioDD: Modality Invariant and Specific Representation for Audio-Visual Deepfake Detection | Vinaya Sree Katamneni et.al. | 2310.02234v1 | null |
2023-10-03 | HoloNets: Spectral Convolutions do extend to Directed Graphs | Christian Koke et.al. | 2310.02232v1 | null |
2023-10-03 | Extraction of Medication and Temporal Relation from Clinical Text by Harnessing Different Deep Learning Models | Hangyu Tu et.al. | 2310.02229v1 | null |
2023-10-03 | Symmetry-based classification of exact flat bands in single and bilayer moiré systems | Siddhartha Sarkar et.al. | 2310.02218v1 | null |
2023-10-03 | Learnable Data Augmentation for One-Shot Unsupervised Domain Adaptation | Julio Ivan Davila Carrazco et.al. | 2310.02201v1 | null |
2023-10-03 | CNN photometric redshifts in the SDSS at |
M. Treyer et.al. | 2310.02173v1 | null |
2023-09-29 | A Large Language Model Approach to Educational Survey Feedback Analysis | Michael J. Parker et.al. | 2309.17447v1 | null |
2023-10-02 | LLM-grounded Video Diffusion Models | Long Lian et.al. | 2309.17444v2 | null |
2023-09-29 | Classification of Potholes Based on Surface Area Using Pre-Trained Models of Convolutional Neural Network | Chauhdary Fazeel Ahmad et.al. | 2309.17426v1 | null |
2023-09-29 | CNN-based automatic segmentation of Lumen & Media boundaries in IVUS images using closed polygonal chains | Pavel Sinha et.al. | 2309.17406v1 | null |
2023-09-29 | AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition | Andrew Rouditchenko et.al. | 2309.17395v1 | null |
2023-09-29 | Tree Cross Attention | Leo Feng et.al. | 2309.17388v1 | null |
2023-09-29 | Adversarial Imitation Learning from Visual Observations using Latent Information | Vittorio Giammarino et.al. | 2309.17371v1 | link |
2023-09-29 | SpinView: General interactive visual analysis tool for multiscale computational magnetism | Qichen Xu et.al. | 2309.17367v1 | null |
2023-09-29 | Asynchronous Graph Generators | Christopher P. Ley et.al. | 2309.17335v1 | null |
2023-09-29 | Multi-Depth Branches Network for Efficient Image Super-Resolution | Huiyuan Tian et.al. | 2309.17334v1 | link |
2023-09-29 | Demystifying CLIP Data | Hu Xu et.al. | 2309.16671v2 | link |
2023-09-28 | Decaf: Monocular Deformation Capture for Face and Hand Interactions | Soshi Shimada et.al. | 2309.16670v1 | null |
2023-09-28 | Training a Large Video Model on a Single Machine in a Day | Yue Zhao et.al. | 2309.16669v1 | link |
2023-09-28 | Novel Deep Learning Pipeline for Automatic Weapon Detection | Haribharathi Sivakumar et.al. | 2309.16654v1 | null |
2023-09-28 | ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning | Qiao Gu et.al. | 2309.16650v1 | null |
2023-09-29 | Mixup Your Own Pairs | Yilei Wu et.al. | 2309.16633v2 | link |
2023-09-28 | Class Activation Map-based Weakly supervised Hemorrhage Segmentation using Resnet-LSTM in Non-Contrast Computed Tomography images | Shreyas H Ramananda et.al. | 2309.16627v1 | null |
2023-09-28 | The twisting index in semitoric systems | Jaume Alonso et.al. | 2309.16614v1 | null |
2023-09-28 | Exploiting Edge Features in Graphs with Fused Network Gromov-Wasserstein Distance | Junjie Yang et.al. | 2309.16604v1 | null |
2023-09-28 | Can LLMs Effectively Leverage Structural Information for Graph Learning: When and Why | Jin Huang et.al. | 2309.16595v1 | null |
2023-09-27 | SHACIRA: Scalable HAsh-grid Compression for Implicit Neural Representations | Sharath Girish et.al. | 2309.15848v1 | null |
2023-09-27 | Cross-Modal Multi-Tasking for Speech-to-Text Translation via Hard Parameter Sharing | Brian Yan et.al. | 2309.15826v1 | null |
2023-09-27 | Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation | David Junhao Zhang et.al. | 2309.15818v1 | link |
2023-09-27 | Convolutional Networks with Oriented 1D Kernels | Alexandre Kirchmeyer et.al. | 2309.15812v1 | link |
2023-09-27 | A Quantum-Classical Hybrid Block-Matching Algorithm in Noisy Environment using Dissimilarity Measure | M. Martínez-Felipe et.al. | 2309.15792v1 | null |
2023-09-27 | Large Language Model Routing with Benchmark Datasets | Tal Shnitzer et.al. | 2309.15789v1 | null |
2023-09-27 | One For All: Video Conversation is Feasible Without Video Instruction Tuning | Ruyang Liu et.al. | 2309.15785v1 | null |
2023-09-27 | Rapid Network Adaptation: Learning to Adapt Neural Networks Using Test-Time Feedback | Teresa Yeo et.al. | 2309.15762v1 | null |
2023-09-27 | Automated CT Lung Cancer Screening Workflow using 3D Camera | Brian Teixeira et.al. | 2309.15750v1 | null |
2023-09-27 | Data-Driven Latent Space Representation for Robust Bipedal Locomotion Learning | Guillermo A. Castillo et.al. | 2309.15740v1 | null |
2023-09-26 | Classification of symmetry-enriched topological quantum spin liquids | Weicheng Ye et.al. | 2309.15118v1 | null |
2023-09-26 | Doduo: Learning Dense Visual Correspondence from Unsupervised Semantic-Aware Flow | Zhenyu Jiang et.al. | 2309.15110v1 | null |
2023-09-27 | LAVIE: High-Quality Video Generation with Cascaded Latent Diffusion Models | Yaohui Wang et.al. | 2309.15103v2 | null |
2023-09-26 | VideoDirectorGPT: Consistent Multi-scene Video Generation via LLM-Guided Planning | Han Lin et.al. | 2309.15091v1 | null |
2023-09-26 | Video-adverb retrieval with compositional adverb-action embeddings | Thomas Hummel et.al. | 2309.15086v1 | null |
2023-09-26 | Challenges of building medical image datasets for development of deep learning software in stroke | Alessandro Fontanella et.al. | 2309.15081v1 | null |
2023-09-26 | On Excess Risk Convergence Rates of Neural Network Classifiers | Hyunouk Ko et.al. | 2309.15075v1 | null |
2023-09-26 | Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding | Christina Kassab et.al. | 2309.15065v1 | null |
2023-09-26 | QUILT: Effective Multi-Class Classification on Quantum Computers Using an Ensemble of Diverse Quantum Classifiers | Daniel Silver et.al. | 2309.15056v1 | null |
2023-09-26 | Thalamic nuclei segmentation from T$_1$-weighted MRI: unifying and benchmarking state-of-the-art methods with young and old cohorts | Brendan Williams et.al. | 2309.15053v1 | null |
2023-09-25 | Extreme Parkour with Legged Robots | Xuxin Cheng et.al. | 2309.14341v1 | null |
2023-09-25 | Chop & Learn: Recognizing and Generating Object-State Compositions | Nirat Saini et.al. | 2309.14339v1 | null |
2023-09-25 | Human-Assisted Continual Robot Learning with Foundation Models | Meenal Parakh et.al. | 2309.14321v1 | null |
2023-09-25 | MUTEX: Learning Unified Policies from Multimodal Task Specifications | Rutav Shah et.al. | 2309.14320v1 | null |
2023-09-25 | DeepMesh: Mesh-based Cardiac Motion Tracking using Deep Learning | Qingjie Meng et.al. | 2309.14306v1 | null |
2023-09-25 | NAS-NeRF: Generative Neural Architecture Search for Neural Radiance Fields | Saeejith Nair et.al. | 2309.14293v1 | null |
2023-09-25 | CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Monika Wysoczańska et.al. | 2309.14289v1 | null |
2023-09-25 | Comparison of One- Two- and Three- Dimensional CNN models for Drawing-Test-Based Diagnostics of the Parkinson's Disease | Xuechao Wang et.al. | 2309.14288v1 | null |
2023-09-26 | Virtual Hyperspectral Images Using Symmetric Autoencoders | Archisman Bhattacharjee et.al. | 2309.14286v2 | null |
2023-09-25 | OmniEvent: A Comprehensive, Fair, and Easy-to-Use Toolkit for Event Understanding | Hao Peng et.al. | 2309.14258v1 | link |
2023-09-22 | Robotic Offline RL from Internet Videos via Value-Function Pre-Training | Chethan Bhateja et.al. | 2309.13041v1 | null |
2023-09-22 | Privacy Assessment on Reconstructed Images: Are Existing Evaluation Metrics Faithful to Human Perception? | Xiaoxiao Sun et.al. | 2309.13038v1 | null |
2023-09-22 | Encoding optimization for quantum machine learning demonstrated on a superconducting transmon qutrit | Shuxiang Cao et.al. | 2309.13036v1 | null |
2023-09-22 | Performance Analysis of UNet and Variants for Medical Image Segmentation | Walid Ehab et.al. | 2309.13013v1 | null |
2023-09-22 | Pursuing Counterfactual Fairness via Sequential Autoencoder Across Domains | Yujie Lin et.al. | 2309.13005v1 | null |
2023-09-22 | Braid groups, elliptic curves, and resolving the quartic | Peter Huxford et.al. | 2309.12999v1 | null |
2023-09-22 | License Plate Recognition Based On Multi-Angle View Model | Dat Tran-Anh et.al. | 2309.12972v1 | null |
2023-09-22 | PI-RADS v2 Compliant Automated Segmentation of Prostate Zones Using co-training Motivated Multi-task Dual-Path CNN | Arnab Das et.al. | 2309.12970v1 | null |
2023-09-22 | Detect Every Thing with Few Examples | Xinyu Zhang et.al. | 2309.12969v1 | link |
2023-09-22 | Massive End-to-end Models for Short Search Queries | Weiran Wang et.al. | 2309.12963v1 | null |
2023-09-21 | ForceSight: Text-Guided Mobile Manipulation with Visual-Force Goals | Jeremy A. Collins et.al. | 2309.12312v1 | null |
2023-09-21 | LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent | Jianing Yang et.al. | 2309.12311v1 | null |
2023-09-21 | TalkNCE: Improving Active Speaker Detection with Talk-Aware Contrastive Learning | Chaeyoung Jung et.al. | 2309.12306v1 | null |
2023-09-22 | PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation | Shilin Yan et.al. | 2309.12303v2 | link |
2023-09-21 | See to Touch: Learning Tactile Dexterity through Visual Incentives | Irmak Guzey et.al. | 2309.12300v1 | null |
2023-09-21 | The Broad Impact of Feature Imitation: Neural Enhancements Across Financial, Speech, and Physiological Domains | Reza Khanmohammadi et.al. | 2309.12279v1 | null |
2023-09-21 | Enabling Quartile-based Estimated-Mean Gradient Aggregation As Baseline for Federated Image Classifications | Yusen Wu et.al. | 2309.12267v1 | null |
2023-09-21 | Parallelizing non-linear sequential models over the sequence length | Yi Heng Lim et.al. | 2309.12252v1 | null |
2023-09-21 | Adaptive Input-image Normalization for Solving Mode Collapse Problem in GAN-based X-ray Images | Muhammad Muneeb Saad et.al. | 2309.12245v1 | null |
2023-09-21 | Model-based Clustering using Non-parametric Hidden Markov Models | Elisabeth Gassiat et.al. | 2309.12238v1 | null |
2023-09-20 | A Large-scale Dataset for Audio-Language Representation Learning | Luoyi Sun et.al. | 2309.11500v1 | null |
2023-09-20 | FreeU: Free Lunch in Diffusion U-Net | Chenyang Si et.al. | 2309.11497v1 | null |
2023-09-21 | Text2Reward: Automated Dense Reward Function Generation for Reinforcement Learning | Tianbao Xie et.al. | 2309.11489v2 | null |
2023-09-20 | First detection of CO$_2$ emission in a Centaur: JWST NIRSpec observations of 39P/Oterma | O. Harrington Pinto et.al. | 2309.11486v1 | null |
2023-09-20 | Multi-Label Takagi-Sugeno-Kang Fuzzy System | Qiongdan Lou et.al. | 2309.11469v1 | null |
2023-09-20 | Budget-Aware Pruning: Handling Multiple Domains with Less Parameters | Samuel Felipe dos Santos et.al. | 2309.11464v1 | null |
2023-09-20 | AudioFool: Fast, Universal and synchronization-free Cross-Domain Attack on Speech Recognition | Mohamad Fakih et.al. | 2309.11462v1 | null |
2023-09-20 | SkeleTR: Towrads Skeleton-based Action Recognition in the Wild | Haodong Duan et.al. | 2309.11445v1 | null |
2023-09-20 | A Systematic Review of Few-Shot Learning in Medical Imaging | Eva Pachetti et.al. | 2309.11433v1 | null |
2023-09-21 | Video Screens for Hearing Research: Transmittance and Reflectance of Professional and Other Fabrics | Jan Heeren et.al. | 2309.11430v2 | null |
2023-09-19 | Assessing the capacity of a denoising diffusion probabilistic model to reproduce spatial context | Rucha Deshpande et.al. | 2309.10817v1 | null |
2023-09-19 | Multisource Holography | Grace Kuo et.al. | 2309.10816v1 | null |
2023-09-19 | Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning | Tianhua Zhang et.al. | 2309.10814v1 | link |
2023-09-19 | Semantic Text Compression for Classification | Emrecan Kutay et.al. | 2309.10809v1 | null |
2023-09-19 | Multi-Context Dual Hyper-Prior Neural Image Compression | Atefeh Khoshkhahtinat et.al. | 2309.10799v1 | null |
2023-09-19 | Multi-spectral Entropy Constrained Neural Compression of Solar Imagery | Ali Zafari et.al. | 2309.10791v1 | null |
2023-09-19 | Guide Your Agent with Adaptive Multimodal Rewards | Changyeon Kim et.al. | 2309.10790v1 | link |
2023-09-19 | Physics-Informed Machine Learning for Data Anomaly Detection, Classification, Localization, and Mitigation: A Review, Challenges, and Path Forward | Mehdi Jabbari Zideh et.al. | 2309.10788v1 | null |
2023-09-19 | AV-SUPERB: A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models | Yuan Tseng et.al. | 2309.10787v1 | link |
2023-09-19 | Context-Aware Neural Video Compression on Solar Dynamics Observatory | Atefeh Khoshkhahtinat et.al. | 2309.10784v1 | null |
2023-09-19 | Des-q: a quantum algorithm to construct and efficiently retrain decision trees for regression and binary classification | Niraj Kumar et.al. | 2309.09976v2 | null |
2023-09-18 | Empirical Study of Mix-based Data Augmentation Methods in Physiological Time Series Data | Peikun Guo et.al. | 2309.09970v1 | null |
2023-09-18 | vSHARP: variable Splitting Half-quadratic ADMM algorithm for Reconstruction of inverse-Problems | George Yiasemis et.al. | 2309.09954v1 | null |
2023-09-18 | TransientViT: A novel CNN - Vision Transformer hybrid real/bogus transient classifier for the Kilodegree Automatic Transient Survey | Zhuoyang Chen et.al. | 2309.09937v1 | null |
2023-09-18 | Algebra of Self-Replication | Lawrence S. Moss et.al. | 2309.09931v1 | null |
2023-09-18 | Evaluating Adversarial Robustness with Expected Viable Performance | Ryan McCoppin et.al. | 2309.09928v1 | null |
2023-09-18 | Impact of Augmented reality system on elementary school ESL learners in country side of china: Motivations, achievements, behaviors and cognitive attainment | Ijaz Ul Haq et.al. | 2309.09894v1 | null |
2023-09-18 | Not Enough Labeled Data? Just Add Semantics: A Data-Efficient Method for Inferring Online Health Texts | Joseph Gatto et.al. | 2309.09877v1 | null |
2023-09-18 | Domain Generalization with Fourier Transform and Soft Thresholding | Hongyi Pan et.al. | 2309.09866v1 | null |
2023-09-18 | Unsupervised Open-Vocabulary Object Localization in Videos | Ke Fan et.al. | 2309.09858v1 | null |
2023-09-18 | Closing the Loop on Runtime Monitors with Fallback-Safe MPC | Rohan Sinha et.al. | 2309.08603v2 | null |
2023-09-15 | Robust Frame-to-Frame Camera Rotation Estimation in Crowded Scenes | Fabien Delattre et.al. | 2309.08588v1 | null |
2023-09-15 | Compositional Foundation Models for Hierarchical Planning | Anurag Ajay et.al. | 2309.08587v1 | null |
2023-09-15 | HINT: Healthy Influential-Noise based Training to Defend against Data Poisoning Attacks | Minh-Hao Van et.al. | 2309.08549v1 | null |
2023-09-15 | Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal Tokens | Minsu Kim et.al. | 2309.08531v1 | null |
2023-09-15 | Generalised Probabilistic Diffusion Scale-Spaces | Pascal Peter et.al. | 2309.08511v1 | null |
2023-09-15 | Deep-learning-powered data analysis in plankton ecology | Harshith Bachimanchi et.al. | 2309.08500v1 | link |
2023-09-15 | P-ROCKET: Pruning Random Convolution Kernels for Time Series Classification | Shaowu Chen et.al. | 2309.08499v1 | link |
2023-09-15 | YCB-Ev: Event-vision dataset for 6DoF object pose estimation | Pavel Rojtberg et.al. | 2309.08482v1 | link |
2023-09-15 | Current and future directions in network biology | Marinka Zitnik et.al. | 2309.08478v1 | null |
2023-09-14 | Disentangling Spatial and Temporal Learning for Efficient Image-to-Video Transfer Learning | Zhiwu Qing et.al. | 2309.07911v1 | link |
2023-09-14 | Generative Image Dynamics | Zhengqi Li et.al. | 2309.07906v1 | null |
2023-09-14 | Ambiguity-Aware In-Context Learning with Large Language Models | Lingyu Gao et.al. | 2309.07900v1 | null |
2023-09-14 | SMARTFEAT: Efficient Feature Construction through Feature-Level Foundation Model Interactions | Yin Lin et.al. | 2309.07856v1 | null |
2023-09-14 | Two Timin': Repairing Smart Contracts With A Two-Layered Approach | Abhinav Jain et.al. | 2309.07841v1 | null |
2023-09-14 | Text Classification of Cancer Clinical Trial Eligibility Criteria | Yumeng Yang et.al. | 2309.07812v1 | null |
2023-09-14 | What Matters to Enhance Traffic Rule Compliance of Imitation Learning for Automated Driving | Hongkuan Zhou et.al. | 2309.07808v1 | null |
2023-09-14 | Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary tasks | Danae Sánchez Villegas et.al. | 2309.07794v1 | null |
2023-09-14 | A Multi-In and Multi-Out Dendritic Neuron Model and its Optimization | Yu Ding et.al. | 2309.07791v1 | null |
2023-09-15 | Virchow: A Million-Slide Digital Pathology Foundation Model | Eugene Vorontsov et.al. | 2309.07778v2 | null |
2023-09-13 | Contrastive Deep Encoding Enables Uncertainty-aware Machine-learning-assisted Histopathology | Nirhoshan Sivaroopan et.al. | 2309.07113v1 | null |
2023-09-13 | Data Augmentation via Subgroup Mixup for Improving Fairness | Madeline Navarro et.al. | 2309.07110v1 | null |
2023-09-13 | The end sum of surfaces | Liam K. Axon et.al. | 2309.07101v1 | null |
2023-09-13 | Revisiting the classics: On the evolutionary origin of the "Fe II" and "He/N" spectral classes of novae | E. Aydi et.al. | 2309.07097v1 | null |
2023-09-13 | RadarLCD: Learnable Radar-based Loop Closure Detection Pipeline | Mirko Usuelli et.al. | 2309.07094v1 | null |
2023-09-13 | Mitigating Group Bias in Federated Learning for Heterogeneous Devices | Khotso Selialia et.al. | 2309.07085v1 | null |
2023-09-13 | The Boundaries of Verifiable Accuracy, Robustness, and Generalisation in Deep Learning | Alexander Bastounis et.al. | 2309.07072v1 | null |
2023-09-13 | Aggregating Long-term Sharp Features via Hybrid Transformers for Video Deblurring | Dongwei Ren et.al. | 2309.07054v1 | link |
2023-09-13 | Thurston's theorem and the Nielsen-Thurston classification via Teichmüller's theorem | James Belk et.al. | 2309.06993v1 | null |
2023-09-13 | Neural network-based coronary dominance classification of RCA angiograms | Ivan Kruzhilov et.al. | 2309.06958v1 | null |
2023-09-12 | Learning Disentangled Avatars with Hybrid 3D Representations | Yao Feng et.al. | 2309.06441v1 | null |
2023-09-12 | LEAP Hand: Low-Cost, Efficient, and Anthropomorphic Hand for Robot Learning | Kenneth Shaw et.al. | 2309.06440v1 | null |
2023-09-12 | AGMDT: Virtual Staining of Renal Histology Images with Adjacency-Guided Multi-Domain Transfer | Tao Ma et.al. | 2309.06421v1 | null |
2023-09-12 | Style2Fab: Functionality-Aware Segmentation for Fabricating Personalized 3D Models with Generative AI | Faraz Faruqi et.al. | 2309.06379v1 | null |
2023-09-12 | Padding-free Convolution based on Preservation of Differential Characteristics of Kernels | Kuangdai Leng et.al. | 2309.06370v1 | null |
2023-09-12 | Using Reed-Muller Codes for Classification with Rejection and Recovery | Daniel Fentham et.al. | 2309.06359v1 | link |
2023-09-12 | Eccentric graph of trees and their Cartesian products | Anita Arora et.al. | 2309.06338v1 | null |
2023-09-12 | Exploring Flat Minima for Domain Generalization with Large Learning Rates | Jian Zhang et.al. | 2309.06337v1 | null |
2023-09-12 | Grounded Language Acquisition From Object and Action Imagery | James Robert Kubricht et.al. | 2309.06335v1 | null |
2023-09-12 | Visualising Game Engine Subsystem Coupling | Gabriel C. Ullmann et.al. | 2309.06329v1 | null |
2023-09-11 | Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips | Yufei Ye et.al. | 2309.05663v1 | null |
2023-09-11 | From Capture to Display: A Survey on Volumetric Video | Yili Jin et.al. | 2309.05658v1 | null |
2023-09-11 | Potentials of Deterministic Radio Propagation Simulation for AI-Enabled Localization and Sensing | Albrecht Michler et.al. | 2309.05650v1 | null |
2023-09-11 | A Novel Supervised Deep Learning Solution to Detect Distributed Denial of Service (DDoS) attacks on Edge Systems using Convolutional Neural Networks (CNN) | Vedanth Ramanathan et.al. | 2309.05646v1 | null |
2023-09-11 | Boundary Peeling: Outlier Detection Method Using One-Class Peeling | Sheikh Arafat et.al. | 2309.05630v1 | null |
2023-09-11 | Temporal Action Localization with Enhanced Instant Discriminability | Dingfeng Shi et.al. | 2309.05590v1 | link |
2023-09-11 | Anisotropic Diffusion Stencils: From Simple Derivations over Stability Estimates to ResNet Implementations | Karl Schrader et.al. | 2309.05575v1 | null |
2023-09-11 | On the Meromorphic Integrability of the Critical Systems for Optimal Sums of Eigenvalues | Yuzhou Tian et.al. | 2309.05568v1 | null |
2023-09-11 | OpenFashionCLIP: Vision-and-Language Contrastive Learning with Open-Source Fashion Data | Giuseppe Cartella et.al. | 2309.05551v1 | link |
2023-09-11 | Distance-Aware eXplanation Based Learning | Misgina Tsighe Hagos et.al. | 2309.05548v1 | link |
2023-09-08 | Generalized Cross-domain Multi-label Few-shot Learning for Chest X-rays | Aroof Aimen et.al. | 2309.04462v1 | null |
2023-09-08 | Generalized Variable Selection Algorithms for Gaussian Process Models by LASSO-like Penalty | Zhiyong Hu et.al. | 2309.04455v1 | null |
2023-09-08 | Vis-SPLIT: Interactive Hierarchical Modeling for mRNA Expression Classification | Braden Roper et.al. | 2309.04423v1 | null |
2023-09-08 | Video Task Decathlon: Unifying Image and Video Tasks in Autonomous Driving | Thomas E. Huang et.al. | 2309.04422v1 | null |
2023-09-08 | Seeing-Eye Quadruped Navigation with Force Responsive Locomotion Control | David DeFazio et.al. | 2309.04370v1 | null |
2023-09-08 | Active Learning for Classifying 2D Grid-Based Level Completability | Mahsa Bazzaz et.al. | 2309.04367v1 | link |
2023-09-08 | Sparse Codesigned Communication and Radar Systems | Hyeon Seok Rou et.al. | 2309.04362v1 | null |
2023-09-08 | Learning from Power Signals: An Automated Approach to Electrical Disturbance Identification Within a Power Transmission System | Jonathan D. Boyd et.al. | 2309.04361v1 | null |
2023-09-08 | Zero-Shot Robustification of Zero-Shot Models With Foundation Models | Dyah Adila et.al. | 2309.04344v1 | null |
2023-09-08 | Encoding Multi-Domain Scientific Papers by Ensembling Multiple CLS Tokens | Ronald Seoh et.al. | 2309.04333v1 | link |
2023-09-07 | A-Eval: A Benchmark for Cross-Dataset Evaluation of Abdominal Multi-Organ Segmentation | Ziyan Huang et.al. | 2309.03906v1 | link |
2023-09-07 | ImageBind-LLM: Multi-modality Instruction Tuning | Jiaming Han et.al. | 2309.03905v1 | link |
2023-09-07 | Tracking Anything with Decoupled Video Segmentation | Ho Kei Cheng et.al. | 2309.03903v1 | link |
2023-09-07 | Learning Continuous Exposure Value Representations for Single-Image HDR Reconstruction | Su-Kai Chen et.al. | 2309.03900v1 | null |
2023-09-07 | The Making and Breaking of Camouflage | Hala Lamdouar et.al. | 2309.03899v1 | null |
2023-09-07 | ProPainter: Improving Propagation and Transformer for Video Inpainting | Shangchen Zhou et.al. | 2309.03897v1 | null |
2023-09-07 | Zero-Shot Audio Captioning via Audibility Guidance | Tal Shaharabany et.al. | 2309.03884v1 | null |
2023-09-07 | Text-to-feature diffusion for audio-visual few-shot learning | Otniel-Bogdan Mercea et.al. | 2309.03869v1 | null |
2023-09-07 | Classification of Killing Magnetic Curves In H^3 | Özgür Kelekçi et.al. | 2309.03859v1 | null |
2023-09-07 | CenTime: Event-Conditional Modelling of Censoring in Survival Analysis | Ahmed H. Shahin et.al. | 2309.03851v1 | link |
2023-09-07 | Terahertz-Band Direction Finding With Beam-Split and Mutual Coupling Calibration | Ahmet M. Elbir et.al. | 2309.03195v2 | null |
2023-09-06 | Signatures of Bayesian inference emerge from energy efficient synapses | James Malkin et.al. | 2309.03194v1 | null |
2023-09-06 | 3D Transformer based on deformable patch location for differential diagnosis between Alzheimer's disease and Frontotemporal dementia | Huy-Dung Nguyen et.al. | 2309.03183v1 | null |
2023-09-06 | PDiscoNet: Semantically consistent part discovery for fine-grained recognition | Robert van der Klis et.al. | 2309.03173v1 | null |
2023-09-06 | ResFields: Residual Neural Fields for Spatiotemporal Signals | Marko Mihajlovic et.al. | 2309.03160v1 | null |
2023-09-06 | Normal mode decomposition of atomic motion in solids | Jaeyun Moon et.al. | 2309.03140v1 | null |
2023-09-06 | Serving Time: Real-Time, Safe Motion Planning and Control for Manipulation of Unsecured Objects | Zachary Brei et.al. | 2309.03111v1 | null |
2023-09-06 | The Secrets of Non-Blind Poisson Deconvolution | Abhiram Gnanasambandam et.al. | 2309.03105v1 | null |
2023-09-06 | On the |
Marcos Escartín Ferrer et.al. | 2309.03091v1 | null |
2023-09-06 | Hide and Seek (HaS): A Lightweight Framework for Prompt Privacy Protection | Yu Chen et.al. | 2309.03057v1 | null |
2023-09-05 | ReliTalk: Relightable Talking Portrait Generation from a Single Video | Haonan Qiu et.al. | 2309.02434v1 | link |
2023-09-05 | A Likelihood Approach to Incorporating Self-Report Data in HIV Recency Classification | Wenlong Yang et.al. | 2309.02430v1 | null |
2023-09-05 | Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach | Vimal K B et.al. | 2309.02429v1 | null |
2023-09-05 | EgoPCA: A New Framework for Egocentric Hand-Object Interaction Understanding | Yue Xu et.al. | 2309.02423v1 | null |
2023-09-05 | Doppelgangers: Learning to Disambiguate Images of Similar Structures | Ruojin Cai et.al. | 2309.02420v1 | link |
2023-09-05 | Classification of La3+ and Gd3+ rare earth ions using surface-enhanced Raman scattering | Hao Jin et.al. | 2309.02409v1 | null |
2023-09-05 | Semantic Communications Based on Adaptive Generative Models and Information Bottleneck | S. Barbarossa et.al. | 2309.02387v1 | null |
2023-09-05 | On the classification of primitive ideals for complex classical Lie algebras, IV | William McGovern et.al. | 2309.02363v1 | null |
2023-09-05 | Generating Infinite-Resolution Texture using GANs with Patch-by-Patch Paradigm | Alhasan Abdellatif et.al. | 2309.02340v1 | null |
2023-09-05 | DEEPBEAS3D: Deep Learning and B-Spline Explicit Active Surfaces | Helena Williams et.al. | 2309.02335v1 | null |
2023-09-01 | Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following | Ziyu Guo et.al. | 2309.00615v1 | link |
2023-09-01 | Amyloid-Beta Axial Plane PET Synthesis from Structural MRI: An Image Translation Approach for Screening Alzheimer's Disease | Fernando Vega et.al. | 2309.00569v1 | null |
2023-09-01 | Powder-Bot: A Modular Autonomous Multi-Robot Workflow for Powder X-Ray Diffraction | Amy M. Lunt et.al. | 2309.00544v1 | null |
2023-09-01 | A Machine Vision Method for Correction of Eccentric Error: Based on Adaptive Enhancement Algorithm | Fanyi Wang et.al. | 2309.00514v1 | null |
2023-09-01 | Multi-stage Deep Learning Artifact Reduction for Computed Tomography | Jiayang Shi et.al. | 2309.00494v1 | null |
2023-09-01 | Geometry-aware Line Graph Transformer Pre-training for Molecular Property Prediction | Peizhen Bai et.al. | 2309.00483v1 | null |
2023-09-01 | Deep Joint Source-Channel Coding for Adaptive Image Transmission over MIMO Channels | Haotian Wu et.al. | 2309.00470v1 | null |
2023-09-01 | New metrics for analyzing continual learners | Nicolas Michel et.al. | 2309.00462v1 | null |
2023-09-01 | The miniJPAS survey quasar selection IV: Classification and redshift estimation with SQUEzE | Ignasi Pérez-Ràfols et.al. | 2309.00461v1 | null |
2023-09-01 | CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding | Étienne Labbé et.al. | 2309.00454v1 | link |
2023-08-31 | PointLLM: Empowering Large Language Models to Understand Point Clouds | Runsen Xu et.al. | 2308.16911v1 | link |
2023-08-31 | StyleInV: A Temporal Style Modulated Inversion Network for Unconditional Video Generation | Yuhan Wang et.al. | 2308.16909v1 | link |
2023-08-31 | Learning to Taste: A Multimodal Wine Dataset | Thoranna Bender et.al. | 2308.16900v1 | null |
2023-08-31 | EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild | Manuel Kaufmann et.al. | 2308.16894v1 | link |
2023-08-31 | On the Role of Non-Localities in Fundamental Diagram Estimation | Jing Liu et.al. | 2308.16878v1 | null |
2023-08-31 | SportsSloMo: A New Benchmark and Baselines for Human-centric Video Frame Interpolation | Jiaben Chen et.al. | 2308.16876v1 | null |
2023-08-31 | Understanding defects in amorphous silicon with million-atom simulations and machine learning | Joe D. Morrow et.al. | 2308.16868v1 | null |
2023-08-31 | Self-pruning Graph Neural Network for Predicting Inflammatory Disease Activity in Multiple Sclerosis from Brain MR Images | Chinmay Prabhakar et.al. | 2308.16863v1 | link |
2023-08-31 | Facing Unknown: Open-World Encrypted Traffic Classification Based on Contrastive Pre-Training | Xiang Li et.al. | 2308.16861v1 | null |
2023-08-31 | Majorization-Minimization for sparse SVMs | Alessandro Benfenati et.al. | 2308.16858v1 | null |
2023-08-30 | Fully Non-Linear Neuromorphic Computing with Linear Wave Scattering | Clara C. Wanjura et.al. | 2308.16181v1 | null |
2023-08-30 | General Purpose Audio Effect Removal | Matthew Rice et.al. | 2308.16177v1 | null |
2023-08-30 | Algebraic, Topological, and Mereological Foundations of Existential Granules | Mani A et.al. | 2308.16157v1 | null |
2023-08-31 | MMVP: Motion-Matrix-based Video Prediction | Yiqi Zhong et.al. | 2308.16154v2 | link |
2023-08-30 | Modality Cycles with Masked Conditional Diffusion for Unsupervised Anomaly Segmentation in MRI | Ziyun Liang et.al. | 2308.16150v1 | null |
2023-08-30 | Spatial Graph Coarsening: Weather and Weekday Prediction with London's Bike-Sharing Service using GNN | Yuta Sato et.al. | 2308.16122v1 | null |
2023-08-30 | Learned Image Reasoning Prior Penetrates Deep Unfolding Network for Panchromatic and Multi-Spectral Image Fusion | Man Zhou et.al. | 2308.16083v1 | null |
2023-08-30 | A Classification of Observation-Driven State-Space Count Models for Panel Data | Jae Youn Ahn et.al. | 2308.16058v1 | null |
2023-08-30 | Low-Rank Multitask Learning based on Tensorized SVMs and LSSVMs | Jiani Liu et.al. | 2308.16056v1 | null |
2023-08-30 | Telepresence Lantern -- Designing an Immersive Video-Mediated Communication Device for Older Adults | Thomas H. Weisswange et.al. | 2308.16052v1 | null |
2023-08-29 | An Adaptive Tangent Feature Perspective of Neural Networks | Daniel LeJeune et.al. | 2308.15478v1 | null |
2023-08-29 | A General-Purpose Self-Supervised Model for Computational Pathology | Richard J. Chen et.al. | 2308.15474v1 | null |
2023-08-29 | Learning Modulated Transformation in GANs | Ceyuan Yang et.al. | 2308.15472v1 | null |
2023-08-30 | Policy composition in reinforcement learning via multi-objective policy optimization | Shruti Mishra et.al. | 2308.15470v2 | null |
2023-08-29 | Input margins can predict generalization too | Coenraad Mouton et.al. | 2308.15466v1 | null |
2023-08-29 | A Comparative Study of Loss Functions: Traffic Predictions in Regular and Congestion Scenarios | Yangxinyu Xie et.al. | 2308.15464v1 | link |
2023-08-29 | Online Overexposed Pixels Hallucination in Videos with Adaptive Reference Frame Selection | Yazhou Xing et.al. | 2308.15462v1 | null |
2023-08-29 | From SMOTE to Mixup for Deep Imbalanced Classification | Wei-Chao Cheng et.al. | 2308.15457v1 | link |
2023-08-29 | Pseudo-Boolean Polynomials Approach To Edge Detection And Image Segmentation | Tendai Mapungwana Chikake et.al. | 2308.15453v1 | null |
2023-08-29 | WrappingNet: Mesh Autoencoder via Deep Sphere Deformation | Eric Lei et.al. | 2308.15413v1 | null |
2023-08-28 | MagicEdit: High-Fidelity and Temporally Coherent Video Editing | Jun Hao Liew et.al. | 2308.14749v1 | null |
2023-08-28 | MagicAvatar: Multimodal Avatar Generation and Animation | Jianfeng Zhang et.al. | 2308.14748v1 | null |
2023-08-28 | CoVR: Learning Composed Video Retrieval from Web Video Captions | Lucas Ventura et.al. | 2308.14746v1 | link |
2023-08-28 | Total Selfie: Generating Full-Body Selfies | Bowei Chen et.al. | 2308.14740v1 | null |
2023-08-28 | PanoSwin: a Pano-style Swin Transformer for Panorama Understanding | Zhixin Ling et.al. | 2308.14726v1 | null |
2023-08-28 | VideoCutLER: Surprisingly Simple Unsupervised Video Instance Segmentation | Xudong Wang et.al. | 2308.14710v1 | link |
2023-08-28 | Fine-Tuning Llama 2 Large Language Models for Detecting Online Sexual Predatory Chats and Abusive Texts | Thanh Thi Nguyen et.al. | 2308.14683v1 | null |
2023-08-28 | Video-Based Hand Pose Estimation for Remote Assessment of Bradykinesia in Parkinson's Disease | Gabriela T. Acevedo Trebbau et.al. | 2308.14679v1 | null |
2023-08-28 | Noncommutative tensor triangular geometry: classification via noetherian spectra | James Rowe et.al. | 2308.14661v1 | null |
2023-08-28 | Towards Standardized Disturbance Rejection Testing of Legged Robot Locomotion with Linear Impactor: A Preliminary Study, Observations, and Implications | Bowen Weng et.al. | 2308.14636v1 | null |
2023-08-25 | Unveiling the Role of Message Passing in Dual-Privacy Preservation on GNNs | Tianyi Zhao et.al. | 2308.13513v1 | null |
2023-08-25 | Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation | Jiaming Zhang et.al. | 2308.13505v1 | null |
2023-08-25 | Attending Generalizability in Course of Deep Fake Detection by Exploring Multi-task Learning | Pranav Balaji et.al. | 2308.13503v1 | null |
2023-08-25 | Eventful Transformers: Leveraging Temporal Redundancy in Vision Transformers | Matthew Dutson et.al. | 2308.13494v1 | link |
2023-08-25 | Temporal Uncertainty Localization to Enable Human-in-the-loop Analysis of Dynamic Contrast-enhanced Cardiac MRI Datasets | Dilek M. Yalcinkaya et.al. | 2308.13488v1 | null |
2023-08-25 | QKSAN: A Quantum Kernel Self-Attention Network | Ren-Xin Zhao et.al. | 2308.13422v1 | null |
2023-08-25 | An investigation into the impact of deep learning model choice on sex and race bias in cardiac MR segmentation | Tiarna Lee et.al. | 2308.13415v1 | null |
2023-08-25 | Self-Supervised Representation Learning with Cross-Context Learning between Global and Hypercolumn Features | Zheng Gao et.al. | 2308.13392v1 | null |
2023-08-25 | Direction-aware Video Demoireing with Temporal-guided Bilateral Learning | Shuning Xu et.al. | 2308.13388v1 | null |
2023-08-25 | On flags of holomorphic foliations associated with singular second-order ordinary differential equations | Fernando Lourenço et.al. | 2308.13370v1 | null |
2023-08-24 | POCO: 3D Pose and Shape Estimation with Confidence | Sai Kumar Dwivedi et.al. | 2308.12965v1 | null |
2023-08-24 | Motion-Guided Masking for Spatiotemporal Representation Learning | David Fan et.al. | 2308.12962v1 | null |
2023-08-24 | Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment | Sheng Zhang et.al. | 2308.12960v1 | link |
2023-08-24 | Beyond Document Page Classification: Design, Datasets, and Challenges | Jordy Van Landeghem et.al. | 2308.12896v1 | null |
2023-08-24 | Large Language Models Vote: Prompting for Rare Disease Identification | David Oniani et.al. | 2308.12890v1 | link |
2023-08-24 | Multi-stage feature decorrelation constraints for improving CNN classification performance | Qiuyu Zhu et.al. | 2308.12880v1 | null |
2023-08-24 | ToonTalker: Cross-Domain Face Reenactment | Yuan Gong et.al. | 2308.12866v1 | null |
2023-08-24 | Learned Local Attention Maps for Synthesising Vessel Segmentations | Yash Deo et.al. | 2308.12861v1 | null |
2023-08-24 | Algebraicity of hypergeometric functions with arbitrary parameters | Florian Fürnsinn et.al. | 2308.12855v1 | null |
2023-08-24 | Eric Bergshoeff et.al. | 2308.12852v1 | null | |
2023-08-23 | Simple is Better and Large is Not Enough: Towards Ensembling of Foundational Language Models | Nancy Tyagi et.al. | 2308.12272v1 | null |
2023-08-23 | Bugsplainer: Leveraging Code Structures to Explain Software Bugs with Neural Machine Translation | Parvez Mahbub et.al. | 2308.12267v1 | null |
2023-08-23 | SPPNet: A Single-Point Prompt Network for Nuclei Image Segmentation | Qing Xu et.al. | 2308.12231v1 | link |
2023-08-23 | Towards Real-Time Analysis of Broadcast Badminton Videos | Nitin Nilesh et.al. | 2308.12199v1 | null |
2023-08-23 | Sign Language Translation with Iterative Prototype | Huijie Yao et.al. | 2308.12191v1 | null |
2023-08-23 | Tumor-Centered Patching for Enhanced Medical Image Segmentation | Mutyyba Asghar et.al. | 2308.12168v1 | null |
2023-08-23 | Constant mean curvature hypersurfaces in Anti-de Sitter space | Enrico Trebeschi et.al. | 2308.12167v1 | null |
2023-08-23 | NPF-200: A Multi-Modal Eye Fixation Dataset and Method for Non-Photorealistic Videos | Ziyu Yang et.al. | 2308.12163v1 | null |
2023-08-23 | A Probabilistic Fluctuation based Membership Inference Attack for Generative Models | Wenjie Fu et.al. | 2308.12143v1 | null |
2023-08-23 | Masking Strategies for Background Bias Removal in Computer Vision Models | Ananthu Aniraj et.al. | 2308.12127v1 | link |
2023-08-22 | StoryBench: A Multifaceted Benchmark for Continuous Story Visualization | Emanuele Bugliarello et.al. | 2308.11606v1 | link |
2023-08-22 | Semantic Multi-Resolution Communications | Matin Mortaheb et.al. | 2308.11604v1 | null |
2023-08-22 | EndoNet: model for automatic calculation of H-score on histological slides | Egor Ushakov et.al. | 2308.11562v1 | null |
2023-08-22 | Open Set Synthetic Image Source Attribution | Shengbang Fang et.al. | 2308.11557v1 | null |
2023-08-22 | Multi-event Video-Text Retrieval | Gengyuan Zhang et.al. | 2308.11551v1 | link |
2023-08-22 | Furnishing Sound Event Detection with Language Model Abilities | Hualei Wang et.al. | 2308.11530v1 | null |
2023-08-22 | LCCo: Lending CLIP to Co-Segmentation | Xin Duan et.al. | 2308.11506v1 | null |
2023-08-23 | Learning from Semantic Alignment between Unpaired Multiviews for Egocentric Video Recognition | Qitong Wang et.al. | 2308.11489v2 | link |
2023-08-22 | Opening the Vocabulary of Egocentric Actions | Dibyadip Chatterjee et.al. | 2308.11488v1 | null |
2023-08-22 | Free Lunch for Gait Recognition: A Novel Relation Descriptor | Jilong Wang et.al. | 2308.11487v1 | null |
2023-08-21 | Structured World Models from Human Videos | Russell Mendonca et.al. | 2308.10901v1 | null |
2023-08-21 | Unlocking Accuracy and Fairness in Differentially Private Image Classification | Leonard Berrada et.al. | 2308.10888v1 | null |
2023-08-21 | Evaluating quantum generative models via imbalanced data classification benchmarks | Graham R. Enos et.al. | 2308.10847v1 | null |
2023-08-21 | Pixel Adaptive Deep Unfolding Transformer for Hyperspectral Image Reconstruction | Miaoyu Li et.al. | 2308.10820v1 | null |
2023-08-21 | Improving Continuous Sign Language Recognition with Cross-Lingual Signs | Fangyun Wei et.al. | 2308.10809v1 | null |
2023-08-21 | DynED: Dynamic Ensemble Diversification in Data Stream Classification | Soheil Abadifard et.al. | 2308.10807v1 | link |
2023-08-21 | MGMAE: Motion Guided Masking for Video Masked Autoencoding | Bingkun Huang et.al. | 2308.10794v1 | null |
2023-08-21 | Extraction of Text from Optic Nerve Optical Coherence Tomography Reports | Iyad Majid et.al. | 2308.10790v1 | null |
2023-08-21 | Dense Error Map Estimation for MRI-Ultrasound Registration in Brain Tumor Surgery Using Swin UNETR | Soorena Salari et.al. | 2308.10784v1 | null |
2023-08-21 | Superfluid weight in the isolated band limit within the generalized random phase approximation | Minh Tam et.al. | 2308.10780v1 | null |
2023-08-18 | Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization | Soumik Mukhopadhyay et.al. | 2308.09716v1 | link |
2023-08-18 | Dynamic 3D Gaussians: Tracking by Persistent Dynamic View Synthesis | Jonathon Luiten et.al. | 2308.09713v1 | null |
2023-08-18 | SimDA: Simple Diffusion Adapter for Efficient Video Generation | Zhen Xing et.al. | 2308.09710v1 | null |
2023-08-18 | Invariant Training 2D-3D Joint Hard Samples for Few-Shot Point Cloud Recognition | Xuanyu Yi et.al. | 2308.09694v1 | null |
2023-08-18 | A Lightweight Transformer for Faster and Robust EBSD Data Collection | Harry Dong et.al. | 2308.09693v1 | link |
2023-08-18 | Audiovisual Moments in Time: A Large-Scale Annotated Dataset of Audiovisual Actions | Michael Joannou et.al. | 2308.09685v1 | link |
2023-08-18 | Quantifying Uncertainties of Contact Classifications in a Human-Robot Collaboration with Parallel Robots | Aran Mohammad et.al. | 2308.09675v1 | null |
2023-08-18 | Classification of modular data up to rank 11 | Siu-Hung Ng et.al. | 2308.09670v1 | null |
2023-08-18 | Collision Isolation and Identification Using Proprioceptive Sensing for Parallel Robots to Enable Human-Robot Collaboration | Aran Mohammad et.al. | 2308.09650v1 | null |
2023-08-18 | Robust Uncertainty Quantification using Conformalised Monte Carlo Prediction | Daniel Bethell et.al. | 2308.09647v1 | link |
2023-08-16 | MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Henghui Ding et.al. | 2308.08544v1 | link |
2023-08-16 | Deployment and Analysis of Instance Segmentation Algorithm for In-field Grade Estimation of Sweetpotatoes | Hoang M. Nguyen et.al. | 2308.08534v1 | null |
2023-08-16 | Diagnosing Human-object Interaction Detectors | Fangrui Zhu et.al. | 2308.08529v1 | link |
2023-08-17 | Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on Bidirectional Prediction | Yuhao Yang et.al. | 2308.08518v2 | null |
2023-08-17 | Two-and-a-half Order Score-based Model for Solving 3D Ill-posed Inverse Problems | Zirong Li et.al. | 2308.08511v2 | null |
2023-08-16 | ResBuilder: Automated Learning of Depth with Residual Structures | Julian Burghoff et.al. | 2308.08504v1 | null |
2023-08-16 | Galactic Archaeology: Tracing the Milky Way's Formation and Evolution through Stellar Populations | J. Alfredo Collazos et.al. | 2308.08492v1 | null |
2023-08-16 | Label Propagation Techniques for Artifact Detection in Imbalanced Classes using Photoplethysmogram Signals | Clara Macabiau et.al. | 2308.08480v1 | null |
2023-08-16 | DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature Matching | Johan Edstedt et.al. | 2308.08479v1 | link |
2023-08-16 | Classification Committee for Active Deep Object Detection | Lei Zhao et.al. | 2308.08476v1 | null |
2023-08-15 | CoDeF: Content Deformation Fields for Temporally Consistent Video Processing | Hao Ouyang et.al. | 2308.07926v1 | link |
2023-08-15 | Helping Hands: An Object-Aware Ego-Centric Video Recognition Model | Chuhan Zhang et.al. | 2308.07918v1 | link |
2023-08-15 | Relightable and Animatable Neural Avatar from Sparse-View Video | Zhen Xu et.al. | 2308.07903v1 | null |
2023-08-15 | Back to Basics: A Sanity Check on Modern Time Series Classification Algorithms | Bhaskar Dhariyal et.al. | 2308.07886v1 | link |
2023-08-15 | The Challenge of Fetal Cardiac MRI Reconstruction Using Deep Learning | Denis Prokopenko et.al. | 2308.07885v1 | null |
2023-08-15 | Towards Temporal Edge Regression: A Case Study on Agriculture Trade Between Nations | Lekang Jiang et.al. | 2308.07883v1 | link |
2023-08-15 | Synthesizing Political Zero-Shot Relation Classification via Codebook Knowledge, NLI, and ChatGPT | Yibo Hu et.al. | 2308.07876v1 | null |
2023-08-15 | SEDA: Self-Ensembling ViT with Defensive Distillation and Adversarial Training for robust Chest X-rays Classification | Raza Imam et.al. | 2308.07874v1 | link |
2023-08-15 | Sequence Processing with Quantum Tensor Networks | Carys Harvey et.al. | 2308.07865v1 | null |
2023-08-15 | ImbSAM: A Closer Look at Sharpness-Aware Minimization in Class-Imbalanced Recognition | Yixuan Zhou et.al. | 2308.07815v1 | link |
2023-08-14 | Comparison between parameter-efficient techniques and full fine-tuning: A case study on multilingual news article classification | Olesya Razuvayevskaya et.al. | 2308.07282v1 | null |
2023-08-14 | A Robust Approach Towards Distinguishing Natural and Computer Generated Images using Multi-Colorspace fused and Enriched Vision Transformer | Manjary P Gangan et.al. | 2308.07279v1 | null |
2023-08-14 | EasyEdit: An Easy-to-use Knowledge Editing Framework for Large Language Models | Peng Wang et.al. | 2308.07269v1 | link |
2023-08-14 | Diving with Penguins: Detecting Penguins and their Prey in Animal-borne Underwater Videos via Deep Learning | Kejia Zhang et.al. | 2308.07267v1 | null |
2023-08-14 | Large-kernel Attention for Efficient and Robust Brain Lesion Segmentation | Liam Chalcroft et.al. | 2308.07251v1 | link |
2023-08-14 | LCE -- An Augmented Combination of Bagging and Boosting in Python | Kevin Fauvel et.al. | 2308.07250v1 | link |
2023-08-14 | Large-scale environment mapping and immersive human-robot interaction for agricultural mobile robot teleoperation | Tao Liu et.al. | 2308.07231v1 | null |
2023-08-14 | Almost fine gradings on algebras and classification of gradings up to isomorphism | Alberto Elduque et.al. | 2308.07230v1 | null |
2023-08-14 | Distance Matters For Improving Performance Estimation Under Covariate Shift | Mélanie Roschewitz et.al. | 2308.07223v1 | link |
2023-08-15 | AudioFormer: Audio Transformer learns audio feature representations from discrete acoustic codes | Zhaohui Li et.al. | 2308.07221v2 | link |
2023-08-11 | ARGUS: Visualization of AI-Assisted Task Guidance in AR | Sonia Castelo et.al. | 2308.06246v1 | null |
2023-08-11 | Exploring Predicate Visual Context in Detecting of Human-Object Interactions | Frederic Z. Zhang et.al. | 2308.06202v1 | link |
2023-08-11 | Weakly Supervised Text Classification on Free Text Comments in Patient-Reported Outcome Measures | Anna-Grace Linton et.al. | 2308.06199v1 | null |
2023-08-11 | Physical Adversarial Attacks For Camera-based Smart Systems: Current Trends, Categorization, Applications, Research Challenges, and Future Outlook | Amira Guesmi et.al. | 2308.06173v1 | null |
2023-08-11 | Extrinsic geometry and linear differential equations of |
Boris Doubrov et.al. | 2308.06169v1 | null |
2023-08-11 | Rethinking the Localization in Weakly Supervised Object Localization | Rui Xu et.al. | 2308.06161v1 | null |
2023-08-11 | Identification of the Relevance of Comments in Codes Using Bag of Words and Transformer Based Models | Sruthi S et.al. | 2308.06144v1 | link |
2023-08-11 | Lip2Vec: Efficient and Robust Visual Speech Recognition via Latent-to-Latent Visual to Audio Representation Mapping | Yasser Abdelaziz Dahou Djilali et.al. | 2308.06112v1 | null |
2023-08-11 | Diffusion-based Visual Counterfactual Explanations -- Towards Systematic Quantitative Evaluation | Philipp Vaeth et.al. | 2308.06100v1 | link |
2023-08-11 | Automated Construction of Time-Space Diagrams for Traffic Analysis Using Street-View Video Sequence | Tanay Rastogi et.al. | 2308.06098v1 | null |
2023-08-10 | Follow Anything: Open-set detection, tracking, and following in real-time | Alaa Maalouf et.al. | 2308.05737v1 | link |
2023-08-10 | FrozenRecon: Pose-free 3D Scene Reconstruction with Frozen Depth Models | Guangkai Xu et.al. | 2308.05733v1 | null |
2023-08-10 | Optimizing Performance of Feedforward and Convolutional Neural Networks through Dynamic Activation Functions | Chinmay Rane et.al. | 2308.05724v1 | null |
2023-08-10 | Towards the Automorphism Conjecture I: Combinatorial Control and Compensation for Factorials | Bernd S. W. Schröder et.al. | 2308.05715v1 | null |
2023-08-10 | Automatic Extraction of Relevant Road Infrastructure using Connected vehicle data and Deep Learning Model | Adu-Gyamfi Kojo et.al. | 2308.05658v1 | null |
2023-08-10 | Attention-based 3D CNN with Multi-layer Features for Alzheimer's Disease Diagnosis using Brain Images | Yanteng Zhang et.al. | 2308.05655v1 | null |
2023-08-10 | Counterfactual Cross-modality Reasoning for Weakly Supervised Video Moment Localization | Zezhong Lv et.al. | 2308.05648v1 | link |
2023-08-10 | Self-Supervised Monocular Depth Estimation by Direction-aware Cumulative Convolution Network | Wencheng Han et.al. | 2308.05605v1 | link |
2023-08-10 | Object Goal Navigation with Recursive Implicit Maps | Shizhe Chen et.al. | 2308.05602v1 | null |
2023-08-10 | You Only Prompt Once: On the Capabilities of Prompt Learning on Large Language Models to Tackle Toxic Content | Xinlei He et.al. | 2308.05596v1 | null |
2023-08-09 | Improved Multi-Shot Diffusion-Weighted MRI with Zero-Shot Self-Supervised Learning Reconstruction | Jaejin Cho et.al. | 2308.05103v1 | link |
2023-08-09 | DOST -- Domain Obedient Self-supervised Training for Multi Label Classification with Noisy Labels | Soumadeep Saha et.al. | 2308.05101v1 | null |
2023-08-09 | Constructing Holistic Spatio-Temporal Scene Graph for Video Semantic Role Labeling | Yu Zhao et.al. | 2308.05081v1 | null |
2023-08-10 | Geometric Learning-Based Transformer Network for Estimation of Segmentation Errors | Sneha Sree C et.al. | 2308.05068v2 | null |
2023-08-09 | PAT: Position-Aware Transformer for Dense Multi-Label Action Detection | Faegheh Sardari et.al. | 2308.05051v1 | null |
2023-08-09 | Collaborative Wideband Spectrum Sensing and Scheduling for Networked UAVs in UTM Systems | Sravan Reddy Chintareddy et.al. | 2308.05036v1 | null |
2023-08-09 | Expert load matters: operating networks at high accuracy and low manual effort | Sara Sangalli et.al. | 2308.05035v1 | null |
2023-08-09 | MetRoBERTa: Leveraging Traditional Customer Relationship Management Data to Develop a Transit-Topic-Aware Language Model | Michael Leong et.al. | 2308.05012v1 | null |
2023-08-09 | Exploring Multilingual Text Data Distillation | Shivam Sahni et.al. | 2308.04982v1 | link |
2023-08-09 | CasCIFF: A Cross-Domain Information Fusion Framework Tailored for Cascade Prediction in Social Networks | Hongjun Zhu et.al. | 2308.04961v1 | null |
2023-08-08 | A Deep-Learning Method Using Auto-encoder and Generative Adversarial Network for Anomaly Detection on Ancient Stone Stele Surfaces | Yikun Liu et.al. | 2308.04426v1 | null |
2023-08-08 | A Bi-directional Multi-hop Inference Model for Joint Dialog Sentiment Classification and Act Recognition | Li Zheng et.al. | 2308.04424v1 | null |
2023-08-08 | DiffCR: A Fast Conditional Diffusion Framework for Cloud Removal from Optical Satellite Images | Xuechao Zou et.al. | 2308.04417v1 | null |
2023-08-08 | Probabilistic Invariant Learning with Randomized Linear Classifiers | Leonardo Cotta et.al. | 2308.04412v1 | null |
2023-08-08 | Data Augmentation-Based Unsupervised Domain Adaptation In Medical Imaging | Sebastian Nørgaard Llambias et.al. | 2308.04395v1 | null |
2023-08-08 | SSTFormer: Bridging Spiking Neural Network and Memory Support Transformer for Frame-Event based Recognition | Xiao Wang et.al. | 2308.04369v1 | link |
2023-08-08 | Vascular Ageing and Smoking Habit Prediction via a Low-Cost Single-Lead ECG Module | S. Anas Ali et.al. | 2308.04355v1 | null |
2023-08-08 | A Lightweight and Accurate Face Detection Algorithm Based on Retinaface | Baozhu Liu et.al. | 2308.04340v1 | null |
2023-08-08 | Pengembangan Model untuk Mendeteksi Kerusakan pada Terumbu Karang dengan Klasifikasi Citra | Fadhil Muhammad et.al. | 2308.04337v1 | null |
2023-08-08 | Embracing Safe Contacts with Contact-aware Planning and Control | Zhaoting Li et.al. | 2308.04323v1 | null |
2023-08-07 | 3D Motion Magnification: Visualizing Subtle Motions with Time Varying Radiance Fields | Brandon Y. Feng et.al. | 2308.03757v1 | null |
2023-08-07 | What about translation? New coding system for content analysis on the perception of literary translation around the political transformation in 1989 in Hungary as a classification problem on an unbalanced dataset | Dalma Galambos et.al. | 2308.03742v1 | null |
2023-08-07 | Efficient Temporal Sentence Grounding in Videos with Multi-Teacher Knowledge Distillation | Renjie Liang et.al. | 2308.03725v1 | null |
2023-08-07 | Automated Real Time Delineation of Supraclavicular Brachial Plexus in Neck Ultrasonography Videos: A Deep Learning Approach | Abhay Tyagi et.al. | 2308.03717v1 | null |
2023-08-08 | Communication-Efficient Framework for Distributed Image Semantic Wireless Transmission | Bingyan Xie et.al. | 2308.03713v2 | null |
2023-08-07 | Scaling may be all you need for achieving human-level object recognition capacity with human-like visual experience | A. Emin Orhan et.al. | 2308.03712v1 | link |
2023-08-07 | Video-based Person Re-identification with Long Short-Term Representation Learning | Xuehu Liu et.al. | 2308.03703v1 | null |
2023-08-08 | Screen-based 3D Subjective Experiment Software | Songlin Fan et.al. | 2308.03698v2 | null |
2023-08-07 | Learning Concise and Descriptive Attributes for Visual Recognition | An Yan et.al. | 2308.03685v1 | null |
2023-08-07 | Detecting Spells in Fantasy Literature with a Transformer Based Artificial Intelligence | Marcel Moravek et.al. | 2308.03660v1 | null |
2023-08-04 | Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | Qihang Yu et.al. | 2308.02487v1 | link |
2023-08-04 | BlindSage: Label Inference Attacks against Node-level Vertical Federated Graph Neural Networks | Marco Arazzi et.al. | 2308.02465v1 | null |
2023-08-04 | Nonprehensile Planar Manipulation through Reinforcement Learning with Multimodal Categorical Exploration | Juan Del Aguila Ferrandis et.al. | 2308.02459v1 | null |
2023-08-04 | Getting the Ball Rolling: Learning a Dexterous Policy for a Biomimetic Tendon-Driven Hand with Rolling Contact Joints | Yasunori Toshimitsu et.al. | 2308.02453v1 | null |
2023-08-04 | Adaptive Preferential Attached kNN Graph With Distribution-Awareness | Shaojie Min et.al. | 2308.02442v1 | link |
2023-08-04 | Scaling Survival Analysis in Healthcare with Federated Survival Forests: A Comparative Study on Heart Failure and Breast Cancer Genomics | Alberto Archetti et.al. | 2308.02382v1 | null |
2023-08-04 | Brain MRI Segmentation using Template-Based Training and Visual Perception Augmentation | Fang-Cheng Yeh et.al. | 2308.02363v1 | null |
2023-08-04 | T-UNet: Triplet UNet for Change Detection in High-Resolution Remote Sensing Images | Huan Zhong et.al. | 2308.02356v1 | link |
2023-08-04 | Adapting to Change: Robust Counterfactual Explanations in Dynamic Data Landscapes | Bardh Prenkaj et.al. | 2308.02353v1 | link |
2023-08-04 | Generative Image Priors for MRI Reconstruction Trained from Magnitude-Only Images | Guanxiong Luo et.al. | 2308.02340v1 | null |
2023-08-03 | FROD: Robust Object Detection for Free | Muhammad et.al. | 2308.01888v1 | null |
2023-08-03 | Similar image retrieval using Autoencoder. I. Automatic morphology classification of galaxies | Eunsuk Seo et.al. | 2308.01871v1 | null |
2023-08-03 | Tag Prediction of Competitive Programming Problems using Deep Learning Techniques | Taha Lokat et.al. | 2308.01863v1 | null |
2023-08-03 | URET: Universal Robustness Evaluation Toolkit (for Evasion) | Kevin Eykholt et.al. | 2308.01840v1 | link |
2023-08-03 | Distribution-Free Inference for the Regression Function of Binary Classification | Ambrus Tamás et.al. | 2308.01835v1 | null |
2023-08-03 | Deep Neural Networks Fused with Textures for Image Classification | Asish Bera et.al. | 2308.01813v1 | null |
2023-08-03 | Deep Learning-based Prediction of Stress and Strain Maps in Arterial Walls for Improved Cardiovascular Risk Assessment | Yasin Shokrollahi1 et.al. | 2308.01771v1 | null |
2023-08-03 | Focus on Content not Noise: Improving Image Generation for Nuclei Segmentation by Suppressing Steganography in CycleGAN | Jonas Utz et.al. | 2308.01769v1 | null |
2023-08-03 | A Novel Tensor Decomposition of arbitrary order based on Block Convolution with Reflective Boundary Conditions for Multi-Dimensional Data Analysis | Mahdi Molavi et.al. | 2308.01768v1 | null |
2023-08-03 | NuInsSeg: A Fully Annotated Dataset for Nuclei Instance Segmentation in H&E-Stained Histological Images | Amirreza Mahbod et.al. | 2308.01760v1 | link |
2023-08-02 | ELIXR: Towards a general purpose X-ray artificial intelligence system through alignment of large language models and radiology vision encoders | Shawn Xu et.al. | 2308.01317v1 | null |
2023-08-02 | More Context, Less Distraction: Visual Classification by Inferring and Conditioning on Contextual Attributes | Bang An et.al. | 2308.01313v1 | link |
2023-08-02 | Revisiting DETR Pre-training for Object Detection | Yan Ma et.al. | 2308.01300v1 | null |
2023-08-02 | A Probabilistic Approach to Self-Supervised Learning using Cyclical Stochastic Gradient MCMC | Masoumeh Javanbakhat et.al. | 2308.01271v1 | null |
2023-08-02 | Incorporating Season and Solar Specificity into Renderings made by a NeRF Architecture using Satellite Images | Michael Gableman et.al. | 2308.01262v1 | link |
2023-08-02 | Quantum Imprint of the Anharmonic Oscillator | Prisco Lo Chiatto et.al. | 2308.01244v1 | null |
2023-08-03 | CMUNeXt: An Efficient Medical Image Segmentation Network based on Large Kernel and Skip Fusion | Fenghe Tang et.al. | 2308.01239v2 | link |
2023-08-02 | LSF-IDM: Lightweight Deep Learning Models for Automotive Intrusion Detection Model Based on Semantic Fusion | Pengzhou Cheng et.al. | 2308.01237v1 | null |
2023-08-02 | JADES. The diverse population of infant Black Holes at 4<z<11: merging, tiny, poor, but mighty | Roberto Maiolino et.al. | 2308.01230v1 | null |
2023-08-02 | TeachCLIP: Multi-Grained Teaching for Efficient Text-to-Video Retrieval | Kaibin Tian et.al. | 2308.01217v1 | null |
2023-08-01 | Tool Documentation Enables Zero-Shot Tool-Usage with Large Language Models | Cheng-Yu Hsieh et.al. | 2308.00675v1 | null |
2023-08-01 | Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation in Outdoor Scenes | Bohao Fan et.al. | 2308.00628v1 | link |
2023-08-01 | NeRT: Implicit Neural Representations for General Unsupervised Turbulence Mitigation | Weiyun Jiang et.al. | 2308.00622v1 | null |
2023-08-01 | Beyond One-Hot-Encoding: Injecting Semantics to Drive Image Classifiers | Alan Perotti et.al. | 2308.00607v1 | link |
2023-08-01 | Relation-Aware Distribution Representation Network for Person Clustering with Multiple Modalities | Kaijian Liu et.al. | 2308.00588v1 | null |
2023-08-01 | Gradient Scaling on Deep Spiking Neural Networks with Spike-Dependent Local Information | Seongsik Park et.al. | 2308.00558v1 | null |
2023-08-01 | SF-IDS: An Imbalanced Semi-Supervised Learning Framework for Fine-grained Intrusion Detection | Xinran Zheng et.al. | 2308.00542v1 | null |
2023-08-01 | Compressed Private Aggregation for Scalable and Robust Federated Learning over Massive Networks | Natalie Lang et.al. | 2308.00540v1 | link |
2023-08-01 | Predicting Early Dropouts of an Active and Healthy Ageing App | Vasileios Perifanis et.al. | 2308.00539v1 | null |
2023-08-01 | PressureTransferNet: Human Attribute Guided Dynamic Ground Pressure Profile Transfer using 3D simulated Pressure Maps | Lala Shakti Swarup Ray et.al. | 2308.00538v1 | null |
2023-07-31 | A Quantized Interband Topological Index in Two-Dimensional Systems | Tharindu Fernando et.al. | 2307.16893v1 | null |
2023-07-31 | Foundational Models for Fault Diagnosis of Electrical Motors | Sriram Anbalagan et.al. | 2307.16891v1 | null |
2023-07-31 | Discovering Adaptable Symbolic Algorithms from Scratch | Stephen Kelly et.al. | 2307.16890v1 | null |
2023-07-31 | Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion Models | Weikang Yu et.al. | 2307.16865v1 | null |
2023-07-31 | Nonlinearity-induced topological phase transition characterized by the nonlinear Chern number | Kazuki Sone et.al. | 2307.16827v1 | null |
2023-07-31 | Defense of Adversarial Ranking Attack in Text Retrieval: Benchmark and Baseline via Detection | Xuanang Chen et.al. | 2307.16816v1 | null |
2023-07-31 | Capturing Co-existing Distortions in User-Generated Content for No-reference Video Quality Assessment | Kun Yuan et.al. | 2307.16813v1 | null |
2023-07-31 | DoDo Learning: DOmain-DemOgraphic Transfer in Language Models for Detecting Abuse Targeted at Public Figures | Hannah Rose Kirk et.al. | 2307.16811v1 | null |
2023-07-31 | DPMix: Mixture of Depth and Point Cloud Video Experts for 4D Action Segmentation | Yue Zhang et.al. | 2307.16803v1 | null |
2023-07-31 | Classification with Deep Neural Networks and Logistic Loss | Zihan Zhang et.al. | 2307.16792v1 | null |
2023-07-28 | Quantum-noise-limited optical neural networks operating at a few quanta per activation | Shi-Yuan Ma et.al. | 2307.15712v1 | null |
2023-07-31 | MeMOTR: Long-Term Memory-Augmented Transformer for Multi-Object Tracking | Ruopeng Gao et.al. | 2307.15700v2 | null |
2023-07-28 | PatchMixer: Rethinking network design to boost generalization for 3D point cloud understanding | Davide Boscaini et.al. | 2307.15692v1 | null |
2023-07-28 | ODTlearn: A Package for Learning Optimal Decision Trees for Prediction and Prescription | Patrick Vossler et.al. | 2307.15691v1 | link |
2023-07-28 | Dynamic Analysis and an Eigen Initializer for Recurrent Neural Networks | Ran Dou et.al. | 2307.15679v1 | null |
2023-07-28 | Bayesian Time-Series Classifier for Decoding Simple Visual Stimuli from Intracranial Neural Activity | Navid Ziaei et.al. | 2307.15672v1 | null |
2023-07-28 | Classifying core collapse supernova remnants by their morphology as shaped by the last exploding jets | Noam Soker et.al. | 2307.15666v1 | null |
2023-07-28 | Multi-layer Aggregation as a key to feature-based OOD detection | Benjamin Lambert et.al. | 2307.15647v1 | null |
2023-07-28 | Scale-aware Test-time Click Adaptation for Pulmonary Nodule and Mass Segmentation | Zhihao Li et.al. | 2307.15645v1 | link |
2023-07-28 | TriadNet: Sampling-free predictive intervals for lesional volume in 3D brain MR images | Benjamin Lambert et.al. | 2307.15638v1 | null |
2023-07-27 | PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point Tracking | Yang Zheng et.al. | 2307.15055v1 | null |
2023-07-27 | A Transformer-based Approach for Arabic Offline Handwritten Text Recognition | Saleh Momeni et.al. | 2307.15045v1 | null |
2023-07-27 | Drive Asymmetry, Convergence and the Origin of Turbulence in ICF Implosions | Vincent A. Thomas et.al. | 2307.15028v1 | null |
2023-07-27 | Self-Supervised Graph Transformer for Deepfake Detection | Aminollah Khormali et.al. | 2307.15019v1 | null |
2023-07-27 | The last patch for classifying shuffle groups | Junyang Zhang et.al. | 2307.15012v1 | null |
2023-07-27 | Gzip versus bag-of-words for text classification with KNN | Juri Opitz et.al. | 2307.15002v1 | null |
2023-07-27 | Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs | Or Sharir et.al. | 2307.14988v1 | null |
2023-07-27 | Take-A-Photo: 3D-to-2D Generative Pre-training of Point Cloud Models | Ziyi Wang et.al. | 2307.14971v1 | link |
2023-07-27 | Federated Model Aggregation via Self-Supervised Priors for Highly Imbalanced Medical Image Classification | Marawan Elbatel et.al. | 2307.14959v1 | link |
2023-07-27 | Multi-Source Domain Adaptation through Dataset Dictionary Learning in Wasserstein Space | Eduardo Fernandes Montesuma et.al. | 2307.14953v1 | null |
2023-07-26 | MAMo: Leveraging Memory and Attention for Monocular Video Depth Estimation | Rajeev Yasarla et.al. | 2307.14336v1 | null |
2023-07-26 | Event-based Vision for Early Prediction of Manipulation Actions | Daniel Deniz et.al. | 2307.14332v1 | null |
2023-07-26 | Waypoint-Based Imitation Learning for Robotic Manipulation | Lucy Xiaoyang Shi et.al. | 2307.14326v1 | null |
2023-07-26 | Unraveling the Complexity of Splitting Sequential Data: Tackling Challenges in Video and Time Series Analysis | Diego Botache et.al. | 2307.14294v1 | null |
2023-07-26 | G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory | Hongxiang Li et.al. | 2307.14277v1 | null |
2023-07-26 | Deepfake Image Generation for Improved Brain Tumor Segmentation | Roa'a Al-Emaryeen et.al. | 2307.14273v1 | null |
2023-07-26 | Sim-to-Real Model-Based and Model-Free Deep Reinforcement Learning for Tactile Pushing | Max Yang et.al. | 2307.14272v1 | null |
2023-07-26 | Artifact Restoration in Histology Images with Diffusion Probabilistic Models | Zhenqi He et.al. | 2307.14262v1 | link |
2023-07-26 | Defending Adversarial Patches via Joint Region Localizing and Inpainting | Junwen Chen et.al. | 2307.14242v1 | null |
2023-07-26 | DisguisOR: Holistic Face Anonymization for the Operating Room | Lennart Bastian et.al. | 2307.14241v1 | link |
2023-07-25 | RED CoMETS: An ensemble classifier for symbolically represented multivariate time series | Luca A. Bennett et.al. | 2307.13679v1 | link |
2023-07-25 | QuickQual: Lightweight, convenient retinal image quality scoring with off-the-shelf pretrained models | Justin Engelmann et.al. | 2307.13646v1 | link |
2023-07-25 | Manifestly Covariant Worldline Actions from Coadjoint Orbits. Part I: Generalities and Vectorial Descriptions | Thomas Basile et.al. | 2307.13644v1 | null |
2023-07-25 | Optical Flow boosts Unsupervised Localization and Segmentation | Xinyu Zhang et.al. | 2307.13640v1 | link |
2023-07-25 | Insights into Cognitive Engagement: Comparing the Effectiveness of Game-Based and Video-Based Learning | Shayla Sharmin et.al. | 2307.13637v1 | null |
2023-07-25 | Contributions to the Improvement of Question Answering Systems in the Biomedical Domain | Mourad Sarrouti et.al. | 2307.13631v1 | null |
2023-07-25 | Chandra X-ray Observatory Observations of 13 Fermi LAT Sources | Blagoy Rangelov et.al. | 2307.13594v1 | null |
2023-07-25 | Reinterpreting survival analysis in the universal approximator age | Sören Dittmer et.al. | 2307.13579v1 | link |
2023-07-25 | PT$\mathrm{L}^{p}$: Partial Transport |
Xinran Liu et.al. | 2307.13571v1 | null |
2023-07-25 | Group Activity Recognition in Computer Vision: A Comprehensive Review, Challenges, and Future Perspectives | Chuanchuan Wang et.al. | 2307.13541v1 | null |
2023-07-24 | Leveraging Label Variation in Large Language Models for Zero-Shot Text Classification | Flor Miriam Plaza-del-Arco et.al. | 2307.12973v1 | null |
2023-07-24 | A Connection between One-Step Regularization and Critic Regularization in Reinforcement Learning | Benjamin Eysenbach et.al. | 2307.12968v1 | link |
2023-07-24 | Audio-Enhanced Text-to-Video Retrieval using Text-Conditioned Feature Alignment | Sarah Ibrahimi et.al. | 2307.12964v1 | null |
2023-07-24 | Rule By Example: Harnessing Logical Rules for Explainable Hate Speech Detection | Christopher Clarke et.al. | 2307.12935v1 | link |
2023-07-25 | Towards a Visual-Language Foundation Model for Computational Pathology | Ming Y. Lu et.al. | 2307.12914v2 | null |
2023-07-24 | Dyn-E: Local Appearance Editing of Dynamic Neural Radiance Fields | Shangzhan Zhang et.al. | 2307.12909v1 | null |
2023-07-24 | Conditional Residual Coding: A Remedy for Bottleneck Problems in Conditional Inter Frame Coding | Fabian Brand et.al. | 2307.12864v1 | null |
2023-07-24 | Multiscale Video Pretraining for Long-Term Activity Forecasting | Reuben Tan et.al. | 2307.12854v1 | null |
2023-07-25 | Spatiotemporal Modeling Encounters 3D Medical Image Analysis: Slice-Shift UNet with Multi-View Fusion | C. I. Ugwu et.al. | 2307.12853v2 | null |
2023-07-24 | Early Neuron Alignment in Two-layer ReLU Networks with Small Initialization | Hancheng Min et.al. | 2307.12851v1 | null |
2023-07-21 | Advanced Monte Carlo simulation techniques to study polymers under equilibrium conditions | Monika Angwani et.al. | 2307.11722v1 | null |
2023-07-21 | Deep Learning Hyperspectral Pansharpening on large scale PRISMA dataset | Simone Zini et.al. | 2307.11666v1 | null |
2023-07-21 | FEDD -- Fair, Efficient, and Diverse Diffusion-based Lesion Segmentation and Malignancy Classification | Héctor Carrión et.al. | 2307.11654v1 | null |
2023-07-21 | Sparse Cholesky factorization by greedy conditional selection | Stephen Huan et.al. | 2307.11648v1 | link |
2023-07-24 | Morphological Image Analysis and Feature Extraction for Reasoning with AI-based Defect Detection and Classification Models | Jiajun Zhang et.al. | 2307.11643v2 | null |
2023-07-21 | Deep Reinforcement Learning Based System for Intraoperative Hyperspectral Video Autofocusing | Charlie Budd et.al. | 2307.11638v1 | null |
2023-07-21 | Computational Image Formation | Stanley H. Chan et.al. | 2307.11635v1 | null |
2023-07-21 | Finding Optimal Diverse Feature Sets with Alternative Feature Selection | Jakob Bach et.al. | 2307.11607v1 | null |
2023-07-21 | Cascaded multitask U-Net using topological loss for vessel segmentation and centerline extraction | Pierre Rougé et.al. | 2307.11603v1 | null |
2023-07-21 | Mixbiotic society measures: Assessment of community well-going as living system | Takeshi Kato et.al. | 2307.11594v1 | null |
2023-07-20 | GLSFormer: Gated - Long, Short Sequence Transformer for Step Recognition in Surgical Videos | Nisarg A. Shah et.al. | 2307.11081v1 | link |
2023-07-20 | Driving Policy Prediction based on Deep Learning Models | Fuxiao Liu et.al. | 2307.11058v1 | null |
2023-07-20 | Cascade-DETR: Delving into High-Quality Universal Object Detection | Mingqiao Ye et.al. | 2307.11035v1 | link |
2023-07-20 | Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification | Neel Guha et.al. | 2307.11031v1 | null |
2023-07-20 | Cluster-aware Semi-supervised Learning: Relational Knowledge Distillation Provably Learns Clustering | Yijun Dong et.al. | 2307.11030v1 | null |
2023-07-20 | Multi-objective point cloud autoencoders for explainable myocardial infarction prediction | Marcel Beetz et.al. | 2307.11017v1 | null |
2023-07-20 | Treatment And Follow-Up Guidelines For Multiple Brain Metastases: A Systematic Review | Ana Sofia Santos et.al. | 2307.11016v1 | null |
2023-07-21 | Dense Sample Deep Learning | Stephen Josè Hanson et.al. | 2307.10991v2 | null |
2023-07-20 | Deep Spiking-UNet for Image Processing | Hebei Li et.al. | 2307.10974v1 | link |
2023-07-20 | Spinal nerve segmentation method and dataset construction in endoscopic surgical scenarios | Shaowu Peng et.al. | 2307.10955v1 | link |
2023-07-19 | DNA-Rendering: A Diverse Neural Actor Repository for High-Fidelity Human-centric Rendering | Wei Cheng et.al. | 2307.10173v1 | link |
2023-07-19 | Adversarial Latent Autoencoder with Self-Attention for Structural Image Synthesis | Jiajie Fan et.al. | 2307.10166v1 | null |
2023-07-19 | Leveraging Visemes for Better Visual Speech Representation and Lip Reading | Javad Peymanfard et.al. | 2307.10157v1 | null |
2023-07-19 | Remarks on a theorem of Pink in presence of bad reduction | Wojciech Gajda et.al. | 2307.10140v1 | null |
2023-07-19 | Gradient Sparsification For Masked Fine-Tuning of Transformers | James O' Neill et.al. | 2307.10098v1 | null |
2023-07-19 | Boundary-Refined Prototype Generation: A General End-to-End Paradigm for Semi-Supervised Semantic Segmentation | Junhao Dong et.al. | 2307.10097v1 | null |
2023-07-19 | Make-A-Volume: Leveraging Latent Diffusion Models for Cross-Modality 3D Brain MRI Synthesis | Lingting Zhu et.al. | 2307.10094v1 | null |
2023-07-19 | Divert More Attention to Vision-Language Object Tracking | Mingzhe Guo et.al. | 2307.10046v1 | link |
2023-07-19 | A non-monotone extra-gradient trust-region method with noisy oracles | Natasa Krejic et.al. | 2307.10038v1 | null |
2023-07-20 | Class Attention to Regions of Lesion for Imbalanced Medical Image Recognition | Jia-Xin Zhuang et.al. | 2307.10036v2 | null |
2023-07-18 | AnyDoor: Zero-shot Object-level Image Customization | Xi Chen et.al. | 2307.09481v1 | null |
2023-07-18 | FACTS: Facial Animation Creation using the Transfer of Styles | Jack Saunders et.al. | 2307.09480v1 | null |
2023-07-18 | GroupLane: End-to-End 3D Lane Detection with Channel-wise Grouping | Zhuoling Li et.al. | 2307.09472v1 | null |
2023-07-18 | Smooth Attention for Deep Multiple Instance Learning: Application to CT Intracranial Hemorrhage Detection | Yunan Wu et.al. | 2307.09457v1 | link |
2023-07-19 | A comparative analysis of SRGAN models | Fatemeh Rezapoor Nikroo et.al. | 2307.09456v2 | null |
2023-07-19 | Pseudo Outlier Exposure for Out-of-Distribution Detection using Pretrained Transformers | Jaeyoung Kim et.al. | 2307.09455v2 | null |
2023-07-18 | Measuring Student Behavioral Engagement using Histogram of Actions | Ahmed Abdelkawy et.al. | 2307.09420v1 | null |
2023-07-18 | Is this Snippet Written by ChatGPT? An Empirical Study with a CodeBERT-Based Classifier | Phuong T. Nguyen et.al. | 2307.09381v1 | null |
2023-07-18 | CertPri: Certifiable Prioritization for Deep Neural Networks via Movement Cost in Feature Space | Haibin Zheng et.al. | 2307.09375v1 | null |
2023-07-18 | Enhancing Pattern Classification in Support Vector Machines through Matrix Formulation | Sambhav Jain Reshma Rastogi et.al. | 2307.09372v1 | null |
2023-07-17 | Diffusion Models Beat GANs on Image Classification | Soumik Mukhopadhyay et.al. | 2307.08702v1 | null |
2023-07-17 | Neural Video Depth Stabilizer | Yiran Wang et.al. | 2307.08695v1 | link |
2023-07-17 | SEMI-DiffusionInst: A Diffusion Model Based Approach for Semiconductor Defect Classification and Segmentation | Vic De Ridder et.al. | 2307.08693v1 | null |
2023-07-17 | FlashAttention-2: Faster Attention with Better Parallelism and Work Partitioning | Tri Dao et.al. | 2307.08691v1 | link |
2023-07-17 | Implementation of a perception system for autonomous vehicles using a detection-segmentation network in SoC FPGA | Maciej Baczmanski et.al. | 2307.08682v1 | null |
2023-07-17 | Neural Image Compression: Generalization, Robustness, and Spectral Biases | Kelsey Lieberman et.al. | 2307.08657v1 | null |
2023-07-17 | PolyGNN: Polyhedron-based Graph Neural Network for 3D Building Reconstruction from Point Clouds | Zhaiyu Chen et.al. | 2307.08636v1 | null |
2023-07-17 | Deficiency-Aware Masked Transformer for Video Inpainting | Yongsheng Yu et.al. | 2307.08629v1 | link |
2023-07-17 | BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs | Yang Zhao et.al. | 2307.08581v1 | null |
2023-07-18 | Deep Learning with Passive Optical Nonlinear Mapping | Fei Xia et.al. | 2307.08558v2 | null |
2023-07-14 | Expressive Monotonic Neural Networks | Ouail Kitouni et.al. | 2307.07512v1 | link |
2023-07-14 | Streaming CTR Prediction: Rethinking Recommendation Task for Real-World Streaming Data | Qi-Wei Wang et.al. | 2307.07509v1 | null |
2023-07-14 | Brain Tumor Detection using Convolutional Neural Networks with Skip Connections | Aupam Hamran et.al. | 2307.07503v1 | null |
2023-07-14 | TALL: Thumbnail Layout for Deepfake Video Detection | Yuting Xu et.al. | 2307.07494v1 | null |
2023-07-14 | DreamTeacher: Pretraining Image Backbones with Deep Generative Models | Daiqing Li et.al. | 2307.07487v1 | null |
2023-07-14 | Multimodal Distillation for Egocentric Action Recognition | Gorjan Radevski et.al. | 2307.07483v1 | null |
2023-07-14 | Dual-Query Multiple Instance Learning for Dynamic Meta-Embedding based Tumor Classification | Simon Holdenried-Krafft et.al. | 2307.07482v1 | null |
2023-07-14 | Passage-times for partially-homogeneous reflected random walks on the quadrant | Conrado da Costa et.al. | 2307.07458v1 | null |
2023-07-14 | An equivariant surgery classification of |
Kelly Pohland et.al. | 2307.07446v1 | null |
2023-07-14 | Can Large Language Models Empower Molecular Property Prediction? | Chen Qian et.al. | 2307.07443v1 | link |
2023-07-13 | Video-FocalNets: Spatio-Temporal Focal Modulation for Video Action Recognition | Syed Talal Wasim et.al. | 2307.06947v1 | link |
2023-07-13 | InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation | Yi Wang et.al. | 2307.06942v1 | link |
2023-07-13 | Animate-A-Story: Storytelling with Retrieval-Augmented Video Generation | Yingqing He et.al. | 2307.06940v1 | link |
2023-07-13 | DRAGON: A Dialogue-Based Robot for Assistive Navigation with Visual Language Grounding | Shuijing Liu et.al. | 2307.06924v1 | null |
2023-07-13 | Provable Multi-Task Representation Learning by Two-Layer ReLU Neural Networks | Liam Collins et.al. | 2307.06887v1 | null |
2023-07-13 | LVLane: Deep Learning for Lane Detection and Classification in Challenging Conditions | Zillur Rahman et.al. | 2307.06853v1 | link |
2023-07-13 | Leveraging Vision-Language Foundation Models for Fine-Grained Downstream Tasks | Denis Coquenet et.al. | 2307.06795v1 | link |
2023-07-13 | Robotic surface exploration with vision and tactile sensing for cracks detection and characterisation | Francesca Palermo et.al. | 2307.06784v1 | null |
2023-07-13 | Generalizing Supervised Deep Learning MRI Reconstruction to Multiple and Unseen Contrasts using Meta-Learning Hypernetworks | Sriprabha Ramanarayanan et.al. | 2307.06771v1 | link |
2023-07-13 | Pairs of inner projections and two applications | Ramlal Debnath et.al. | 2307.06744v1 | null |
2023-07-12 | Deep Learning of Crystalline Defects from TEM images: A Solution for the Problem of "Never Enough Training Data" | Kishan Govind et.al. | 2307.06322v1 | null |
2023-07-12 | A geometric classification of rod complements in the 3-torus | Connie On Yu Hui et.al. | 2307.06317v1 | null |
2023-07-12 | Facial Reenactment Through a Personalized Generator | Ariel Elazary et.al. | 2307.06307v1 | null |
2023-07-12 | Patch n' Pack: NaViT, a Vision Transformer for any Aspect Ratio and Resolution | Mostafa Dehghani et.al. | 2307.06304v1 | null |
2023-07-12 | Feature Embeddings from Large-Scale Acoustic Bird Classifiers Enable Few-Shot Transfer Learning | Burooj Ghani et.al. | 2307.06292v1 | null |
2023-07-12 | Stochastic Light Field Holography | Florian Schiffers et.al. | 2307.06277v1 | null |
2023-07-12 | Machine learning and Topological data analysis identify unique features of human papillae in 3D scans | Rayna Andreeva et.al. | 2307.06255v1 | null |
2023-07-12 | On the Importance of Denoising when Learning to Compress Images | Benoit Brummer et.al. | 2307.06233v1 | link |
2023-07-12 | Ashaar: Automatic Analysis and Generation of Arabic Poetry Using Deep Learning Approaches | Zaid Alyafeai et.al. | 2307.06218v1 | link |
2023-07-12 | Local Conditional Neural Fields for Versatile and Generalizable Large-Scale Reconstructions in Computational Imaging | Hao Wang et.al. | 2307.06207v1 | null |
2023-07-11 | Fractonic Higher-Order Topological Phases in Open Quantum Systems | Jian-Hao Zhang et.al. | 2307.05474v1 | null |
2023-07-11 | Differentiable Blocks World: Qualitative 3D Decomposition by Rendering Primitives | Tom Monnier et.al. | 2307.05473v1 | null |
2023-07-11 | EgoVLPv2: Egocentric Video-Language Pre-training with Fusion in the Backbone | Shraman Pramanick et.al. | 2307.05463v1 | null |
2023-07-11 | Improving the Security of Smartwatch Payment with Deep Learning | George Webber et.al. | 2307.05437v1 | null |
2023-07-11 | One-Versus-Others Attention: Scalable Multimodal Integration | Michal Golovanevsky et.al. | 2307.05435v1 | link |
2023-07-11 | Identifying Acoustic Wave Sources on the Sun. II. Improved Filter Techniques for Source Wavefield Seismology | Shah Mohammad Bahauddin et.al. | 2307.05433v1 | null |
2023-07-11 | Effective Whitney Stratification of Real Algebraic Varieties | Martin Helmer et.al. | 2307.05427v1 | null |
2023-07-11 | Domain-Agnostic Neural Architecture for Class Incremental Continual Learning in Document Processing Platform | Mateusz Wójcik et.al. | 2307.05399v1 | link |
2023-07-11 | ShredGP: Guitarist Style-Conditioned Tablature Generation | Pedro Sarmento et.al. | 2307.05324v1 | null |
2023-07-11 | Class Instance Balanced Learning for Long-Tailed Classification | Marc-Antoine Lavoie et.al. | 2307.05322v1 | null |
2023-07-10 | Semantic-SAM: Segment and Recognize Anything at Any Granularity | Feng Li et.al. | 2307.04767v1 | link |
2023-07-10 | Learning Spatial Features from Audio-Visual Correspondence in Egocentric Videos | Sagnik Majumder et.al. | 2307.04760v1 | null |
2023-07-10 | Shelving, Stacking, Hanging: Relational Pose Diffusion for Multi-modal Rearrangement | Anthony Simeonov et.al. | 2307.04751v1 | null |
2023-07-10 | RoCo: Dialectic Multi-Robot Collaboration with Large Language Models | Zhao Mandi et.al. | 2307.04738v1 | link |
2023-07-10 | AnimateDiff: Animate Your Personalized Text-to-Image Diffusion Models without Specific Tuning | Yuwei Guo et.al. | 2307.04725v1 | null |
2023-07-10 | Quark/Gluon Discrimination and Top Tagging with Dual Attention Transformer | Minxuan He et.al. | 2307.04723v1 | null |
2023-07-10 | CVPR MultiEarth 2023 Deforestation Estimation Challenge:SpaceVision4Amazon | Sunita Arya et.al. | 2307.04715v1 | null |
2023-07-10 | Multimodal brain age estimation using interpretable adaptive population-graph learning | Kyriaki-Margarita Bintsi et.al. | 2307.04639v1 | null |
2023-07-10 | Learning Fine Pinch-Grasp Skills using Tactile Sensing from Real Demonstration Data | Xiaofeng Mao et.al. | 2307.04619v1 | null |
2023-07-10 | Weakly-supervised positional contrastive learning: application to cirrhosis classification | Emma Sarfati et.al. | 2307.04617v1 | null |
2023-07-07 | On the representation theory of cyclic and dihedral quandles | Mohamed Elhamdadi et.al. | 2307.03728v1 | null |
2023-07-07 | Polybot: Training One Policy Across Robots While Embracing Variability | Jonathan Yang et.al. | 2307.03719v1 | null |
2023-07-07 | Motion Magnification in Robotic Sonography: Enabling Pulsation-Aware Artery Segmentation | Dianye Huang et.al. | 2307.03698v1 | null |
2023-07-07 | Detecting the Sensing Area of A Laparoscopic Probe in Minimally Invasive Cancer Surgery | Baoru Huang et.al. | 2307.03662v1 | null |
2023-07-07 | Physical-aware Cross-modal Adversarial Network for Wearable Sensor-based Human Action Recognition | Jianyuan Ni et.al. | 2307.03638v1 | null |
2023-07-07 | VesselVAE: Recursive Variational Autoencoders for 3D Blood Vessel Synthesis | Paula Feldman et.al. | 2307.03592v1 | null |
2023-07-07 | SpawnNet: Learning Generalizable Visuomotor Skills from Pre-trained Networks | Xingyu Lin et.al. | 2307.03567v1 | null |
2023-07-07 | VariGrad: A Novel Feature Vector Architecture for Geometric Deep Learning on Unregistered Data | Emmanuel Hartman et.al. | 2307.03553v1 | null |
2023-07-07 | TBGC: Task-level Backbone-Oriented Gradient Clip for Multi-Task Foundation Model Learning | Zelun Zhang et.al. | 2307.03465v1 | null |
2023-07-07 | A Deep Active Contour Model for Delineating Glacier Calving Fronts | Konrad Heidler et.al. | 2307.03461v1 | null |
2023-07-06 | Synthesizing Artistic Cinemagraphs from Text | Aniruddha Mahapatra et.al. | 2307.03190v1 | null |
2023-07-06 | Long-term follow-up observations of extreme coronal line emitting galaxies | Peter Clark et.al. | 2307.03182v1 | null |
2023-07-06 | Push Past Green: Learning to Look Behind Plant Foliage by Moving It | Xiaoyu Zhang et.al. | 2307.03175v1 | null |
2023-07-06 | VideoGLUE: Video General Understanding Evaluation of Foundation Models | Liangzhe Yuan et.al. | 2307.03166v1 | null |
2023-07-06 | Can Domain Adaptation Improve Accuracy and Fairness of Skin Lesion Classification? | Janet Wang et.al. | 2307.03157v1 | null |
2023-07-06 | MultiVENT: Multilingual Videos of Events with Aligned Natural Text | Kate Sanders et.al. | 2307.03153v1 | null |
2023-07-06 | Topology-Aware Loss for Aorta and Great Vessel Segmentation in Computed Tomography Images | Seher Ozcelik et.al. | 2307.03137v1 | null |
2023-07-06 | Distilling Large Vision-Language Model with Out-of-Distribution Generalizability | Xuanlin Li et.al. | 2307.03135v1 | link |
2023-07-06 | Benchmarking Test-Time Adaptation against Distribution Shifts in Image Classification | Yongcan Yu et.al. | 2307.03133v1 | link |
2023-07-06 | VisKoP: Visual Knowledge oriented Programming for Interactive Knowledge Base Question Answering | Zijun Yao et.al. | 2307.03130v1 | null |
2023-07-05 | Building Cooperative Embodied Agents Modularly with Large Language Models | Hongxin Zhang et.al. | 2307.02485v1 | null |
2023-07-05 | Elastic Decision Transformer | Yueh-Hua Wu et.al. | 2307.02484v1 | null |
2023-07-05 | What Matters in Training a GPT4-Style Language Model with Multimodal Inputs? | Yan Zeng et.al. | 2307.02469v1 | null |
2023-07-05 | Supersymmetric asymptotically locally AdS$_5$ gravitational solitons | Turkuler Durgut et.al. | 2307.02466v1 | null |
2023-07-05 | AxonCallosumEM Dataset: Axon Semantic Segmentation of Whole Corpus Callosum cross section from EM Images | Ao Cheng et.al. | 2307.02464v1 | null |
2023-07-05 | Expert-Agnostic Ultrasound Image Quality Assessment using Deep Variational Clustering | Deepak Raina et.al. | 2307.02462v1 | null |
2023-07-05 | LLCaps: Learning to Illuminate Low-Light Capsule Endoscopy with Curved Wavelet Attention and Reverse Diffusion | Long Bai et.al. | 2307.02452v1 | link |
2023-07-05 | On Deep Learning Classification of Digitally Modulated Signals Using Raw I/Q Data | John A. Snoap et.al. | 2307.02450v1 | null |
2023-07-05 | Vulnerable Source Code Detection using SonarCloud Code Analysis | Alifia Puspaningrum et.al. | 2307.02446v1 | null |
2023-07-05 | Base Layer Efficiency in Scalable Human-Machine Coding | Yalda Foroutan et.al. | 2307.02430v1 | null |
2023-07-03 | Real-time Monocular Full-body Capture in World Space via Sequential Proxy-to-Motion Learning | Yuxiang Zhang et.al. | 2307.01200v1 | null |
2023-07-03 | Segment Anything Meets Point Tracking | Frano Rajič et.al. | 2307.01197v1 | link |
2023-07-03 | Online nearest neighbor classification | Sanjoy Dasgupta et.al. | 2307.01170v1 | null |
2023-07-03 | Don't freeze: Finetune encoders for better Self-Supervised HAR | Vitor Fortes Rey et.al. | 2307.01168v1 | null |
2023-07-03 | Characteristic signatures of accreting binary black holes produced by eccentric minidisks | John Ryan Westernacher-Schneider et.al. | 2307.01154v1 | null |
2023-07-03 | Integral cohomology rings of weighted Grassmann orbifolds and Rigidity properties | Koushik Brahma et.al. | 2307.01153v1 | null |
2023-07-03 | Investigating Data Memorization in 3D Latent Diffusion Models for Medical Image Synthesis | Salman Ul Hassan Dar et.al. | 2307.01148v1 | null |
2023-07-05 | AVSegFormer: Audio-Visual Segmentation with Transformer | Shengyi Gao et.al. | 2307.01146v2 | link |
2023-07-03 | Cross-modality Attention Adapter: A Glioma Segmentation Fine-tuning Method for SAM Using Multimodal Brain MR Images | Xiaoyu Shi et.al. | 2307.01124v1 | null |
2023-07-03 | Supervised Manifold Learning via Random Forest Geometry-Preserving Proximities | Jake S. Rhodes et.al. | 2307.01077v1 | null |
2023-07-03 | SPAE: Semantic Pyramid AutoEncoder for Multimodal Generation with Frozen LLMs | Lijun Yu et.al. | 2306.17842v2 | null |
2023-06-30 | Learning Evacuee Models from Robot-Guided Emergency Evacuation Experiments | Mollik Nayyar et.al. | 2306.17824v1 | null |
2023-06-30 | Act3D: Infinite Resolution Action Detection Transformer for Robotic Manipulation | Theophile Gervet et.al. | 2306.17817v1 | null |
2023-06-30 | Topologically Attributed Graphs for Shape Discrimination | Justin Curry et.al. | 2306.17805v1 | null |
2023-06-30 | Vision Through the Veil: Differential Privacy in Federated Learning for Medical Image Classification | Kishore Babu Nampalle et.al. | 2306.17794v1 | null |
2023-06-30 | Precision Anti-Cancer Drug Selection via Neural Ranking | Vishal Dey et.al. | 2306.17771v1 | null |
2023-06-30 | Improved NL2SQL based on Multi-layer Expert Network | Chenduo Hao et.al. | 2306.17727v1 | null |
2023-06-30 | Content-Preserving Diffusion Model for Unsupervised AS-OCT image Despeckling | Li Sanqian et.al. | 2306.17717v1 | null |
2023-06-30 | Evaluation of the Benefits of Zero Velocity Update in Decentralized EKF-Based Cooperative Localization Algorithms for GNSS-Denied Multi-Robot Systems | Cagri Kilic et.al. | 2306.17703v1 | null |
2023-06-30 | Generalized Time Warping Invariant Dictionary Learning for Time Series Classification and Clustering | Ruiyu Xu et.al. | 2306.17690v1 | null |
2023-06-29 | An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training | Zitian Chen et.al. | 2306.17165v1 | null |
2023-06-29 | Can Machines Garden? Systematically Comparing the AlphaGarden vs. Professional Horticulturalists | Simeon Adebola et.al. | 2306.17162v1 | null |
2023-06-29 | FogROS2-SGC: A ROS2 Cloud Robotics Platform for Secure Global Connectivity | Kaiyuan Chen et.al. | 2306.17157v1 | null |
2023-06-29 | Orbit Classification of asteroids using implementation of radial Basis Function on Support Vector Machines | Yashvir Tiberwal et.al. | 2306.17138v1 | null |
2023-06-29 | On separably integrable symmetric convex bodies | Vladyslav Yaskin et.al. | 2306.17127v1 | null |
2023-06-29 | PVP: Personalized Video Prior for Editable Dynamic Portraits using StyleGAN | Kai-En Lin et.al. | 2306.17123v1 | null |
2023-06-29 | Learning Nuclei Representations with Masked Image Modelling | Piotr Wójcik et.al. | 2306.17116v1 | null |
2023-06-29 | Deep Ensemble for Rotorcraft Attitude Prediction | Hikmat Khan et.al. | 2306.17104v1 | null |
2023-06-29 | Twice Binnable Color Filter Arrays | Mritunjay Singh et.al. | 2306.17078v1 | null |
2023-06-29 | Extremal behavior of reduced type of one dimensional rings | Sarasij Maitra et.al. | 2306.17069v1 | null |
2023-06-28 | Class Numbers, Congruent Numbers and Umbral Moonshine | Miranda C. N. Cheng et.al. | 2306.16414v1 | null |
2023-06-28 | Information-Computation Tradeoffs for Learning Margin Halfspaces with Random Classification Noise | Ilias Diakonikolas et.al. | 2306.16352v1 | null |
2023-06-28 | Accurate, uncertainty-aware classification of molecular chemical motifs from multi-modal X-ray absorption spectroscopy | Matthew R. Carbone et.al. | 2306.16349v1 | null |
2023-06-28 | DoseDiff: Distance-aware Diffusion Model for Dose Prediction in Radiotherapy | Yiwen Zhang et.al. | 2306.16324v1 | null |
2023-06-28 | Universal theory of spin-momentum-orbital-site locking | Yuntian Liu et.al. | 2306.16312v1 | null |
2023-06-28 | Generalizing Surgical Instruments Segmentation to Unseen Domains with One-to-Many Synthesis | An Wang et.al. | 2306.16285v1 | link |
2023-06-28 | Emotion Analysis of Tweets Banning Education in Afghanistan | Mohammad Ali Hussiny et.al. | 2306.16268v1 | null |
2023-06-28 | Reconfigurable Robot Control Using Flexible Coupling Mechanisms | Sha Yi et.al. | 2306.16265v1 | null |
2023-06-28 | Latent SDEs on Homogeneous Spaces | Sebastian Zeng et.al. | 2306.16248v1 | null |
2023-06-28 | Investigating the Uncanny Valley Phenomenon Through the Temporal Dynamics of Neural Responses to Virtual Characters | Chiara Gorlini et.al. | 2306.16233v1 | null |
2023-06-27 | Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties | Hsiao-Yu Tung et.al. | 2306.15668v1 | null |
2023-06-27 | Enhancing Representation Learning on High-Dimensional, Small-Size Tabular Data: A Divide and Conquer Method with Ensembled VAEs | Navindu Leelarathna et.al. | 2306.15661v1 | null |
2023-06-27 | Style-transfer based Speech and Audio-visual Scene Understanding for Robot Action Sequence Acquisition from Videos | Chiori Hori et.al. | 2306.15644v1 | null |
2023-06-27 | Biclustering random matrix partitions with an application to classification of forensic body fluids | Chieh-Hsi Wu et.al. | 2306.15622v1 | null |
2023-06-27 | Recurrent Neural Network-coupled SPAD TCSPC System for Real-time Fluorescence Lifetime Imaging | Yang Lin et.al. | 2306.15599v1 | null |
2023-06-27 | Optimizing Credit Limit Adjustments Under Adversarial Goals Using Reinforcement Learning | Sherly Alfonso-Sánchez et.al. | 2306.15585v1 | null |
2023-06-27 | Parity doublet model for baryon octets: diquark classifications and mass hierarchy based on the quark-line diagram | Takuya Minamikawa et.al. | 2306.15564v1 | null |
2023-06-27 | You Can Mask More For Extremely Low-Bitrate Image Compression | Anqi Li et.al. | 2306.15561v1 | link |
2023-06-27 | A Survey on Deep Learning Hardware Accelerators for Heterogeneous HPC Platforms | Cristina Silvano et.al. | 2306.15552v1 | null |
2023-06-27 | Self-supervised Learning of Event-guided Video Frame Interpolation for Rolling Shutter Frames | Yunfan Lu et.al. | 2306.15507v1 | null |
2023-06-26 | FunQA: Towards Surprising Video Comprehension | Binzhu Xie et.al. | 2306.14899v1 | link |
2023-06-26 | Mapping out phase diagrams with generative classifiers | Julian Arnold et.al. | 2306.14894v1 | null |
2023-06-26 | Fuzzy-Conditioned Diffusion and Diffusion Projection Attention Applied to Facial Image Correction | Majed El Helou et.al. | 2306.14891v1 | link |
2023-06-26 | A Fully Unsupervised Instance Segmentation Technique for White Blood Cell Images | Shrijeet Biswas et.al. | 2306.14875v1 | null |
2023-06-26 | ANYmal Parkour: Learning Agile Navigation for Quadrupedal Robots | David Hoeller et.al. | 2306.14874v1 | null |
2023-06-26 | Leveraging Task Structures for Improved Identifiability in Neural Network Representations | Wenlin Chen et.al. | 2306.14861v1 | null |
2023-06-26 | ViNT: A Foundation Model for Visual Navigation | Dhruv Shah et.al. | 2306.14846v1 | null |
2023-06-26 | An open-source robust machine learning platform for real-time detection and classification of 2D material flakes | Jan-Lucas Uslu et.al. | 2306.14845v1 | null |
2023-06-26 | A Flyweight CNN with Adaptive Decoder for Schistosoma mansoni Egg Detection | Leonardo de Melo Joao et.al. | 2306.14840v1 | null |
2023-06-26 | Label-Aware Hyperbolic Embeddings for Fine-grained Emotion Classification | Chih-Yao Chen et.al. | 2306.14822v1 | link |
2023-06-23 | Adversarial Robustness Certification for Bayesian Neural Networks | Matthew Wicker et.al. | 2306.13614v1 | link |
2023-06-23 | TACOformer:Token-channel compounded Cross Attention for Multimodal Emotion Recognition | Xinda Li et.al. | 2306.13592v1 | null |
2023-06-23 | Estimating Residential Solar Potential Using Aerial Data | Ross Goroshin et.al. | 2306.13564v1 | null |
2023-06-23 | Efficient Model Selection for Predictive Pattern Mining Model by Safe Pattern Pruning | Takumi Yoshida et.al. | 2306.13561v1 | null |
2023-06-26 | FPGA Implementation of Convolutional Neural Network for Real-Time Handwriting Recognition | Shichen Qiao et.al. | 2306.13557v2 | link |
2023-06-23 | Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation | Massimiliano Patacchiola et.al. | 2306.13554v1 | link |
2023-06-23 | Manifold Contrastive Learning with Variational Lie Group Operators | Kion Fallah et.al. | 2306.13544v1 | null |
2023-06-23 | Torsion Graph Neural Networks | Cong Shen et.al. | 2306.13541v1 | link |
2023-06-23 | Topological learning for the classification of disorder: an application to the design of metasurfaces | Tristan Madeleine et.al. | 2306.13540v1 | null |
2023-06-23 | WBCAtt: A White Blood Cell Dataset Annotated with Detailed Morphological Attributes | Satoshi Tsutsui et.al. | 2306.13531v1 | link |
2023-06-22 | A Comparison of Time-based Models for Multimodal Emotion Recognition | Ege Kesim et.al. | 2306.13076v1 | null |
2023-06-22 | Auditing Predictive Models for Intersectional Biases | Kate S. Boxer et.al. | 2306.13064v1 | null |
2023-06-22 | Impacts and Risk of Generative AI Technology on Cyber Defense | Subash Neupane et.al. | 2306.13033v1 | null |
2023-06-22 | Toward Automated Detection of Microbleeds with Anatomical Scale Localization: A Complete Clinical Diagnosis Support Using Deep Learning | Jun-Ho Kim et.al. | 2306.13020v1 | null |
2023-06-22 | Minimalist and High-Quality Panoramic Imaging with PSF-aware Transformers | Qi Jiang et.al. | 2306.12992v1 | link |
2023-06-22 | Can a single image processing algorithm work equally well across all phases of DCE-MRI? | Adam G. Tattersall et.al. | 2306.12988v1 | null |
2023-06-22 | Radiation Emission during the Erasure of Magnetic Monopoles | Maximilian Bachmaier et.al. | 2306.12958v1 | null |
2023-06-22 | Robust Semantic Segmentation: Strong Adversarial Attacks and Fast Training of Robust Models | Francesco Croce et.al. | 2306.12941v1 | link |
2023-06-22 | Deficit of Hot Dust in Low-redshift Active Galactic Nuclei | Suyeon Son et.al. | 2306.12927v1 | null |
2023-06-22 | Machine-Learning-Assisted and Real-Time-Feedback-Controlled Growth of InAs/GaAs Quantum Dots | Chao Shen et.al. | 2306.12898v1 | null |
2023-06-21 | Spectroscopy of the Supernova H0pe Host Galaxy at Redshift 1.78 | M. Polletta et.al. | 2306.12385v1 | null |
2023-06-21 | Geometric Algorithms for |
Diego Ihara Centurion et.al. | 2306.12377v1 | null |
2023-06-21 | M-VAAL: Multimodal Variational Adversarial Active Learning for Downstream Medical Image Analysis Tasks | Bidur Khanal et.al. | 2306.12376v1 | link |
2023-06-21 | One Policy to Dress Them All: Learning to Dress People with Diverse Poses and Garments | Yufei Wang et.al. | 2306.12372v1 | null |
2023-06-21 | Attention Hybrid Variational Net for Accelerated MRI Reconstruction | Guoyao Shen et.al. | 2306.12365v1 | null |
2023-06-21 | Linear and Non-Linear Barrier Coverage in Deterministic and Uncertain environment in WSNs: A New Classification | Adda Boualem et.al. | 2306.12355v1 | null |
2023-06-21 | An efficient, provably exact algorithm for the 0-1 loss linear classification problem | Xi He et.al. | 2306.12344v1 | null |
2023-06-21 | Geometric Pooling: maintaining more useful information | Hao Xu et.al. | 2306.12341v1 | null |
2023-06-22 | Do you still need a manual smart contract audit? | Isaac David et.al. | 2306.12338v2 | null |
2023-06-22 | Beyond Deep Ensembles: A Large-Scale Evaluation of Bayesian Deep Learning under Distribution Shift | Florian Seligmann et.al. | 2306.12306v2 | link |
2023-06-20 | Segment Anything Model (SAM) for Radiation Oncology | Lian Zhang et.al. | 2306.11730v1 | null |
2023-06-20 | Dense Video Object Captioning from Disjoint Supervision | Xingyi Zhou et.al. | 2306.11729v1 | link |
2023-06-20 | How can objects help action recognition? | Xingyi Zhou et.al. | 2306.11726v1 | link |
2023-06-20 | Low-complexity Multidimensional DCT Approximations | V. A. Coutinho et.al. | 2306.11724v1 | null |
2023-06-20 | Meta-Analysis of Transfer Learning for Segmentation of Brain Lesions | Sovesh Mohapatra et.al. | 2306.11714v1 | null |
2023-06-20 | Hexagonal circular 3-webs with polar curves of degree three | Sergey I. Agafonov et.al. | 2306.11707v1 | null |
2023-06-20 | SkyGPT: Probabilistic Short-term Solar Forecasting Using Synthetic Sky Videos from Physics-constrained VideoGPT | Yuhao Nie et.al. | 2306.11682v1 | null |
2023-06-20 | The Implicit Bias of Batch Normalization in Linear Models and Two-layer Linear Convolutional Neural Networks | Yuan Cao et.al. | 2306.11680v1 | null |
2023-06-20 | A primal-dual data-driven method for computational optical imaging with a photonic lantern | Carlos Santos Garcia et.al. | 2306.11679v1 | null |
2023-06-20 | Deep Learning Methods for Retinal Blood Vessel Segmentation: Evaluation on Images with Retinopathy of Prematurity | Gorana Gojić et.al. | 2306.11576v1 | null |
2023-06-16 | Variational quantum algorithms for machine learning: theory and applications | Stefano Mangini et.al. | 2306.09984v1 | null |
2023-06-16 | HePCo: Data-Free Heterogeneous Prompt Consolidation for Continual Federated Learning | Shaunak Halbe et.al. | 2306.09970v1 | null |
2023-06-16 | Training shallow ReLU networks on noisy data using hinge loss: when do we overfit and is it benign? | Erin George et.al. | 2306.09955v1 | null |
2023-06-16 | Towards Better Certified Segmentation via Diffusion Models | Othmane Laousy et.al. | 2306.09949v1 | null |
2023-06-16 | Knowledge Distillation for Efficient Audio-Visual Video Captioning | Özkan Çaylı et.al. | 2306.09947v1 | null |
2023-06-16 | RealImpact: A Dataset of Impact Sound Fields for Real Objects | Samuel Clarke et.al. | 2306.09944v1 | null |
2023-06-16 | A classification of supersymmetric Kaluza-Klein black holes with a single axial symmetry | David Katona et.al. | 2306.09933v1 | null |
2023-06-16 | A Metaheuristic-based Machine Learning Approach for Energy Prediction in Mobile App Development | Seyed Jalaleddin Mousavirad et.al. | 2306.09931v1 | null |
2023-06-16 | Learning to Summarize and Answer Questions about a Virtual Robot's Past Actions | Chad DeChant et.al. | 2306.09922v1 | null |
2023-06-16 | No Strong Feelings One Way or Another: Re-operationalizing Neutrality in Natural Language Inference | Animesh Nighojkar et.al. | 2306.09918v1 | null |
2023-06-16 | UrbanIR: Large-Scale Urban Scene Inverse Rendering from a Single Video | Zhi-Hao Lin et.al. | 2306.09349v2 | null |
2023-06-15 | Causal classification of spatiotemporal quantum correlations | Minjeong Song et.al. | 2306.09336v1 | null |
2023-06-15 | Class-Conditional Conformal Prediction With Many Classes | Tiffany Ding et.al. | 2306.09335v1 | link |
2023-06-15 | Personalized Image Enhancement Featuring Masked Style Modeling | Satoshi Kosugi et.al. | 2306.09334v1 | link |
2023-06-15 | Seeing the Pose in the Pixels: Learning Pose-Aware Representations in Vision Transformers | Dominick Reilly et.al. | 2306.09331v1 | link |
2023-06-15 | WizMap: Scalable Interactive Visualization for Exploring Large Machine Learning Embeddings | Zijie J. Wang et.al. | 2306.09328v1 | link |
2023-06-15 | Language-Guided Music Recommendation for Video via Prompt Analogies | Daniel McKee et.al. | 2306.09327v1 | null |
2023-06-15 | Single-Stage Visual Query Localization in Egocentric Videos | Hanwen Jiang et.al. | 2306.09324v1 | null |
2023-06-15 | Crowd-Powered Photo Enhancement Featuring an Active Learning Based Local Filter | Satoshi Kosugi et.al. | 2306.09321v1 | link |
2023-06-15 | Learnable Weight Initialization for Volumetric Medical Image Segmentation | Shahina Kunhimon et.al. | 2306.09320v1 | link |
2023-06-13 | Classification of branched Willmore spheres | Dorian Martino et.al. | 2306.07965v1 | null |
2023-06-13 | Supervised-Contrastive Loss Learns Orthogonal Frames and Batching Matters | Ganesh Ramachandra Kini et.al. | 2306.07960v1 | link |
2023-06-13 | Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation | Shuai Yang et.al. | 2306.07954v1 | null |
2023-06-13 | MOFI: Learning Image Representations from Noisy Entity Annotated Images | Wentao Wu et.al. | 2306.07952v1 | null |
2023-06-13 | Image Captioners Are Scalable Vision Learners Too | Michael Tschannen et.al. | 2306.07915v1 | null |
2023-06-13 | Massively Multilingual Corpus of Sentiment Datasets and Multi-faceted Sentiment Classification Benchmark | Łukasz Augustyniak et.al. | 2306.07902v1 | link |
2023-06-13 | Artificial Artificial Artificial Intelligence: Crowd Workers Widely Use Large Language Models for Text Production Tasks | Veniamin Veselovsky et.al. | 2306.07899v1 | link |
2023-06-13 | CAMEO: A Causal Transfer Learning Approach for Performance Optimization of Configurable Computer Systems | Md Shahriar Iqbal et.al. | 2306.07888v1 | null |
2023-06-13 | Deep Learning-Enabled Zero-Touch Device Identification: Mitigating the Impact of Channel Variability Through MIMO Diversity | Bechir Hamdaoui et.al. | 2306.07878v1 | null |
2023-06-13 | On the flow unsteadiness and operational characteristics of a novel supersonic fluidic oscillator | Spandan Maikap et.al. | 2306.07849v1 | null |
2023-06-12 | Waffling around for Performance: Visual Classification with Random Words and Broad Concepts | Karsten Roth et.al. | 2306.07282v1 | link |
2023-06-12 | The Cheltsov--Rubinstein problem for strongly asymptotically log del Pezzo surfaces | Chenzi Jin et.al. | 2306.07278v1 | null |
2023-06-12 | Gaussian Membership Inference Privacy | Tobias Leemann et.al. | 2306.07273v1 | null |
2023-06-12 | MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images | Junchen Zhu et.al. | 2306.07257v1 | null |
2023-06-12 | On the Expected Size of Conformal Prediction Sets | Guneet S. Dhillon et.al. | 2306.07254v1 | null |
2023-06-12 | RB-Dust -- A Reference-based Dataset for Vision-based Dust Removal | Peter Buckel et.al. | 2306.07244v1 | null |
2023-06-12 | Strokes2Surface: Recovering Curve Networks From 4D Architectural Design Sketches | S. Rasoulzadeh et.al. | 2306.07220v1 | null |
2023-06-12 | Cyclic objects from surfaces | Ivan Bartulović et.al. | 2306.07216v1 | null |
2023-06-12 | Valley: Video Assistant with Large Language model Enhanced abilitY | Ruipu Luo et.al. | 2306.07207v1 | null |
2023-06-12 | A Survey of Vision-Language Pre-training from the Lens of Multimodal Machine Translation | Jeremy Gwinnup et.al. | 2306.07198v1 | null |
2023-06-09 | Shock Cooling and Possible Precursor Emission in the Early Light Curve of the Type II SN 2023ixf | Griffin Hosseinzadeh et.al. | 2306.06097v1 | null |
2023-06-09 | Leveraging Large Language Models for Scalable Vector Graphics-Driven Image Understanding | Mu Cai et.al. | 2306.06094v1 | null |
2023-06-09 | Virtual Node Tuning for Few-shot Node Classification | Zhen Tan et.al. | 2306.06063v1 | null |
2023-06-09 | Ion-Driven Instabilities in the Inner Heliosphere II: Classification and Multi-Dimensional Mapping | Mihailo M. Martinovic et.al. | 2306.06060v1 | null |
2023-06-09 | Exploring the Impact of Image Resolution on Chest X-ray Classification Performance | Alessandro Wollek et.al. | 2306.06051v1 | null |
2023-06-09 | How Does Fine-Tuning Impact Out-of-Distribution Detection for Vision-Language Models? | Yifei Ming et.al. | 2306.06048v1 | null |
2023-06-09 | GANeRF: Leveraging Discriminators to Optimize Neural Radiance Fields | Barbara Roessle et.al. | 2306.06044v1 | null |
2023-06-09 | WindowNet: Learnable Windows for Chest X-ray Classification | Alessandro Wollek et.al. | 2306.06038v1 | null |
2023-06-09 | Benchmarking self-supervised video representation learning | Akash Kumar et.al. | 2306.06010v1 | null |
2023-06-09 | Beyond Detection: Visual Realism Assessment of Deepfakes | Luka Dragar et.al. | 2306.05985v1 | null |
2023-06-08 | MIMIC-IT: Multi-Modal In-Context Instruction Tuning | Bo Li et.al. | 2306.05425v1 | link |
2023-06-08 | Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models | Muhammad Maaz et.al. | 2306.05424v1 | link |
2023-06-08 | ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process | Changyao Tian et.al. | 2306.05423v1 | null |
2023-06-08 | Tracking Everything Everywhere All at Once | Qianqian Wang et.al. | 2306.05422v1 | null |
2023-06-08 | 2D Supervised Monocular 3D Object Detection by Global-to-Local 3D Reconstruction | Jiawei He et.al. | 2306.05418v1 | null |
2023-06-08 | Tracking Objects with 3D Representation from Videos | Jiawei He et.al. | 2306.05416v1 | null |
2023-06-08 | Quantum symmetries in 2+1 dimensions: Carroll, (a)dS-Carroll, Galilei and (a)dS-Galilei | Tomasz Trześniewski et.al. | 2306.05409v1 | null |
2023-06-08 | Deformation theory for prismatic |
Kazuhiro Ito et.al. | 2306.05361v1 | null |
2023-06-08 | Unsupervised Compositional Concepts Discovery with Text-to-Image Generative Models | Nan Liu et.al. | 2306.05357v1 | null |
2023-06-08 | Predictive Modeling of Equine Activity Budgets Using a 3D Skeleton Reconstructed from Surveillance Recordings | Ernest Pokropek et.al. | 2306.05311v1 | null |
2023-06-08 | Integrating Geometric Control into Text-to-Image Diffusion Models for High-Quality Detection Data Generation via Text Prompt | Kai Chen et.al. | 2306.04607v2 | null |
2023-06-07 | MarineVRS: Marine Video Retrieval System with Explainability via Semantic Understanding | Tan-Sang Ha et.al. | 2306.04593v1 | null |
2023-06-07 | A Dataset for Deep Learning-based Bone Structure Analyses in Total Hip Arthroplasty | Kaidong Zhang et.al. | 2306.04579v1 | link |
2023-06-07 | ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models | Sophie Jentzsch et.al. | 2306.04563v1 | link |
2023-06-07 | Contrastive Bootstrapping for Label Refinement | Shudi Hou et.al. | 2306.04544v1 | null |
2023-06-07 | Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications | Paul Pu Liang et.al. | 2306.04539v1 | link |
2023-06-07 | Long-form analogies generated by chatGPT lack human-like psycholinguistic properties | S. M. Seals et.al. | 2306.04537v1 | null |
2023-06-07 | ContriMix: Unsupervised disentanglement of content and attribute for domain generalization in microscopy image analysis | Tan H. Nguyen et.al. | 2306.04527v1 | null |
2023-06-07 | Cross-attention learning enables real-time nonuniform rotational distortion correction in OCT | Haoran Zhang et.al. | 2306.04512v1 | null |
2023-06-07 | Hardness of Deceptive Certificate Selection | Stephan Wäldchen et.al. | 2306.04505v1 | null |
2023-06-06 | CL-UZH at SemEval-2023 Task 10: Sexism Detection through Incremental Fine-Tuning and Multi-Task Learning with Label Descriptions | Janis Goldzycher et.al. | 2306.03907v1 | null |
2023-06-06 | Utterance Classification with Logical Neural Network: Explainable AI for Mental Disorder Diagnosis | Yeldar Toleubay et.al. | 2306.03902v1 | null |
2023-06-06 | Towards Label-free Scene Understanding by Vision Foundation Models | Runnan Chen et.al. | 2306.03899v1 | null |
2023-06-06 | Multi-Label ECG Classification using Temporal Convolutional Neural Network | Eedara Prabhakararao et.al. | 2306.03844v1 | null |
2023-06-06 | Atrial Septal Defect Detection in Children Based on Ultrasound Video Using Multiple Instances Learning | Yiman Liu et.al. | 2306.03835v1 | null |
2023-06-06 | MTS2Graph: Interpretable Multivariate Time Series Classification with Temporal Evolving Graphs | Raneen Younis et.al. | 2306.03834v1 | null |
2023-06-06 | GEO-Bench: Toward Foundation Models for Earth Monitoring | Alexandre Lacoste et.al. | 2306.03831v1 | link |
2023-06-06 | Quick-Tune: Quickly Learning Which Pretrained Model to Finetune and How | Sebastian Pineda Arango et.al. | 2306.03828v1 | null |
2023-06-06 | Learning to Ground Instructional Articles in Videos through Narrations | Effrosyni Mavroudi et.al. | 2306.03802v1 | null |
2023-06-06 | Matched Pair Calibration for Ranking Fairness | Hannah Korevaar et.al. | 2306.03775v1 | null |
2023-06-05 | Neuralangelo: High-Fidelity Neural Surface Reconstruction | Zhaoshuo Li et.al. | 2306.03092v1 | null |
2023-06-05 | Dismantling Hate: Understanding Hate Speech Trends Against NBA Athletes | Edinam Kofi Klutse et.al. | 2306.03086v1 | null |
2023-06-05 | MotionDiffuser: Controllable Multi-Agent Motion Prediction using Diffusion | Chiyu Max Jiang et.al. | 2306.03083v1 | null |
2023-06-05 | Of Mice and Mates: Automated Classification and Modelling of Mouse Behaviour in Groups using a Single Model across Cages | Michael P. J. Camilleri et.al. | 2306.03066v1 | null |
2023-06-05 | LibAUC: A Deep Learning Library for X-Risk Optimization | Zhuoning Yuan et.al. | 2306.03065v1 | link |
2023-06-05 | Classification of Edge-dependent Labels of Nodes in Hypergraphs | Minyoung Choe et.al. | 2306.03032v1 | link |
2023-06-05 | AI Techniques for Cone Beam Computed Tomography in Dentistry: Trends and Practices | Saba Sarwar et.al. | 2306.03025v1 | null |
2023-06-05 | Interpretable Alzheimer's Disease Classification Via a Contrastive Diffusion Autoencoder | Ayodeji Ijishakin et.al. | 2306.03022v1 | null |
2023-06-05 | Controllable odd-frequency Cooper pairs in multi-superconductor Josephson junctions | Jorge Cayao et.al. | 2306.03017v1 | null |
2023-06-05 | Over-the-Air Federated Learning in Satellite systems | Edward Akito Carlos et.al. | 2306.02996v1 | null |
2023-06-02 | Multilingual Conceptual Coverage in Text-to-Image Models | Michael Saxon et.al. | 2306.01735v1 | link |
2023-06-02 | Video Colorization with Pre-trained Text-to-Image Diffusion Models | Hanyuan Liu et.al. | 2306.01732v1 | null |
2023-06-02 | Streaming algorithms for evaluating noisy judges on unlabeled data -- binary classification | Andrés Corrada-Emmanuel et.al. | 2306.01726v1 | null |
2023-06-02 | A Data-Driven Measure of Relative Uncertainty for Misclassification Detection | Eduardo Dadalto et.al. | 2306.01710v1 | link |
2023-06-02 | Is Generative Modeling-based Stylization Necessary for Domain Adaptation in Regression Tasks? | Jinman Park et.al. | 2306.01706v1 | null |
2023-06-02 | Temporal-controlled Frame Swap for Generating High-Fidelity Stereo Driving Data for Autonomy Analysis | Yedi Luo et.al. | 2306.01704v1 | null |
2023-06-02 | Unique Brain Network Identification Number for Parkinson's Individuals Using Structural MRI | Tanmayee Samantaray et.al. | 2306.01689v1 | null |
2023-06-02 | Enhancing CLIP with CLIP: Exploring Pseudolabeling for Limited-Label Prompt Tuning | Cristina Menghini et.al. | 2306.01669v1 | link |
2023-06-02 | SourceP: Smart Ponzi Schemes Detection on Ethereum Using Pre-training Model with Data Flow | Pengcheng Lu et.al. | 2306.01665v1 | null |
2023-06-02 | An Adaptive Method for Weak Supervision with Drifting Data | Alessio Mazzetto et.al. | 2306.01658v1 | null |
2023-06-01 | Hiera: A Hierarchical Vision Transformer without the Bells-and-Whistles | Chaitanya Ryali et.al. | 2306.00989v1 | link |
2023-06-01 | Continual Learning for Abdominal Multi-Organ and Tumor Segmentation | Yixiao Zhang et.al. | 2306.00988v1 | link |
2023-06-01 | Using generative AI to investigate medical imagery models and datasets | Oran Lang et.al. | 2306.00985v1 | null |
2023-06-01 | Building Rearticulable Models for Arbitrary 3D Objects from 4D Point Clouds | Shaowei Liu et.al. | 2306.00979v1 | null |
2023-06-01 | Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion Models | Chang Liu et.al. | 2306.00973v1 | link |
2023-06-01 | LIV: Language-Image Representations and Rewards for Robotic Control | Yecheng Jason Ma et.al. | 2306.00958v1 | link |
2023-06-01 | The ObjectFolder Benchmark: Multisensory Learning with Neural and Real Objects | Ruohan Gao et.al. | 2306.00956v1 | null |
2023-06-01 | Make-Your-Video: Customized Video Generation Using Textual and Structural Guidance | Jinbo Xing et.al. | 2306.00943v1 | null |
2023-06-01 | STEVE-1: A Generative Model for Text-to-Behavior in Minecraft | Shalev Lifshitz et.al. | 2306.00937v1 | null |
2023-06-01 | Interpreting GNN-based IDS Detections Using Provenance Graph Structural Features | Kunal Mukherjee et.al. | 2306.00934v1 | null |
2023-05-31 | Humans in 4D: Reconstructing and Tracking Humans with Transformers | Shubham Goel et.al. | 2305.20091v1 | null |
2023-05-31 | MARDELS: A full-sky X-ray selected galaxy cluster catalog | Matthias Klein et.al. | 2305.20066v1 | null |
2023-05-31 | Exploiting Mechanics-Based Priors for Lateral Displacement Estimation in Ultrasound Elastography | Md Ashikuzzaman et.al. | 2305.20059v1 | null |
2023-05-31 | Exploring Regions of Interest: Visualizing Histological Image Classification for Breast Cancer using Deep Learning | Imane Nedjar et.al. | 2305.20058v1 | null |
2023-05-31 | LOWA: Localize Objects in the Wild with Attributes | Xiaoyuan Guo et.al. | 2305.20047v1 | null |
2023-06-01 | Crowdsourcing subjective annotations using pairwise comparisons reduces bias and error compared to the majority-vote method | Hasti Narimanzadeh et.al. | 2305.20042v2 | null |
2023-05-31 | Bias Mitigation Methods for Binary Classification Decision-Making Systems: Survey and Recommendations | Madeleine Waller et.al. | 2305.20020v1 | null |
2023-05-31 | On the faces of unigraphic |
Riccardo W. Maffucci et.al. | 2305.20012v1 | null |
2023-05-31 | Number of Equivalence Classes of Rational Functions over Finite Fields | Xiang-dong Hou et.al. | 2305.20008v1 | null |
2023-05-31 | Physics-Informed Ensemble Representation for Light-Field Image Super-Resolution | Manchang Jin et.al. | 2305.20006v1 | null |
2023-05-30 | Imaginary quadratic fields with |
Olivia Beckwith et.al. | 2305.19272v1 | null |
2023-05-30 | A Stutter Seldom Comes Alone -- Cross-Corpus Stuttering Detection as a Multi-label Problem | Sebastian P. Bayerl et.al. | 2305.19255v1 | null |
2023-05-30 | Preserving Pre-trained Features Helps Calibrate Fine-tuned Language Models | Guande He et.al. | 2305.19249v1 | link |
2023-05-30 | Optimal bounds on surfaces | Jihao Liu et.al. | 2305.19248v1 | null |
2023-05-30 | COVID-19 Detection from Mass Spectra of Exhaled Breath | Nicolò Bellarmino et.al. | 2305.19211v1 | null |
2023-05-30 | Video ControlNet: Towards Temporally Consistent Synthetic-to-Real Video Translation Using Conditional Image Diffusion Models | Ernie Chu et.al. | 2305.19193v1 | null |
2023-05-30 | GMCs and their Type classification in M74: Toward understanding star formation and cloud evolution | F. Demachi et.al. | 2305.19192v1 | null |
2023-05-30 | Classification of Classical Spin Liquids: Detailed Formalism and Suite of Examples | Han Yan et.al. | 2305.19189v1 | null |
2023-05-30 | Reduced Precision Floating-Point Optimization for Deep Neural Network On-Device Learning on MicroControllers | Davide Nadalini et.al. | 2305.19167v1 | link |
2023-05-30 | Recognizing People by Body Shape Using Deep Networks of Images and Words | Blake A. Myers et.al. | 2305.19160v1 | null |
2023-05-29 | Direct Preference Optimization: Your Language Model is Secretly a Reward Model | Rafael Rafailov et.al. | 2305.18290v1 | null |
2023-05-29 | LaFTer: Label-Free Tuning of Zero-shot Classifier using Language and Unlabeled Image Collections | M. Jehanzeb Mirza et.al. | 2305.18287v1 | null |
2023-05-29 | CommonAccent: Exploring Large Acoustic Pretrained Models for Accent Classification Based on Common Voice | Juan Zuluaga-Gomez et.al. | 2305.18283v1 | link |
2023-05-29 | Gen-L-Video: Multi-Text to Long Video Generation via Temporal Co-Denoising | Fu-Yun Wang et.al. | 2305.18264v1 | link |
2023-05-30 | TaleCrafter: Interactive Story Visualization with Multiple Characters | Yuan Gong et.al. | 2305.18247v2 | link |
2023-05-29 | GazeGNN: A Gaze-Guided Graph Neural Network for Disease Classification | Bin Wang et.al. | 2305.18221v1 | null |
2023-05-29 | Improved Probabilistic Image-Text Representations | Sanghyuk Chun et.al. | 2305.18171v1 | link |
2023-05-29 | LM-CPPF: Paraphrasing-Guided Data Augmentation for Contrastive Prompt-Based Few-Shot Fine-Tuning | Amirhossein Abaskohi et.al. | 2305.18169v1 | link |
2023-05-29 | Generative Adversarial Networks based Skin Lesion Segmentation | Shubham Innani et.al. | 2305.18164v1 | link |
2023-05-30 | Out-of-Distributed Semantic Pruning for Robust Semi-Supervised Learning | Yu Wang et.al. | 2305.18158v2 | link |
2023-05-26 | Characterizing and Measuring Linguistic Dataset Drift | Tyler A. Chang et.al. | 2305.17127v1 | null |
2023-05-26 | IndustReal: Transferring Contact-Rich Assembly Tasks from Simulation to Reality | Bingjie Tang et.al. | 2305.17110v1 | null |
2023-05-26 | ControlVideo: Adding Conditional Control for One Shot Text-to-Video Editing | Min Zhao et.al. | 2305.17098v1 | null |
2023-05-26 | GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance Segmentation | Tanveer Hannan et.al. | 2305.17096v1 | null |
2023-05-26 | Benchmarking state-of-the-art gradient boosting algorithms for classification | Piotr Florek et.al. | 2305.17094v1 | null |
2023-05-26 | NeuroX Library for Neuron Analysis of Deep NLP Models | Fahim Dalvi et.al. | 2305.17073v1 | link |
2023-05-26 | Extremely weakly-supervised blood vessel segmentation with physiologically based synthesis and domain adaptation | Peidi Xu et.al. | 2305.17054v1 | link |
2023-05-26 | The Brain Tumor Segmentation (BraTS) Challenge 2023: Focus on Pediatrics (CBTN-CONNECT-DIPGR-ASNR-MICCAI BraTS-PEDs) | Anahita Fathi Kazerooni et.al. | 2305.17033v1 | null |
2023-05-26 | Are Deep Neural Networks Adequate Behavioural Models of Human Visual Perception? | Felix A. Wichmann et.al. | 2305.17023v1 | null |
2023-05-26 | D-CALM: A Dynamic Clustering-based Active Learning Approach for Mitigating Bias | Sabit Hassan et.al. | 2305.17013v1 | null |
2023-05-25 | Image is First-order Norm+Linear Autoregressive | Yinpeng Chen et.al. | 2305.16319v1 | null |
2023-05-25 | Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation | Shilin Yan et.al. | 2305.16318v1 | link |
2023-05-25 | Making Vision Transformers Truly Shift-Equivariant | Renan A. Rojas-Gomez et.al. | 2305.16316v1 | null |
2023-05-25 | Break-A-Scene: Extracting Multiple Concepts from a Single Image | Omri Avrahami et.al. | 2305.16311v1 | null |
2023-05-25 | Imitating Task and Motion Planning with Visuomotor Transformers | Murtaza Dalal et.al. | 2305.16309v1 | null |
2023-05-25 | Look Ma, No Hands! Agent-Environment Factorization of Egocentric Videos | Matthew Chang et.al. | 2305.16301v1 | null |
2023-05-25 | Sharpness-Aware Minimization Leads to Low-Rank Features | Maksym Andriushchenko et.al. | 2305.16292v1 | link |
2023-05-25 | Diversify Your Vision Datasets with Automatic Diffusion-Based Augmentation | Lisa Dunlap et.al. | 2305.16289v1 | link |
2023-05-25 | UDPM: Upsampling Diffusion Probabilistic Models | Shady Abu-Hussein et.al. | 2305.16269v1 | null |
2023-05-25 | Trans-Dimensional Generative Modeling via Jump Diffusion Models | Andrew Campbell et.al. | 2305.16261v1 | link |
2023-05-24 | RoMa: Revisiting Robust Losses for Dense Feature Matching | Johan Edstedt et.al. | 2305.15404v1 | null |
2023-05-24 | Peek Across: Improving Multi-Document Modeling via Cross-Document Question-Answering | Avi Caciularu et.al. | 2305.15387v1 | null |
2023-05-24 | What can generic neural networks learn from a child's visual experience? | A. Emin Orhan et.al. | 2305.15372v1 | null |
2023-05-24 | Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution | Yiyang Ma et.al. | 2305.15357v1 | null |
2023-05-24 | A Tale of Two Features: Stable Diffusion Complements DINO for Zero-Shot Semantic Correspondence | Junyi Zhang et.al. | 2305.15347v1 | null |
2023-05-24 | Is Your Model "MADD"? A Novel Metric to Evaluate Algorithmic Fairness for Predictive Student Models | Mélina Verger et.al. | 2305.15342v1 | null |
2023-05-24 | Statistical post-processing of visibility ensemble forecasts | Sándor Baran et.al. | 2305.15325v1 | null |
2023-05-24 | Training on Thin Air: Improve Image Classification with Generated Data | Yongchao Zhou et.al. | 2305.15316v1 | null |
2023-05-24 | Personalized Dictionary Learning for Heterogeneous Datasets | Geyu Liang et.al. | 2305.15311v1 | null |
2023-05-24 | High Speed Human Action Recognition using a Photonic Reservoir Computer | Enrico Picco et.al. | 2305.15283v1 | null |
2023-05-23 | Siamese Masked Autoencoders | Agrim Gupta et.al. | 2305.14344v1 | null |
2023-05-23 | Video Prediction Models as Rewards for Reinforcement Learning | Alejandro Escontrela et.al. | 2305.14343v1 | null |
2023-05-23 | Effect of speed fluctuations on the collective dynamics of active disks | R. Kailasham et.al. | 2305.14340v1 | null |
2023-05-23 | Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation | Shuting He et.al. | 2305.14335v1 | link |
2023-05-23 | Large Language Models are Frame-level Directors for Zero-shot Text-to-Video Generation | Susung Hong et.al. | 2305.14330v1 | link |
2023-05-23 | ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings | William Brannon et.al. | 2305.14321v1 | link |
2023-05-23 | Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science | Yida Mu et.al. | 2305.14310v1 | null |
2023-05-23 | A Laplacian Pyramid Based Generative H&E Stain Augmentation Network | Fangda Li et.al. | 2305.14301v1 | link |
2023-05-23 | TaDSE: Template-aware Dialogue Sentence Embeddings | Minsik Oh et.al. | 2305.14299v1 | null |
2023-05-23 | Active Learning Principles for In-Context Learning with Large Language Models | Katerina Margatina et.al. | 2305.14264v1 | null |
2023-05-22 | VDT: An Empirical Study on Video Diffusion with Transformers | Haoyu Lu et.al. | 2305.13311v1 | link |
2023-05-23 | VideoLLM: Modeling Video Sequence with Large Language Models | Guo Chen et.al. | 2305.13292v2 | link |
2023-05-22 | Materialistic: Selecting Similar Materials in Images | Prafull Sharma et.al. | 2305.13291v1 | null |
2023-05-22 | Morphological Sampling Theorem and its Extension to Grey-value Images | Vivek Sridhar et.al. | 2305.13279v1 | null |
2023-05-22 | U-TILISE: A Sequence-to-sequence Model for Cloud Removal in Optical Satellite Time Series | Corinne Stucker et.al. | 2305.13277v1 | null |
2023-05-22 | The Geometric Approach to the Classification of Signals via a Maximal Set of Signals | Leon A. Luxemburg et.al. | 2305.13255v1 | null |
2023-05-22 | Copy Recurrent Neural Network Structure Network | Xiaofan Zhou et.al. | 2305.13250v1 | null |
2023-05-22 | Interactive Natural Language Processing | Zekun Wang et.al. | 2305.13246v1 | null |
2023-05-22 | Sequential Transfer Learning to Decode Heard and Imagined Timbre from fMRI Data | Sean Paulsen et.al. | 2305.13226v1 | null |
2023-05-22 | Learning to detect an animal sound from five examples | Inês Nolasco et.al. | 2305.13210v1 | null |
2023-05-19 | North Sámi Dialect Identification with Self-supervised Speech Models | Sofoklis Kakouros et.al. | 2305.11864v1 | link |
2023-05-19 | Recommendations for Verifying HDR Subjective Testing Workflows | Vibhoothi et.al. | 2305.11858v1 | null |
2023-05-19 | Q-malizing flow and infinitesimal density ratio estimation | Chen Xu et.al. | 2305.11857v1 | null |
2023-05-19 | Video Killed the HD-Map: Predicting Driving Behavior Directly From Drone Images | Yunpeng Liu et.al. | 2305.11856v1 | null |
2023-05-19 | Any-to-Any Generation via Composable Diffusion | Zineng Tang et.al. | 2305.11846v1 | link |
2023-05-19 | A One-Class Classifier for the Detection of GAN Manipulated Multi-Spectral Satellite Images | Lydia Abady et.al. | 2305.11795v1 | null |
2023-05-19 | Enhancing Few-shot NER with Prompt Ordering based Data Augmentation | Huiming Wang et.al. | 2305.11791v1 | link |
2023-05-22 | Prompting with Pseudo-Code Instructions | Mayank Mishra et.al. | 2305.11790v2 | null |
2023-05-19 | Neural Foundations of Mental Simulation: Future Prediction of Latent Representations on Dynamic Scenes | Aran Nayebi et.al. | 2305.11772v1 | null |
2023-05-19 | Persian Typographical Error Type Detection using Many-to-Many Deep Neural Networks on Algorithmically-Generated Misspellings | Mohammad Dehghani et.al. | 2305.11731v1 | null |
2023-05-18 | ONE-PEACE: Exploring One General Representation Model Toward Unlimited Modalities | Peng Wang et.al. | 2305.11172v1 | link |
2023-05-18 | Exploring the Carbon Footprint of Hugging Face's ML Models: A Repository Mining Study | Joel Castaño et.al. | 2305.11164v1 | null |
2023-05-18 | Skin Lesion Diagnosis Using Convolutional Neural Networks | Daniel Alonso Villanueva Nunez et.al. | 2305.11125v1 | null |
2023-05-18 | MiraBest: A Dataset of Morphologically Classified Radio Galaxies for Machine Learning | Fiona A. M. Porter et.al. | 2305.11108v1 | link |
2023-05-18 | Cross-modality Data Augmentation for End-to-End Sign Language Translation | Jinhui Ye et.al. | 2305.11096v1 | null |
2023-05-18 | Universal Domain Adaptation from Foundation Models | Bin Deng et.al. | 2305.11092v1 | link |
2023-05-18 | The Heterotic-Ricci flow and its three-dimensional solitons | Andrei Moroianu et.al. | 2305.11069v1 | null |
2023-05-18 | NODE-ImgNet: a PDE-informed effective and robust model for image denoising | Xinheng Xie et.al. | 2305.11049v1 | null |
2023-05-18 | Simulation of a Variational Quantum Perceptron using Grover's Algorithm | Nouhaila Innan et.al. | 2305.11040v1 | null |
2023-05-18 | Sizing multimodal suspensions with differential dynamic microscopy | Joe J Bradley et.al. | 2305.11018v1 | null |
2023-05-17 | Performance of the Quasar Spectral Templates for the Dark Energy Spectroscopic Instrument | Allyson Brodzeller et.al. | 2305.10426v1 | null |
2023-05-17 | Evolving Tsukamoto Neuro Fuzzy Model for Multiclass Covid 19 Classification with Chest X Ray Images | Marziyeh Rezaei et.al. | 2305.10421v1 | null |
2023-05-17 | Variational Classification | Shehzaad Dhuliawala et.al. | 2305.10406v1 | null |
2023-05-17 | What You See is What You Read? Improving Text-Image Alignment Evaluation | Michal Yarom et.al. | 2305.10400v1 | link |
2023-05-17 | Optimality of Message-Passing Architectures for Sparse Graphs | Aseem Baranwal et.al. | 2305.10391v1 | null |
2023-05-17 | Logit-Based Ensemble Distribution Distillation for Robust Autoregressive Sequence Uncertainties | Yassir Fathullah et.al. | 2305.10384v1 | null |
2023-05-18 | Large-Scale Text Analysis Using Generative Language Models: A Case Study in Discovering Public Value Expressions in AI Patents | Sergio Pelaez et.al. | 2305.10383v2 | null |
2023-05-18 | Dislocation correlations and the continuum dynamics of the weak line bundle ensemble | Joseph Pierre Anderson et.al. | 2305.10375v2 | null |
2023-05-17 | A catalog of collected debris disks: properties, classifications and correlations between disks and stars/planets | Peng-cheng Cao et.al. | 2305.10364v1 | null |
2023-05-17 | Confidence-Guided Semi-supervised Learning in Land Cover Classification | Wanli Ma et.al. | 2305.10344v1 | null |
2023-05-16 | Annotating 8,000 Abdominal CT Volumes for Multi-Organ Segmentation in Three Weeks | Chongyu Qu et.al. | 2305.09666v1 | null |
2023-05-16 | Understanding 3D Object Interaction from a Single Image | Shengyi Qian et.al. | 2305.09664v1 | null |
2023-05-16 | Sixfold way of traversable wormholes in the Sachdev-Ye-Kitaev model | Antonio M. García-García et.al. | 2305.09663v1 | null |
2023-05-16 | Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation | Samaneh Azadi et.al. | 2305.09662v1 | null |
2023-05-16 | Osteosarcoma Tumor Detection using Transfer Learning Models | Raisa Fairooz Meem et.al. | 2305.09660v1 | null |
2023-05-16 | RAMario: Experimental Approach to Reptile Algorithm -- Reinforcement Learning for Mario | Sanyam Jain et.al. | 2305.09655v1 | link |
2023-05-16 | The Interpreter Understands Your Meaning: End-to-end Spoken Language Understanding Aided by Speech Translation | Mutian He et.al. | 2305.09652v1 | null |
2023-05-16 | Tailoring Instructions to Student's Learning Levels Boosts Knowledge Distillation | Yuxin Ren et.al. | 2305.09651v1 | link |
2023-05-16 | Wavelet-based Unsupervised Label-to-Image Translation | George Eskandar et.al. | 2305.09647v1 | link |
2023-05-16 | Data Augmentation for Conflict and Duplicate Detection in Software Engineering Sentence Pairs | Garima Malik et.al. | 2305.09608v1 | null |
2023-05-15 | Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models | Antoni Bigata Casademunt et.al. | 2305.08854v1 | null |
2023-05-15 | Make-A-Protagonist: Generic Video Editing with An Ensemble of Experts | Yuyang Zhao et.al. | 2305.08850v1 | null |
2023-05-15 | Straightening Out the Straight-Through Estimator: Overcoming Optimization Challenges in Vector Quantized Networks | Minyoung Huh et.al. | 2305.08842v1 | null |
2023-05-15 | Attacking Perceptual Similarity Metrics | Abhijay Ghildyal et.al. | 2305.08840v1 | null |
2023-05-15 | Learning Better Contrastive View from Radiologist's Gaze | Sheng Wang et.al. | 2305.08826v1 | link |
2023-05-15 | Measuring Cross-Lingual Transferability of Multilingual Transformers on Sentence Classification | Zewen Chi et.al. | 2305.08800v1 | null |
2023-05-15 | Predictive Models from Quantum Computer Benchmarks | Daniel Hothem et.al. | 2305.08796v1 | null |
2023-05-15 | Comparing Variation in Tokenizer Outputs Using a Series of Problematic and Challenging Biomedical Sentences | Christopher Meaney et.al. | 2305.08787v1 | null |
2023-05-15 | TAA-GCN: A Temporally Aware Adaptive Graph Convolutional Network for Age Estimation | Matthew Korban et.al. | 2305.08779v1 | null |
2023-05-15 | Question-Answering System Extracts Information on Injection Drug Use from Clinical Progress Notes | Maria Mahbub et.al. | 2305.08777v1 | null |
2023-05-12 | How do supernova remnants cool? -- I. Morphology, optical emission lines, and shocks | Ekaterina I. Makarenko et.al. | 2305.07652v1 | null |
2023-05-12 | Beware of diffusion models for synthesizing medical images -- A comparison with GANs in terms of memorizing brain tumor images | Muhammad Usman Akbar et.al. | 2305.07644v1 | null |
2023-05-12 | Efficient Neural Network based Classification and Outlier Detection for Image Moderation using Compressed Sensing and Group Testing | Sabyasachi Ghosh et.al. | 2305.07639v1 | null |
2023-05-12 | Agile gesture recognition for capacitive sensing devices: adapting on-the-job | Ying Liu et.al. | 2305.07624v1 | null |
2023-05-12 | Uncertainty Estimation for Deep Learning Image Reconstruction using a Local Lipschitz Metric | Danyal F. Bhutto et.al. | 2305.07618v1 | null |
2023-05-12 | Fisher Information Embedding for Node and Graph Learning | Dexiong Chen et.al. | 2305.07580v1 | link |
2023-05-12 | A Memory Model for Question Answering from Streaming Data Supported by Rehearsal and Anticipation of Coreference Information | Vladimir Araujo et.al. | 2305.07565v1 | null |
2023-05-12 | Dish detection in food platters: A framework for automated diet logging and nutrition management | Mansi Goel et.al. | 2305.07552v1 | null |
2023-05-12 | Content-based jewellery item retrieval using the local region-based histograms | Amin Muhammad Shoib et.al. | 2305.07540v1 | null |
2023-05-12 | Saturated Non-Monotonic Activation Functions | Junjia Chen et.al. | 2305.07537v1 | null |
2023-05-11 | A General-Purpose Multilingual Document Encoder | Onur Galoğlu et.al. | 2305.07016v1 | link |
2023-05-11 | Self-Chained Image-Language Model for Video Localization and Question Answering | Shoubin Yu et.al. | 2305.06988v1 | link |
2023-05-11 | Meta-hallucinator: Towards Few-Shot Cross-Modality Cardiac Image Segmentation | Ziyuan Zhao et.al. | 2305.06978v1 | null |
2023-05-11 | Data quality dimensions for fair AI | Camilla Quaresmini et.al. | 2305.06967v1 | link |
2023-05-11 | Transformers for CT Reconstruction From Monoplanar and Biplanar Radiographs | Firas Khader et.al. | 2305.06965v1 | null |
2023-05-11 | Cascaded Cross-Attention Networks for Data-Efficient Whole-Slide Image Classification Using Transformers | Firas Khader et.al. | 2305.06963v1 | null |
2023-05-11 | EAML: Ensemble Self-Attention-based Mutual Learning Network for Document Image Classification | Souhail Bakkali et.al. | 2305.06923v1 | null |
2023-05-11 | Meta-Learners for Few-Shot Weakly-Supervised Medical Image Segmentation | Hugo Oliveira et.al. | 2305.06912v1 | null |
2023-05-11 | Stochastic Variance-Reduced Majorization-Minimization Algorithms | Duy-Nhat Phan et.al. | 2305.06848v1 | link |
2023-05-11 | Detection and Classification of Pole-like Landmarks for Domain-invariant 3D Point Cloud Map Matching | Sun Yifei et.al. | 2305.06845v1 | null |
2023-05-11 | HumanRF: High-Fidelity Neural Radiance Fields for Humans in Motion | Mustafa Işık et.al. | 2305.06356v2 | link |
2023-05-10 | VideoChat: Chat-Centric Video Understanding | KunChang Li et.al. | 2305.06355v1 | link |
2023-05-10 | Reconstructing Animatable Categories from Videos | Gengshan Yang et.al. | 2305.06351v1 | null |
2023-05-10 | Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception | Hassan Akbari et.al. | 2305.06324v1 | null |
2023-05-10 | SepMark: Deep Separable Watermarking for Unified Source Tracing and Deepfake Detection | Xiaoshuai Wu et.al. | 2305.06321v1 | null |
2023-05-10 | Learning Video-Conditioned Policies for Unseen Manipulation Tasks | Elliot Chane-Sane et.al. | 2305.06289v1 | null |
2023-05-10 | Convolution of periodic multiplicative functions and the divisor problem | Marco Aymone et.al. | 2305.06260v1 | null |
2023-05-11 | Embedded Feature Similarity Optimization with Specific Parameter Initialization for 2D/3D Registration | Minheng Chen et.al. | 2305.06252v2 | null |
2023-05-10 | Eigenmodes of magnetic skyrmion lattices | Louise Desplat et.al. | 2305.06248v1 | null |
2023-05-10 | Explainable Knowledge Distillation for On-device Chest X-Ray Classification | Chakkrit Termritthikun et.al. | 2305.06244v1 | null |
2023-05-10 | InternChat: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language | Zhaoyang Liu et.al. | 2305.05662v2 | link |
2023-05-09 | An Exploration of Encoder-Decoder Approaches to Multi-Label Classification for Legal and Biomedical Text | Yova Kementchedjhieva et.al. | 2305.05627v1 | null |
2023-05-09 | Can point cloud networks learn statistical shape models of anatomies? | Jadie Adams et.al. | 2305.05610v1 | null |
2023-05-09 | Region-based Contrastive Pretraining for Medical Image Retrieval with Anatomic Query | Ho Hin Lee et.al. | 2305.05598v1 | null |
2023-05-09 | Topological classification and black hole thermodynamics | Mohammad Reza Alipour et.al. | 2305.05595v1 | null |
2023-05-09 | Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels | Enric Moreu et.al. | 2305.05580v1 | null |
2023-05-09 | Resource Dimensioning for Single-Cell Edge Video Analytics | Jaume Anguera Peris et.al. | 2305.05568v1 | null |
2023-05-09 | ColonMapper: topological mapping and localization for colonoscopy | Javier Morlana et.al. | 2305.05546v1 | null |
2023-05-09 | Integrating Holistic and Local Information to Estimate Emotional Reaction Intensity | Yini Fang et.al. | 2305.05534v1 | link |
2023-05-09 | RMES: Real-Time Micro-Expression Spotting Using Phase From Riesz Pyramid | Yini Fang et.al. | 2305.05523v1 | null |
2023-05-08 | What Do Patients Say About Their Disease Symptoms? Deep Multilabel Text Classification With Human-in-the-Loop Curation for Automatic Labeling of Patient Self Reports of Problems | Lakshmi Arbatti et.al. | 2305.04905v1 | null |
2023-05-08 | Toward Connecting Speech Acts and Search Actions in Conversational Search Tasks | Souvick Ghosh et.al. | 2305.04858v1 | null |
2023-05-08 | Isotonic subgroup selection | Manuel M. Müller et.al. | 2305.04852v1 | null |
2023-05-08 | Compressed Video Quality Assessment for Super-Resolution: a Benchmark and a Quality Metric | Evgeney Bogatyrev et.al. | 2305.04844v1 | link |
2023-05-08 | Learning Summary-Worthy Visual Representation for Abstractive Summarization in Video | Zenan Xu et.al. | 2305.04824v1 | null |
2023-05-08 | On word complexity and topological entropy of random substitution subshifts | Andrew Mitchell et.al. | 2305.04817v1 | null |
2023-05-08 | FlaPy: Mining Flaky Python Tests at Scale | Martin Gruber et.al. | 2305.04793v1 | null |
2023-05-08 | AvatarReX: Real-time Expressive Full-body Avatars | Zerong Zheng et.al. | 2305.04789v1 | null |
2023-05-08 | Multi-Scale Energy (MuSE) plug and play framework for inverse problems | Jyothi Rikhab Chand et.al. | 2305.04775v1 | null |
2023-05-08 | Understanding Noise-Augmented Training for Randomized Smoothing | Ambar Pal et.al. | 2305.04746v1 | link |
2023-05-05 | Avatar Fingerprinting for Authorized Use of Synthetic Talking-Head Videos | Ekta Prashnani et.al. | 2305.03713v1 | null |
2023-05-05 | Fine-Grained Product Classification on Leaflet Advertisements | Daniel Ladwig et.al. | 2305.03706v1 | link |
2023-05-05 | How Segment Anything Model (SAM) Boost Medical Image Segmentation? | Yichi Zhang et.al. | 2305.03678v1 | link |
2023-05-05 | Stochastic maximum principle for sub-diffusions and its applications | Shuaiqi Zhang et.al. | 2305.03676v1 | null |
2023-05-08 | White-Box Multi-Objective Adversarial Attack on Dialogue Generation | Yufei Li et.al. | 2305.03655v2 | link |
2023-05-05 | ( |
Lauren Snider et.al. | 2305.03651v1 | null |
2023-05-05 | A surface-normal photodetector as nonlinear activation function in diffractive optical neural networks | Farshid Ashtiani et.al. | 2305.03627v1 | null |
2023-05-05 | Segmentation of fundus vascular images based on a dual-attention mechanism | Yuanyuan Peng et.al. | 2305.03617v1 | null |
2023-05-05 | Human Attention-Guided Explainable Artificial Intelligence for Computer Vision Models | Guoyang Liu et.al. | 2305.03601v1 | null |
2023-05-05 | HSCNet++: Hierarchical Scene Coordinate Classification and Regression for Visual Localization with Transformer | Shuzhe Wang et.al. | 2305.03595v1 | null |
2023-05-04 | Tracking through Containers and Occluders in the Wild | Basile Van Hoorick et.al. | 2305.03052v1 | null |
2023-05-04 | NeuralEditor: Editing Neural Radiance Fields via Manipulating Point Clouds | Jun-Kun Chen et.al. | 2305.03049v1 | null |
2023-05-04 | Personalize Segment Anything Model with One Shot | Renrui Zhang et.al. | 2305.03048v1 | link |
2023-05-04 | Learning Hand-Held Object Reconstruction from In-The-Wild Videos | Aditya Prakash et.al. | 2305.03036v1 | null |
2023-05-04 | NeRSemble: Multi-view Radiance Field Reconstruction of Human Heads | Tobias Kirschstein et.al. | 2305.03027v1 | null |
2023-05-04 | The Polynomial Connection between Morphological Dilation and Discrete Convolution | Vivek Sridhar et.al. | 2305.03018v1 | null |
2023-05-04 | Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Recover the Whole Sentence | Haoran Li et.al. | 2305.03010v1 | link |
2023-05-04 | NatCS: Eliciting Natural Customer Support Dialogues | James Gung et.al. | 2305.03007v1 | link |
2023-05-04 | Hybrid quantum learning with data re-uploading on a small-scale superconducting quantum simulator | Aleksei Tolstobrov et.al. | 2305.02956v1 | null |
2023-05-04 | A study on the composition of elementary cellular automata | Alonso Castillo-Ramirez et.al. | 2305.02947v1 | null |
2023-05-03 | Descent in tensor triangular geometry | Tobias Barthel et.al. | 2305.02308v1 | null |
2023-05-03 | New Equivalences Between Interpolation and SVMs: Kernels and Structured Features | Chiraag Kaushik et.al. | 2305.02304v1 | null |
2023-05-03 | Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime | Chuhan Zhang et.al. | 2305.02297v1 | null |
2023-05-03 | DynamicStereo: Consistent Dynamic Depth from Stereo Videos | Nikita Karaev et.al. | 2305.02296v1 | null |
2023-05-03 | Iranian License Plate Recognition Using a Reliable Deep Learning Approach | Soheila Hatami et.al. | 2305.02292v1 | null |
2023-05-03 | Evaluating BERT-based Scientific Relation Classifiers for Scholarly Knowledge Graph Construction on Digital Library Collections | Ming Jiang et.al. | 2305.02291v1 | null |
2023-05-03 | Multi-dimensional Signal Recovery using Low-rank Deconvolution | David Reixach et.al. | 2305.02264v1 | null |
2023-05-03 | Standardized Benchmark Dataset for Localized Exposure to a Realistic Source at 10$-$90 GHz | Ante Kapetanovic et.al. | 2305.02260v1 | null |
2023-05-03 | Thermally-driven Multilevel Non-volatile Memory with Monolayer MoS2 for Neuro-inspired Artificial Learning | Sameer Kumar Mallik et.al. | 2305.02259v1 | null |
2023-05-03 | The Benefits of Label-Description Training for Zero-Shot Text Classification | Lingyu Gao et.al. | 2305.02239v1 | link |
2023-05-02 | Sequence Modeling with Multiresolution Convolutional Memory | Jiaxin Shi et.al. | 2305.01638v1 | null |
2023-05-03 | A Technical Report on Image Classification using AWS | Balakrishna Phani Kommanaboina et.al. | 2305.01634v2 | null |
2023-05-02 | AutoColor: Learned Light Power Control for Multi-Color Holograms | Yicheng Zhan et.al. | 2305.01611v1 | null |
2023-05-02 | On the Impact of Data Quality on Image Classification Fairness | Aki Barry et.al. | 2305.01595v1 | null |
2023-05-02 | On cubic-line arrangements with simple singularities | Przemysław Talar et.al. | 2305.01530v1 | null |
2023-05-02 | ARBEx: Attentive Feature Extraction with Reliability Balancing for Robust Facial Expression Learning | Azmine Toushik Wasi et.al. | 2305.01486v1 | link |
2023-05-02 | Scalable Mask Annotation for Video Text Spotting | Haibin He et.al. | 2305.01443v1 | link |
2023-05-02 | Unsupervised Feature Based Algorithms for Time Series Extrinsic Regression | David Guijo-Rubio et.al. | 2305.01429v1 | null |
2023-05-02 | Are demographically invariant models and representations in medical imaging fair? | Eike Petersen et.al. | 2305.01397v1 | null |
2023-05-02 | Self-supervised arbitrary scale super-resolution framework for anisotropic MRI | Haonan Zhang et.al. | 2305.01360v1 | null |
2023-05-01 | Behavioral Forensics in Social Networks: Identifying Misinformation, Disinformation and Refutation Spreaders Using Machine Learning | Euna Mehnaz Khan et.al. | 2305.00957v1 | null |
2023-05-01 | Probabilistic 3D segmentation for aleatoric uncertainty quantification in full 3D medical data | Christiaan G. A. Viviers et.al. | 2305.00950v1 | null |
2023-05-01 | StyleAvatar: Real-time Photo-realistic Portrait Avatar from a Single Video | Lizhen Wang et.al. | 2305.00942v1 | null |
2023-05-01 | Early Detection of Alzheimer's Disease using Bottleneck Transformers | Arunima Jaiswal et.al. | 2305.00923v1 | null |
2023-05-01 | A Novel Low-Rank Tensor Method for Undersampling Artifact Removal in Respiratory Motion-Resolved Multi-Echo 3D Cones MRI | Seongho Jeong et.al. | 2305.00892v1 | null |
2023-05-01 | Estimating the Density Ratio between Distributions with High Discrepancy using Multinomial Logistic Regression | Akash Srivastava et.al. | 2305.00869v1 | null |
2023-05-01 | Automated Paper Screening for Clinical Reviews Using Large Language Models | Eddie Guo et.al. | 2305.00844v1 | null |
2023-05-01 | LCAUnet: A skin lesion segmentation network with enhanced edge and body fusion | Qisen Ma et.al. | 2305.00837v1 | null |
2023-05-01 | Performance and Energy Consumption of Parallel Machine Learning Algorithms | Xidong Wu et.al. | 2305.00798v1 | null |
2023-05-01 | GeneFace++: Generalized and Stable Real-Time Audio-Driven 3D Talking Face Generation | Zhenhui Ye et.al. | 2305.00787v1 | null |
2023-04-28 | Empirical Analysis of the Strengths and Weaknesses of PEFT Techniques for LLMs | George Pu et.al. | 2304.14999v1 | null |
2023-04-28 | Topology of critical points in boundary matrix duals | Pavan Kumar Yerra et.al. | 2304.14988v1 | null |
2023-04-28 | Quality-Adaptive Split-Federated Learning for Segmenting Medical Images with Inaccurate Annotations | Zahra Hafezi Kafshgari et.al. | 2304.14976v1 | null |
2023-04-28 | Embodiment perception of a smart home assistant | Mariya Kilina et.al. | 2304.14947v1 | null |
2023-04-28 | The Emotions of the Crowd: Learning Image Sentiment from Tweets via Cross-modal Distillation | Alessio Serra et.al. | 2304.14942v1 | null |
2023-04-28 | Contactless hand tremor amplitude measurement using smartphones: development and pilot evaluation | James Bungay et.al. | 2304.14937v1 | null |
2023-04-28 | Unified Noise-aware Network for Low-count PET Denoising | Huidong Xie et.al. | 2304.14900v1 | null |
2023-04-28 | Making the Invisible Visible: Toward High-Quality Terahertz Tomographic Imaging via Physics-Guided Restoration | Weng-Tai Su et.al. | 2304.14894v1 | null |
2023-04-28 | Dense Hybrid Proposal Modulation for Lane Detection | Yuejian Wu et.al. | 2304.14874v1 | link |
2023-04-28 | Evaluating the Stability of Semantic Concept Representations in CNNs for Robust Explainability | Georgii Mikriukov et.al. | 2304.14864v1 | null |
2023-04-27 | ChatVideo: A Tracklet-centric Multimodal and Versatile Video Understanding System | Junke Wang et.al. | 2304.14407v1 | null |
2023-04-27 | Putting People in Their Place: Affordance-Aware Human Insertion into Scenes | Sumith Kulal et.al. | 2304.14406v1 | link |
2023-04-27 | ViMQ: A Vietnamese Medical Question Dataset for Healthcare Dialogue System Development | Ta Duc Huy et.al. | 2304.14405v1 | link |
2023-04-27 | Motion-Conditioned Diffusion Model for Controllable Video Synthesis | Tsai-Shien Chen et.al. | 2304.14404v1 | null |
2023-04-27 | ActorsNeRF: Animatable Few-shot Human Rendering with Generalizable NeRFs | Jiteng Mu et.al. | 2304.14401v1 | null |
2023-04-27 | SeqTrack: Sequence to Sequence Learning for Visual Object Tracking | Xin Chen et.al. | 2304.14394v1 | link |
2023-04-27 | SLoMo: A General System for Legged Robot Motion Imitation from Casual Videos | John Z. Zhang et.al. | 2304.14389v1 | null |
2023-04-27 | Learning Neural Constitutive Laws From Motion Observations for Generalizable PDE Dynamics | Pingchuan Ma et.al. | 2304.14369v1 | null |
2023-04-27 | CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants | Albert Yu Sun et.al. | 2304.14364v1 | null |
2023-04-27 | Double-Deck Multi-Agent Pickup and Delivery: Multi-Robot Rearrangement in Large-Scale Warehouses | Baiyu Li et.al. | 2304.14309v1 | null |
2023-04-26 | A Control-Centric Benchmark for Video Prediction | Stephen Tian et.al. | 2304.13723v1 | link |
2023-04-26 | Association Rules Mining with Auto-Encoders | Théophile Berteloot et.al. | 2304.13717v1 | null |
2023-04-26 | Random Additive Polynomials | Lior Bary-Soroker et.al. | 2304.13709v1 | null |
2023-04-27 | Rigidity, Generators and Homology of Interval Exchange Groups | Owen Tanner et.al. | 2304.13691v2 | null |
2023-04-26 | Pseudo-periodic map and classification of theories with eight supercharges | Dan Xie et.al. | 2304.13663v1 | null |
2023-04-26 | PVP: Pre-trained Visual Parameter-Efficient Tuning | Zhao Song et.al. | 2304.13639v1 | null |
2023-04-26 | HausaNLP at SemEval-2023 Task 12: Leveraging African Low Resource TweetData for Sentiment Analysis | Saheed Abdullahi Salahudeen et.al. | 2304.13634v1 | link |
2023-04-26 | HDR-VDP-3: A multi-metric for predicting image differences, quality and contrast distortions in high dynamic range and regular content | Rafal K. Mantiuk et.al. | 2304.13625v1 | null |
2023-04-26 | Shades of meaning: Uncovering the geometry of ambiguous word representations through contextualised language models | Benedetta Cevoli et.al. | 2304.13597v1 | null |
2023-04-26 | Video Frame Interpolation with Densely Queried Bilateral Correlation | Chang Zhou et.al. | 2304.13596v1 | link |
2023-04-25 | Bake off redux: a review and experimental evaluation of recent time series classification algorithms | Matthew Middlehurst et.al. | 2304.13029v1 | null |
2023-04-25 | Flickr-PAD: New Face High-Resolution Presentation Attack Detection Database | Diego Pasmino et.al. | 2304.13015v1 | link |
2023-04-25 | Methods and datasets for segmentation of minimally invasive surgical instruments in endoscopic images and videos: A review of the state of the art | Tobias Rueckert et.al. | 2304.13014v1 | null |
2023-04-25 | The Potential of Visual ChatGPT For Remote Sensing | Lucas Prado Osco et.al. | 2304.13009v1 | null |
2023-04-25 | PoseVocab: Learning Joint-structured Pose Embeddings for Human Avatar Modeling | Zhe Li et.al. | 2304.13006v1 | link |
2023-04-25 | Segment anything, from space? | Simiao Ren et.al. | 2304.13000v1 | null |
2023-04-25 | Multi-Scale Feature Fusion using Parallel-Attention Block for COVID-19 Chest X-ray Diagnosis | Xiao Qi et.al. | 2304.12988v1 | null |
2023-04-25 | Quantifying the Effect of Image Similarity on Diabetic Foot Ulcer Classification | Imran Chowdhury Dipto et.al. | 2304.12987v1 | null |
2023-04-25 | How to account for behavioural states in step-selection analysis: a model comparison | Jennifer Pohle et.al. | 2304.12964v1 | null |
2023-04-25 | Latent diffusion models for generative precipitation nowcasting with accurate uncertainty quantification | Jussi Leinonen et.al. | 2304.12891v1 | null |
2023-04-24 | Total-Recon: Deformable Scene Reconstruction for Embodied View Synthesis | Chonghyuk Song et.al. | 2304.12317v1 | null |
2023-04-24 | Segment Anything in Medical Images | Jun Ma et.al. | 2304.12306v1 | link |
2023-04-24 | AssemblyHands: Towards Egocentric Activity Understanding via 3D Hand Pose Estimation | Takehiko Ohkawa et.al. | 2304.12301v1 | null |
2023-04-24 | Large-capacity and Flexible Video Steganography via Invertible Neural Network | Chong Mou et.al. | 2304.12300v1 | link |
2023-04-24 | On local delta invariant of del Pezzo surfaces | Erroxe Etxabarri Alberdi et.al. | 2304.12286v1 | null |
2023-04-24 | HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video | Jia-Wei Liu et.al. | 2304.12281v1 | null |
2023-04-24 | Dynamic generation and attribution of revenues in a video platform | Francisco Lopez-Navarrete et.al. | 2304.12268v1 | null |
2023-04-24 | Pre-Training Strategies Using Contrastive Learning and Playlist Information for Music Classification and Similarity | Pablo Alonso-Jiménez et.al. | 2304.12257v1 | null |
2023-04-24 | Ordinal time series analysis with the R package otsfeatures | Ángel López Oriona et.al. | 2304.12251v1 | null |
2023-04-24 | Classification of regular subalgebras of injective type III factors | Soham Chakraborty et.al. | 2304.12243v1 | null |
2023-04-21 | Implicit Neural Head Synthesis via Controllable Local Deformation Fields | Chuhan Chen et.al. | 2304.11113v1 | null |
2023-04-21 | A Convolutional Spiking Network for Gesture Recognition in Brain-Computer Interfaces | Yiming Ai et.al. | 2304.11106v1 | null |
2023-04-21 | Classification of solutions to Hardy-Sobolev Doubly Critical Systems | Francesco Esposito et.al. | 2304.11066v1 | null |
2023-04-24 | CLaMP: Contrastive Language-Music Pre-training for Cross-Modal Symbolic Music Information Retrieval | Shangda Wu et.al. | 2304.11029v2 | link |
2023-04-21 | 3d mirror symmetry of braided tensor categories | Andrew Ballin et.al. | 2304.11001v1 | null |
2023-04-21 | Information Extraction from Documents: Question Answering vs Token Classification in real-world setups | Laurent Lam et.al. | 2304.10994v1 | null |
2023-04-21 | LEIA: Linguistic Embeddings for the Identification of Affect | Segun Taofeek Aroyehun et.al. | 2304.10973v1 | null |
2023-04-21 | Factored Neural Representation for Scene Understanding | Yu-Shiang Wong et.al. | 2304.10950v1 | null |
2023-04-21 | A combined approach to analyze and classify families of classical spin liquids | Naïmo Davier et.al. | 2304.10906v1 | null |
2023-04-21 | AMP in the wild: Learning robust, agile, natural legged locomotion skills | Yikai Wang et.al. | 2304.10888v1 | null |
2023-04-20 | Learning Sparse and Low-Rank Priors for Image Recovery via Iterative Reweighted Least Squares Minimization | Stamatios Lefkimmiatis et.al. | 2304.10536v1 | null |
2023-04-20 | Farm3D: Learning Articulated 3D Animals by Distilling 2D Diffusion | Tomas Jakab et.al. | 2304.10535v1 | null |
2023-04-20 | Multidimensional Uncertainty Quantification for Deep Neural Networks | Xujiang Zhao et.al. | 2304.10527v1 | null |
2023-04-20 | Contrastive Tuning: A Little Help to Make Masked Autoencoders Forget | Johannes Lehner et.al. | 2304.10520v1 | link |
2023-04-20 | "Can We Detect Substance Use Disorder?": Knowledge and Time Aware Classification on Social Media from Darkweb | Usha Lokala et.al. | 2304.10512v1 | null |
2023-04-20 | Reconstructing Signing Avatars From Video Using Linguistic Priors | Maria-Paola Forte et.al. | 2304.10482v1 | null |
2023-04-20 | Implicit Temporal Modeling with Learnable Alignment for Video Recognition | Shuyuan Tu et.al. | 2304.10465v1 | link |
2023-04-20 | Angle based dynamic learning rate for gradient descent | Neel Mishra et.al. | 2304.10457v1 | link |
2023-04-20 | On the classification of singular cubic threefolds | Sasha Viktorova et.al. | 2304.10452v1 | null |
2023-04-20 | Domain-specific Continued Pretraining of Language Models for Capturing Long Context in Mental Health | Shaoxiong Ji et.al. | 2304.10447v1 | null |
2023-04-19 | Transformer-Based Visual Segmentation: A Survey | Xiangtai Li et.al. | 2304.09854v1 | link |
2023-04-19 | AMT: All-Pairs Multi-Field Transforms for Efficient Frame Interpolation | Zhen Li et.al. | 2304.09790v1 | link |
2023-04-19 | Automatic Interaction and Activity Recognition from Videos of Human Manual Demonstrations with Application to Anomaly Detection | Elena Merlo et.al. | 2304.09789v1 | null |
2023-04-19 | Advances on Concept Drift Detection in Regression Tasks using Social Networks Theory | Jean Paul Barddal et.al. | 2304.09788v1 | null |
2023-04-20 | Application of attention-based Siamese composite neural network in medical image recognition | Zihao Huang et.al. | 2304.09783v2 | null |
2023-04-19 | Equalised Odds is not Equal Individual Odds: Post-processing for Group and Individual Fairness | Edward A. Small et.al. | 2304.09779v1 | null |
2023-04-19 | An End-to-End Vehicle Trajcetory Prediction Framework | Fuad Hasan et.al. | 2304.09764v1 | null |
2023-04-19 | Rehabilitation Exercise Repetition Segmentation and Counting using Skeletal Body Joints | Ali Abedi et.al. | 2304.09735v1 | link |
2023-04-19 | Hyperspectral Image Analysis with Subspace Learning-based One-Class Classification | Sertac Kilickaya et.al. | 2304.09730v1 | null |
2023-04-20 | Any-to-Any Style Transfer: Making Picasso and Da Vinci Collaborate | Songhua Liu et.al. | 2304.09728v2 | link |
2023-04-18 | Hyperbolic Image-Text Representations | Karan Desai et.al. | 2304.09172v1 | null |
2023-04-18 | Optimal PAC Bounds Without Uniform Convergence | Ishaq Aden-Ali et.al. | 2304.09167v1 | null |
2023-04-18 | Structure Preserving Cycle-GAN for Unsupervised Medical Image Domain Adaptation | Paolo Iacono et.al. | 2304.09164v1 | null |
2023-04-18 | Detection and Classification of Glioblastoma Brain Tumor | Utkarsh Maurya et.al. | 2304.09133v1 | null |
2023-04-18 | Variational Relational Point Completion Network for Robust 3D Classification | Liang Pan et.al. | 2304.09131v1 | null |
2023-04-18 | Safety Guaranteed Manipulation Based on Reinforcement Learning Planner and Model Predictive Control Actor | Zhenshan Bing et.al. | 2304.09119v1 | null |
2023-04-18 | Sliced Optimal Transport on the Sphere | Michael Quellmalz et.al. | 2304.09092v1 | null |
2023-04-18 | A general review on the NLS equation with point-concentrated nonlinearity | Lorenzo Tentarelli et.al. | 2304.09086v1 | null |
2023-04-18 | Performance of GAN-based augmentation for deep learning COVID-19 image classification | Oleksandr Fedoruk et.al. | 2304.09067v1 | null |
2023-04-18 | KSB stability is automatic in codimension 3 | János Kollár et.al. | 2304.09009v1 | null |
2023-04-17 | Conditional Generation of Audio from Video via Foley Analogies | Yuexi Du et.al. | 2304.08490v1 | link |
2023-04-17 | Affordances from Human Videos as a Versatile Representation for Robotics | Shikhar Bahl et.al. | 2304.08488v1 | null |
2023-04-17 | Text2Performer: Text-Driven Human Video Generation | Yuming Jiang et.al. | 2304.08483v1 | link |
2023-04-18 | Latent-Shift: Latent Diffusion with Temporal Shift for Efficient Text-to-Video Generation | Jie An et.al. | 2304.08477v2 | null |
2023-04-17 | Synthetic Data from Diffusion Models Improves ImageNet Classification | Shekoofeh Azizi et.al. | 2304.08466v1 | null |
2023-04-17 | Efficient Video Action Detection with Token Dropout and Context Refinement | Lei Chen et.al. | 2304.08451v1 | null |
2023-04-17 | Morph-SSL: Self-Supervision with Longitudinal Morphing to Predict AMD Progression from OCT | Arunava Chakravarty et.al. | 2304.08439v1 | null |
2023-04-17 | CAViaR: Context Aware Video Recommendations | Khushhall Chandra Mahajan et.al. | 2304.08435v1 | null |
2023-04-17 | Tame symmetric algebras of period four | Karin Erdmann et.al. | 2304.08414v1 | null |
2023-04-17 | OVTrack: Open-Vocabulary Multiple Object Tracking | Siyuan Li et.al. | 2304.08408v1 | null |
2023-04-14 | CAD-RADS scoring of coronary CT angiography with Multi-Axis Vision Transformer: a clinically-inspired deep learning pipeline | Alessia Gerbasi et.al. | 2304.07277v1 | null |
2023-04-14 | Genus Comparisons in the Topological Analysis of RNA Structures | Nicolò Cangiotti et.al. | 2304.07273v1 | null |
2023-04-14 | Phantom Embeddings: Using Embedding Space for Model Regularization in Deep Neural Networks | Mofassir ul Islam Arif et.al. | 2304.07262v1 | null |
2023-04-14 | The University of California San Francisco, Brain Metastases Stereotactic Radiosurgery (UCSF-BMSR) MRI Dataset | Jeffrey D. Rudie et.al. | 2304.07248v1 | null |
2023-04-14 | Covidia: COVID-19 Interdisciplinary Academic Knowledge Graph | Cheng Deng et.al. | 2304.07242v1 | null |
2023-04-14 | PARFormer: Transformer-based Multi-Task Network for Pedestrian Attribute Recognition | Xinwen Fan et.al. | 2304.07230v1 | link |
2023-04-14 | Instance-aware Dynamic Prompt Tuning for Pre-trained Point Cloud Models | Yaohua Zha et.al. | 2304.07221v1 | link |
2023-04-14 | Emergency Resource Layout with Multiple Objectives under Complex Disaster Scenarios | Changwei Yuan et.al. | 2304.07216v1 | null |
2023-04-14 | Radio Galaxy Zoo EMU: Towards a Semantic Radio Galaxy Morphology Taxonomy | Micah Bowles et.al. | 2304.07171v1 | null |
2023-04-14 | Robust thalamic nuclei segmentation from T1-weighted MRI | Julie P. Vidal et.al. | 2304.07167v1 | null |
2023-04-13 | Representing Volumetric Videos as Dynamic MLP Maps | Sida Peng et.al. | 2304.06717v1 | null |
2023-04-13 | What does CLIP know about a red circle? Visual prompt engineering for VLMs | Aleksandar Shtedritski et.al. | 2304.06712v1 | null |
2023-04-13 | DiffusionRig: Learning Personalized Priors for Facial Appearance Editing | Zheng Ding et.al. | 2304.06711v1 | null |
2023-04-13 | Remote Sensing Change Detection With Transformers Trained from Scratch | Mubashir Noman et.al. | 2304.06710v1 | link |
2023-04-13 | Verbs in Action: Improving verb understanding in video-language models | Liliane Momeni et.al. | 2304.06708v1 | null |
2023-04-13 | How Will It Drape Like? Capturing Fabric Mechanics from Depth Images | Carlos Rodriguez-Pardo et.al. | 2304.06704v1 | null |
2023-04-13 | Learning Controllable 3D Diffusion Models from Single-view Images | Jiatao Gu et.al. | 2304.06700v1 | null |
2023-04-13 | Improving novelty detection with generative adversarial networks on hand gesture data | Miguel Simão et.al. | 2304.06696v1 | null |
2023-04-13 | LSFSL: Leveraging Shape Information in Few-shot Learning | Deepan Chakravarthi Padmanabhan et.al. | 2304.06672v1 | null |
2023-04-13 | Do deep neural networks have an inbuilt Occam's razor? | Chris Mingard et.al. | 2304.06670v1 | null |
2023-04-12 | Fluctuation based interpretable analysis scheme for quantum many-body snapshots | Henning Schlömer et.al. | 2304.06029v1 | null |
2023-04-12 | RECLIP: Resource-efficient CLIP by Training with Small Images | Runze Li et.al. | 2304.06028v1 | null |
2023-04-12 | Continual Diffusion: Continual Customization of Text-to-Image Diffusion with C-LoRA | James Seale Smith et.al. | 2304.06027v1 | null |
2023-04-12 | DreamPose: Fashion Image-to-Video Synthesis via Stable Diffusion | Johanna Karras et.al. | 2304.06025v1 | null |
2023-04-12 | VidStyleODE: Disentangled Video Editing via StyleGAN and NeuralODEs | Moayed Haji Ali et.al. | 2304.06020v1 | null |
2023-04-12 | Generating Aligned Pseudo-Supervision from Non-Aligned Data for Image Restoration in Under-Display Camera | Ruicheng Feng et.al. | 2304.06019v1 | link |
2023-04-12 | Adaptive Human Matting for Dynamic Videos | Chung-Ching Lin et.al. | 2304.06018v1 | null |
2023-04-12 | Exploiting Logic Locking for a Neural Trojan Attack on Machine Learning Accelerators | Hongye Xu et.al. | 2304.06017v1 | null |
2023-04-12 | An Improved Heart Disease Prediction Using Stacked Ensemble Method | Md. Maidul Islam et.al. | 2304.06015v1 | null |
2023-04-12 | Rigidly-rotating scalar fields: between real divergence and imaginary fractalization | Victor E. Ambruş et.al. | 2304.05998v1 | null |
2023-04-11 | Discrete pre-Tannakian categories | Nate Harman et.al. | 2304.05375v1 | null |
2023-04-11 | A comparative study between paired and unpaired Image Quality Assessment in Low-Dose CT Denoising | Francesco Di Feola et.al. | 2304.05359v1 | link |
2023-04-11 | On Elliott's conjecture and applications | Oleksiy Klurman et.al. | 2304.05344v1 | null |
2023-04-11 | Bayesian Optimization of Catalysts With In-context Learning | Mayk Caldas Ramos et.al. | 2304.05341v1 | link |
2023-04-11 | Unified Multi-Modal Image Synthesis for Missing Modality Imputation | Yue Zhang et.al. | 2304.05340v1 | null |
2023-04-11 | Deep-learning assisted detection and quantification of (oo)cysts of Giardia and Cryptosporidium on smartphone microscopy images | Suprim Nakarmi et.al. | 2304.05339v1 | null |
2023-04-11 | Neural Delay Differential Equations: System Reconstruction and Image Classification | Qunxi Zhu et.al. | 2304.05310v1 | null |
2023-04-11 | A Comprehensive Study on Object Detection Techniques in Unconstrained Environments | Hrishitva Patel et.al. | 2304.05295v1 | null |
2023-04-11 | MC-ViViT: Multi-branch Classifier-ViViT to Detect Mild Cognitive Impairment in Older Adults using Facial Videos | Jian Sun et.al. | 2304.05292v1 | null |
2023-04-11 | YouNICon: YouTube's CommuNIty of Conspiracy Videos | Shaoyi Liaw et.al. | 2304.05274v1 | null |
2023-04-10 | Detection Transformer with Stable Matching | Shilong Liu et.al. | 2304.04742v1 | link |
2023-04-10 | Brain Extraction comparing Segment Anything Model (SAM) and FSL Brain Extraction Tool | Sovesh Mohapatra et.al. | 2304.04738v1 | null |
2023-04-10 | Artifact magnification on deepfake videos increases human detection and subjective confidence | Emilie Josephs et.al. | 2304.04733v1 | null |
2023-04-10 | Prompt Pre-Training with Twenty-Thousand Classes for Open-Vocabulary Visual Recognition | Shuhuai Ren et.al. | 2304.04704v1 | link |
2023-04-10 | Video-kMaX: A Simple Unified Approach for Online and Near-Online Video Panoptic Segmentation | Inkyu Shin et.al. | 2304.04694v1 | null |
2023-04-11 | Interaction-Aware Prompting for Zero-Shot Spatio-Temporal Action Detection | Wei-Jhe Huang et.al. | 2304.04688v2 | null |
2023-04-10 | Learning to Detect Touches on Cluttered Tables | Norberto Adrian Goussies et.al. | 2304.04687v1 | null |
2023-04-10 | ECG-CL: A Comprehensive Electrocardiogram Interpretation Method Based on Continual Learning | Hongxiang Gao et.al. | 2304.04646v1 | null |
2023-04-10 | Improving ABR Performance for Short Video Streaming Using Multi-Agent Reinforcement Learning with Expert Guidance | Yueheng Li et.al. | 2304.04637v1 | null |
2023-04-10 | VARS: Video Assistant Referee System for Automated Soccer Decision Making from Multiple Views | Jan Held et.al. | 2304.04617v1 | null |
2023-04-07 | SparseFormer: Sparse Visual Recognition via Limited Latent Tokens | Ziteng Gao et.al. | 2304.03768v1 | null |
2023-04-07 | Zero-shot CT Field-of-view Completion with Unconditional Generative Diffusion Prior | Kaiwen Xu et.al. | 2304.03760v1 | null |
2023-04-07 | An unsupervised segmentation of vocal breath sounds | Shivani Yadav et.al. | 2304.03758v1 | null |
2023-04-07 | Language Models are Causal Knowledge Extractors for Zero-shot Video Question Answering | Hung-Ting Su et.al. | 2304.03754v1 | null |
2023-04-07 | Three-dimensional morphology of an ultrafine Al-Si eutectic produced via laser rapid solidification | Xinyi Zhou et.al. | 2304.03740v1 | null |
2023-04-07 | Integrating Edge-AI in Structural Health Monitoring domain | Anoop Mishra et.al. | 2304.03718v1 | null |
2023-04-07 | Meta-causal Learning for Single Domain Generalization | Jin Chen et.al. | 2304.03709v1 | null |
2023-04-07 | Efficient automatic segmentation for multi-level pulmonary arteries: The PARSE challenge | Gongning Luo et.al. | 2304.03708v1 | null |
2023-04-07 | Idaho Blacks: Quiet Economic Triumph of Enduring Champions | Rama K. Malladi et.al. | 2304.03676v1 | null |
2023-04-07 | An Accessible Toolkit for 360 VR Studies | Corrie Green et.al. | 2304.03652v1 | null |
2023-04-06 | SegGPT: Segmenting Everything In Context | Xinlong Wang et.al. | 2304.03284v1 | link |
2023-04-06 | Diffusion Models as Masked Autoencoders | Chen Wei et.al. | 2304.03283v1 | null |
2023-04-06 | That's What I Said: Fully-Controllable Talking Face Generation | Youngjoon Jang et.al. | 2304.03275v1 | null |
2023-04-06 | ImageEye: Batch Image Processing Using Program Synthesis | Celeste Barnaby et.al. | 2304.03253v1 | null |
2023-04-06 | Implicit Anatomical Rendering for Medical Image Segmentation with Stochastic Experts | Chenyu You et.al. | 2304.03209v1 | null |
2023-04-06 | Face Animation with an Attribute-Guided Diffusion Model | Bohan Zeng et.al. | 2304.03199v1 | link |
2023-04-06 | RFAConv: Innovating Spatital Attention and Standard Convolutional Operation | Xin Zhang et.al. | 2304.03198v1 | link |
2023-04-06 | Micron-BERT: BERT-based Facial Micro-Expression Recognition | Xuan-Bac Nguyen et.al. | 2304.03195v1 | link |
2023-04-06 | Improving automatic endoscopic stone recognition using a multi-view fusion approach enhanced with two-step transfer learning | Francisco Lopez-Tiro et.al. | 2304.03193v1 | null |
2023-04-06 | The Concept of Forward-Forward Learning Applied to a Multi Output Perceptron | K. Fredrik Karlsson et.al. | 2304.03189v1 | link |
2023-04-05 | Self-Distillation for Gaussian Process Regression and Classification | Kenneth Borup et.al. | 2304.02641v1 | link |
2023-04-05 | HNeRV: A Hybrid Neural Representation for Videos | Hao Chen et.al. | 2304.02633v1 | link |
2023-04-05 | High-fidelity Pseudo-labels for Boosting Weakly-Supervised Segmentation | Arvi Jonnarth et.al. | 2304.02621v1 | null |
2023-04-05 | A note on the classification of positive solutions to the critical p-Laplace equation in |
Jérôme Vétois et.al. | 2304.02600v1 | null |
2023-04-05 | ECG Feature Importance Rankings: Cardiologists vs. Algorithms | Temesgen Mehari et.al. | 2304.02577v1 | null |
2023-04-05 | Deep learning estimation of modified Zernike coefficients for image point spread functions | Abu Bucker Siddik et.al. | 2304.02576v1 | null |
2023-04-05 | VicTR: Video-conditioned Text Representations for Activity Recognition | Kumara Kahatapitiya et.al. | 2304.02560v1 | null |
2023-04-05 | Detecting and Grounding Multi-Modal Media Manipulation | Rui Shao et.al. | 2304.02556v1 | link |
2023-04-05 | Self-Supervised Siamese Autoencoders | Friederike Baier et.al. | 2304.02549v1 | null |
2023-04-05 | Multi-annotator Deep Learning: A Probabilistic Framework for Classification | Marek Herde et.al. | 2304.02539v1 | link |
2023-04-04 | NPC: Neural Point Characters from Video | Shih-Yang Su et.al. | 2304.02013v1 | null |
2023-04-04 | EGC: Image Generation and Classification via a Single Energy-Based Model | Qiushan Guo et.al. | 2304.02012v1 | link |
2023-04-04 | FakET: Simulating Cryo-Electron Tomograms with Neural Style Transfer | Pavol Harar et.al. | 2304.02011v1 | link |
2023-04-04 | MonoHuman: Animatable Human Neural Field from Monocular Video | Zhengming Yu et.al. | 2304.02001v1 | null |
2023-04-05 | Waving Goodbye to Low-Res: A Diffusion-Wavelet Approach for Image Super-Resolution | Brian Moser et.al. | 2304.01994v2 | link |
2023-04-04 | Cross-modulated Few-shot Image Generation for Colorectal Tissue Classification | Amandeep Kumar et.al. | 2304.01992v1 | link |
2023-04-04 | Side Channel-Assisted Inference Leakage from Machine Learning-based ECG Classification | Jialin Liu et.al. | 2304.01990v1 | null |
2023-04-04 | Kinetic relaxation and Bose-star formation in multicomponent dark matter- I | Mudit Jain et.al. | 2304.01985v1 | null |
2023-04-04 | Reactive Multi-agent Coordination using Auction-based Task Allocation and Behavior Trees | Niklas Dahlquist et.al. | 2304.01976v1 | null |
2023-04-04 | MEGClass: Text Classification with Extremely Weak Supervision via Mutually-Enhancing Text Granularities | Priyanka Kargupta et.al. | 2304.01969v1 | link |
2023-04-03 | Neural Volumetric Memory for Visual Locomotion Control | Ruihan Yang et.al. | 2304.01201v1 | null |
2023-04-03 | Video Instance Segmentation in an Open-World | Omkar Thawakar et.al. | 2304.01200v1 | link |
2023-04-03 | Zero-Shot Semantic Segmentation with Decoupled One-Pass Network | Cong Han et.al. | 2304.01198v1 | null |
2023-04-03 | Bringing Telepresence to Every Desk | Shengze Wang et.al. | 2304.01197v1 | null |
2023-04-03 | Follow Your Pose: Pose-Guided Text-to-Video Generation using Pose-Free Videos | Yue Ma et.al. | 2304.01186v1 | link |
2023-04-03 | WeakTr: Exploring Plain Vision Transformer for Weakly-supervised Semantic Segmentation | Lianghui Zhu et.al. | 2304.01184v1 | link |
2023-04-03 | Hate Speech Targets Detection in Parler using BERT | Nadav Schneider et.al. | 2304.01179v1 | link |
2023-04-03 | DribbleBot: Dynamic Legged Manipulation in the Wild | Yandong Ji et.al. | 2304.01159v1 | null |
2023-04-03 | Algebraic and Geometric Models for Space Networking | William Bernardoni et.al. | 2304.01150v1 | null |
2023-04-03 | Use Your Head: Improving Long-Tail Video Recognition | Toby Perrett et.al. | 2304.01143v1 | null |
2023-03-31 | Where are we in the search for an Artificial Visual Cortex for Embodied Intelligence? | Arjun Majumdar et.al. | 2303.18240v1 | null |
2023-03-31 | DIME-FM: DIstilling Multimodal and Efficient Foundation Models | Ximeng Sun et.al. | 2303.18232v1 | null |
2023-03-31 | Procedure-Aware Pretraining for Instructional Video Understanding | Honglu Zhou et.al. | 2303.18230v1 | link |
2023-03-31 | A Closer Look at Few-Shot 3D Point Cloud Classification | Chuangguan Ye et.al. | 2303.18210v1 | null |
2023-03-31 | SimTS: Rethinking Contrastive Representation Learning for Time Series Forecasting | Xiaochen Zheng et.al. | 2303.18205v1 | link |
2023-03-31 | Towards a Classification of Charge-3 Monopoles with Symmetry | H. W. Braden et.al. | 2303.18189v1 | null |
2023-04-03 | Three-dimensional coherent diffraction snapshot imaging using extreme ultraviolet radiation from a free electron laser | Danny Fainozzi et.al. | 2303.18166v2 | null |
2023-03-31 | Towards Nonlinear-Motion-Aware and Occlusion-Robust Rolling Shutter Correction | Delin Qu et.al. | 2303.18125v1 | null |
2023-03-31 | BERTino: an Italian DistilBERT model | Matteo Muffo et.al. | 2303.18121v1 | link |
2023-03-31 | A two-head loss function for deep Average-K classification | Camille Garcin et.al. | 2303.18118v1 | null |
2023-03-30 | Zero-Shot Video Editing Using Off-The-Shelf Image Diffusion Models | Wen Wang et.al. | 2303.17599v1 | link |
2023-03-30 | Consistent View Synthesis with Pose-Guided Diffusion Models | Hung-Yu Tseng et.al. | 2303.17598v1 | null |
2023-03-30 | MobileInst: Video Instance Segmentation on the Mobile | Renhong Zhang et.al. | 2303.17594v1 | null |
2023-03-30 | Anatomically aware dual-hop learning for pulmonary embolism detection in CT pulmonary angiograms | Florin Condrea et.al. | 2303.17593v1 | null |
2023-03-30 | SoftCLIP: Softer Cross-modal Alignment Makes CLIP Stronger | Yuting Gao et.al. | 2303.17561v1 | null |
2023-03-30 | DAE-Talker: High Fidelity Speech-Driven Talking Face Generation with Diffusion Autoencoder | Chenpng Du et.al. | 2303.17550v1 | null |
2023-03-30 | Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment | Kim Sung-Bin et.al. | 2303.17490v1 | null |
2023-03-30 | PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose Estimation | Qitao Zhao et.al. | 2303.17472v1 | link |
2023-03-30 | NN-Copula-CD: A Copula-Guided Interpretable Neural Network for Change Detection in Heterogeneous Remote Sensing Images | Weiming Li et.al. | 2303.17448v1 | null |
2023-03-30 | Steered Mixture of Experts Regression for Image Denoising with Multi-Model-Inference | Aytac Özkan et.al. | 2303.17409v1 | null |
2023-03-29 | Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos | Kun Su et.al. | 2303.16897v1 | null |
2023-03-29 | Towards Understanding the Effect of Pretraining Label Granularity | Guan Zhe Hong et.al. | 2303.16887v1 | null |
2023-03-29 | End-to-End |
Yuhang Jiang et.al. | 2303.16886v1 | link |
2023-03-29 | CheckerPose: Progressive Dense Keypoint Localization for Object Pose Estimation with Graph Neural Network | Ruyi Lian et.al. | 2303.16874v1 | null |
2023-03-29 | A Video-based End-to-end Pipeline for Non-nutritive Sucking Action Recognition and Segmentation in Young Infants | Shaotong Zhu et.al. | 2303.16867v1 | link |
2023-03-29 | The puzzle of the formation of T8 dwarf Ross 458c | Josefine Gaarn et.al. | 2303.16863v1 | null |
2023-03-29 | Beyond Empirical Risk Minimization: Local Structure Preserving Regularization for Improving Adversarial Robustness | Wei Wei et.al. | 2303.16861v1 | null |
2023-03-29 | Robust Dancer: Long-term 3D Dance Synthesis Using Unpaired Data | Bin Feng et.al. | 2303.16856v1 | link |
2023-03-30 | MaMMUT: A Simple Architecture for Joint Learning for MultiModal Tasks | Weicheng Kuo et.al. | 2303.16839v2 | null |
2023-03-29 | bAIoimage analysis: elevating the rate of scientific discovery -- as a community | Damian Edward Dalle Nogare et.al. | 2303.16743v1 | null |
2023-03-29 | Your Diffusion Model is Secretly a Zero-Shot Classifier | Alexander C. Li et.al. | 2303.16203v2 | link |
2023-03-28 | Natural Selection Favors AIs over Humans | Dan Hendrycks et.al. | 2303.16200v1 | null |
2023-03-28 | Forecasting localized weather impacts on vegetation as seen from space with meteo-guided video prediction | Vitus Benson et.al. | 2303.16198v1 | link |
2023-03-28 | Automorphisms of del Pezzo surfaces in odd characteristic | Igor Dolgachev et.al. | 2303.16170v1 | null |
2023-03-28 | Comparison of HDR quality metrics in Per-Clip Lagrangian multiplier optimisation with AV1 | Vibhoothi et.al. | 2303.16163v1 | link |
2023-03-28 | A Comparative Study of Federated Learning Models for COVID-19 Detection | Erfan Darzidehkalani et.al. | 2303.16141v1 | null |
2023-03-28 | XRBcats: Galactic High Mass X-ray Binary Catalogue | Marvin Neumann et.al. | 2303.16137v1 | null |
2023-03-28 | Transformer and Snowball Graph Convolution Learning for Biomedical Graph Classification | Jinlong Hu et.al. | 2303.16132v1 | null |
2023-03-28 | Evaluating the Effectiveness of 2D and 3D Features for Predicting Tumor Response to Chemotherapy | Neman Abdoli et.al. | 2303.16123v1 | null |
2023-03-28 | CycleACR: Cycle Modeling of Actor-Context Relations for Video Action Detection | Lei Chen et.al. | 2303.16118v1 | null |
2023-03-27 | GeoNet: Benchmarking Unsupervised Adaptation across Geographies | Tarun Kalluri et.al. | 2303.15443v1 | null |
2023-03-27 | Zero-shot Model Diagnosis | Jinqi Luo et.al. | 2303.15441v1 | null |
2023-03-27 | TextMI: Textualize Multimodal Information for Integrating Non-verbal Cues in Pre-trained Language Models | Md Kamrul Hasan et.al. | 2303.15430v1 | null |
2023-03-27 | JAWS: Just A Wild Shot for Cinematic Transfer in Neural Radiance Fields | Xi Wang et.al. | 2303.15427v1 | null |
2023-03-27 | ACAT: Adversarial Counterfactual Attention for Classification and Detection in Medical Imaging | Alessandro Fontanella et.al. | 2303.15421v1 | null |
2023-03-27 | 3D Video Object Detection with Learnable Object-Centric Global Optimization | Jiawei He et.al. | 2303.15416v1 | link |
2023-03-27 | Classifier Robustness Enhancement Via Test-Time Transformation | Tsachi Blau et.al. | 2303.15409v1 | null |
2023-03-27 | Generalizable Neural Voxels for Fast Human Radiance Fields | Taoran Yi et.al. | 2303.15387v1 | null |
2023-03-27 | List Online Classification | Shay Moran et.al. | 2303.15383v1 | null |
2023-03-27 | Quantum-inspired classification based on quantum state discrimination | Emmanuel Zambrini Cruzeiro et.al. | 2303.15353v1 | null |
2023-03-24 | FastViT: A Fast Hybrid Vision Transformer using Structural Reparameterization | Pavan Kumar Anasosalu Vasu et.al. | 2303.14189v1 | null |
2023-03-24 | BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown Objects | Bowen Wen et.al. | 2303.14158v1 | null |
2023-03-24 | Adversarial Attack and Defense for Medical Image Analysis: Methods and Applications | Junhao Dong et.al. | 2303.14133v1 | null |
2023-03-24 | CIFAKE: Image Classification and Explainable Identification of AI-Generated Synthetic Images | Jordan J. Bird et.al. | 2303.14126v1 | null |
2023-03-24 | Towards Scalable Neural Representation for Diverse Videos | Bo He et.al. | 2303.14124v1 | null |
2023-03-24 | Prediction of the morphological evolution of a splashing drop using an encoder-decoder | Jingzu Yee et.al. | 2303.14109v1 | null |
2023-03-24 | Parallel and totally umbilical hypersurfaces of the four-dimensional Thurston geometry |
Marie D'haene et.al. | 2303.14105v1 | null |
2023-03-24 | PanoVPR: Towards Unified Perspective-to-Equirectangular Visual Place Recognition via Sliding Windows across the Panoramic View | Ze Shi et.al. | 2303.14095v1 | link |
2023-03-24 | CoLa-Diff: Conditional Latent Diffusion Model for Multi-Modal MRI Synthesis | Lan Jiang et.al. | 2303.14081v1 | null |
2023-03-27 | The ImSPOC snapshot imaging spectrometer: image formation model and device characterization | Daniele Picone et.al. | 2303.14076v2 | link |
2023-03-23 | Learning and Verification of Task Structure in Instructional Videos | Medhini Narasimhan et.al. | 2303.13519v1 | null |
2023-03-23 | DreamBooth3D: Subject-Driven Text-to-3D Generation | Amit Raj et.al. | 2303.13508v1 | null |
2023-03-23 | A Large-scale Study of Spatiotemporal Representation Learning with a New Benchmark on Action Recognition | Andong Deng et.al. | 2303.13505v1 | link |
2023-03-23 | ReBotNet: Fast Real-time Video Enhancement | Jeya Maria Jose Valanarasu et.al. | 2303.13504v1 | null |
2023-03-23 | TriPlaneNet: An Encoder for EG3D Inversion | Ananta R. Bhattarai et.al. | 2303.13497v1 | null |
2023-03-23 | The effectiveness of MAE pre-pretraining for billion-scale pretraining | Mannat Singh et.al. | 2303.13496v1 | null |
2023-03-23 | The strength of a simplex is the key to a continuous isometry classification of Euclidean clouds of unlabelled points | Vitaliy Kurlin et.al. | 2303.13486v1 | null |
2023-03-23 | TactoFind: A Tactile Only System for Object Retrieval | Sameer Pai et.al. | 2303.13482v1 | null |
2023-03-23 | Plotting Behind the Scenes: Towards Learnable Game Engines | Willi Menapace et.al. | 2303.13472v1 | null |
2023-03-23 | Egocentric Audio-Visual Object Localization | Chao Huang et.al. | 2303.13471v1 | link |
2023-03-22 | CiCo: Domain-Aware Sign Language Retrieval via Cross-Lingual Contrastive Learning | Yiting Cheng et.al. | 2303.12793v1 | link |
2023-03-22 | SHERF: Generalizable Human NeRF from a Single Image | Shoukang Hu et.al. | 2303.12791v1 | link |
2023-03-22 | Tube-Link: A Flexible Cross Tube Baseline for Universal Video Segmentation | Xiangtai Li et.al. | 2303.12782v1 | link |
2023-03-22 | Active particles with delayed attractions form quaking crystallites | Pin-Chuan Chen et.al. | 2303.12780v1 | null |
2023-03-22 | LSTM-based Video Quality Prediction Accounting for Temporal Distortions in Videoconferencing Calls | Gabriel Mittag et.al. | 2303.12761v1 | link |
2023-03-22 | Uncertainty Aware Active Learning for Reconfiguration of Pre-trained Deep Object-Detection Networks for New Target Domains | Jiaming Na et.al. | 2303.12760v1 | null |
2023-03-22 | Multiscale Relevance of Natural Images | Samy Lakhal et.al. | 2303.12717v1 | null |
2023-03-22 | Automatic Identification of Crystal Structures and Interfaces via Artificial-Intelligence-based Electron Microscopy | Andreas Leitherer et.al. | 2303.12702v1 | null |
2023-03-22 | Pix2Video: Video Editing using Image Diffusion | Duygu Ceylan et.al. | 2303.12688v1 | null |
2023-03-22 | The miniJPAS survey quasar selection III: Classification with artificial neural networks and hybridisation | G. Martínez-Solaeche et.al. | 2303.12684v1 | null |
2023-03-21 | Natural Language-Assisted Sign Language Recognition | Ronglai Zuo et.al. | 2303.12080v1 | link |
2023-03-21 | OmniTracker: Unifying Object Tracking by Tracking-with-Detection | Junke Wang et.al. | 2303.12079v1 | null |
2023-03-21 | Two-shot Video Object Segmentation | Kun Yan et.al. | 2303.12078v1 | null |
2023-03-21 | Dexterity from Touch: Self-Supervised Pre-Training of Tactile Representations with Robotic Play | Irmak Guzey et.al. | 2303.12076v1 | null |
2023-03-21 | 3D Mitochondria Instance Segmentation with Spatio-Temporal Transformers | Omkar Thawakar et.al. | 2303.12073v1 | null |
2023-03-21 | Machine Learning for Brain Disorders: Transformers and Visual Transformers | Robin Courant et.al. | 2303.12068v1 | null |
2023-03-21 | VideoXum: Cross-modal Visual and Textural Summarization of Videos | Jingyang Lin et.al. | 2303.12060v1 | null |
2023-03-21 | Motion Matters: Neural Motion Transfer for Better Camera Physiological Sensing | Akshay Paruchuri et.al. | 2303.12059v1 | null |
2023-03-21 | Influencer Backdoor Attack on Semantic Segmentation | Haoheng Lan et.al. | 2303.12054v1 | null |
2023-03-21 | Calibration of the convective parameters in stellar pulsation hydrocodes | Gábor B. Kovács et.al. | 2303.12049v1 | null |
2023-03-20 | Legs as Manipulator: Pushing Quadrupedal Agility Beyond Locomotion | Xuxin Cheng et.al. | 2303.11330v1 | null |
2023-03-20 | Over-the-Air Federated Edge Learning with Error-Feedback One-Bit Quantization and Power Control | Yuding Liu et.al. | 2303.11319v1 | null |
2023-03-20 | Learning Audio-Visual Source Localization via False Negative Aware Contrastive Learning | Weixuan Sun et.al. | 2303.11302v1 | null |
2023-03-20 | Reliability in Semantic Segmentation: Are We on the Right Track? | Pau de Jorge et.al. | 2303.11298v1 | link |
2023-03-20 | Resource Saving via Ensemble Techniques for Quantum Neural Networks | Massimiliano Incudini et.al. | 2303.11283v1 | null |
2023-03-20 | Cascading Hierarchical Networks with Multi-task Balanced Loss for Fine-grained hashing | Xianxian Zeng et.al. | 2303.11274v1 | link |
2023-03-20 | Rethinking the backbone architecture for tiny object detection | Jinlai Ning et.al. | 2303.11267v1 | null |
2023-03-20 | Robust Imaging of Speed-of-Sound Using Virtual Source Transmission | Dieter Schweizer et.al. | 2303.11262v1 | null |
2023-03-20 | Bronchoscopic video synchronization for interactive multimodal inspection of bronchial lesions | Qi Chang et.al. | 2303.11258v1 | null |
2023-03-20 | Towards End-to-End Generative Modeling of Long Videos with Memory-Efficient Bidirectional Transformers | Jaehoon Yoo et.al. | 2303.11251v1 | link |
2023-03-17 | Toward Super-Resolution for Appearance-Based Gaze Estimation | Galen O'Shea et.al. | 2303.10151v1 | null |
2023-03-17 | Spectrum-inspired Low-light Image Translation for Saliency Detection | Kitty Varghese et.al. | 2303.10145v1 | null |
2023-03-20 | GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models | Tyna Eloundou et.al. | 2303.10130v2 | null |
2023-03-17 | Retrieving false claims on Twitter during the Russia-Ukraine conflict | Valerio La Gatta et.al. | 2303.10121v1 | null |
2023-03-17 | Causal Discovery from Temporal Data: An Overview and New Perspectives | Chang Gong et.al. | 2303.10112v1 | null |
2023-03-17 | Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation | Liulei Li et.al. | 2303.10100v1 | null |
2023-03-17 | Efficient Neural Generation of 4K Masks for Homogeneous Diffusion Inpainting | Karl Schrader et.al. | 2303.10096v1 | null |
2023-03-17 | Posterior Estimation Using Deep Learning: A Simulation Study of Compartmental Modeling in Dynamic PET | Xiaofeng Liu et.al. | 2303.10057v1 | null |
2023-03-20 | Uncertainty-informed Mutual Learning for Joint Medical Image Classification and Segmentation | Kai Ren et.al. | 2303.10049v2 | null |
2023-03-17 | Multi-modal Expression Recognition with Ensemble Method | Chuanhe Liu et.al. | 2303.10033v1 | null |
2023-03-16 | Deep Metric Learning for Unsupervised Remote Sensing Change Detection | Wele Gedara Chaminda Bandara et.al. | 2303.09536v1 | link |
2023-03-17 | FateZero: Fusing Attentions for Zero-shot Text-based Video Editing | Chenyang Qi et.al. | 2303.09535v2 | link |
2023-03-16 | Fast 3D Volumetric Image Reconstruction from 2D MRI Slices by Parallel Processing | Somoballi Ghoshal et.al. | 2303.09523v1 | null |
2023-03-16 | MATIS: Masked-Attention Transformers for Surgical Instrument Segmentation | Nicolás Ayobi et.al. | 2303.09514v1 | null |
2023-03-16 | LDMVFI: Video Frame Interpolation with Latent Diffusion Models | Duolikun Danier et.al. | 2303.09508v1 | null |
2023-03-16 | Knowledge Distillation for Adaptive MRI Prostate Segmentation Based on Limit-Trained Multi-Teacher Models | Eddardaa Ben Loussaief et.al. | 2303.09494v1 | null |
2023-03-16 | Classification of tight contact structures on some Seifert fibered manifolds: I | Tanushree Shah et.al. | 2303.09490v1 | null |
2023-03-16 | Effectively Modeling Time Series with Simple Discrete State Spaces | Michael Zhang et.al. | 2303.09489v1 | link |
2023-03-16 | Combining Distance to Class Centroids and Outlier Discounting for Improved Learning with Noisy Labels | Farooq Ahmad Wani et.al. | 2303.09470v1 | link |
2023-03-16 | Diffusion-Shock Inpainting | Kristina Schaefer et.al. | 2303.09450v1 | null |
2023-03-16 | DeepMIM: Deep Supervision for Masked Image Modeling | Sucheng Ren et.al. | 2303.08817v2 | link |
2023-03-15 | BiFormer: Vision Transformer with Bi-Level Routing Attention | Lei Zhu et.al. | 2303.08810v1 | link |
2023-03-15 | Mesh Strikes Back: Fast and Efficient Human Reconstruction from RGB videos | Rohit Jena et.al. | 2303.08808v1 | null |
2023-03-15 | Building an Effective Email Spam Classification Model with spaCy | Kazem Taghandiki et.al. | 2303.08792v1 | null |
2023-03-15 | PLEX: Making the Most of the Available Data for Robotic Manipulation Pretraining | Garrett Thomas et.al. | 2303.08789v1 | null |
2023-03-15 | Exploiting 4D CT Perfusion for segmenting infarcted areas in patients with suspected acute ischemic stroke | Luca Tomasetti et.al. | 2303.08757v1 | null |
2023-03-15 | 2D and 3D CNN-Based Fusion Approach for COVID-19 Severity Prediction from 3D CT-Scans | Fares Bougourzi et.al. | 2303.08740v1 | link |
2023-03-15 | Evaluating gesture-generation in a large-scale open challenge: The GENEA Challenge 2022 | Taras Kucherenko et.al. | 2303.08737v1 | null |
2023-03-15 | A machine-learning approach to thunderstorm forecasting through post-processing of simulation data | Kianusch Vahid Yousefnia et.al. | 2303.08736v1 | null |
2023-03-16 | UniCT DMI Solution for 3rd COV19D Competition on COVID-19 Detection trough attention deep learning for CT Scan | Alessia Rondinella et.al. | 2303.08728v2 | null |
2023-03-15 | Manipulate by Seeing: Creating Manipulation Controllers from Pre-Trained Representations | Jianren Wang et.al. | 2303.08135v2 | null |
2023-03-14 | InstMove: Instance Motion for Object-centric Video Segmentation | Qihao Liu et.al. | 2303.08132v1 | link |
2023-03-14 | Blind Video Deflickering by Neural Filtering with a Flawed Atlas | Chenyang Lei et.al. | 2303.08120v1 | link |
2023-03-14 | Homeomorphic Image Registration via Conformal-Invariant Hyperelastic Regularisation | Jing Zou et.al. | 2303.08113v1 | null |
2023-03-15 | Alias-Free Convnets: Fractional Shift Invariance via Polynomial Activations | Hagay Michaeli et.al. | 2303.08085v2 | link |
2023-03-14 | Point Cloud Diffusion Models for Automatic Implant Generation | Paul Friedrich et.al. | 2303.08061v1 | null |
2023-03-14 | Subjective and Objective Quality Assessment for in-the-Wild Computer Graphics Images | Zicheng Zhang et.al. | 2303.08050v1 | null |
2023-03-14 | BODEGA: Benchmark for Adversarial Example Generation in Credibility Assessment | Piotr Przybyła et.al. | 2303.08032v1 | null |
2023-03-14 | Optimizing Deep Learning Model Parameters with the Bees Algorithm for Improved Medical Text Classification | Mai A. Shaaban et.al. | 2303.08021v1 | link |
2023-03-14 | Leveraging Pretrained Representations with Task-related Keywords for Alzheimer's Disease Detection | Jinchao Li et.al. | 2303.08019v1 | null |
2023-03-13 | TriDet: Temporal Action Detection with Relative Boundary Modeling | Dingfeng Shi et.al. | 2303.07347v1 | link |
2023-03-13 | Unsupervised HDR Image and Video Tone Mapping via Contrastive Learning | Cong Cao et.al. | 2303.07327v1 | null |
2023-03-13 | Collision Cross-entropy and EM Algorithm for Self-labeled Classification | Zhongwen Zhang et.al. | 2303.07321v1 | null |
2023-03-13 | Model-tuning Via Prompts Makes NLP Models Adversarially Robust | Mrigank Raman et.al. | 2303.07320v1 | null |
2023-03-13 | Nearest-Neighbor Inter-Intra Contrastive Learning from Unlabeled Videos | David Fan et.al. | 2303.07317v1 | null |
2023-03-13 | Transformer Models for Acute Brain Dysfunction Prediction | Brandon Silva et.al. | 2303.07305v1 | null |
2023-03-13 | One-form symmetries in |
Antonio Amariti et.al. | 2303.07299v1 | null |
2023-03-13 | Transformer-based approaches to Sentiment Detection | Olumide Ebenezer Ojo et.al. | 2303.07292v1 | null |
2023-03-13 | Align and Attend: Multimodal Summarization with Dual Contrastive Losses | Bo He et.al. | 2303.07284v1 | link |
2023-03-13 | Vision-Language Models as Success Detectors | Yuqing Du et.al. | 2303.07280v1 | null |
2023-03-10 | Learning to Select Camera Views: Efficient Multiview Understanding at Few Glances | Yunzhong Hou et.al. | 2303.06145v1 | link |
2023-03-10 | Warm-Starting and Quantum Computing: A Systematic Mapping Study | Felix Truger et.al. | 2303.06133v1 | null |
2023-03-10 | Supersolid phases of bosonic particles in a bubble trap | Matteo Ciardi et.al. | 2303.06113v1 | null |
2023-03-10 | Statistical Study of the Correlation between Solar Energetic Particles and Properties of Active Regions | Russell D. Marroquin et.al. | 2303.06100v1 | null |
2023-03-10 | Communication-Critical Planning via Multi-Agent Trajectory Exchange | Nathaniel Moore Glaser et.al. | 2303.06080v1 | null |
2023-03-10 | Long-tailed Classification from a Bayesian-decision-theory Perspective | Bolian Li et.al. | 2303.06075v1 | null |
2023-03-10 | MVImgNet: A Large-scale Dataset of Multi-view Images | Xianggang Yu et.al. | 2303.06042v1 | null |
2023-03-10 | Importance of Aligning Training Strategy with Evaluation for Diffusion Models in 3D Multiclass Segmentation | Yunguan Fu et.al. | 2303.06040v1 | link |
2023-03-10 | Tactile-Filter: Interactive Tactile Perception for Part Mating | Kei Ota et.al. | 2303.06034v1 | null |
2023-03-10 | Material Identification From Radiographs Without Energy Resolution | Michael T. McCann et.al. | 2303.06005v1 | null |
2023-03-09 | PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification | Xuan Li et.al. | 2303.05512v1 | null |
2023-03-09 | Cherry-Picking with Reinforcement Learning | Yunchu Zhang et.al. | 2303.05508v1 | null |
2023-03-09 | Learning Arm-Assisted Fall Damage Reduction and Recovery for Legged Mobile Manipulators | Yuntao Ma et.al. | 2303.05486v1 | null |
2023-03-09 | Spawrious: A Benchmark for Fine Control of Spurious Correlation Biases | Aengus Lynch et.al. | 2303.05470v1 | link |
2023-03-09 | Resolving quantitative MRI model degeneracy with machine learning via training data distribution design | Michele Guerreri et.al. | 2303.05464v1 | null |
2023-03-09 | Understanding the Challenges and Opportunities of Pose-based Anomaly Detection | Ghazal Alinezhad Noghre et.al. | 2303.05463v1 | null |
2023-03-09 | Presentation Attack Detection with Advanced CNN Models for Noncontact-based Fingerprint Systems | Sandip Purnapatra et.al. | 2303.05459v1 | null |
2023-03-09 | The Impact of Feature Selection and Transformation on Machine Learning Methods in Determining the Credit Scoring | Oguz Koc et.al. | 2303.05427v1 | null |
2023-03-09 | FaceXHuBERT: Text-less Speech-driven E(X)pressive 3D Facial Animation Synthesis Using Self-Supervised Speech Representation Learning | Kazi Injamamul Haque et.al. | 2303.05416v1 | link |
2023-03-09 | Deep Equilibrium Learning of Explicit Regularizers for Imaging Inverse Problems | Zihao Zou et.al. | 2303.05386v1 | null |
2023-03-08 | Automatic Debiased Learning from Positive, Unlabeled, and Exposure Data | Masahiro Kato et.al. | 2303.04797v1 | null |
2023-03-08 | Video-P2P: Video Editing with Cross-attention Control | Shaoteng Liu et.al. | 2303.04761v1 | null |
2023-03-08 | Multimodal Parameter-Efficient Few-Shot Class Incremental Learning | Marco D'Alessandro et.al. | 2303.04751v1 | null |
2023-03-08 | A General Theory of Correct, Incorrect, and Extrinsic Equivariance | Dian Wang et.al. | 2303.04745v1 | null |
2023-03-08 | Model Predictive Control with Gaussian-Process-Supported Dynamical Constraints for Autonomous Vehicles | Johanna Bethge et.al. | 2303.04725v1 | null |
2023-03-08 | A lattice model for condensation in Levin-Wen systems | Jessica Christian et.al. | 2303.04711v1 | null |
2023-03-08 | VOLTA: an Environment-Aware Contrastive Cell Representation Learning for Histopathology | Ramin Nakhli et.al. | 2303.04696v1 | null |
2023-03-08 | STPDnet: Spatial-temporal convolutional primal dual network for dynamic PET image reconstruction | Rui Hu et.al. | 2303.04667v1 | null |
2023-03-08 | Centroid-centered Modeling for Efficient Vision Transformer Pre-training | Xin Yan et.al. | 2303.04664v1 | null |
2023-03-08 | DULDA: Dual-domain Unsupervised Learned Descent Algorithm for PET image reconstruction | Rui Hu et.al. | 2303.04661v1 | null |
2023-03-07 | From Copilot to Pilot: Towards AI Supported Software Development | Rohith Pudari et.al. | 2303.04142v1 | null |
2023-03-07 | Predicted Embedding Power Regression for Large-Scale Out-of-Distribution Detection | Hong Yang et.al. | 2303.04115v1 | null |
2023-03-07 | Introspective Cross-Attention Probing for Lightweight Transfer of Pre-trained Models | Yonatan Dukler et.al. | 2303.04105v1 | null |
2023-03-07 | VOCALExplore: Pay-as-You-Go Video Data Exploration and Model Building | Maureen Daum et.al. | 2303.04068v1 | null |
2023-03-07 | Describe me an Aucklet: Generating Grounded Perceptual Category Descriptions | Bill Noble et.al. | 2303.04053v1 | null |
2023-03-07 | DINet: Deformation Inpainting Network for Realistic Face Visually Dubbing on High Resolution Video | Zhimeng Zhang et.al. | 2303.03988v1 | null |
2023-03-07 | Comparing 3D deformations between longitudinal daily CBCT acquisitions using CNN for head and neck radiotherapy toxicity prediction | William Trung Le et.al. | 2303.03965v1 | null |
2023-03-07 | On the connectedness of multistationarity regions of small reaction networks | Allison Dennis et.al. | 2303.03960v1 | null |
2023-03-07 | ChatGPT: Beginning of an End of Manual Annotation? Use Case of Automatic Genre Identification | Taja Kuzman et.al. | 2303.03953v1 | null |
2023-03-07 | A survey on automated detection and classification of acute leukemia and WBCs in microscopic blood cells | Mohammad Zolfaghari et.al. | 2303.03916v1 | null |
2023-03-06 | Simultaneous Tactile Estimation and Control of Extrinsic Contact | Sangwoon Kim et.al. | 2303.03385v1 | null |
2023-03-06 | Referring Multi-Object Tracking | Dongming Wu et.al. | 2303.03366v1 | link |
2023-03-06 | Keep It Simple: CNN Model Complexity Studies for Interference Classification Tasks | Taiwo Oyedare et.al. | 2303.03326v1 | null |
2023-03-06 | Robust Immersive Telepresence and Mobile Telemanipulation: NimbRo wins ANA Avatar XPRIZE Finals | Max Schwarz et.al. | 2303.03297v1 | null |
2023-03-06 | Liouville Theorem for semilinear elliptic inequalities with the fractional Hardy operators | Huyuan Chen et.al. | 2303.03246v1 | null |
2023-03-06 | Evaluating the Fairness of Deep Learning Uncertainty Estimates in Medical Image Analysis | Raghav Mehta et.al. | 2303.03242v1 | null |
2023-03-06 | Stargazer: An Interactive Camera Robot for Capturing How-To Videos Based on Subtle Instructor Cues | Jiannan Li et.al. | 2303.03221v1 | null |
2023-03-06 | Combination of Single and Multi-frame Image Super-resolution: An Analytical Perspective | Mohammad Mahdi Afrasiabi et.al. | 2303.03212v1 | null |
2023-03-06 | Continuous Sign Language Recognition with Correlation Network | Lianyu Hu et.al. | 2303.03202v1 | null |
2023-03-06 | A Unified Algebraic Perspective on Lipschitz Neural Networks | Alexandre Araujo et.al. | 2303.03169v1 | link |
2023-03-03 | Prompt, Generate, then Cache: Cascade of Foundation Models makes Strong Few-shot Learners | Renrui Zhang et.al. | 2303.02151v1 | link |
2023-03-03 | Quantum utility -- definition and assessment of a practical quantum advantage | Nils Herrmann et.al. | 2303.02138v1 | null |
2023-03-03 | TRUSformer: Improving Prostate Cancer Detection from Micro-Ultrasound Using Attention and Self-Supervision | Mahdi Gilany et.al. | 2303.02128v1 | link |
2023-03-03 | Dispersed, condensed and self-limiting states of geometrically frustrated assembly | Nicholas W. Hackney et.al. | 2303.02121v1 | null |
2023-03-03 | Need for Objective Task-based Evaluation of Deep Learning-Based Denoising Methods: A Study in the Context of Myocardial Perfusion SPECT | Zitong Yu et.al. | 2303.02110v1 | null |
2023-03-03 | Bi-parametric prostate MR image synthesis using pathology and sequence-conditioned stable diffusion | Shaheer U. Saeed et.al. | 2303.02094v1 | null |
2023-03-03 | Unsupervised Deep Digital Staining For Microscopic Cell Images Via Knowledge Distillation | Ziwang Xu et.al. | 2303.02057v1 | null |
2023-03-03 | On the complexity of PAC learning in Hilbert spaces | Sergei Chubanov et.al. | 2303.02047v1 | null |
2023-03-03 | Uncertainty Estimation by Fisher Information-based Evidential Deep Learning | Danruo Deng et.al. | 2303.02045v1 | null |
2023-03-03 | Single-photon Image Super-resolution via Self-supervised Learning | Yiwei Chen et.al. | 2303.02033v1 | null |
2023-03-02 | ABAW: Valence-Arousal Estimation, Expression Recognition, Action Unit Detection & Emotional Reaction Intensity Estimation Challenges | Dimitrios Kollias et.al. | 2303.01498v1 | null |
2023-03-02 | Teach a Robot to FISH: Versatile Imitation from One Minute of Demonstrations | Siddhant Haldar et.al. | 2303.01497v1 | null |
2023-03-02 | Transferring Models Trained on Natural Images to 3D MRI via Position Encoded Slice Models | Umang Gupta et.al. | 2303.01491v1 | link |
2023-03-02 | Self-Improving Robots: End-to-End Autonomous Visuomotor Reinforcement Learning | Archit Sharma et.al. | 2303.01488v1 | null |
2023-03-02 | Consistency Models | Yang Song et.al. | 2303.01469v1 | null |
2023-03-02 | MoSFPAD: An end-to-end Ensemble of MobileNet and Support Vector Classifier for Fingerprint Presentation Attack Detection | Anuj Rai et.al. | 2303.01465v1 | null |
2023-03-02 | VARAHA: A Fast Non-Markovian sampler for estimating Gravitational-Wave posteriors | Vaibhav Tiwari et.al. | 2303.01463v1 | null |
2023-03-02 | Emergent Order in Classical Data Representations on Ising Spin Models | Jorja J. Kirk et.al. | 2303.01461v1 | null |
2023-03-02 | Subgoal-Driven Navigation in Dynamic Environments Using Attention-Based Deep Reinforcement Learning | Jorge de Heuvel et.al. | 2303.01443v1 | null |
2023-03-02 | PuSHR: A Multirobot System for Nonprehensile Rearrangement | Sidharth Talia et.al. | 2303.01428v1 | null |
2023-03-01 | READ Avatars: Realistic Emotion-controllable Audio Driven Avatars | Jack Saunders et.al. | 2303.00744v1 | null |
2023-03-01 | SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks | Kai-Wei Chang et.al. | 2303.00733v1 | null |
2023-03-01 | OSRE: Object-to-Spot Rotation Estimation for Bike Parking Assessment | Saghir Alfasly et.al. | 2303.00725v1 | null |
2023-03-01 | Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis | Renrui Zhang et.al. | 2303.00703v1 | null |
2023-03-01 | On the Importance of Feature Representation for Flood Mapping using Classical Machine Learning Approaches | Kevin Iselborn et.al. | 2303.00691v1 | link |
2023-03-01 | Fairness Evaluation in Text Classification: Machine Learning Practitioner Perspectives of Individual and Group Fairness | Zahra Ashktorab et.al. | 2303.00673v1 | null |
2023-03-01 | MEGA-DAgger: Imitation Learning with Multiple Imperfect Experts | Xiatao Sun et.al. | 2303.00638v1 | null |
2023-03-01 | The TESS Triple-9 Catalog II: a new set of 999 uniformly-vetted exoplanet candidates | Christian Magliano et.al. | 2303.00624v1 | null |
2023-03-01 | Level Up the Deepfake Detection: a Method to Effectively Discriminate Images Generated by GAN Architectures and Diffusion Models | Luca Guarnera et.al. | 2303.00608v1 | null |
2023-03-01 | Comprehensive Comparisons of Uniform Quantization in Deep Image Compression | Koki Tsubota et.al. | 2303.00587v1 | null |
2023-02-28 | BakedSDF: Meshing Neural SDFs for Real-Time View Synthesis | Lior Yariv et.al. | 2302.14859v1 | null |
2023-02-28 | Automatic Scoring of Dream Reports' Emotional Content with Large Language Models | Lorenzo Bertolini et.al. | 2302.14828v1 | null |
2023-02-28 | Incremental Learning of Acoustic Scenes and Sound Events | Manjunath Mulimani et.al. | 2302.14815v1 | null |
2023-02-28 | Opto-UNet: Optimized UNet for Segmentation of Varicose Veins in Optical Coherence Tomography | Maryam Viqar et.al. | 2302.14808v1 | null |
2023-02-28 | Framelet Message Passing | Xinliang Liu et.al. | 2302.14806v1 | null |
2023-02-28 | Generic-to-Specific Distillation of Masked Autoencoders | Wei Huang et.al. | 2302.14771v1 | null |
2023-02-28 | The Classification of Short and Long-term Driving Behavior for an Advanced Driver Assistance System by Analyzing Bidirectional Driving Features | Mudasser Seraj et.al. | 2302.14743v1 | null |
2023-02-28 | IMU-based online multi-lidar calibration without lidar odometry | Sandipan Das et.al. | 2302.14735v1 | null |
2023-02-28 | Automatically Classifying Emotions based on Text: A Comparative Exploration of Different Datasets | Anna Koufakou et.al. | 2302.14727v1 | null |
2023-02-28 | IntrinsicNGP: Intrinsic Coordinate based Hash Encoding for Human NeRF | Bo Peng et.al. | 2302.14683v1 | null |
2023-02-27 | Internet Explorer: Targeted Representation Learning on the Open Web | Alexander C. Li et.al. | 2302.14051v1 | link |
2023-02-27 | Language Is Not All You Need: Aligning Perception with Language Models | Shaohan Huang et.al. | 2302.14045v1 | null |
2023-02-27 | Learning to Generalize towards Unseen Domains via a Content-Aware Style Invariant Framework for Disease Detection from Chest X-rays | Mohammad Zunaed et.al. | 2302.13991v1 | link |
2023-02-27 | UMIFormer: Mining the Correlations between Similar Tokens for Multi-View 3D Reconstruction | Zhenwei Zhu et.al. | 2302.13987v1 | null |
2023-02-27 | Rotating hematite cube chains | Mārtiņš Brics et.al. | 2302.13978v1 | null |
2023-02-27 | Exploring the influence of different velocity fields on Wolf-Rayet star spectra | Roel R. Lefever et.al. | 2302.13964v1 | null |
2023-02-27 | Make Every Example Count: On Stability and Utility of Self-Influence for Learning from Noisy NLP Datasets | Irina Bejan et.al. | 2302.13959v1 | null |
2023-02-27 | Supervised topological data analysis for MALDI imaging applications | Gideon Klaila et.al. | 2302.13948v1 | null |
2023-02-27 | Inseq: An Interpretability Toolkit for Sequence Generation Models | Gabriele Sarti et.al. | 2302.13942v1 | link |
2023-02-27 | Data re-uploading with a single qudit | Noah L. Wach et.al. | 2302.13932v1 | null |
2023-02-27 | Improving Massively Multilingual ASR With Auxiliary CTC Objectives | William Chen et.al. | 2302.12829v2 | link |
2023-02-24 | Decoupling Human and Camera Motion from Videos in the Wild | Vickie Ye et.al. | 2302.12827v1 | null |
2023-02-24 | GraphSR: A Data Augmentation Algorithm for Imbalanced Node Classification | Mengting Zhou et.al. | 2302.12814v1 | null |
2023-02-24 | STA: Self-controlled Text Augmentation for Improving Text Classifications | Congcong Wang et.al. | 2302.12784v1 | link |
2023-02-24 | Language-Driven Representation Learning for Robotics | Siddharth Karamcheti et.al. | 2302.12766v1 | link |
2023-02-24 | Definition and Classification of Fermi Surface Anomalies | Da-Chuan Lu et.al. | 2302.12731v1 | null |
2023-02-24 | LightTS: Lightweight Time Series Classification with Adaptive Ensemble Distillation -- Extended Version | David Campos et.al. | 2302.12721v1 | link |
2023-02-24 | Designing and simulating realistic spatial frequency domain imaging systems using open-source 3D rendering software | Jane Crowley et.al. | 2302.12705v1 | null |
2023-02-24 | Reflexive polytopes and discrete polymatroids | Jürgen Herzog et.al. | 2302.12703v1 | null |
2023-02-24 | Video4MRI: An Empirical Study on Brain Magnetic Resonance Image Analytics with CNN-based Video Classification Frameworks | Yuxuan Zhang et.al. | 2302.12688v1 | null |
2023-02-23 | Boosting Adversarial Transferability using Dynamic Cues | Muzammal Naseer et.al. | 2302.12252v1 | null |
2023-02-23 | MERF: Memory-Efficient Radiance Fields for Real-time View Synthesis in Unbounded Scenes | Christian Reiser et.al. | 2302.12249v1 | null |
2023-02-24 | Learning Neural Volumetric Representations of Dynamic Humans in Minutes | Chen Geng et.al. | 2302.12237v2 | null |
2023-02-23 | RSFDM-Net: Real-time Spatial and Frequency Domains Modulation Network for Underwater Image Enhancement | Jingxia Jiang et.al. | 2302.12186v1 | null |
2023-02-23 | Unified Chest X-ray and Radiology Report Generation Model with Multi-view Chest X-rays | Hyungyung Lee et.al. | 2302.12172v1 | null |
2023-02-24 | Automated Statement Extraction from Press Briefings | Jüri Keller et.al. | 2302.12131v2 | link |
2023-02-23 | Local and Global Explainability Metrics for Machine Learning Predictions | Cristian Munoz et.al. | 2302.12094v1 | null |
2023-02-23 | Deep learning model for Mongolian Citizens Feedback Analysis using Word Vector Embeddings | Zolzaya Dashdorj et.al. | 2302.12069v1 | null |
2023-02-23 | Teaching CLIP to Count to Ten | Roni Paiss et.al. | 2302.12066v1 | null |
2023-02-23 | Attention Mechanism for Contrastive Learning in GAN-based Image-to-Image Translation | Hanzhen Zhang et.al. | 2302.12052v1 | null |
2023-02-22 | Inferring HIV Transmission Patterns from Viral Deep-Sequence Data via Latent Typed Point Processes | Fan Bu et.al. | 2302.11567v1 | null |
2023-02-22 | Vid2Avatar: 3D Avatar Reconstruction from Videos in the Wild via Self-supervised Scene Decomposition | Chen Guo et.al. | 2302.11566v1 | null |
2023-02-22 | Scaling Robot Learning with Semantically Imagined Experience | Tianhe Yu et.al. | 2302.11550v1 | null |
2023-02-22 | Slim U-Net: Efficient Anatomical Feature Preserving U-net Architecture for Ultrasound Image Segmentation | Deepak Raina et.al. | 2302.11524v1 | null |
2023-02-22 | Evaluation of Extra Pixel Interpolation with Mask Processing for Medical Image Segmentation with Deep Learning | Olivier Rukundo et.al. | 2302.11522v1 | null |
2023-02-22 | A Global and Patch-wise Contrastive Loss for Accurate Automated Exudate Detection | Wei Tang et.al. | 2302.11517v1 | null |
2023-02-22 | S3I-PointHop: SO(3)-Invariant PointHop for 3D Point Cloud Classification | Pranav Kadam et.al. | 2302.11506v1 | null |
2023-02-22 | On The Role of Alias and Band-Shift for Sentinel-2 Super-Resolution | Ngoc Long Nguyen et.al. | 2302.11494v1 | null |
2023-02-22 | Magnification Invariant Medical Image Analysis: A Comparison of Convolutional Networks, Vision Transformers, and Token Mixers | Pranav Jeevan et.al. | 2302.11488v1 | null |
2023-02-22 | Improving Model Choice in Classification: An Approach Based on Clustering of Covariance Matrices | David Rodríguez-Vítores et.al. | 2302.11487v1 | null |
2023-02-21 | Efficient CTC Regularization via Coarse Labels for End-to-End Speech Translation | Biao Zhang et.al. | 2302.10871v1 | null |
2023-02-21 | SF2Former: Amyotrophic Lateral Sclerosis Identification From Multi-center MRI Data Using Spatial and Frequency Fusion Transformer | Rafsanjany Kushol et.al. | 2302.10859v1 | null |
2023-02-21 | There Are No Post-Quantum Weakly Pseudo-Free Families in Any Nontrivial Variety of Expanded Groups | Mikhail Anokhin et.al. | 2302.10847v1 | null |
2023-02-21 | Performant feature extraction for photometric time series | Anastasia Lavrukhina et.al. | 2302.10837v1 | null |
2023-02-21 | Weakly Supervised Temporal Convolutional Networks for Fine-grained Surgical Activity Recognition | Sanat Ramesh et.al. | 2302.10834v1 | null |
2023-02-21 | A Local Machine Learning Approach for Fingerprint-based Indoor Localization | Nora Agah et.al. | 2302.10810v1 | null |
2023-02-21 | Learning 3D Photography Videos via Self-supervised Diffusion on Single Images | Xiaodong Wang et.al. | 2302.10781v1 | null |
2023-02-21 | Quantifying Jump Height Using Markerless Motion Capture with a Single Smartphone | Timilehin B. Aderinola et.al. | 2302.10749v1 | null |
2023-02-21 | MalProtect: Stateful Defense Against Adversarial Query Attacks in ML-based Malware Detection | Aqib Rashid et.al. | 2302.10739v1 | null |
2023-02-21 | Depth Estimation and Image Restoration by Deep Learning from Defocused Images | Saqib Nazir et.al. | 2302.10730v1 | null |
2023-02-20 | Towards Universal Fake Image Detectors that Generalize Across Generative Models | Utkarsh Ojha et.al. | 2302.10174v1 | null |
2023-02-20 | Hashtag-Guided Low-Resource Tweet Classification | Shizhe Diao et.al. | 2302.10143v1 | link |
2023-02-20 | Large-scale Multi-Modal Pre-trained Models: A Comprehensive Survey | Xiao Wang et.al. | 2302.10035v1 | link |
2023-02-20 | Hunting for C-rich long-period variable stars in the Milky Way's bar-bulge using unsupervised classification of Gaia BP/RP spectra | Jason L. Sanders et.al. | 2302.10022v1 | null |
2023-02-20 | Boosting classification reliability of NLP transformer models in the long run | Zoltán Kmetty et.al. | 2302.10016v1 | null |
2023-02-20 | Simulating analogue film damage to analyse and improve artefact restoration on high-resolution scans | Daniela Ivanova et.al. | 2302.10004v1 | null |
2023-02-20 | STB-VMM: Swin Transformer Based Video Motion Magnification | Ricard Lado-Roigé et.al. | 2302.10001v1 | null |
2023-02-20 | Observation of near room temperature thin film superconductivity of atmospherically stable Ag-Au mesoscopic thin film | Sobhan Hazra et.al. | 2302.09974v1 | null |
2023-02-20 | Social4Rec: Distilling User Preference from Social Graph for Video Recommendation in Tencent | Xuanji Xiao et.al. | 2302.09971v1 | null |
2023-02-20 | Why is the prediction wrong? Towards underfitting case explanation via meta-classification | Sheng Zhou et.al. | 2302.09952v1 | null |
2023-02-17 | Automated Graph Genetic Algorithm based Puzzle Validation for Faster Game Desig | Karine Levonyan et.al. | 2302.09040v1 | null |
2023-02-17 | Self-supervised Action Representation Learning from Partial Spatio-Temporal Skeleton Sequences | Yujie Zhou et.al. | 2302.09018v1 | null |
2023-02-17 | The asymptotic topology of the multineighbor complex of a random graph | Wojciech Matysiak et.al. | 2302.09017v1 | null |
2023-02-17 | Automated classification of Chandra X-ray point sources using machine learning methods | Shivam Kumaran et.al. | 2302.09008v1 | null |
2023-02-17 | Towards Fine-Grained Information: Identifying the Type and Location of Translation Errors | Keqin Bao et.al. | 2302.08975v1 | null |
2023-02-17 | sMRI-PatchNet: A novel explainable patch-based deep learning network for Alzheimer's disease diagnosis and discriminative atrophy localisation with Structural MRI | Xin Zhang et.al. | 2302.08967v1 | null |
2023-02-17 | Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts | Zhihong Chen et.al. | 2302.08958v1 | link |
2023-02-17 | Like a Good Nearest Neighbor: Practical Content Moderation with Sentence Transformers | Luke Bates et.al. | 2302.08957v1 | link |
2023-02-17 | AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages | Shamsuddeen Hassan Muhammad et.al. | 2302.08956v1 | null |
2023-02-17 | Tropical Feynman integration in the Minkowski regime | Michael Borinsky et.al. | 2302.08955v1 | null |
2023-02-16 | FOSI: Hybrid First and Second Order Optimization | Hadar Sivan et.al. | 2302.08484v1 | null |
2023-02-16 | Kernelized Back-Projection Networksfor Blind Super Resolution | Tomoki Yoshida et.al. | 2302.08478v1 | null |
2023-02-16 | The tilted CHSH games: an operator algebraic classification | Alexander Frei et.al. | 2302.08475v1 | null |
2023-02-16 | Emergent spinor fields from exotic spin structures | J. M. Hoff da Silva et.al. | 2302.08473v1 | null |
2023-02-16 | Learning a Meta-Controller for Dynamic Grasping | Yinsen Jia et.al. | [2 |
kruss84 / cv-arxiv-daily Goto Github PK
View Code? Open in Web Editor NEWThis project forked from dwctod/cv-arxiv-daily