Following is the list of accepted ICASSP 2021 papers, sorted by paper title.
You can use the search feature of your web browser to find your paper number.
Notifications to all authors have also been sent by email. If you have not
received your notification of the results by email, please contact us at papers@2021.ieeeicassp.org.
1269 (W)EARABLE MICROPHONE ARRAY AND ULTRASONIC ECHO LOCALIZATION FOR COARSE INDOOR ENVIRONMENT MAPPING Felix Pfreundtner, Jing Yang, Gábor Sörös 1269 | (W)EARABLE MICROPHONE ARRAY AND ULTRASONIC ECHO LOCALIZATION FOR COARSE INDOOR ENVIRONMENT MAPPING |
4035 “YOU SHOULD PROBABLY READ THIS”: HEDGE DETECTION IN TEXT Denys Katerenchuk, Rivka Levitan 4035 | “YOU SHOULD PROBABLY READ THIS”: HEDGE DETECTION IN TEXT |
2490 2D-FRFT BASED FREQUENCY SHIFT-INVARIANT DIGITAL IMAGE ENCRYPTION Lei Gao, Lin Qi, Ling Guan 2490 | 2D-FRFT BASED FREQUENCY SHIFT-INVARIANT DIGITAL IMAGE ENCRYPTION |
1705 3D MULTIZONE SOUNDFIELD REPRODUCTION IN A REVERBERANT ENVIRONMENT USING INTENSITY MATCHING METHOD Huanyu Zuo, Thushara D. Abhayapala, Prasanga N. Samarasinghe 1705 | 3D MULTIZONE SOUNDFIELD REPRODUCTION IN A REVERBERANT ENVIRONMENT USING INTENSITY MATCHING METHOD |
2715 A Bayesian Inference Approach for Location-Based Micro Motions using Radio Frequency Sensing David A. Maluf, Amr Elnakeeb, Matt Silverman 2715 | A Bayesian Inference Approach for Location-Based Micro Motions using Radio Frequency Sensing |
2422 A BAYESIAN INTERPRETATION OF THE LIGHT GATED RECURRENT UNIT Alexandre Bittar, Philip Garner 2422 | A BAYESIAN INTERPRETATION OF THE LIGHT GATED RECURRENT UNIT |
4082 A Better and Faster End-to-End Model for Streaming ASR Bo Li, Anmol Gulati, Jiahui Yu, Tara Sainath, Chung-Cheng Chiu, Arun Narayanan, Shuo-Yiin Chang, Ruoming Pang, Yanzhang He, James Qin, Wei Han, Qiao Liang, Yu Zhang, Trevor Strohman, Yonghui Wu 4082 | A Better and Faster End-to-End Model for Streaming ASR |
5065 A BIAS-REDUCING LOSS FUNCTION FOR CT IMAGE DENOISING Madhuri Nagare, Roman Melnyk, Obaidullah Rahman, Ken D. Sauer, Charles A. Bouman 5065 | A BIAS-REDUCING LOSS FUNCTION FOR CT IMAGE DENOISING |
3363 A CAPSULE NETWORK BASED APPROACH FOR DETECTION OF AUDIO SPOOFING ATTACKS Anwei Luo, Enlei Li, Yongliang Liu, Xiangui Kang, Z. Jane Wang 3363 | A CAPSULE NETWORK BASED APPROACH FOR DETECTION OF AUDIO SPOOFING ATTACKS |
2087 A CAUSAL DEEP LEARNING FRAMEWORK FOR CLASSIFYING PHONEMES IN COCHLEAR IMPLANTS Kevin Chu, Leslie Collins, Boyla Mainsah 2087 | A CAUSAL DEEP LEARNING FRAMEWORK FOR CLASSIFYING PHONEMES IN COCHLEAR IMPLANTS |
3014 A CHAPTER-WISE UNDERSTANDING SYSTEM FOR TEXT-TO-SPEECH IN CHINESE NOVELS Junjie Pan, Lin Wu, Xiang Yin, Pengfei Wu, Chenchang Xu, Zejun Ma 3014 | A CHAPTER-WISE UNDERSTANDING SYSTEM FOR TEXT-TO-SPEECH IN CHINESE NOVELS |
4461 A CLASSIFIER FOR IMPROVING CAUSE AND EFFECT IN SSVEP-BASED BCIS FOR INDIVIDUALS WITH COMPLEX COMMUNICATION DISORDERS Hadi Habibzadeh, Olivia Zhou, James J. S. Norton, Theresa M. Vaughan, Daphney-Stavroula Zois 4461 | A CLASSIFIER FOR IMPROVING CAUSE AND EFFECT IN SSVEP-BASED BCIS FOR INDIVIDUALS WITH COMPLEX COMMUNICATION DISORDERS |
3835 A CLOSED-LOOP GAIN-CONTROL FEEDBACK MODEL FOR THE MEDIAL EFFERENT SYSTEM OF THE DESCENDING AUDITORY PATHWAY Afagh Farhadi, Skyler G. Jennings, Elizabeth A. Strickland, Laurel H. Carney 3835 | A CLOSED-LOOP GAIN-CONTROL FEEDBACK MODEL FOR THE MEDIAL EFFERENT SYSTEM OF THE DESCENDING AUDITORY PATHWAY |
2665 A CLOSER LOOK AT AUDIO-VISUAL MULTI-PERSON SPEECH RECOGNITION AND ACTIVE SPEAKER SELECTION Otavio Braga, Olivier Siohan 2665 | A CLOSER LOOK AT AUDIO-VISUAL MULTI-PERSON SPEECH RECOGNITION AND ACTIVE SPEAKER SELECTION |
2052 A CO-INTERACTIVE TRANSFORMER FOR JOINT SLOT FILLING AND INTENT DETECTION Libo Qin, Tailu Liu, Wanxiang Che, Bingbing Kang, Sendong Zhao, Ting Liu 2052 | A CO-INTERACTIVE TRANSFORMER FOR JOINT SLOT FILLING AND INTENT DETECTION |
2403 A COLOR DOPPLER PROCESSING ENGINE WITH AN ADAPTIVE CLUTTER FILTER FOR PORTABLE ULTRASOUND IMAGING DEVICES Yi-Lin Lo, Chia-Hsiang Yang 2403 | A COLOR DOPPLER PROCESSING ENGINE WITH AN ADAPTIVE CLUTTER FILTER FOR PORTABLE ULTRASOUND IMAGING DEVICES |
4778 A COMPACT JOINT DISTILLATION NETWORK FOR VISUAL FOOD RECOGNITION Heng Zhao, Kim-Hui Yap, Alex Chichung Kot 4778 | A COMPACT JOINT DISTILLATION NETWORK FOR VISUAL FOOD RECOGNITION |
3882 A COMPARATIVE STUDY OF ACOUSTIC AND LINGUISTIC FEATURES CLASSIFICATION FOR ALZHEIMER’S DISEASE DETECTION Jinchao Li, Jianwei Yu, Ye Zi, Simon Wong, Manwai Mak, Brian Mak, Xunying Liu, Helen Meng 3882 | A COMPARATIVE STUDY OF ACOUSTIC AND LINGUISTIC FEATURES CLASSIFICATION FOR ALZHEIMER’S DISEASE DETECTION |
3630 A COMPARISON OF CONVOLUTIONAL NEURAL NETWORKS FOR GLOTTAL CLOSURE INSTANT DETECTION FROM RAW SPEECH Jindrich Matousek, Daniel Tihelka 3630 | A COMPARISON OF CONVOLUTIONAL NEURAL NETWORKS FOR GLOTTAL CLOSURE INSTANT DETECTION FROM RAW SPEECH |
2210 A Comparison of Discrete Latent Variable Models for Speech Representation Learning Henry Zhou, Alexei Baevski, Michael Auli 2210 | A Comparison of Discrete Latent Variable Models for Speech Representation Learning |
5143 A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET Rudolf A Braun, Srikanth Madikeri, Petr Motlicek 5143 | A COMPARISON OF METHODS FOR OOV-WORD RECOGNITION ON A NEW PUBLIC DATASET |
4262 A COMPARISON STUDY ON INFANT-PARENT VOICE DIARIZATION Junzhe Zhu, Mark Hasegawa-Johnson, Nancy McElwain 4262 | A COMPARISON STUDY ON INFANT-PARENT VOICE DIARIZATION |
3645 A CONSENSUS EQUILIBRIUM SOLUTION FOR DEEP IMAGE PRIOR POWERED BY RED Rakib Hyder, Hassan Mansour, Yanting Ma, Petros Boufounos, Pu Wang 3645 | A CONSENSUS EQUILIBRIUM SOLUTION FOR DEEP IMAGE PRIOR POWERED BY RED |
2384 A CONVEX PENALTY FOR BLOCK-SPARSE SIGNALS WITH UNKNOWN STRUCTURES Hiroki Kuroda, Daichi Kitahara, Akira Hirabayashi 2384 | A CONVEX PENALTY FOR BLOCK-SPARSE SIGNALS WITH UNKNOWN STRUCTURES |
2719 A CORRENTROPY BASED ALGORITHM FOR ROBUST LOCALIZATION IN WIRELESS NETWORKS Mahboobeh Sedighizad, Babak Seyfe, Shahrokh Valaee 2719 | A CORRENTROPY BASED ALGORITHM FOR ROBUST LOCALIZATION IN WIRELESS NETWORKS |
3389 A CURATED DATASET OF URBAN SCENES FOR AUDIO-VISUAL SCENE ANALYSIS Shanshan Wang, Annamaria Mesaros, Toni Heittola, Tuomas Virtanen 3389 | A CURATED DATASET OF URBAN SCENES FOR AUDIO-VISUAL SCENE ANALYSIS |
4386 A DECENTRALIZED VARIANCE-REDUCED METHOD FOR STOCHASTIC OPTIMIZATION OVER DIRECTED GRAPHS Muhammad Qureshi, Ran Xin, Soummya Kar, Usman Khan 4386 | A DECENTRALIZED VARIANCE-REDUCED METHOD FOR STOCHASTIC OPTIMIZATION OVER DIRECTED GRAPHS |
4113 A DEEP REINFORCEMENT LEARNING APPROACH TO AUDIO-BASED NAVIGATION IN A MULTI-SPEAKER ENVIRONMENT Petros Giannakopoulos, Aggelos Pikrakis, Yannis Cotronis 4113 | A DEEP REINFORCEMENT LEARNING APPROACH TO AUDIO-BASED NAVIGATION IN A MULTI-SPEAKER ENVIRONMENT |
2732 A DEEP SPATIO-TEMPORAL MODEL FOR EEG-BASED IMAGINED SPEECH RECOGNITION Pradeep Kumar, Erik Scheme 2732 | A DEEP SPATIO-TEMPORAL MODEL FOR EEG-BASED IMAGINED SPEECH RECOGNITION |
2189 A Diffusion FxLMS Algorithm for Multi-Channel Active Noise Control and Variable Spatial Smoothing Yijing CHU, S. C. CHAN, C. M. MAK, Ming Wu 2189 | A Diffusion FxLMS Algorithm for Multi-Channel Active Noise Control and Variable Spatial Smoothing |
2106 A DNN AUTOENCODER FOR AUTOMOTIVE RADAR INTERFERENCE MITIGATION Shengyi Chen, Jalal Taghia, Tai Fei, Uwe Kühnau, Nils Pohl, Rainer Martin 2106 | A DNN AUTOENCODER FOR AUTOMOTIVE RADAR INTERFERENCE MITIGATION |
3750 A Dynamical Systems Perspective on Online Bayesian Nonparametric Estimators with Adaptive Hyperparameters Alec Koppel, Amrit Singh Bedi, Vikram Krishnamurthy 3750 | A Dynamical Systems Perspective on Online Bayesian Nonparametric Estimators with Adaptive Hyperparameters |
2039 A FAST AND EFFICIENT NETWORK FOR SINGLE IMAGE DERAINING Youzhao Yang, Hong Lu 2039 | A FAST AND EFFICIENT NETWORK FOR SINGLE IMAGE DERAINING |
3483 A FAST RANDOMIZED ADAPTIVE CP DECOMPOSITION FOR STREAMING TENSORS Trung Thanh LE, Karim Abed-Meraim, Linh Trung Nguyen, Adel Hafiane 3483 | A FAST RANDOMIZED ADAPTIVE CP DECOMPOSITION FOR STREAMING TENSORS |
2391 A FEATURES DECOUPLING METHOD FOR MULTIPLE MANIPULATIONS IDENTIFICATION IN IMAGE OPERATION CHAINS Jiaxin Chen, Xin Liao, Wei Wang, Zheng Qin 2391 | A FEATURES DECOUPLING METHOD FOR MULTIPLE MANIPULATIONS IDENTIFICATION IN IMAGE OPERATION CHAINS |
3293 A FLOW-BASED NEURAL NETWORK FOR TIME DOMAIN SPEECH ENHANCEMENT Martin Strauss, Bernd Edler 3293 | A FLOW-BASED NEURAL NETWORK FOR TIME DOMAIN SPEECH ENHANCEMENT |
2261 A FRAMEWORK FOR PRUNING DEEP NEURAL NETWORKS USING ENERGY-BASED MODELS Hojjat Salehinejad, Shahrokh Valaee 2261 | A FRAMEWORK FOR PRUNING DEEP NEURAL NETWORKS USING ENERGY-BASED MODELS |
3155 A FURTHER STUDY OF UNSUPERVISED PRETRAINING FOR TRANSFORMER BASED SPEECH RECOGNITION Dongwei Jiang, Wubo Li, Ruixiong Zhang, Miao Cao, Ne Luo, Yang Han, Wei Zou, Kun Han, Xiangang Li 3155 | A FURTHER STUDY OF UNSUPERVISED PRETRAINING FOR TRANSFORMER BASED SPEECH RECOGNITION |
4036 A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks Yun Tang, Juan Pino, Changhan Wang, Xutai Ma, Dmitriy Genzel 4036 | A General Multi-Task Learning Framework to Leverage Text Data for Speech to Text Tasks |
3799 A GENERAL NETWORK ARCHITECTURE FOR SOUND EVENT LOCALIZATION AND DETECTION USING TRANSFER LEARNING AND RECURRENT NEURAL NETWORK Thi Ngoc Tho Nguyen, Ngoc Khanh Nguyen, Huy Phan, Lam Pham, Kenneth Ooi, Douglas L. Jones, Woon-Seng Gan 3799 | A GENERAL NETWORK ARCHITECTURE FOR SOUND EVENT LOCALIZATION AND DETECTION USING TRANSFER LEARNING AND RECURRENT NEURAL NETWORK |
5624 A GENERALIZED ACCELERATED COMPOSITE GRADIENT METHOD: UNITING NESTEROV'S FAST GRADIENT METHOD AND FISTA Mihai I. Florea, Sergiy A. Vorobyov 5624 | A GENERALIZED ACCELERATED COMPOSITE GRADIENT METHOD: UNITING NESTEROV'S FAST GRADIENT METHOD AND FISTA |
2965 A GLOBAL CAYLEY PARAMETRIZATION OF STIEFEL MANIFOLD FOR DIRECT UTILIZATION OF OPTIMIZATION MECHANISMS OVER VECTOR SPACE Keita Kume, Isao Yamada 2965 | A GLOBAL CAYLEY PARAMETRIZATION OF STIEFEL MANIFOLD FOR DIRECT UTILIZATION OF OPTIMIZATION MECHANISMS OVER VECTOR SPACE |
1849 A GLOBAL-LOCAL ATTENTION FRAMEWORK FOR WEAKLY LABELLED AUDIO TAGGING Helin Wang, Yuexian Zou, Wenwu Wang 1849 | A GLOBAL-LOCAL ATTENTION FRAMEWORK FOR WEAKLY LABELLED AUDIO TAGGING |
4457 A Graph Learning Algorithm Based on Gaussian Markov Random Fields and Minimax Concave Penalty Tatsuya Koyakumaru, Masahiro Yukawa, Eduardo Pavez, Antonio Ortega 4457 | A Graph Learning Algorithm Based on Gaussian Markov Random Fields and Minimax Concave Penalty |
4058 A HIERARCHICAL SUBSPACE MODEL FOR LANGUAGE-ATTUNED ACOUSTIC UNIT DISCOVERY Bolaji Yusuf, Lucas Ondel, Lukas Burget, Jan Cernocky, Murat Saraclar 4058 | A HIERARCHICAL SUBSPACE MODEL FOR LANGUAGE-ATTUNED ACOUSTIC UNIT DISCOVERY |
3242 A HIGH-FRAME-RATE EYE-TRACKING FRAMEWORK FOR MOBILE DEVICES Yuhu Chang, Changyang He, Tun Lu, Ning Gu 3242 | A HIGH-FRAME-RATE EYE-TRACKING FRAMEWORK FOR MOBILE DEVICES |
2687 A HOMOGENEITY-BASED MULTISCALE HYPERSPECTRAL IMAGE REPRESENTATION FOR SPARSE SPECTRAL UNMIXING Luciano Ayres, Sérgio de Almeida, José Bermudez, Ricardo Borsoi 2687 | A HOMOGENEITY-BASED MULTISCALE HYPERSPECTRAL IMAGE REPRESENTATION FOR SPARSE SPECTRAL UNMIXING |
4185 A Hybrid Approach to Coded Compressed Sensing where Coupling Takes Place via the Outer Code Jamison Ebert, Vamsi Amalladinne, Jean-Francois Chamberland, Krishna Narayanan 4185 | A Hybrid Approach to Coded Compressed Sensing where Coupling Takes Place via the Outer Code |
1545 A HYBRID CNN-BILSTM VOICE ACTIVITY DETECTOR Nicholas Wilkinson, Thomas Niesler 1545 | A HYBRID CNN-BILSTM VOICE ACTIVITY DETECTOR |
3449 A HYBRID FEATURE ENHANCEMENT METHOD FOR GLAND SEGMENTATION IN HISTOPATHOLOGY IMAGES Xiangjiang Wu, Xuanya Li, Kai Hu, Zhineng Chen, Xieping Gao 3449 | A HYBRID FEATURE ENHANCEMENT METHOD FOR GLAND SEGMENTATION IN HISTOPATHOLOGY IMAGES |
5122 A JOINT CONVOLUTIONAL AND SPATIAL QUAD-DIRECTIONAL LSTM NETWORK FOR PHASE UNWRAPPING Malsha V. Perera, Ashwin De Silva 5122 | A JOINT CONVOLUTIONAL AND SPATIAL QUAD-DIRECTIONAL LSTM NETWORK FOR PHASE UNWRAPPING |
2907 A JOINT TRAINING FRAMEWORK OF MULTI-LOOK SEPARATOR AND SPEAKER EMBEDDING EXTRACTOR FOR OVERLAPPED SPEECH Naijun Zheng, Na Li, Bo Wu, Meng Yu, JianWei Yu, Chao Weng, Dan Su, XunYing Liu, Helen Meng 2907 | A JOINT TRAINING FRAMEWORK OF MULTI-LOOK SEPARATOR AND SPEAKER EMBEDDING EXTRACTOR FOR OVERLAPPED SPEECH |
3679 A LARGE-DIMENSIONAL ANALYSIS OF SYMMETRIC SNE Charles Séjourné, Romain Couillet, Pierre Comon 3679 | A LARGE-DIMENSIONAL ANALYSIS OF SYMMETRIC SNE |
3532 A Large-Scale Chinese Long-text Extractive Summarization Corpus Kai Chen, Guanyu Fu, Qingcai Chen, Baotian Hu 3532 | A Large-Scale Chinese Long-text Extractive Summarization Corpus |
5332 A Layered Embedding-Based Scheme To Cope With Intra-frame Distortion Drift In IPM-Based HEVC Steganography Xiaoqing Jia, Jie Wang, Yongliang Liu, Xiangui Kang, Yunqing Shi 5332 | A Layered Embedding-Based Scheme To Cope With Intra-frame Distortion Drift In IPM-Based HEVC Steganography |
3118 A LOW-COMPLEXITY ADMM-BASED MASSIVE MIMO DETECTORS VIA DEEP NEURAL NETWORKS Isayiyas Nigatu Tiba, Quan Zhang, Jing Jiang, Yongchao Wang 3118 | A LOW-COMPLEXITY ADMM-BASED MASSIVE MIMO DETECTORS VIA DEEP NEURAL NETWORKS |
3000 A LOW-COMPLEXITY MIMO DUAL FUNCTION RADAR COMMUNICATION SYSTEM VIA ONE-BIT SAMPLING Siyu Zhu, Feng Xi, Shengyao Chen, Arye Nehorai 3000 | A LOW-COMPLEXITY MIMO DUAL FUNCTION RADAR COMMUNICATION SYSTEM VIA ONE-BIT SAMPLING |
4592 A META-LEARNING FRAMEWORK FOR FEW-SHOT CLASSIFICATION OF REMOTE SENSING SCENE Pei Zhang, Yunpeng Bai, Dong Wang, Bendu Bai, Ying LI 4592 | A META-LEARNING FRAMEWORK FOR FEW-SHOT CLASSIFICATION OF REMOTE SENSING SCENE |
1195 A METHOD FOR DETERMINING PERIODICALLY TIME-VARYING BIAS AND ITS APPLICATIONS IN ACOUSTIC FEEDBACK CANCELLATION Meng Guo 1195 | A METHOD FOR DETERMINING PERIODICALLY TIME-VARYING BIAS AND ITS APPLICATIONS IN ACOUSTIC FEEDBACK CANCELLATION |
5625 A MNEMONIC KALMAN FILTER FOR NON-LINEAR SYSTEMS WITH EXTENSIVE TEMPORAL DEPENDENCIES Steffen Jung, Isabel Schlangen, Alexander Charlish 5625 | A MNEMONIC KALMAN FILTER FOR NON-LINEAR SYSTEMS WITH EXTENSIVE TEMPORAL DEPENDENCIES |
4678 A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT Tyler Vuong, Yangyang Xia, Richard Stern 4678 | A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT |
4280 A MULTI-CHANNEL TEMPORAL ATTENTION CONVOLUTIONAL NEURAL NETWORK MODEL FOR ENVIRONMENTAL SOUND CLASSIFICATION You Wang, Chuyao Feng, David Anderson 4280 | A MULTI-CHANNEL TEMPORAL ATTENTION CONVOLUTIONAL NEURAL NETWORK MODEL FOR ENVIRONMENTAL SOUND CLASSIFICATION |
4970 A Multi-layer Multi-channel Attentive Network for Gender and Age Recognition Jia Chen, Haiping Yu, Yimei Kang 4970 | A Multi-layer Multi-channel Attentive Network for Gender and Age Recognition |
4443 A MULTIPLE ACCESS CHANNEL GAME USING LATENCY METRIC Andrey Garnaev, Athina Petropulu, Wade Trappe 4443 | A MULTIPLE ACCESS CHANNEL GAME USING LATENCY METRIC |
4138 A MULTI-VIEW APPROACH TO AUDIO-VISUAL SPEAKER VERIFICATION Leda Sari, Kritika Singh, Jiatong Zhou, Lorenzo Torresani, Nayan Singhal, Yatharth Saraf 4138 | A MULTI-VIEW APPROACH TO AUDIO-VISUAL SPEAKER VERIFICATION |
2723 A NEURAL ACOUSTIC ECHO CANCELLER OPTIMIZED USING AN AUTOMATIC SPEECH RECOGNIZER AND LARGE SCALE SYNTHETIC DATA Nathan Howard, Alex Park, Turaj Shabestary, Alexander Gruenstein, Rohit Prabhavalkar 2723 | A NEURAL ACOUSTIC ECHO CANCELLER OPTIMIZED USING AN AUTOMATIC SPEECH RECOGNIZER AND LARGE SCALE SYNTHETIC DATA |
4290 A NEURAL TEXT-TO-SPEECH MODEL UTILIZING BROADCAST DATA MIXED WITH BACKGROUND MUSIC Hanbin Bae, Jae-Sung Bae, Young-Sun Joo, Young-Ik Kim, Hoon-Young Cho 4290 | A NEURAL TEXT-TO-SPEECH MODEL UTILIZING BROADCAST DATA MIXED WITH BACKGROUND MUSIC |
2556 A NEW AUTOMOTIVE RADAR 4D POINT CLOUDS DETECTOR BY USING DEEP LEARNING Yuwei Cheng, Jingran Su, Hongyu Chen, Yimin Liu 2556 | A NEW AUTOMOTIVE RADAR 4D POINT CLOUDS DETECTOR BY USING DEEP LEARNING |
2568 A NEW DCASE 2017 RARE SOUND EVENT DETECTION BENCHMARK UNDER EQUAL TRAINING DATA: CRNN WITH MULTI-WIDTH KERNELS Jan Baumann, Patrick Meyer, Timo Lohrenz, Alexander Roy, Michael Papendieck, Tim Fingscheidt 2568 | A NEW DCASE 2017 RARE SOUND EVENT DETECTION BENCHMARK UNDER EQUAL TRAINING DATA: CRNN WITH MULTI-WIDTH KERNELS |
5597 A NEW DIFFUSION VARIABLE SPATIAL REGULARIZED QRRLS ALGORITHM Yijing CHU, S. C. CHAN, Yi Zhou, Ming WU 5597 | A NEW DIFFUSION VARIABLE SPATIAL REGULARIZED QRRLS ALGORITHM |
1558 A NEW FRAMEWORK BASED ON TRANSFER LEARNING FOR CROSS-DATABASE PNEUMONIA DETECTION Xinxin Shan, Ying Wen 1558 | A NEW FRAMEWORK BASED ON TRANSFER LEARNING FOR CROSS-DATABASE PNEUMONIA DETECTION |
2025 A NEW HIGH QUALITY TRAJECTORY TILING BASED HYBRID TTS IN REAL TIME Feng-Long Xie, Xin-Hui Li, Wen-Chao Su, Li Lu, Frank Soong 2025 | A NEW HIGH QUALITY TRAJECTORY TILING BASED HYBRID TTS IN REAL TIME |
2997 A NEW TUBULAR STRUCTURE TRACKING ALGORITHM BASED ON CURVATURE-PENALIZED PERCEPTUAL GROUPING Li Liu, Da Chen, Minglei Shu, Huazhong Shu, Laurent Cohen 2997 | A NEW TUBULAR STRUCTURE TRACKING ALGORITHM BASED ON CURVATURE-PENALIZED PERCEPTUAL GROUPING |
3230 A NOISE-ROBUST SIGNAL PROCESSING STRATEGY FOR COCHLEAR IMPLANTS USING NEURAL NETWORKS Nengheng Zheng, Yupeng Shi, Yuyong Kang, Qinglin Meng 3230 | A NOISE-ROBUST SIGNAL PROCESSING STRATEGY FOR COCHLEAR IMPLANTS USING NEURAL NETWORKS |
3409 A NOVEL ATTENTION-BASED GATED RECURRENT UNIT AND ITS EFFICACY IN SPEECH EMOTION RECOGNITION Srividya Tirunellai Rajamani, Kumar T. Rajamani, Adria Mallol-Ragolta, Shuo Liu, Björn Schuller 3409 | A NOVEL ATTENTION-BASED GATED RECURRENT UNIT AND ITS EFFICACY IN SPEECH EMOTION RECOGNITION |
3861 A NOVEL BAYESIAN APPROACH FOR THE TWO-DIMENSIONAL HARMONIC RETRIEVAL PROBLEM Rohan R. Pote, Bhaskar D. Rao 3861 | A NOVEL BAYESIAN APPROACH FOR THE TWO-DIMENSIONAL HARMONIC RETRIEVAL PROBLEM |
4745 A NOVEL CONVOLUTIONAL NEURAL NETWORK MODEL TO REMOVE MUSCLE ARTIFACTS FROM EEG Haoming Zhang, Chen Wei, Mingqi Zhao, Haiyan Wu, Quanying Liu 4745 | A NOVEL CONVOLUTIONAL NEURAL NETWORK MODEL TO REMOVE MUSCLE ARTIFACTS FROM EEG |
4260 A NOVEL END-TO-END SPEECH EMOTION RECOGNITION NETWORK WITH STACKED TRANSFORMER LAYERS Xianfeng Wang, Min Wang, Wenbo Qi, Wanqi Su, Xiangqian Wang, Huan Zhou 4260 | A NOVEL END-TO-END SPEECH EMOTION RECOGNITION NETWORK WITH STACKED TRANSFORMER LAYERS |
4922 A NOVEL NMF-HMM SPEECH ENHANCEMENT ALGORITHM BASED ON POISSON MIXTURE MODEL Yang Xiang, Liming Shi, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Græsbøll Christensen 4922 | A NOVEL NMF-HMM SPEECH ENHANCEMENT ALGORITHM BASED ON POISSON MIXTURE MODEL |
5267 A NOVEL VIEWPORT-ADAPTIVE MOTION COMPENSATION TECHNIQUE FOR FISHEYE VIDEO Andy Regensky, Christian Herglotz, André Kaup 5267 | A NOVEL VIEWPORT-ADAPTIVE MOTION COMPENSATION TECHNIQUE FOR FISHEYE VIDEO |
4020 A PARALLEL ALGORITHM FOR PHASE RETRIEVAL WITH DICTIONARY LEARNING Tianyi Liu, Andreas M. Tillmann, Yang Yang, Yonina C. Eldar, Marius Pesavento 4020 | A PARALLEL ALGORITHM FOR PHASE RETRIEVAL WITH DICTIONARY LEARNING |
4992 A PARALLELIZABLE LATTICE RESCORING STRATEGY WITH NEURAL LANGUAGE MODELS Ke Li, Daniel Povey, Sanjeev Khudanpur 4992 | A PARALLELIZABLE LATTICE RESCORING STRATEGY WITH NEURAL LANGUAGE MODELS |
1230 A parametric unconstrained binaural beamformer based noise reduction and spatial cue preservation for hearing-assistive devices Jie Zhang 1230 | A parametric unconstrained binaural beamformer based noise reduction and spatial cue preservation for hearing-assistive devices |
2637 A PARTIALLY COLLAPSED GIBBS SAMPLER FOR UNSUPERVISED NONNEGATIVE SPARSE SIGNAL RESTORATION Mehdi Chahine AMROUCHE, Hervé CARFANTAN, Jérôme IDIER 2637 | A PARTIALLY COLLAPSED GIBBS SAMPLER FOR UNSUPERVISED NONNEGATIVE SPARSE SIGNAL RESTORATION |
3146 A PARTIALLY-RELAXED ROBUST DOA ESTIMATOR UNDER NON-GAUSSIAN LOW-RANK INTERFERENCE AND NOISE Minh Trinh-Hoang, Mohammed Nabil El Korso, Marius Pesavento 3146 | A PARTIALLY-RELAXED ROBUST DOA ESTIMATOR UNDER NON-GAUSSIAN LOW-RANK INTERFERENCE AND NOISE |
3653 A PATIENT-INVARIANT MODEL FOR FREEZING OF GAIT DETECTION AIDED BY WAVELET DECOMPOSITION Nasimuddin Ahmed, Shivam Singhal, Varsha Sharma, Sakyajit Bhattacharya, Aniruddha Sinha, Avik Ghose 3653 | A PATIENT-INVARIANT MODEL FOR FREEZING OF GAIT DETECTION AIDED BY WAVELET DECOMPOSITION |
4476 A PERIODIC FRAME LEARNING APPROACH FOR ACCURATE LANDMARK LOCALIZATION IN M-MODE ECHOCARDIOGRAPHY Yinbing Tian, Shibiao Xu, Li Guo, Fuze Cong 4476 | A PERIODIC FRAME LEARNING APPROACH FOR ACCURATE LANDMARK LOCALIZATION IN M-MODE ECHOCARDIOGRAPHY |
2359 A PLUG AND PLAY FAST INTERSECTION OVER UNION LOSS FOR BOUNDARY BOX REGRESSION Zengsheng Kuang, Xian Fang, Ruixun Zhang, Xiuli Shao, Hongpeng Wang 2359 | A PLUG AND PLAY FAST INTERSECTION OVER UNION LOSS FOR BOUNDARY BOX REGRESSION |
3114 A PLUG-AND-PLAY DEEP IMAGE PRIOR Zhaodong Sun, Fabian Latorre, Thomas Sanchez, Volkan Cevher 3114 | A PLUG-AND-PLAY DEEP IMAGE PRIOR |
1983 A PROBABILISTIC MODEL FOR SEGMENTATION OF AMBIGUOUS 3D LUNG NODULE Xiaojiang Long, Wei Chen, Qiuli Wang, Xiaohong Zhang, Chen Liu, Yucong Li, Jiuquan Zhang 1983 | A PROBABILISTIC MODEL FOR SEGMENTATION OF AMBIGUOUS 3D LUNG NODULE |
4673 A PROGRESSIVE LEARNING APPROACH TO ADAPTIVE NOISE AND SPEECH ESTIMATION FOR SPEECH ENHANCEMENT AND NOISY SPEECH RECOGNITION Zhaoxu Nian, Yan-Hui Tu, Jun Du, Chin-Hui Lee 4673 | A PROGRESSIVE LEARNING APPROACH TO ADAPTIVE NOISE AND SPEECH ESTIMATION FOR SPEECH ENHANCEMENT AND NOISY SPEECH RECOGNITION |
4000 A QUANTITATIVE ANALYSIS OF THE ROBUSTNESS OF NEURAL NETWORKS FOR TABULAR DATA Kavya Gupta, Beatrice Pesquet-Popescu, Fateh Kaakai, Jean-Christophe Pesquet 4000 | A QUANTITATIVE ANALYSIS OF THE ROBUSTNESS OF NEURAL NETWORKS FOR TABULAR DATA |
4701 A QUANTITATIVE METRIC FOR PRIVACY LEAKAGE IN FEDERATED LEARNING Yong Liu, Xinghua Zhu, Jianzong Wang, Jing Xiao 4701 | A QUANTITATIVE METRIC FOR PRIVACY LEAKAGE IN FEDERATED LEARNING |
2031 A RANK-CONSTRAINED CLUSTERING ALGORITHM WITH ADAPTIVE EMBEDDING Shenfei Pei, Feiping Nie, Rong Wang, Xuelong Li 2031 | A RANK-CONSTRAINED CLUSTERING ALGORITHM WITH ADAPTIVE EMBEDDING |
4212 A RANKED SIMILARITY LOSS FUNCTION WITH PAIR WEIGHTING FOR DEEP METRIC LEARNING Jian Wang, Zhichao Zhang, Dongmei Huang, Quanmiao Wei 4212 | A RANKED SIMILARITY LOSS FUNCTION WITH PAIR WEIGHTING FOR DEEP METRIC LEARNING |
4539 A real-time speaker diarization system based on spatial spectrum Siqi Zheng, Weilong Huang, Xianliang Wang, Hongbin Suo, Jinwei Feng, Zhijie Yan 4539 | A real-time speaker diarization system based on spatial spectrum |
4848 A ReLU Dense Layer to Improve the Performance of Neural Networks Alireza M. Javid, Sandipan Das, Mikael Skoglund, Saikat Chatterjee 4848 | A ReLU Dense Layer to Improve the Performance of Neural Networks |
4949 A ROBUST AND EFFICIENT MULTI-SCALE SEASONAL-TREND DECOMPOSITION Linxiao Yang, Qingsong Wen, Bo Yang, Liang Sun 4949 | A ROBUST AND EFFICIENT MULTI-SCALE SEASONAL-TREND DECOMPOSITION |
4993 A ROBUST COPULA MODEL FOR RADAR-BASED LANDMINE DETECTION Afief D. Pambudi, Fauzia Ahmad, Abdelhak M. Zoubir 4993 | A ROBUST COPULA MODEL FOR RADAR-BASED LANDMINE DETECTION |
3964 A ROBUST TO NOISE ADVERSARIAL RECURRENT MODEL FOR NON-INTRUSIVE LOAD MONITORING Maria Kaselimi, Athanasios Voulodimos, Nikolaos Doulamis, Anastasios Doulamis, Eftychios Protopapadakis 3964 | A ROBUST TO NOISE ADVERSARIAL RECURRENT MODEL FOR NON-INTRUSIVE LOAD MONITORING |
3911 A SAMPLE-EFFICIENT SCHEME FOR CHANNEL RESOURCE ALLOCATION IN NETWORKED ESTIMATION Marcos Vasconcelos, Urbashi Mitra 3911 | A SAMPLE-EFFICIENT SCHEME FOR CHANNEL RESOURCE ALLOCATION IN NETWORKED ESTIMATION |
4575 A SCALE INVARIANT MEASURE OF FLATNESS FOR DEEP NETWORK MINIMA Akshay Rangamani, Nam Nguyen, Abhishek Kumar, Dzung Phan, Sang Chin, Trac Tran 4575 | A SCALE INVARIANT MEASURE OF FLATNESS FOR DEEP NETWORK MINIMA |
1753 A SECURE SEARCHABLE IMAGE RETRIEVAL SCHEME WITH CORRECT RETRIEVAL IDENTITY Liejun Wang, Haitao Yu 1753 | A SECURE SEARCHABLE IMAGE RETRIEVAL SCHEME WITH CORRECT RETRIEVAL IDENTITY |
1748 A SEQUENTIAL CONTRASTIVE LEARNING FRAMEWORK FOR ROBUST DYSARTHRIC SPEECH RECOGNITION Lidan Wu, Daoming Zong, Jing Zhao, Shiliang Sun 1748 | A SEQUENTIAL CONTRASTIVE LEARNING FRAMEWORK FOR ROBUST DYSARTHRIC SPEECH RECOGNITION |
4415 A short tutorial on the Weisfeiler-Lehman test and its variants Ningyuan (Teresa) Huang, Soledad Villar 4415 | A short tutorial on the Weisfeiler-Lehman test and its variants |
3043 A SIMPLIFIED WIENER BEAMFORMER BASED ON COVARIANCE MATRIX MODELLING Fan Zhang, Chao Pan, Jacob Benesty, Jingdong Chen 3043 | A SIMPLIFIED WIENER BEAMFORMER BASED ON COVARIANCE MATRIX MODELLING |
2821 A SPARSE CODING APPROACH TO AUTOMATIC DIET MONITORING WITH CONTINUOUS GLUCOSE MONITORS Anurag Das, Bobak Mortazavi, Theodora Chaspari, Seyedhooman Sajjadi, Projna Paromita, Laura Ruebush, Nicolaas Deutz, Ricardo Gutierrez-Osuna 2821 | A SPARSE CODING APPROACH TO AUTOMATIC DIET MONITORING WITH CONTINUOUS GLUCOSE MONITORS |
2350 A STAGE MATCH FOR QUERY-BY-EXAMPLE SPOKEN TERM DETECTION BASED ON STRUCTURE INFORMATION OF QUERY Junyao Zhan, Qianhua He, Jianbin Su, Yanxiong Li 2350 | A STAGE MATCH FOR QUERY-BY-EXAMPLE SPOKEN TERM DETECTION BASED ON STRUCTURE INFORMATION OF QUERY |
1911 A Stochastic Compositional Optimization Method with Applications to Meta Learning Yuejiao Sun, Tianyi Chen, Wotao Yin 1911 | A Stochastic Compositional Optimization Method with Applications to Meta Learning |
2606 A structure-guided and sparse-representation-based 3D seismic inversion method Bin She, Yaojun Wang, Guangmin Hu 2606 | A structure-guided and sparse-representation-based 3D seismic inversion method |
1068 A TECHNIQUE FOR OFDM SYMBOL SLICING Ana Perez-Neira, Miguel A. Lagunas 1068 | A TECHNIQUE FOR OFDM SYMBOL SLICING |
1324 A Time-domain Convolutional Recurrent Network for Packet Loss Concealment Ju Lin, Yun Wang, Kaustubh Kalgaonkar, Gil Keren, Didi Zhang, Christian Fuegen 1324 | A Time-domain Convolutional Recurrent Network for Packet Loss Concealment |
2398 A Triplet Appearance Parsing Network for Person Re-Identification Mingfu Xiong, Zhongyuan Wang, Ruhan He, Xinrong Hu, Ming Cheng, Xiao Qin, Jia Chen 2398 | A Triplet Appearance Parsing Network for Person Re-Identification |
2953 A TWO-STAGE APPROACH TO DEVICE-ROBUST ACOUSTIC SCENE CLASSIFICATION Hu Hu, Chao-Han Yang, Xianjun Xia, Xue Bai, Xin Tang, Yajian Wang, Shutong Niu, Li Chai, Juanjuan Li, Hongning Zhu, Feng Bao, Yuanjun Zhao, Sabato Marco Siniscalchi, Yannan Wang, Jun Du, Chin-Hui Lee 2953 | A TWO-STAGE APPROACH TO DEVICE-ROBUST ACOUSTIC SCENE CLASSIFICATION |
3260 A Two-Stage Deep Modeling Approach to Articulatory Inversion Abdolreza Sabzi Shahrebabaki, Negar Olfati, Ali Shariq Imran, Magne Hallstein Johnsen, Sabato Marco Siniscalchi, Torbjørn Karl Svendsen 3260 | A Two-Stage Deep Modeling Approach to Articulatory Inversion |
2413 A Tyler-type estimator of location and scatter leveraging Riemannian optimization Antoine Collas, Florent Bouchard, Arnaud Breloy, Chengfang Ren, Guillaume Ginolhac, Jean-Philippe Ovarlez 2413 | A Tyler-type estimator of location and scatter leveraging Riemannian optimization |
2110 A UNIFIED APPROACH TO TRANSLATE CLASSICAL BANDIT ALGORITHMS TO STRUCTURED BANDITS Samarth Gupta, Shreyas Chaudhari, Subhojyoti Mukherjee, Gauri Joshi, Osman Yagan 2110 | A UNIFIED APPROACH TO TRANSLATE CLASSICAL BANDIT ALGORITHMS TO STRUCTURED BANDITS |
5005 A Universal BERT-BASED Front-end Model for Mandarin Text-To-Speech Synthesis Zilong Bai, Beibei Hu 5005 | A Universal BERT-BASED Front-end Model for Mandarin Text-To-Speech Synthesis |
4306 A WIRELESS REFERENCE ACTIVE NOISE CONTROL HEADPHONE USING COHERENCE BASED SELECTION TECHNIQUE Xiaoyi Shen, Dongyuan Shi, Woon-Seng Gan 4306 | A WIRELESS REFERENCE ACTIVE NOISE CONTROL HEADPHONE USING COHERENCE BASED SELECTION TECHNIQUE |
2249 ABSOLUTE 3D POSE ESTIMATION AND LENGTH MEASUREMENT OF SEVERELY DEFORMED FISH FROM MONOCULAR VIDEOS IN LONGLINE FISHING Jie Mei, Jenq-Neng Hwang, Suzanne Romain, Craig Rose, Braden Moore, Kelsey Magrane 2249 | ABSOLUTE 3D POSE ESTIMATION AND LENGTH MEASUREMENT OF SEVERELY DEFORMED FISH FROM MONOCULAR VIDEOS IN LONGLINE FISHING |
3024 ACCDOA: ACTIVITY-COUPLED CARTESIAN DIRECTION OF ARRIVAL REPRESENTATION FOR SOUND EVENT LOCALIZATION AND DETECTION Kazuki Shimada, Yuichiro Koyama, Naoya Takahashi, Shusuke Takahashi, Yuki Mitsufuji 3024 | ACCDOA: ACTIVITY-COUPLED CARTESIAN DIRECTION OF ARRIVAL REPRESENTATION FOR SOUND EVENT LOCALIZATION AND DETECTION |
1974 ACCELERATING AUXILIARY FUNCTION-BASED INDEPENDENT VECTOR ANALYSIS Andreas Brendel, Walter Kellermann 1974 | ACCELERATING AUXILIARY FUNCTION-BASED INDEPENDENT VECTOR ANALYSIS |
1518 ACCELERATING FRANK-WOLFE WITH WEIGHTED AVERAGE GRADIENTS Yilang Zhang, Bingcong Li, Georgios Giannakis 1518 | ACCELERATING FRANK-WOLFE WITH WEIGHTED AVERAGE GRADIENTS |
3815 ACOUSTIC ANALYSIS AND DATASET OF TRANSITIONS BETWEEN COUPLED ROOMS Thomas McKenzie, Sebastian J. Schlecht, Ville Pulkki 3815 | ACOUSTIC ANALYSIS AND DATASET OF TRANSITIONS BETWEEN COUPLED ROOMS |
2871 Acoustic and Linguistic Analyses to Assess Early-Onset and Genetic Alzheimer's Disease Paula Andrea Pérez-Toro, Juan Camilo Vásquez-Correa, Tomás Arias-Vergara, Philipp Klumpp, Melissa Sierra-Castrillón, Mildred Estefania Roldán-López, David Aguillón, Liliana Hincapié-Henao, Carlos Andrés Tobón-Quintero, Tobias Bocklet, Maria Schuster, Juan Rafael Orozco-Arroyave, Elmar Nöth 2871 | Acoustic and Linguistic Analyses to Assess Early-Onset and Genetic Alzheimer's Disease |
4070 Acoustic echo cancellation with the dual-signal transformation LSTM network Nils L. Westhausen, Bernd T. Meyer 4070 | Acoustic echo cancellation with the dual-signal transformation LSTM network |
3143 ACOUSTIC REFLECTORS LOCALIZATION FROM STEREO RECORDINGS USING NEURAL NETWORKS Giovanni Bologni, Richard Heusdens, Jorge Martinez 3143 | ACOUSTIC REFLECTORS LOCALIZATION FROM STEREO RECORDINGS USING NEURAL NETWORKS |
3650 ACOUSTICS BASED INTENT RECOGNITION USING DISCOVERED PHONETIC UNITS FOR LOW RESOURCE LANGUAGES Akshat Gupta, Xinjian Li, SaiKrishna Rallabandi, Alan Black 3650 | ACOUSTICS BASED INTENT RECOGNITION USING DISCOVERED PHONETIC UNITS FOR LOW RESOURCE LANGUAGES |
4934 ACOUSTIC-TO-ARTICULATORY INVERSION FOR DYSARTHRIC SPEECH BY USING CROSS-CORPUS ACOUSTIC-ARTICULATORY DATA Sarthak Kumar Maharana, Aravind Illa, Renuka Mannem, Yamini Belur, Preetie ShettyVeeramani Preethish Kumar, Seena Vengalil, Kiran Polavarapu, Nalini Atchayaram, Prasanta Kumar Ghosh 4934 | ACOUSTIC-TO-ARTICULATORY INVERSION FOR DYSARTHRIC SPEECH BY USING CROSS-CORPUS ACOUSTIC-ARTICULATORY DATA |
4014 ACTION STATE UPDATE APPROACH TO DIALOGUE MANAGEMENT Svetlana Stoyanchev, Simon Keizer, Rama Doddipatla 4014 | ACTION STATE UPDATE APPROACH TO DIALOGUE MANAGEMENT |
5414 Active Estimation from Multimodal Data Arpan Mukherjee, Ali Tajer, Pin-Yu Chen, Payel Das 5414 | Active Estimation from Multimodal Data |
4920 ACTIVE PRIVACY-UTILITY TRADE-OFF AGAINST A HYPOTHESIS TESTING ADVERSARY Ecenaz Erdemir, Pier Luigi Dragotti, Deniz Gunduz 4920 | ACTIVE PRIVACY-UTILITY TRADE-OFF AGAINST A HYPOTHESIS TESTING ADVERSARY |
3051 ACUTE LYMPHOBLASTIC LEUKEMIA DETECTION BASED ON ADAPTIVE UNSHARPENING AND DEEP LEARNING Angelo Genovese, Mahdi S. Hosseini, Vincenzo Piuri, Konstantinos N. Plataniotis, Fabio Scotti 3051 | ACUTE LYMPHOBLASTIC LEUKEMIA DETECTION BASED ON ADAPTIVE UNSHARPENING AND DEEP LEARNING |
1794 ADAPTABLE ENSEMBLE DISTILLATION yankai wang, dawei yang, wei zhang, zhe jiang, wenqiang zhang 1794 | ADAPTABLE ENSEMBLE DISTILLATION |
1298 ADAPTABLE MULTI-DOMAIN LANGUAGE MODEL FOR TRANSFORMER ASR Taewoo Lee, Min-Joong Lee, Tae Gyoon Kang, Seokyeoung Jung, Minseok Kwon, Yeona Hong, Jungin Lee, Kyoung-Gu Woo, Ho-Gyeong Kim, Jiseung Jeong, Jihyun Lee, Hosik Lee, Young Sang Choi 1298 | ADAPTABLE MULTI-DOMAIN LANGUAGE MODEL FOR TRANSFORMER ASR |
2362 ADAPTIVE BI-DIRECTIONAL ATTENTION: EXPLORING MULTI-GRANULARITY REPRESENTATIONS FOR MACHINE READING COMPREHENSION Nuo Chen, Fenglin Liu, Chenyu You, Peilin Zhou, Yuexian Zou 2362 | ADAPTIVE BI-DIRECTIONAL ATTENTION: EXPLORING MULTI-GRANULARITY REPRESENTATIONS FOR MACHINE READING COMPREHENSION |
3704 ADAPTIVE CONTENTION WINDOW DESIGN USING DEEP Q-LEARNING Abhishek Kumar, Gunjan Verma, Chirag Rao, Ananthram Swami, Santiago Segarra 3704 | ADAPTIVE CONTENTION WINDOW DESIGN USING DEEP Q-LEARNING |
1069 ADAPTIVE DUAL TREE STRUCTURE FOR SCREEN CONTENT CODING Weijia Zhu, Jizheng Xu, Li Zhang, Yue Wang 1069 | ADAPTIVE DUAL TREE STRUCTURE FOR SCREEN CONTENT CODING |
1100 ADAPTIVE FEATURE WEIGHT LEARNING FOR ROBUST CLUSTERING PROBLEM WITH SPARSE CONSTRAINT Feiping Nie, Wei Chang, Xuelong Li, Jin Xu, Gongfu Li 1100 | ADAPTIVE FEATURE WEIGHT LEARNING FOR ROBUST CLUSTERING PROBLEM WITH SPARSE CONSTRAINT |
4089 ADAPTIVE GOP SIZE DECISION FOR MULTI-PASS VIDEO CODING BASED ON HIDDEN MARKOV MODEL Bohan Li, Jingning Han, Yaowu Xu 4089 | ADAPTIVE GOP SIZE DECISION FOR MULTI-PASS VIDEO CODING BASED ON HIDDEN MARKOV MODEL |
4282 Adaptive importance sampling via auto-regressive generative models and Gaussian processes Hechuan Wang, Monica Bugallo, Petar Djuric 4282 | Adaptive importance sampling via auto-regressive generative models and Gaussian processes |
1715 ADAPTIVE MULTI-DOMAIN LEARNING FOR OUTDOOR 3D HUMAN POSE AND SHAPE ESTIMATION Zhaoyang Gui, Shanshan Zhang, Kangkan Wang, Jian Yang, Pong Chi Yuen 1715 | ADAPTIVE MULTI-DOMAIN LEARNING FOR OUTDOOR 3D HUMAN POSE AND SHAPE ESTIMATION |
4521 ADAPTIVE QUANTIZATION OF MODEL UPDATES FOR COMMUNICATION-EFFICIENT FEDERATED LEARNING Divyansh Jhunjhunwala, Advait Gadhikar, Gauri Joshi, Yonina C. Eldar 4521 | ADAPTIVE QUANTIZATION OF MODEL UPDATES FOR COMMUNICATION-EFFICIENT FEDERATED LEARNING |
1041 ADAPTIVE REAL-TIME FILTER FOR PARTIALLY-OBSERVED BOOLEAN DYNAMICAL SYSTEMS Mahdi Imani, Seyede Fatemeh Ghoreishi 1041 | ADAPTIVE REAL-TIME FILTER FOR PARTIALLY-OBSERVED BOOLEAN DYNAMICAL SYSTEMS |
2507 ADAPTIVE RE-BALANCING NETWORK WITH GATE MECHANISM FOR LONG-TAILED VISUAL QUESTION ANSWERING Hongyu Chen, Ruifang Liu, Han Fang, Ximing Zhang 2507 | ADAPTIVE RE-BALANCING NETWORK WITH GATE MECHANISM FOR LONG-TAILED VISUAL QUESTION ANSWERING |
5588 ADAPTIVE REVERBERATION ABSORPTION USING NON-STATIONARY MASKING COMPONENTS DETECTION FOR INTELLIGIBILITY IMPROVEMENT Guilherme Zucatelli, Rosângela Coelho 5588 | ADAPTIVE REVERBERATION ABSORPTION USING NON-STATIONARY MASKING COMPONENTS DETECTION FOR INTELLIGIBILITY IMPROVEMENT |
5030 ADAPTIVE RF FINGERPRINT DECOMPOSITION IN MICRO UAV DETECTION BASED ON MACHINE LEARNING Chengtao Xu, Fengyu He, Bowen Chen, Yushan Jiang, Houbing Song 5030 | ADAPTIVE RF FINGERPRINT DECOMPOSITION IN MICRO UAV DETECTION BASED ON MACHINE LEARNING |
3978 ADAPTIVE SUBSAMPLING OF MULTIDOMAIN SIGNALS WITH GRAPH PRODUCTS Théo Gnassounou, Pierre Humbert, Laurent Oudre 3978 | ADAPTIVE SUBSAMPLING OF MULTIDOMAIN SIGNALS WITH GRAPH PRODUCTS |
2317 ADAPT-THEN-COMBINE FULL WAVEFORM INVERSION FOR DISTRIBUTED SUBSURFACE IMAGING IN SEISMIC NETWORKS Ban-Sok Shin, Dmitriy Shutin 2317 | ADAPT-THEN-COMBINE FULL WAVEFORM INVERSION FOR DISTRIBUTED SUBSURFACE IMAGING IN SEISMIC NETWORKS |
4216 ADA-SISE: ADAPTIVE SEMANTIC INPUT SAMPLING FOR EFFICIENT EXPLANATION OF CONVOLUTIONAL NEURAL NETWORKS Mahesh Sudhakar, Sam Sattarzadeh, Konstantinos N. Plataniotis, Jongseong Jang, Yeonjeong Jeong, Hyunwoo Kim 4216 | ADA-SISE: ADAPTIVE SEMANTIC INPUT SAMPLING FOR EFFICIENT EXPLANATION OF CONVOLUTIONAL NEURAL NETWORKS |
3202 ADASPEECH 2: ADAPTIVE TEXT TO SPEECH WITH UNTRANSCRIBED DATA Yuzi Yan, Xu Tan, Bohan Li, Tao Qin, Sheng Zhao, Yuan Shen, Tie-Yan Liu 3202 | ADASPEECH 2: ADAPTIVE TEXT TO SPEECH WITH UNTRANSCRIBED DATA |
1240 ADL-MVDR: ALL DEEP LEARNING MVDR BEAMFORMER FOR TARGET SPEECH SEPARATION Zhuohuang Zhang, Yong Xu, Meng Yu, Shi-Xiong Zhang, Lianwu Chen, Dong Yu 1240 | ADL-MVDR: ALL DEEP LEARNING MVDR BEAMFORMER FOR TARGET SPEECH SEPARATION |
4533 ADMM-BASED FAST ALGORITHM FOR ROBUST MULTI-GROUP MULTICAST BEAMFORMING Niloofar Mohamadi, Min Dong, Shahram ShahbazPanahi 4533 | ADMM-BASED FAST ALGORITHM FOR ROBUST MULTI-GROUP MULTICAST BEAMFORMING |
3745 ADMM-BASED ML DECODING: FROM THEORY TO PRACTICE Kira Kraft, Norbert Wehn 3745 | ADMM-BASED ML DECODING: FROM THEORY TO PRACTICE |
3819 ADVANCES IN MORPHOLOGICAL NEURAL NETWORKS: TRAINING, PRUNING AND ENFORCING SHAPE CONSTRAINTS Nikolaos Dimitriadis, Petros Maragos 3819 | ADVANCES IN MORPHOLOGICAL NEURAL NETWORKS: TRAINING, PRUNING AND ENFORCING SHAPE CONSTRAINTS |
1655 Advances in Nonstationary Source Separation Reza Sameni, Christian Jutten 1655 | Advances in Nonstationary Source Separation |
2747 ADVANCING RNN TRANSDUCER TECHNOLOGY FOR SPEECH RECOGNITION George Saon, Zoltan Tueske, Daniel Bolanos, Brian Kingsbury 2747 | ADVANCING RNN TRANSDUCER TECHNOLOGY FOR SPEECH RECOGNITION |
1839 ADVERSARIAL ATTACKS ON AUDIO SOURCE SEPARATION Naoya Takahashi, Shota Inoue, Yuki Mitsufuji 1839 | ADVERSARIAL ATTACKS ON AUDIO SOURCE SEPARATION |
2819 ADVERSARIAL ATTACKS ON COARSE-TO-FINE CLASSIFIERS Ismail Alkhouri, George Atia 2819 | ADVERSARIAL ATTACKS ON COARSE-TO-FINE CLASSIFIERS |
2990 ADVERSARIAL ATTACKS ON OBJECT DETECTORS WITH LIMITED PERTURBATIONS Zhenbo Shi, Wei Yang, Zhenbo Xu, Zhi Chen, Yingjie Li, Haoran Zhu, Liusheng Huang 2990 | ADVERSARIAL ATTACKS ON OBJECT DETECTORS WITH LIMITED PERTURBATIONS |
3398 ADVERSARIAL DEFENSE FOR AUTOMATIC SPEAKER VERIFICATION BY CASCADED SELF-SUPERVISED LEARNING MODELS Haibin Wu, Xu Li, Andy Liu, Zhiyong Wu, Helen Meng, Hung-yi Lee 3398 | ADVERSARIAL DEFENSE FOR AUTOMATIC SPEAKER VERIFICATION BY CASCADED SELF-SUPERVISED LEARNING MODELS |
4676 ADVERSARIAL DEFENSE FOR DEEP SPEAKER RECOGNITION USING HYBRID ADVERSARIAL TRAINING Monisankha Pal, Arindam Jati, Raghuveer Peri, Chin-Cheng Hsu, Wael AbdAlmageed, Shrikanth Narayanan 4676 | ADVERSARIAL DEFENSE FOR DEEP SPEAKER RECOGNITION USING HYBRID ADVERSARIAL TRAINING |
2288 Adversarial Examples Detection beyond Image Space Kejiang Chen, Yuefeng Chen, Hang Zhou, Chuan Qin, Xiaofeng Mao, Weiming Zhang, NengHai Yu 2288 | Adversarial Examples Detection beyond Image Space |
4359 ADVERSARIAL GENERATIVE DISTANCE-BASED CLASSIFIER FOR ROBUST OUT-OF-DOMAIN DETECTION Zhiyuan Zeng, Hong Xu, Keqing He, Yuanmeng Yan, Sihong Liu, Zijun Liu, Weiran Xu 4359 | ADVERSARIAL GENERATIVE DISTANCE-BASED CLASSIFIER FOR ROBUST OUT-OF-DOMAIN DETECTION |
4153 Adversarial Learning via Probabilistic Proximity Analysis Jarrod Hollis, Jinsub Kim, Raviv Raich 4153 | Adversarial Learning via Probabilistic Proximity Analysis |
4520 ADVERSARIALLY ROBUST CLASSIFICATION BASED ON GLRT Bhagyashree Puranik, Upamanyu Madhow, Ramtin Pedarsani 4520 | ADVERSARIALLY ROBUST CLASSIFICATION BASED ON GLRT |
2417 AEC IN A NETSHELL: ON TARGET AND TOPOLOGY CHOICES FOR FCRN ACOUSTIC ECHO CANCELLATION Jan Franzen, Ernst Seidel, Tim Fingscheidt 2417 | AEC IN A NETSHELL: ON TARGET AND TOPOLOGY CHOICES FOR FCRN ACOUSTIC ECHO CANCELLATION |
3680 AFC-28K: A DATASET FOR DETECTING DISFLUENCIES IN CONVERSATIONAL SPEECH Colin Lea, Vikramjit Mitra, Aparna Joshi, Sachin Kajarekar, Jeffrey Bigham 3680 | AFC-28K: A DATASET FOR DETECTING DISFLUENCIES IN CONVERSATIONAL SPEECH |
3241 AFFINE PROJECTION SUBSPACE TRACKING Marc Vilà, Carlos Alejandro López, Jaume Riba 3241 | AFFINE PROJECTION SUBSPACE TRACKING |
1785 AGAIN-VC: A ONE-SHOT VOICE CONVERSION USING ACTIVATION GUIDANCE AND ADAPTIVE INSTANCE NORMALIZATION Yen-Hao Chen, Da-Yi Wu, Tsung-Han Wu, Hung-yi Lee 1785 | AGAIN-VC: A ONE-SHOT VOICE CONVERSION USING ACTIVATION GUIDANCE AND ADAPTIVE INSTANCE NORMALIZATION |
1729 AGENT-ENVIRONMENT NETWORK FOR TEMPORAL ACTION PROPOSAL GENERATION Viet-Khoa Vo-Ho, Hoang-Ngan Le, Kashu Kamazaki, Akihiro Sugimoto, Minh-Triet Tran 1729 | AGENT-ENVIRONMENT NETWORK FOR TEMPORAL ACTION PROPOSAL GENERATION |
2108 AGE-VOX-CELEB: MULTI-MODAL CORPUS FOR FACIAL AND SPEECH ESTIMATION Naohiro Tawara, Atsunori Ogawa, Yuki Kitagishi, Hosana Kamiyama 2108 | AGE-VOX-CELEB: MULTI-MODAL CORPUS FOR FACIAL AND SPEECH ESTIMATION |
1683 AGGREGATION ARCHITECTURE AND ALL-TO-ONE NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION Kuntao Cao, Xi Huang, Jie Shao 1683 | AGGREGATION ARCHITECTURE AND ALL-TO-ONE NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION |
5163 AISPEECH-SJTU ACCENT IDENTIFICATION SYSTEM FOR THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE Houjun Huang, Xu Xiang, Yexin Yang, Rao Ma, Yanmin Qian 5163 | AISPEECH-SJTU ACCENT IDENTIFICATION SYSTEM FOR THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE |
4853 AISPEECH-SJTU ASR system for the Accented English Speech Recognition Challenge Tian Tan, Yizhou Lu, Rao Ma, Sen Zhu, Jiaqi Guo, Yanmin Qian 4853 | AISPEECH-SJTU ASR system for the Accented English Speech Recognition Challenge |
4351 ALIGN OR ATTEND? TOWARD MORE EFFICIENT AND ACCURATE SPOKEN WORD DISCOVERY USING SPEECH-TO-IMAGE RETRIEVAL Liming Wang, Xinsheng Wang, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak 4351 | ALIGN OR ATTEND? TOWARD MORE EFFICIENT AND ACCURATE SPOKEN WORD DISCOVERY USING SPEECH-TO-IMAGE RETRIEVAL |
3148 ALIGNING SETS OF TEMPORAL SIGNALS WITH RIEMANNIAN GEOMETRY AND KOOPMAN OPERATOR Ohad Rahamim, Ronen Talmon 3148 | ALIGNING SETS OF TEMPORAL SIGNALS WITH RIEMANNIAN GEOMETRY AND KOOPMAN OPERATOR |
5154 ALIGNING THE TRAINING AND EVALUATION OF UNSUPERVISED TEXT STYLE TRANSFER Wanhui Qian, Fuqing Zhu, Jinzhu Yang, Jizhong Han, Songlin Hu 5154 | ALIGNING THE TRAINING AND EVALUATION OF UNSUPERVISED TEXT STYLE TRANSFER |
4407 All for One and One for All: Improving Music Separation by Bridging Networks Ryosuke Sawata, Stefan Uhlich, Shusuke Takahashi, Yuki Mitsufuji 4407 | All for One and One for All: Improving Music Separation by Bridging Networks |
2066 ALLOCATING DNN LAYERS COMPUTATION BETWEEN FRONT-END DEVICES AND THE CLOUD SERVER FOR VIDEO BIG DATA PROCESSING Peiyin Xing, Xiaofei Liu, Peixi Peng, Tiejun Huang, Yonghong Tian 2066 | ALLOCATING DNN LAYERS COMPUTATION BETWEEN FRONT-END DEVICES AND THE CLOUD SERVER FOR VIDEO BIG DATA PROCESSING |
5580 All-Pass Filter Design Using Blaschke Interpolation Kumar Appaiah, Debasattam Pal 5580 | All-Pass Filter Design Using Blaschke Interpolation |
4983 ALTERNATING PROJECTIONS GRIDLESS COVARIANCE-BASED ESTIMATION FOR DOA YONGSUNG PARK, PETER GERSTOFT 4983 | ALTERNATING PROJECTIONS GRIDLESS COVARIANCE-BASED ESTIMATION FOR DOA |
1602 AMPLITUDE MATCHING: MAJORIZATION-MINIMIZATION ALGORITHM FOR SOUND FIELD CONTROL ONLY WITH AMPLITUDE CONSTRAINT Shoichi Koyama, Takashi Amakasu, Natsuki Ueno, Hiroshi Saruwatari 1602 | AMPLITUDE MATCHING: MAJORIZATION-MINIMIZATION ALGORITHM FOR SOUND FIELD CONTROL ONLY WITH AMPLITUDE CONSTRAINT |
2843 An Actor-Critic Reinforcement Learning Approach to Minimum Age of Information Scheduling in Energy Harvesting Networks Shiyang Leng, Aylin Yener 2843 | An Actor-Critic Reinforcement Learning Approach to Minimum Age of Information Scheduling in Energy Harvesting Networks |
1613 AN ADAPTIVE DISCRIMINANT AND SPARSITY FEATURE DESCRIPTOR FOR FINGER VEIN RECOGNITION Shuyi Li, Bob Zhang 1613 | AN ADAPTIVE DISCRIMINANT AND SPARSITY FEATURE DESCRIPTOR FOR FINGER VEIN RECOGNITION |
3406 AN ADAPTIVE MULTI-SCALE AND MULTI-LEVEL FEATURES FUSION NETWORK WITH PERCEPTUAL LOSS FOR CHANGE DETECTION Jialang Xu, Yang Luo, Xinyue Chen, Chunbo Luo 3406 | AN ADAPTIVE MULTI-SCALE AND MULTI-LEVEL FEATURES FUSION NETWORK WITH PERCEPTUAL LOSS FOR CHANGE DETECTION |
2105 AN ADAPTIVE NON-LINEAR PROCESS FOR UNDER-DETERMINED VIRTUAL MICROPHONE BEAMFORMING Mehdi Bekrani, Anh H. T. Nguyen, Andy W. H. Khong 2105 | AN ADAPTIVE NON-LINEAR PROCESS FOR UNDER-DETERMINED VIRTUAL MICROPHONE BEAMFORMING |
1288 AN ADAPTIVE PART-BASED MODEL FOR PERSON RE-IDENTIFICATION Xipeng Lin, Yubin Yang 1288 | AN ADAPTIVE PART-BASED MODEL FOR PERSON RE-IDENTIFICATION |
2427 AN ADAPTIVE PYRAMID SINGLE-VIEW DEPTH LOOKUP TABLE CODING METHOD Yangang Cai, Ronggang Wang, Jian Zhang, Wen Gao 2427 | AN ADAPTIVE PYRAMID SINGLE-VIEW DEPTH LOOKUP TABLE CODING METHOD |
5239 An adaptive Regularization Approach to Portfolio Optimization Tarig Ballal, Abdelrahman Abdelrahman, Ali Muqaibel, Tareq Al-Naffouri 5239 | An adaptive Regularization Approach to Portfolio Optimization |
4061 AN ADMM BASED NETWORK FOR HYPERSPECTRAL UNMIXING TASKS Chao Zhou, Miguel R.D. Rodrigues 4061 | AN ADMM BASED NETWORK FOR HYPERSPECTRAL UNMIXING TASKS |
4886 AN ASYMPTOTICALLY POINTWISE OPTIMAL PROCEDURE FOR SEQUENTIAL JOINT DETECTION AND ESTIMATION Dominik Reinhard, Michael Fauß, Abdelhak M. Zoubir 4886 | AN ASYMPTOTICALLY POINTWISE OPTIMAL PROCEDURE FOR SEQUENTIAL JOINT DETECTION AND ESTIMATION |
4714 AN ASYNCHRONOUS WFST-BASED DECODER FOR AUTOMATIC SPEECH RECOGNITION Hang Lv, Zhehuai Chen, Hainan Xu, Daniel Povey, Lei Xie, Sanjeev Khudanpur 4714 | AN ASYNCHRONOUS WFST-BASED DECODER FOR AUTOMATIC SPEECH RECOGNITION |
1322 AN ATTENTION BASED WAVELET CONVOLUTIONAL MODEL FOR VISUAL SALIENCY DETECTION RESHMI BHOOSHAN, SURESH K. 1322 | AN ATTENTION BASED WAVELET CONVOLUTIONAL MODEL FOR VISUAL SALIENCY DETECTION |
3812 AN ATTENTION MODEL FOR HYPERNASALITY PREDICTION IN CHILDREN WITH CLEFT PALATE Vikram C Mathad, Nancy Scherer, Kathy Chapman, Julie Liss, Visar Berisha 3812 | AN ATTENTION MODEL FOR HYPERNASALITY PREDICTION IN CHILDREN WITH CLEFT PALATE |
2202 AN ATTENTION-SEQ2SEQ MODEL BASED ON CRNN ENCODING FOR AUTOMATIC LABANOTATION GENERATION FROM MOTION CAPTURE DATA Min Li, Zhenjiang Miao, Xiao-Ping Zhang, Wanru Xu 2202 | AN ATTENTION-SEQ2SEQ MODEL BASED ON CRNN ENCODING FOR AUTOMATIC LABANOTATION GENERATION FROM MOTION CAPTURE DATA |
3757 AN EFFECTIVE DEEP EMBEDDING LEARNING METHOD BASED ON DENSE-RESIDUAL NETWORKS FOR SPEAKER VERIFICATION Ying Liu, Yan Song, Ian McLoughlin, Lin Liu, Li-rong Dai 3757 | AN EFFECTIVE DEEP EMBEDDING LEARNING METHOD BASED ON DENSE-RESIDUAL NETWORKS FOR SPEAKER VERIFICATION |
2348 AN EFFICIENT ACTIVE SET ALGORITHM FOR COVARIANCE BASED JOINT DATA AND ACTIVITY DECTION FOR MASSIVE RANDOM ACCESS WITH MASSIVE MIMO Ziyue Wang, Zhilin Chen, Ya-Feng Liu, Foad Sohrabi, Wei Yu 2348 | AN EFFICIENT ACTIVE SET ALGORITHM FOR COVARIANCE BASED JOINT DATA AND ACTIVITY DECTION FOR MASSIVE RANDOM ACCESS WITH MASSIVE MIMO |
2810 An Efficient Algorithm for Device Detection and Channel Estimation in Asynchronous IoT Systems Liang Liu, Ya-Feng Liu 2810 | An Efficient Algorithm for Device Detection and Channel Estimation in Asynchronous IoT Systems |
2957 AN EFFICIENT ALTERNATING DIRECTION METHOD FOR GRAPH LEARNING FROM SMOOTH SIGNALS Xiaolu Wang, Chaorui Yao, Haoyu Lei, Anthony Man-Cho So 2957 | AN EFFICIENT ALTERNATING DIRECTION METHOD FOR GRAPH LEARNING FROM SMOOTH SIGNALS |
4349 AN EFFICIENT LINEAR PROGRAMMING ROUNDING-AND-REFINEMENT ALGORITHM FOR LARGE-SCALE NETWORK SLICING PROBLEM Wei-Kun Chen, Ya-Feng Liu, Yu-Hong Dai, Zhi-Quan Luo 4349 | AN EFFICIENT LINEAR PROGRAMMING ROUNDING-AND-REFINEMENT ALGORITHM FOR LARGE-SCALE NETWORK SLICING PROBLEM |
2820 AN EFFICIENT PAPER ANTI-COUNTERFEITING METHOD BASED ON MICROSTRUCTURE ORIENTATION ESTIMATION Yuhao Sun, Xin Liao, Jianfeng Liu 2820 | AN EFFICIENT PAPER ANTI-COUNTERFEITING METHOD BASED ON MICROSTRUCTURE ORIENTATION ESTIMATION |
3917 AN EMPIRICAL STUDY OF END-TO-END SIMULTANEOUS SPEECH TRANSLATION DECODING STRATEGIES Ha Nguyen, Yannick Estève, Laurent Besacier 3917 | AN EMPIRICAL STUDY OF END-TO-END SIMULTANEOUS SPEECH TRANSLATION DECODING STRATEGIES |
1965 AN EMPIRICAL STUDY OF VISUAL FEATURES FOR DNN BASED AUDIO-VISUAL SPEECH ENHANCEMENT IN MULTI-TALKER ENVIRONMENTS Shrishti Saha Shetu, Soumitro Chakrabarty, Emanuël A.P. Habets 1965 | AN EMPIRICAL STUDY OF VISUAL FEATURES FOR DNN BASED AUDIO-VISUAL SPEECH ENHANCEMENT IN MULTI-TALKER ENVIRONMENTS |
5135 An Empirical Study on Task-Oriented Dialogue Translation Siyou Liu 5135 | An Empirical Study on Task-Oriented Dialogue Translation |
3997 An End-to-End Actor-Critic-Based Neural Coreference Resolution System Yu Wang, Yilin Shen, Hongxia Jin 3997 | An End-to-End Actor-Critic-Based Neural Coreference Resolution System |
5619 An End-to-End Dense-InceptionNet for Image Copy-Move Forgery Detection Jun-Liu Zhong, Chi-Man Pun 5619 | An End-to-End Dense-InceptionNet for Image Copy-Move Forgery Detection |
3995 AN END-TO-END NON-INTRUSIVE MODEL FOR SUBJECTIVE AND OBJECTIVE REAL-WORLD SPEECH ASSESSMENT USING A MULTI-TASK FRAMEWORK Zhuohuang Zhang, Piyush Vyas, Xuan Dong, Donald S. Williamson 3995 | AN END-TO-END NON-INTRUSIVE MODEL FOR SUBJECTIVE AND OBJECTIVE REAL-WORLD SPEECH ASSESSMENT USING A MULTI-TASK FRAMEWORK |
3307 AN END-TO-END SPEECH ACCENT RECOGNITION METHOD BASED ON HYBRID CTC/ATTENTION TRANSFORMER ASR Qiang Gao, Haiwei Wu, Yanqing Sun, Yitao Duan 3307 | AN END-TO-END SPEECH ACCENT RECOGNITION METHOD BASED ON HYBRID CTC/ATTENTION TRANSFORMER ASR |
5585 AN ENHANCED SPATIAL SMOOTHING TECHNIQUE WITH ESPRIT ALGORITHM FOR DIRECTION OF ARRIVAL ESTIMATION IN COHERENT SCENARIOS Jingjing PAN, Meng SUN, Yide WANG, Xiaofei ZHANG 5585 | AN ENHANCED SPATIAL SMOOTHING TECHNIQUE WITH ESPRIT ALGORITHM FOR DIRECTION OF ARRIVAL ESTIMATION IN COHERENT SCENARIOS |
2777 AN EXTENSION OF SPARSE AUDIO DECLIPPER TO MULTIPLE MEASUREMENT VECTORS Satoru Emura, Noboru Harada 2777 | AN EXTENSION OF SPARSE AUDIO DECLIPPER TO MULTIPLE MEASUREMENT VECTORS |
4445 An F Test for Polynomial Frequency Modulation Kian Blanchette, Wesley Burr, Glen Takahara 4445 | An F Test for Polynomial Frequency Modulation |
2325 AN HRNET-BLSTM MODEL WITH TWO-STAGE TRAINING FOR SINGING MELODY EXTRACTION Yongwei Gao, Xingjian Du, Bilei Zhu, Xiaoheng Sun, Wei Li, Zejun Ma 2325 | AN HRNET-BLSTM MODEL WITH TWO-STAGE TRAINING FOR SINGING MELODY EXTRACTION |
3595 An Improved Data Driven Dynamic SIRD model for Predictive Monitoring of COVID-19 Pushpendra Singh, Amit Singhal, Binish Fatimah, Anubha Gupta 3595 | An Improved Data Driven Dynamic SIRD model for Predictive Monitoring of COVID-19 |
3428 AN IMPROVED DEEP RELATION NETWORK FOR ACTION RECOGNITION IN STILL IMAGES Wei Wu, Jiale Yu 3428 | AN IMPROVED DEEP RELATION NETWORK FOR ACTION RECOGNITION IN STILL IMAGES |
4224 An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection Yin Cao, Turab Iqbal, Qiuqiang Kong, Fengyan An, Wenwu Wang, Mark Plumbley 4224 | An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection |
3272 AN IMPROVED MEAN TEACHER BASED METHOD FOR LARGE SCALE WEAKLY LABELED SEMI-SUPERVISED SOUND EVENT DETECTION Xu Zheng, Yan Song, Ian McLoughlin, Lin Liu, Li-Rong Dai 3272 | AN IMPROVED MEAN TEACHER BASED METHOD FOR LARGE SCALE WEAKLY LABELED SEMI-SUPERVISED SOUND EVENT DETECTION |
5280 AN INVESTIGATION OF END-TO-END MODELS FOR ROBUST SPEECH RECOGNITION Archiki Prasad, Preethi Jyothi, Rajbabu Velmurugan 5280 | AN INVESTIGATION OF END-TO-END MODELS FOR ROBUST SPEECH RECOGNITION |
3036 AN INVESTIGATION OF USING HYBRID MODELING UNITS FOR IMPROVING END-TO-END SPEECH RECOGNITION SYSTEM Shunfei Chen, Xinhui Hu, Sheng Li, Xinkang Xu 3036 | AN INVESTIGATION OF USING HYBRID MODELING UNITS FOR IMPROVING END-TO-END SPEECH RECOGNITION SYSTEM |
4352 AN ITERATIVE FRAMEWORK FOR SELF-SUPERVISED DEEP SPEAKER REPRESENTATION LEARNING Danwei Cai, Weiqing Wang, Ming Li 4352 | AN ITERATIVE FRAMEWORK FOR SELF-SUPERVISED DEEP SPEAKER REPRESENTATION LEARNING |
4315 AN ORDER-OPTIMAL ADAPTIVE TEST PLAN FOR NOISY GROUP TESTING UNDER UNKNOWN NOISE MODELS Sudeep Salgia, Qing Zhao 4315 | AN ORDER-OPTIMAL ADAPTIVE TEST PLAN FOR NOISY GROUP TESTING UNDER UNKNOWN NOISE MODELS |
5274 Analog Beamforming with Antenna Selection for Large-Scale Antenna Arrays Aakash Arora, Christos Tsinos, Bhavani Shankar Mysore R, Symeon Chatzinotas, Bjorn Ottersten 5274 | Analog Beamforming with Antenna Selection for Large-Scale Antenna Arrays |
3425 ANALYSING BIAS IN SPOKEN LANGUAGE ASSESSMENT USING CONCEPT ACTIVATION VECTORS Xizi Wei, Mark J. F. Gales, Kate M. Knill 3425 | ANALYSING BIAS IN SPOKEN LANGUAGE ASSESSMENT USING CONCEPT ACTIVATION VECTORS |
5584 Analysis and Detection of Pathological Voice Using Glottal Source Features Sudarsana Reddy Kadiri, Paavo Alku 5584 | Analysis and Detection of Pathological Voice Using Glottal Source Features |
3922 ANALYSIS OF THE BUT DIARIZATION SYSTEM FOR VOXCONVERSE CHALLENGE Federico Landini, Ondřej Glembek, Pavel Matějka, Johan Rohdin, Lukáš Burget, Mireia Diez, Anna Silnova 3922 | ANALYSIS OF THE BUT DIARIZATION SYSTEM FOR VOXCONVERSE CHALLENGE |
5342 ANALYSIS OF X-VECTORS FOR LOW-RESOURCE SPEECH RECOGNITION Martin Karafiat, Karel Vesely, Jan Cernocky, Jan Profant, Jiri Nytra, Miroslav Hlavacek, Tomas Pavlicek 5342 | ANALYSIS OF X-VECTORS FOR LOW-RESOURCE SPEECH RECOGNITION |
4294 ANGLE–OF–ARRIVAL (AOA) FACTORIZATION IN MULTIPATH CHANNELS Yu-Lin Wei, Romit Roy Choudhury 4294 | ANGLE–OF–ARRIVAL (AOA) FACTORIZATION IN MULTIPATH CHANNELS |
4335 ANTENNA SELECTION FOR MASSIVE MIMO SYSTEMS BASED ON POMDP FRAMEWORK Sara Sharifi, Shahram ShahbazPanahi, Min Dong 4335 | ANTENNA SELECTION FOR MASSIVE MIMO SYSTEMS BASED ON POMDP FRAMEWORK |
1384 ANY-TO-ONE SEQUENCE-TO-SEQUENCE VOICE CONVERSION USING SELF-SUPERVISED DISCRETE SPEECH REPRESENTATIONS Wen-Chin Huang, Yi-Chiao Wu, Tomoki Hayashi, Tomoki Toda 1384 | ANY-TO-ONE SEQUENCE-TO-SEQUENCE VOICE CONVERSION USING SELF-SUPERVISED DISCRETE SPEECH REPRESENTATIONS |
3243 APPLICATION-LAYER DDOS ATTACKS WITH MULTIPLE EMULATION DICTIONARIES Michele Cirillo, Mario Di Mauro, Vincenzo Matta, Marco Tambasco 3243 | APPLICATION-LAYER DDOS ATTACKS WITH MULTIPLE EMULATION DICTIONARIES |
3063 APPLIED METHODS FOR SPARSE SAMPLING OF HEAD-RELATED TRANSFER FUNCTIONS Lior Arbel, Zamir Ben-Hur, David Lou Alon, Boaz Rafaely 3063 | APPLIED METHODS FOR SPARSE SAMPLING OF HEAD-RELATED TRANSFER FUNCTIONS |
1474 APPROXIMATE WEIGHTED CR CODED MATRIX MULTIPLICATION Neophytos Charalambides, Mert Pilanci, Alfred Hero 1474 | APPROXIMATE WEIGHTED CR CODED MATRIX MULTIPLICATION |
2067 ARRAYS OF FIRST-ORDER STEERABLE DIFFERENTIAL MICROPHONES Federico Borra, Alberto Bernardini, Ivan Bertuletti, Fabio Antonacci, Augusto Sarti 2067 | ARRAYS OF FIRST-ORDER STEERABLE DIFFERENTIAL MICROPHONES |
4647 ARRHYTHMIA CLASSIFICATION WITH HEARTBEAT-AWARE TRANSFORMER Bin Wang, Chang Liu, Chuanyan Hu, Xudong Liu, Jun Cao 4647 | ARRHYTHMIA CLASSIFICATION WITH HEARTBEAT-AWARE TRANSFORMER |
3223 ARTIFICIALLY SYNTHESISING DATA FOR AUDIO CLASSIFICATION AND SEGMENTATION TO IMPROVE SPEECH AND MUSIC DETECTION IN RADIO BROADCAST Satvik Venkatesh, David Moffat, Alexis Kirke, Gözel Shakeri, Stephen Brewster, Jörg Fachner, Helen Odell-Miller, Alex Street, Nicolas Farina, Sube Banerjee, Eduardo Reck Miranda 3223 | ARTIFICIALLY SYNTHESISING DATA FOR AUDIO CLASSIFICATION AND SEGMENTATION TO IMPROVE SPEECH AND MUSIC DETECTION IN RADIO BROADCAST |
4140 ASR n-best Fusion Nets Xinyue Liu, Mingda Li, Luoxin Chen, Prashan Wanigasekara, Weitong Ruan, Haidar Khan, Wael Hamza, Chengwei Su 4140 | ASR n-best Fusion Nets |
1435 ASSESSMENT OF BIPOLAR DISORDER USING HETEROGENEOUS DATA OF SMARTPHONE-BASED DIGITAL PHENOTYPING Hung-Yi Su, Chung-Hsien Wu, Cheng-Ray Liou, Esther Ching-Lan Lin, Po-See Chen 1435 | ASSESSMENT OF BIPOLAR DISORDER USING HETEROGENEOUS DATA OF SMARTPHONE-BASED DIGITAL PHENOTYPING |
2092 Assisted Learning: Cooperative AI with Autonomy Jiaying Zhou, Xun Xian, Na Li, Jie Ding 2092 | Assisted Learning: Cooperative AI with Autonomy |
5200 ASV-SUBTOOLS: OPEN SOURCE TOOLKIT FOR AUTOMATIC SPEAKER VERIFICATION FUCHUAN TONG, MIAO ZHAO, JIANFENG ZHOU, HAO LU, ZHENG LI, LIN LI, QINGYANG HONG 5200 | ASV-SUBTOOLS: OPEN SOURCE TOOLKIT FOR AUTOMATIC SPEAKER VERIFICATION |
4693 ASYMPTOTIC DISTRIBUTION OF GENERALIZED LIKELIHOOD RATIO TEST UNDER MODEL MISSPECIFICATION WITH APPLICATION TO COOPERATIVE RADAR-COMMUNICATIONS Akshay Bondre, Christ Richmond 4693 | ASYMPTOTIC DISTRIBUTION OF GENERALIZED LIKELIHOOD RATIO TEST UNDER MODEL MISSPECIFICATION WITH APPLICATION TO COOPERATIVE RADAR-COMMUNICATIONS |
1004 ASYNCHRONOUS ACOUSTIC ECHO CANCELLATION OVER WIRELESS CHANNELS Robert Ayrapetian, Philip Hilmes, Mohamed Mansour, Trausti Kristjansson, Carlo Murgia 1004 | ASYNCHRONOUS ACOUSTIC ECHO CANCELLATION OVER WIRELESS CHANNELS |
5375 ATTACK ON PRACTICAL SPEAKER VERIFICATION SYSTEM USING UNIVERSAL ADVERSARIAL PERTURBATIONS Weiyi Zhang, Shuning Zhao, Le Liu, Jianmin Li, Xingliang Cheng, Thomas Fang Zheng, Xiaolin Hu 5375 | ATTACK ON PRACTICAL SPEAKER VERIFICATION SYSTEM USING UNIVERSAL ADVERSARIAL PERTURBATIONS |
2318 ATTACKING AND DEFENDING BEHIND A PSYCHOACOUSTICS-BASED CAPTCHA Chih-Hsiang Huang, Po-Hao Wu, Yi-Wen Liu, Shan-Hung Wu 2318 | ATTACKING AND DEFENDING BEHIND A PSYCHOACOUSTICS-BASED CAPTCHA |
3503 ATTENTION ENHANCED SPATIAL TEMPORAL NEURAL NETWORK FOR HRRP RECOGNITION Yuchen Chu, Zunhua Guo 3503 | ATTENTION ENHANCED SPATIAL TEMPORAL NEURAL NETWORK FOR HRRP RECOGNITION |
4069 ATTENTION IS ALL YOU NEED IN SPEECH SEPARATION Cem Subakan, Mirco Ravanelli, Samuele Cornell, Mirko Bronzi, Jianyuan Zhong 4069 | ATTENTION IS ALL YOU NEED IN SPEECH SEPARATION |
5263 ATTENTION ON ATTENTION SPARSE DENSE CONVOLUTIONAL NETWORK FOR FINANCIAL SIGNAL PROCESSING Tianlei Zhu, Jiawei Li, Xinji Liu, Yong Jiang, Shu-Tao Xia 5263 | ATTENTION ON ATTENTION SPARSE DENSE CONVOLUTIONAL NETWORK FOR FINANCIAL SIGNAL PROCESSING |
2919 ATTENTION-BASED MULTI-ENCODER AUTOMATIC PRONUNCIATION ASSESSMENT Binghuai Lin, Liyuan Wang 2919 | ATTENTION-BASED MULTI-ENCODER AUTOMATIC PRONUNCIATION ASSESSMENT |
4366 ATTENTION-EMBEDDED DECOMPOSED NETWORK WITH UNPAIRED CT IMAGES PRIOR FOR METAL ARTIFACT REDUCTION Binyu Zhao, Qianqian Ren, Jinbao Li, Yafeng Zhao 4366 | ATTENTION-EMBEDDED DECOMPOSED NETWORK WITH UNPAIRED CT IMAGES PRIOR FOR METAL ARTIFACT REDUCTION |
1665 ATTENTION-GUIDED SECOND-ORDER POOLING CONVOLUTIONAL NETWORKS Shannan Chen, Qiule Sun, Cunhua Li, Jianxin Zhang, Qiang Zhang 1665 | ATTENTION-GUIDED SECOND-ORDER POOLING CONVOLUTIONAL NETWORKS |
4188 ATTENTIONLITE: TOWARDS EFFICIENT SELF-ATTENTION MODELS FOR VISION Souvik Kundu, Sairam Sundaresan 4188 | ATTENTIONLITE: TOWARDS EFFICIENT SELF-ATTENTION MODELS FOR VISION |
1746 ATTENTIVE SEMANTIC EXPLORING FOR MANIPULATED FACE DETECTION Zehao Chen, Hua Yang 1746 | ATTENTIVE SEMANTIC EXPLORING FOR MANIPULATED FACE DETECTION |
3481 ATTRIBUTE DECOMPOSITION FOR FLOW-BASED DOMAIN MAPPING Sheng-Jhe Huang, Jen-Tzung Chien 3481 | ATTRIBUTE DECOMPOSITION FOR FLOW-BASED DOMAIN MAPPING |
5048 ATVIO: ATTENTION GUIDED VISUAL-INERTIAL ODOMETRY Li Liu, Ge Li, Thomas H Li 5048 | ATVIO: ATTENTION GUIDED VISUAL-INERTIAL ODOMETRY |
5279 AUDIO DEQUANTIZATION USING (CO)SPARSE (NON)CONVEX METHODS Pavel Záviška, Pavel Rajmic, Ondřej Mokrý 5279 | AUDIO DEQUANTIZATION USING (CO)SPARSE (NON)CONVEX METHODS |
5621 Audio Replay Spoof Attack Detection by Joint Segment-Based Linear Filter Bank Feature Extraction and Attention-Enhanced DenseNet-BiLSTM Network Lian Huang, Chi-Man Pun 5621 | Audio Replay Spoof Attack Detection by Joint Segment-Based Linear Filter Bank Feature Extraction and Attention-Enhanced DenseNet-BiLSTM Network |
3824 Audio-Visual Event Recognition through the lens of Adversary Juncheng Li, Kaixin Ma, Shuhui Qu, Po-Yao Huang, Florian Metze 3824 | Audio-Visual Event Recognition through the lens of Adversary |
4160 AUDIOVISUAL HIGHLIGHT DETECTION IN VIDEOS Karel Mundnich, Alexandra Fenster, Aparna Khare, Shiva Sundaram 4160 | AUDIOVISUAL HIGHLIGHT DETECTION IN VIDEOS |
4968 AUDIO-VISUAL SPEECH ENHANCEMENT METHOD CONDITIONED ON THE LIP MOTION AND SPEAKER-DISCRIMINATIVE EMBEDDINGS Koichiro Ito, Masaaki Yamamoto, Kenji Nagamatsu 4968 | AUDIO-VISUAL SPEECH ENHANCEMENT METHOD CONDITIONED ON THE LIP MOTION AND SPEAKER-DISCRIMINATIVE EMBEDDINGS |
1217 AUDIO-VISUAL SPEECH INPAINTING WITH DEEP LEARNING Giovanni Morrone, Daniel Michelsanti, Zheng-Hua Tan, Jesper Jensen 1217 | AUDIO-VISUAL SPEECH INPAINTING WITH DEEP LEARNING |
4178 AUDIO-VISUAL SPEECH SEPARATION USING CROSS-MODAL CORRESPONDENCE LOSS Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi, Ryo Masumura 4178 | AUDIO-VISUAL SPEECH SEPARATION USING CROSS-MODAL CORRESPONDENCE LOSS |
1948 AUDITORY FILTERBANKS BENEFIT UNIVERSAL SOUND SOURCE SEPARATION Han Li, Kean Chen, Bernhard U. Seeber 1948 | AUDITORY FILTERBANKS BENEFIT UNIVERSAL SOUND SOURCE SEPARATION |
4941 Augmentation as Sanitization: Strong Data Augmentation Breaks Poisoning and Backdoor Attacks for Free Eitan Borgnia, Valeriia Cherepanova, Liam Fowl, Jonas Geiping, Amin Ghiasi, Micah Goldblum, Tom Goldstein, Arjun Gupta 4941 | Augmentation as Sanitization: Strong Data Augmentation Breaks Poisoning and Backdoor Attacks for Free |
4302 AUGMENTED GAUSSIAN LINEAR MIXTURE MODEL FOR SPECTRAL VARIABILITY IN HYPERSPECTRAL UNMIXING yaser esmaeili salehani, ehsan arabnejad, saeed gazor 4302 | AUGMENTED GAUSSIAN LINEAR MIXTURE MODEL FOR SPECTRAL VARIABILITY IN HYPERSPECTRAL UNMIXING |
5185 AUGMENTING TRANSFERRED REPRESENTATIONS FOR STOCK CLASSIFICATION Elizabeth Fons, Paula Dawson, Xiao-jun Zeng, John Keane, Alexandros Iosifidis 5185 | AUGMENTING TRANSFERRED REPRESENTATIONS FOR STOCK CLASSIFICATION |
3724 Autoencoder for Vibrotactile Signal Compression Zhuoran Li, Rania Hassen, Zhou Wang 3724 | Autoencoder for Vibrotactile Signal Compression |
1706 AutoKWS: Keyword Spotting with Differentiable Architecture Search Bo Zhang, Wenfeng Li, Qingyuan Li, Weiji Zhuang, Xiangxiang Chu, Yujun Wang 1706 | AutoKWS: Keyword Spotting with Differentiable Architecture Search |
4203 AUTOMATED MULTI-ORGAN SEGMENTATION IN PET IMAGES USING CASCADED TRAINING OF A 3D U-NET AND CONVOLUTIONAL AUTOENCODER Annika Liebgott, Charlotte Lorenz, Sergios Gatidis, Viet Chau Vu, Konstantin Nikolaou, Bin Yang 4203 | AUTOMATED MULTI-ORGAN SEGMENTATION IN PET IMAGES USING CASCADED TRAINING OF A 3D U-NET AND CONVOLUTIONAL AUTOENCODER |
5074 AUTOMATIC AND PERCEPTUAL DISCRIMINATION BETWEEN DYSARTHRIA, APRAXIA OF SPEECH, AND NEUROTYPICAL SPEECH Ina Kodrasi, Michaela Pernon, Marina Laganaro, Hervé Bourlard 5074 | AUTOMATIC AND PERCEPTUAL DISCRIMINATION BETWEEN DYSARTHRIA, APRAXIA OF SPEECH, AND NEUROTYPICAL SPEECH |
4772 AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS Parvaneh Janbakhshi, Ina Kodrasi, Hervé Bourlard 4772 | AUTOMATIC DYSARTHRIC SPEECH DETECTION EXPLOITING PAIRWISE DISTANCE-BASED CONVOLUTIONAL NEURAL NETWORKS |
4831 AUTOMATIC ELICITATION COMPLIANCE FOR SHORT-DURATION SPEECH BASED DEPRESSION DETECTION Brian Stasak, Zhaocheng Huang, Dale Joachim, Julien Epps 4831 | AUTOMATIC ELICITATION COMPLIANCE FOR SHORT-DURATION SPEECH BASED DEPRESSION DETECTION |
4424 Automatic Fine-grained Localization of Utility Pole Landmarks on Distributed Acoustic Sensing Traces Based on Bilinear ResNets You Lu, Yue Tian, Shaobo Han, Eric Cosatto, Sarper Ozharar, Yangmin Ding 4424 | Automatic Fine-grained Localization of Utility Pole Landmarks on Distributed Acoustic Sensing Traces Based on Bilinear ResNets |
2521 AUTOMATIC MULTITRACK MIXING WITH A DIFFERENTIABLE MIXING CONSOLE OF NEURAL AUDIO EFFECTS Christian J. Steinmetz, Jordi Pons, Santiago Pascual, Joan Serrà 2521 | AUTOMATIC MULTITRACK MIXING WITH A DIFFERENTIABLE MIXING CONSOLE OF NEURAL AUDIO EFFECTS |
2771 AUTOMATIC ORDER SELECTION IN AUTOREGRESSIVE MODELING WITH APPLICATION IN EEG SLEEP-STAGE CLASSIFICATION Farah Nassif, Soosan Beheshti 2771 | AUTOMATIC ORDER SELECTION IN AUTOREGRESSIVE MODELING WITH APPLICATION IN EEG SLEEP-STAGE CLASSIFICATION |
4015 AUTOMATIC REGISTRATION AND CONVEX CLUSTERING OF TIME SERIES Michael Weylandt, George Michailidis 4015 | AUTOMATIC REGISTRATION AND CONVEX CLUSTERING OF TIME SERIES |
2270 AUTOREGRESSIVE FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR JOINT BLIND SOURCE SEPARATION AND DEREVERBERATION Kouhei Sekiguchi, Yoshiaki Bando, Aditya Arie Nugraha, Mathieu Fontaine, Kazuyoshi Yoshii 2270 | AUTOREGRESSIVE FAST MULTICHANNEL NONNEGATIVE MATRIX FACTORIZATION FOR JOINT BLIND SOURCE SEPARATION AND DEREVERBERATION |
5594 AUTO-TUNING SPECTRAL CLUSTERING FOR SPEAKER DIARIZATION USING NORMALIZED MAXIMUM EIGENGAP Taejin Park, Kyu Han, Manoj Kumar, Shrikanth Narayanan 5594 | AUTO-TUNING SPECTRAL CLUSTERING FOR SPEAKER DIARIZATION USING NORMALIZED MAXIMUM EIGENGAP |
3369 BACKDOOR ATTACK AGAINST SPEAKER VERIFICATION Tongqing Zhai, Yiming Li, Ziqi Zhang, Baoyuan Wu, Yong Jiang, Shu-Tao Xia 3369 | BACKDOOR ATTACK AGAINST SPEAKER VERIFICATION |
1129 BAITRADAR: A MULTI-MODEL CLICKBAIT DETECTION ALGORITHM USING DEEP LEARNING Bhanuka Gamage, Adnan Labib, Aisha Joomun, Chern Hong Lim, KokSheik Wong 1129 | BAITRADAR: A MULTI-MODEL CLICKBAIT DETECTION ALGORITHM USING DEEP LEARNING |
4656 BANDWIDTH EXTENSION IS ALL YOU NEED Jiaqi Su, Yunyun Wang, Adam Finkelstein, Zeyu Jin 4656 | BANDWIDTH EXTENSION IS ALL YOU NEED |
1134 BanRAW: Band-Limited Radar Waveform Design via Phase Retrieval Samuel Pinilla, Kumar Vijay Mishra, Brian Sadler, Henry Arguello 1134 | BanRAW: Band-Limited Radar Waveform Design via Phase Retrieval |
1659 BAYESIAN ESTIMATION OF A TAIL-INDEX WITH MARGINALIZED THRESHOLD Douglas Johnston, Petar Djuric 1659 | BAYESIAN ESTIMATION OF A TAIL-INDEX WITH MARGINALIZED THRESHOLD |
3215 BAYESIAN MULTIPLE CHANGE-POINT DETECTION OF PROPAGATING EVENTS Topi Halme, Eyal Nitzan, Visa Koivunen 3215 | BAYESIAN MULTIPLE CHANGE-POINT DETECTION OF PROPAGATING EVENTS |
1539 BAYESIAN TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION Boyang Xue, Jianwei Yu, Junhao Xu, Shansong Liu, Shoukang Hu, Zi Ye, Mengzhe Geng, Xunying Liu, Helen Meng 1539 | BAYESIAN TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION |
4189 BAYES-OPTIMAL METHODS FOR FINDING THE SOURCE OF A CASCADE Anirudh Sridhar, H. Vincent Poor 4189 | BAYES-OPTIMAL METHODS FOR FINDING THE SOURCE OF A CASCADE |
2758 Beam Focusing for Multi-User MIMO Communications with Dynamic Metasurface Antennas Haiyang Zhang, Nir Shlezinger, Francesco Guidi, Davide Dardari, Mohammadreza F. Imani, Yonina C. Eldar 2758 | Beam Focusing for Multi-User MIMO Communications with Dynamic Metasurface Antennas |
5380 BEAMFORMING FOR BIDIRECTIONAL MIMO FULL DUPLEX UNDER THE JOINT SUM POWER AND PER ANTENNA POWER CONSTRAINTS Chandan Kumar Sheemar, Dirk Slock 5380 | BEAMFORMING FOR BIDIRECTIONAL MIMO FULL DUPLEX UNDER THE JOINT SUM POWER AND PER ANTENNA POWER CONSTRAINTS |
5075 Benign overfitting in binary classification of Gaussian mixtures Ke Wang, Christos Thrampoulidis 5075 | Benign overfitting in binary classification of Gaussian mixtures |
4050 BI-APC: BIDIRECTIONAL AUTOREGRESSIVE PREDICTIVE CODING FOR UNSUPERVISED PRE-TRAINING AND ITS APPLICATION TO CHILDREN’S ASR Ruchao Fan, Amber Afshan, Abeer Alwan 4050 | BI-APC: BIDIRECTIONAL AUTOREGRESSIVE PREDICTIVE CODING FOR UNSUPERVISED PRE-TRAINING AND ITS APPLICATION TO CHILDREN’S ASR |
1740 BIDIRECTIONAL FOCUSED SEMANTIC ALIGNMENT ATTENTION NETWORK FOR CROSS-MODAL RETRIEVAL Shuli Cheng, Liejun Wang, Anyu Du, Yongming Li 1740 | BIDIRECTIONAL FOCUSED SEMANTIC ALIGNMENT ATTENTION NETWORK FOR CROSS-MODAL RETRIEVAL |
4381 BIFOCAL RNN-T: EXPLOITING KEYWORD SPOTTING FOR ASR INFERENCE OPTIMIZATION Grant Strimel, Jon Macoskey, Ariya Rastrow 4381 | BIFOCAL RNN-T: EXPLOITING KEYWORD SPOTTING FOR ASR INFERENCE OPTIMIZATION |
5153 BI-LEVEL STYLE AND PROSODY DECOUPLING MODELING FOR PERSONALIZED END-TO-END SPEECH SYNTHESIS Ruibo Fu, Jianhua Tao, Zhengqi Wen, Jiangyan Yi, Tao Wang, Chunyu Qiang 5153 | BI-LEVEL STYLE AND PROSODY DECOUPLING MODELING FOR PERSONALIZED END-TO-END SPEECH SYNTHESIS |
2544 BINARY CONTROL AND DIGITAL-TO-ANALOG CONVERSION USING COMPOSITE NUV PRIORS AND ITERATIVE GAUSSIAN MESSAGE PASSING Raphael Keusch, Hampus Malmberg, Hans-Andrea Loeliger 2544 | BINARY CONTROL AND DIGITAL-TO-ANALOG CONVERSION USING COMPOSITE NUV PRIORS AND ITERATIVE GAUSSIAN MESSAGE PASSING |
4458 BISHIFT-NET FOR IMAGE INPAINTING Xue Zhou, Tao Dai, Shutao Xia, Yong Jiang 4458 | BISHIFT-NET FOR IMAGE INPAINTING |
2023 Bit Constrained Communication Receivers in Joint Radar Communications Systems Dingyou Ma, Nir Shlezinger, Tianyao Huang, Yimin Liu, Yonina C. Eldar 2023 | Bit Constrained Communication Receivers in Joint Radar Communications Systems |
1624 BLEND-RES^2NET: BLENDED REPRESENTATION SPACE BY TRANSFORMATION OF RESIDUAL MAPPING WITH RESTRAINED LEARNING FOR TIME SERIES CLASSIFICATION Arijit Ukil, Antonio J. Jara, Leandro Marin 1624 | BLEND-RES^2NET: BLENDED REPRESENTATION SPACE BY TRANSFORMATION OF RESIDUAL MAPPING WITH RESTRAINED LEARNING FOR TIME SERIES CLASSIFICATION |
2949 BLIND AMPLITUDE ESTIMATION OF EARLY ROOM REFLECTIONS USING ALTERNATING LEAST SQUARES Tom Shlomo, Boaz Rafaely 2949 | BLIND AMPLITUDE ESTIMATION OF EARLY ROOM REFLECTIONS USING ALTERNATING LEAST SQUARES |
1840 BLIND AND NEURAL NETWORK-GUIDED CONVOLUTIONAL BEAMFORMER FOR JOINT DENOISING, DEREVERBERATION, AND SOURCE SEPARATION Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Hiroshi Sawada, Shoko Araki 1840 | BLIND AND NEURAL NETWORK-GUIDED CONVOLUTIONAL BEAMFORMER FOR JOINT DENOISING, DEREVERBERATION, AND SOURCE SEPARATION |
2188 Blind Carbon Copy on Dirty Paper: Seamless Spectrum Underlay via Canonical Correlation Analysis Mohamed Salah Ibrahim, Nicholas D. Sidiropoulos 2188 | Blind Carbon Copy on Dirty Paper: Seamless Spectrum Underlay via Canonical Correlation Analysis |
5260 Blind Deinterleaving of Signals in Time Series with Self-attention Based Soft Min-cost Flow Learning Oğul Can, Yeti Z. Gürbüz, Berkin Yıldırım, A. Aydın Alatan 5260 | Blind Deinterleaving of Signals in Time Series with Self-attention Based Soft Min-cost Flow Learning |
3324 BLIND EXTRACTION OF MOVING AUDIO SOURCE IN A CHALLENGING ENVIRONMENT SUPPORTED BY SPEAKER IDENTIFICATION VIA X-VECTORS Jiri Malek, Jakub Jansky, Tomas Kounovsky, Zbynek Koldovsky, Jindrich Zdansky 3324 | BLIND EXTRACTION OF MOVING AUDIO SOURCE IN A CHALLENGING ENVIRONMENT SUPPORTED BY SPEAKER IDENTIFICATION VIA X-VECTORS |
4008 Blind Extraction of Moving Sources via Independent Component and Vector Analysis: Examples Nesrine Amor, Jaroslav Cmejla, Vaclav Kautsky, Zbynek Koldovsky, Tomas Kounovsky 4008 | Blind Extraction of Moving Sources via Independent Component and Vector Analysis: Examples |
4669 BLIND IMAGE QUALITY EVALUATOR WITH SCALE ROBUSTNESS Ci Wang, Mei Li 4669 | BLIND IMAGE QUALITY EVALUATOR WITH SCALE ROBUSTNESS |
4584 Blind Sound Source Localization based on Deep Learning Yifan Wu, Roshan Ayyalasomayajula, Michael Bianco, Dinesh Bharadia, Peter Gerstoft 4584 | Blind Sound Source Localization based on Deep Learning |
4016 BLOCK KALMAN FILTER: AN ASYMPTOTIC BLOCK PARTICLE FILTER IN THE LINEAR GAUSSIAN CASE Rui MIN, Christelle GARNIER, François SEPTIER, John KLEIN 4016 | BLOCK KALMAN FILTER: AN ASYMPTOTIC BLOCK PARTICLE FILTER IN THE LINEAR GAUSSIAN CASE |
2908 BLSTM-BASED CONFIDENCE ESTIMATION FOR END-TO-END SPEECH RECOGNITION Atsunori Ogawa, Naohiro Tawara, Takatomo Kano, Marc Delcroix 2908 | BLSTM-BASED CONFIDENCE ESTIMATION FOR END-TO-END SPEECH RECOGNITION |
1164 Bluetooth Low Energy and CNN-based Angle of Arrival Localization in Presence of Rayleigh Fading Zohreh HajiAkhondi-Meybodi, Mohammad Salimibeni, Arash Mohammadi, Konstantinos N. Plataniotis 1164 | Bluetooth Low Energy and CNN-based Angle of Arrival Localization in Presence of Rayleigh Fading |
1123 BOOSTING LOW-RESOURCE INTENT DETECTION WITH IN-SCOPE PROTOTYPICAL NETWORKS Hongzhan Lin, Yuanmeng Yan, Guang Chen 1123 | BOOSTING LOW-RESOURCE INTENT DETECTION WITH IN-SCOPE PROTOTYPICAL NETWORKS |
3068 Branchy-GNN: a Device-Edge Co-Inference Framework for Efficient Point Cloud Processing Jiawei Shao, Haowei Zhang, Yuyi Mao, Jun Zhang 3068 | Branchy-GNN: a Device-Edge Co-Inference Framework for Efficient Point Cloud Processing |
5142 Bridging Unpaired Facial Photos and Sketches by Line-drawings Fei Gao, Meimei Shang, Xiang Li, Jingjie Zhu, Lingna Dai 5142 | Bridging Unpaired Facial Photos and Sketches by Line-drawings |
4168 B-SMALL: A BAYESIAN NEURAL NETWORK APPROACH TO SPARSE MODEL-AGNOSTIC META-LEARNING Anish Madan, Ranjitha Prasad 4168 | B-SMALL: A BAYESIAN NEURAL NETWORK APPROACH TO SPARSE MODEL-AGNOSTIC META-LEARNING |
2744 BW-EDA-EEND: STREAMING END-TO-END NEURAL SPEAKER DIARIZATIONFOR A VARIABLE NUMBER OF SPEAKERS Eunjung Han, Chul Lee, Andreas Stocke 2744 | BW-EDA-EEND: STREAMING END-TO-END NEURAL SPEAKER DIARIZATIONFOR A VARIABLE NUMBER OF SPEAKERS |
5617 BYRDIE: BYZANTINE-RESILIENT DISTRIBUTED COORDINATE DESCENT FOR DECENTRALIZED LEARNING Zhixiong Yang, Waheed Bajwa 5617 | BYRDIE: BYZANTINE-RESILIENT DISTRIBUTED COORDINATE DESCENT FOR DECENTRALIZED LEARNING |
3719 BYTECOVER: COVER SONG IDENTIFICATION VIA MULTI-LOSS TRAINING Xingjian Du, Zhesong Yu, Bilei Zhu, Xiaoou Chen, Zejun Ma 3719 | BYTECOVER: COVER SONG IDENTIFICATION VIA MULTI-LOSS TRAINING |
3514 Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation Zhaoxian Wu, Han Shen, Tianyi Chen, Qing Ling 3514 | Byzantine-Resilient Decentralized TD Learning with Linear Function Approximation |
2135 CAMERA CALIBRATION WITH POSE GUIDANCE Yuzhuo Ren, Feng Hu 2135 | CAMERA CALIBRATION WITH POSE GUIDANCE |
2639 CAMP: A TWO-STAGE APPROACH TO MODELLING PROSODY IN CONTEXT Zack Hodari, Alexis Moinet, Sri Karlapati, Jaime Lorenzo-Trueba, Thomas Merritt, Arnaud Joly, Ammar Abbas, Penny Karanasou, Thomas Drugman 2639 | CAMP: A TWO-STAGE APPROACH TO MODELLING PROSODY IN CONTEXT |
1687 CANET: CONTEXT-AWARE LOSS FOR DESCRIPTOR LEARNING Tianyou Chen, Xiaoguang Hu, Jin Xiao, Guofeng Zhang, Hui Ruan 1687 | CANET: CONTEXT-AWARE LOSS FOR DESCRIPTOR LEARNING |
5096 Canonical Polyadic Tensor Decomposition with Low-Rank Factor Matrices ANH-HUY PHAN, Petr Tichavsky, Konstantin Sobolev, Konstantin Sozykin, Dmitry Ermilov, Andrzej Cichocki 5096 | Canonical Polyadic Tensor Decomposition with Low-Rank Factor Matrices |
4761 CAPTURING BANDING IN IMAGES: DATABASE CONSTRUCTION AND OBJECTIVE ASSESSMENT Akshay Kapoor, Jatin Sapra, Zhou Wang 4761 | CAPTURING BANDING IN IMAGES: DATABASE CONSTRUCTION AND OBJECTIVE ASSESSMENT |
3859 CAPTURING MULTI-RESOLUTION CONTEXT BY DILATED SELF-ATTENTION Niko Moritz, Takaaki Hori, Jonathan Le Roux 3859 | CAPTURING MULTI-RESOLUTION CONTEXT BY DILATED SELF-ATTENTION |
3508 CAPTURING TEMPORAL DEPENDENCIES THROUGH FUTURE PREDICTION FOR CNN-BASED AUDIO CLASSIFIERS Hongwei Song, Jiqing Han, Shiwen Deng, Zhihao Du 3508 | CAPTURING TEMPORAL DEPENDENCIES THROUGH FUTURE PREDICTION FOR CNN-BASED AUDIO CLASSIFIERS |
2926 Cascade Attention Fusion for Fine-grained Image Captioning based on Multi-layer LSTM Shuang Wang, Yun Meng, Yu Gu, Lei Zhang, Xiutiao Ye, Jingxian Tian, Licheng Jiao 2926 | Cascade Attention Fusion for Fine-grained Image Captioning based on Multi-layer LSTM |
5206 CASCADED ALL-PASS FILTERS WITH RANDOMIZED CENTER FREQUENCIES AND PHASE POLARITY FOR ACOUSTIC AND SPEECH MEASUREMENT AND DATA AUGMENTATION Hideki Kawahara, Kohei Yatabe 5206 | CASCADED ALL-PASS FILTERS WITH RANDOMIZED CENTER FREQUENCIES AND PHASE POLARITY FOR ACOUSTIC AND SPEECH MEASUREMENT AND DATA AUGMENTATION |
3784 Cascaded encoders for unifying streaming and non-streaming ASR Arun Narayanan, Tara Sainath, Ruoming Pang, Jiahui Yu, Chung-Cheng Chiu, Rohit Prabhavalkar, Ehsan Variani, Trevor Strohman 3784 | Cascaded encoders for unifying streaming and non-streaming ASR |
2406 CASCADED MODELS WITH CYCLIC FEEDBACK FOR DIRECT SPEECH TRANSLATION Tsz Kin Lam, Shigehiko Schamoni, Stefan Riezler 2406 | CASCADED MODELS WITH CYCLIC FEEDBACK FOR DIRECT SPEECH TRANSLATION |
4130 CASCADED TIME + TIME-FREQUENCY UNET FOR SPEECH ENHANCEMENT: JOINTLY ADDRESSING CLIPPING, CODEC DISTORTIONS, AND GAPS Arun Asokan Nair, Kazuhito Koishida 4130 | CASCADED TIME + TIME-FREQUENCY UNET FOR SPEECH ENHANCEMENT: JOINTLY ADDRESSING CLIPPING, CODEC DISTORTIONS, AND GAPS |
4525 CASS-NAT: CTC ALIGNMENT-BASED SINGLE STEP NON-AUTOREGRESSIVE TRANSFORMER FOR SPEECH RECOGNITION Ruchao Fan, Wei Chu, Peng Chang, Jing Xiao 4525 | CASS-NAT: CTC ALIGNMENT-BASED SINGLE STEP NON-AUTOREGRESSIVE TRANSFORMER FOR SPEECH RECOGNITION |
5484 CATILOC: CAMERA IMAGE TRANSFORMER FOR INDOOR LOCALIZATION Ali Ghofrani, Rahil Mahdian Toroghi, Seyed Mojtaba Tabatabaie 5484 | CATILOC: CAMERA IMAGE TRANSFORMER FOR INDOOR LOCALIZATION |
2794 Centrality based number of cluster estimation in Graph clustering Mahdi Shamsi, Soosan Beheshti 2794 | Centrality based number of cluster estimation in Graph clustering |
2855 CGAN-NET: CLASS-GUIDED ASYMMETRIC NON-LOCAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION Hanlin Chen, Qingyong Hu, Jungang Yang, Jing Wu, Yulan Guo 2855 | CGAN-NET: CLASS-GUIDED ASYMMETRIC NON-LOCAL NETWORK FOR REAL-TIME SEMANTIC SEGMENTATION |
1707 CHANNEL ATTENTION RESIDUAL U-NET FOR RETINAL VESSEL SEGMENTATION Changlu Guo, Márton Szemenyei, Yangtao Hu, Wenle Wang, Wei Zhou, Yugen Yi 1707 | CHANNEL ATTENTION RESIDUAL U-NET FOR RETINAL VESSEL SEGMENTATION |
4852 CHANNEL-WISE MIX-FUSION DEEP NEURAL NETWORKS FOR ZERO-SHOT LEARNING Guowei Wang, Naiyang Guan, Hanjia Ye, Hang Cheng, Junjie Zhu 4852 | CHANNEL-WISE MIX-FUSION DEEP NEURAL NETWORKS FOR ZERO-SHOT LEARNING |
1797 CHARACTERIZATION OF MEMS MICROPHONE SENSITIVITY AND PHASE DISTRIBUTIONS WITH APPLICATIONS IN ARRAY PROCESSING Patrick W.A. Wijnings, Sander Stuijk, Rick Scholte, Henk Corporaal 1797 | CHARACTERIZATION OF MEMS MICROPHONE SENSITIVITY AND PHASE DISTRIBUTIONS WITH APPLICATIONS IN ARRAY PROCESSING |
3617 CHECKING PRNU USABILITY ON MODERN DEVICES Chiara Albisani, Massimo Iuliani, Alessandro Piva 3617 | CHECKING PRNU USABILITY ON MODERN DEVICES |
3605 CIF-BASED COLLABORATIVE DECODING FOR END-TO-END CONTEXTUAL SPEECH RECOGNITION Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu 3605 | CIF-BASED COLLABORATIVE DECODING FOR END-TO-END CONTEXTUAL SPEECH RECOGNITION |
1445 CLASS AWARE ROBUST TRAINING Zhikang Xia, Bin Chen, Tao Dai, Shutao Xia 1445 | CLASS AWARE ROBUST TRAINING |
3989 Class-Conditional Defense GAN Against End-to-End Speech Attacks Mohammad Esmaeilpour, Patrick Cardinal, Alessandro Lameiras Koerich 3989 | Class-Conditional Defense GAN Against End-to-End Speech Attacks |
2372 CLASSIFICATION OF EXPERT-NOVICE LEVEL USING EYE TRACKING AND MOTION DATA VIA CONDITIONAL MULTIMODAL VARIATIONAL AUTOENCODER Yusuke Akamatsu, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama 2372 | CLASSIFICATION OF EXPERT-NOVICE LEVEL USING EYE TRACKING AND MOTION DATA VIA CONDITIONAL MULTIMODAL VARIATIONAL AUTOENCODER |
2273 CLASSIFYING SPEECH INTELLIGIBILITY LEVELS OF CHILDREN IN TWO CONTINUOUS SPEECH STYLES Yeh-Sheng Lin, Shu-Chuan Tseng 2273 | CLASSIFYING SPEECH INTELLIGIBILITY LEVELS OF CHILDREN IN TWO CONTINUOUS SPEECH STYLES |
4393 CLASS-IMBALANCED CLASSIFIERS USING ENSEMBLES OF GAUSSIAN PROCESSES AND GAUSSIAN PROCESS LATENT VARIABLE MODELS Liu Yang, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djurić 4393 | CLASS-IMBALANCED CLASSIFIERS USING ENSEMBLES OF GAUSSIAN PROCESSES AND GAUSSIAN PROCESS LATENT VARIABLE MODELS |
4636 CLOSE-TALKING RECORDING WITH PLANARLY DISTRIBUTED MICROPHONES Takuma Okamoto 4636 | CLOSE-TALKING RECORDING WITH PLANARLY DISTRIBUTED MICROPHONES |
1166 CLUSTERING A COLLECTION OF NETWORKS WITH MIXTURES OF L1-SPARSE GRAPHICAL MODELS Zuogong Yue, Victor Solo 1166 | CLUSTERING A COLLECTION OF NETWORKS WITH MIXTURES OF L1-SPARSE GRAPHICAL MODELS |
4002 CNN-BASED SPOKEN TERM DETECTION AND LOCALIZATION WITHOUT DYNAMIC PROGRAMMING Tzeviya Sylvia Fuchs, Yael Segal, Joseph Keshet 4002 | CNN-BASED SPOKEN TERM DETECTION AND LOCALIZATION WITHOUT DYNAMIC PROGRAMMING |
3246 COARSE-TO-CAREFUL: SEEKING SEMANTIC-RELATED KNOWLEDGE FOR OPEN-DOMAIN COMMONSENSE QUESTION ANSWERING Luxi Xing, Yue Hu, Jing Yu, Yuqiang Xie, Wei Peng 3246 | COARSE-TO-CAREFUL: SEEKING SEMANTIC-RELATED KNOWLEDGE FOR OPEN-DOMAIN COMMONSENSE QUESTION ANSWERING |
5292 CO-ATTENTIONAL TRANSFORMERS FOR STORY-BASED VIDEO UNDERSTANDING Björn Bebensee, Byoung-Tak Zhang 5292 | CO-ATTENTIONAL TRANSFORMERS FOR STORY-BASED VIDEO UNDERSTANDING |
5485 CO-CAPSULE NETWORKS BASED KNOWLEDGE TRANSFER FOR CROSS-DOMAIN RECOMMENDATION Huiyuan Li, Li Yu, Youfang Leng, Qihan Du 5485 | CO-CAPSULE NETWORKS BASED KNOWLEDGE TRANSFER FOR CROSS-DOMAIN RECOMMENDATION |
3873 CODEBOOK DESIGN FOR DUAL-POLARIZED ULTRA-MASSIVE MIMO COMMUNICATIONS AT MILLIMETER WAVE AND TERAHERTZ BANDS Shuai Nie, Ian Akyildiz 3873 | CODEBOOK DESIGN FOR DUAL-POLARIZED ULTRA-MASSIVE MIMO COMMUNICATIONS AT MILLIMETER WAVE AND TERAHERTZ BANDS |
1621 Code-Switch Speech Rescoring With Monolingual Data Guoyu Liu, Lixin Cao 1621 | Code-Switch Speech Rescoring With Monolingual Data |
4104 Cognitive Memory Constrained Human Decision Making based on Multi-source Information BAOCHENG GENG, Chen Quan, Pramod Varshney 4104 | Cognitive Memory Constrained Human Decision Making based on Multi-source Information |
4873 COLD START REVISITED: A DEEP HYBRID RECOMMENDER WITH COLD-WARM ITEM HARMONIZATION Oren Barkan, Roy Hirsch, Ori Katz, Avi Caciularu, Yoni Weill, Noam Koenigstein 4873 | COLD START REVISITED: A DEEP HYBRID RECOMMENDER WITH COLD-WARM ITEM HARMONIZATION |
1291 COLLABORATIVE INFERENCE VIA ENSEMBLES ON THE EDGE Nir Shlezinger, Erez Farhan, Hai Morgenstern, Yonina Eldar 1291 | COLLABORATIVE INFERENCE VIA ENSEMBLES ON THE EDGE |
3984 COLLABORATIVE INTELLIGENCE: CHALLENGES AND OPPORTUNITIES Ivan Bajic, Weisi Lin, Yonghong Tian 3984 | COLLABORATIVE INTELLIGENCE: CHALLENGES AND OPPORTUNITIES |
1827 COLLABORATIVE LEARNING TO GENERATE AUDIO-VIDEO JOINTLY Vinod Kurmi, Vipul Bajaj, Badri Patro, Venkatesh K Subramanian, Vinay Namboodiri, Preethi Jyothi 1827 | COLLABORATIVE LEARNING TO GENERATE AUDIO-VIDEO JOINTLY |
1292 COMBINED DIFFERENTIAL BEAMFORMING WITH UNIFORM LINEAR MICROPHONE ARRAYS Gongping Huang, Yuzhu Wang, Jacob Benesty, Israel Cohen, Jingdong Chen 1292 | COMBINED DIFFERENTIAL BEAMFORMING WITH UNIFORM LINEAR MICROPHONE ARRAYS |
3596 COMBINING ADAPTIVE FILTERING AND COMPLEX-VALUED DEEP POSTFILTERING FOR ACOUSTIC ECHO CANCELLATION Mhd Modar Halimeh, Thomas Haubner, Annika Briegleb, Alexander Schmidt, Walter Kellermann 3596 | COMBINING ADAPTIVE FILTERING AND COMPLEX-VALUED DEEP POSTFILTERING FOR ACOUSTIC ECHO CANCELLATION |
1562 COMBINING DYNAMIC IMAGE AND PREDICTION ENSEMBLE FOR CROSS-DOMAIN FACE ANTI-SPOOFING Lingling Lv, Youjun Xiang, Xianfeng Li, Hanye Huang, Rongju Ruan, Xiaoyan Xu, Yuli Fu 1562 | COMBINING DYNAMIC IMAGE AND PREDICTION ENSEMBLE FOR CROSS-DOMAIN FACE ANTI-SPOOFING |
1154 COMMUNICATION OVER BLOCK FADING CHANNELS - AN ALGORITHMIC PERSPECTIVE ON OPTIMAL TRANSMISSION SCHEMES Holger Boche, Rafael F. Schaefer, H. Vincent Poor 1154 | COMMUNICATION OVER BLOCK FADING CHANNELS - AN ALGORITHMIC PERSPECTIVE ON OPTIMAL TRANSMISSION SCHEMES |
4759 COMMUNICATION-COST AWARE MICROPHONE SELECTION FOR NEURAL SPEECH ENHANCEMENT WITH AD-HOC MICROPHONE ARRAYS Jonah Casebeer, Jamshed Kaikaus, Paris Smaragdis 4759 | COMMUNICATION-COST AWARE MICROPHONE SELECTION FOR NEURAL SPEECH ENHANCEMENT WITH AD-HOC MICROPHONE ARRAYS |
1623 COMPACT GRAPH ARCHITECTURE FOR SPEECH EMOTION RECOGNITION Amir Shirian, Tanaya Guha 1623 | COMPACT GRAPH ARCHITECTURE FOR SPEECH EMOTION RECOGNITION |
2999 COMPARATIVE STUDY OF DIFFERENT EPOCH EXTRACTION METHODS FOR SPEECH ASSOCIATED WITH VOICE DISORDERS Purva Barche, Krishna Gurugubelli, Anil Kumar Vuppala 2999 | COMPARATIVE STUDY OF DIFFERENT EPOCH EXTRACTION METHODS FOR SPEECH ASSOCIATED WITH VOICE DISORDERS |
3941 COMPARISON OF DEEP CO-TRAINING AND MEAN-TEACHER APPROACHES FOR SEMI-SUPERVISED AUDIO TAGGING Léo Cances, Thomas Pellegrini 3941 | COMPARISON OF DEEP CO-TRAINING AND MEAN-TEACHER APPROACHES FOR SEMI-SUPERVISED AUDIO TAGGING |
5610 Comparison of Wavelet and RID-Rihaczek Based Methods for Phase-Amplitude Coupling Tamanna Tabassum Khan Munia, Selin Aviyente 5610 | Comparison of Wavelet and RID-Rihaczek Based Methods for Phase-Amplitude Coupling |
2587 COMPLEX RATIO MASKING FOR SINGING VOICE SEPARATION Yixuan Zhang, Yuzhou Liu, DeLiang Wang 2587 | COMPLEX RATIO MASKING FOR SINGING VOICE SEPARATION |
4997 COMPLEX-VALUED VS. REAL-VALUED NEURAL NETWORKS FOR CLASSIFICATION PERSPECTIVES: AN EXAMPLE ON NON-CIRCULAR DATA Jose Agustin BARRACHINA, Chengfang REN, Christele Morisseau, Gilles Vieillard, Jean-Philippe Ovarlez 4997 | COMPLEX-VALUED VS. REAL-VALUED NEURAL NETWORKS FOR CLASSIFICATION PERSPECTIVES: AN EXAMPLE ON NON-CIRCULAR DATA |
2674 COMPOSITIONAL EMBEDDING MODELS FOR SPEAKER IDENTIFICATION AND DIARIZATION WITH SIMULTANEOUS SPEECH FROM 2+ SPEAKERS Zeqian Li, Jacob Whitehill 2674 | COMPOSITIONAL EMBEDDING MODELS FOR SPEAKER IDENTIFICATION AND DIARIZATION WITH SIMULTANEOUS SPEECH FROM 2+ SPEAKERS |
4938 COMPRESSED REPRESENTATION OF CEPSTRAL COEFFICIENTS VIA RECURRENT NEURAL NETWORKS FOR INFORMED SPEECH ENHANCEMENT Carol Chermaz, Dario Leuchtmann, Simon Tanner, Roger Wattenhofer 4938 | COMPRESSED REPRESENTATION OF CEPSTRAL COEFFICIENTS VIA RECURRENT NEURAL NETWORKS FOR INFORMED SPEECH ENHANCEMENT |
2129 COMPRESSING DEEP NEURAL NETWORKS FOR EFFICIENT SPEECH ENHANCEMENT Ke Tan, DeLiang Wang 2129 | COMPRESSING DEEP NEURAL NETWORKS FOR EFFICIENT SPEECH ENHANCEMENT |
1811 COMPRESSING LOCAL DESCRIPTOR MODELS FOR MOBILE APPLICATIONS Roy Miles, Krystian Mikolajczyk 1811 | COMPRESSING LOCAL DESCRIPTOR MODELS FOR MOBILE APPLICATIONS |
2424 COMPRESSIVE SIGNAL RECOVERY UNDER SENSING MATRIX ERRORS COMBINED WITH UNKNOWN MEASUREMENT GAINS Jian Vora, Ajit Rajwade 2424 | COMPRESSIVE SIGNAL RECOVERY UNDER SENSING MATRIX ERRORS COMBINED WITH UNKNOWN MEASUREMENT GAINS |
2985 COMPRESSIVE WIDEBAND SPECTRUM SENSING AND CARRIER FREQUENCY ESTIMATION WITH UNKNOWN MIMO CHANNELS Hongwei Wang, Jilin Wang, Jun Fang, Hongbin Li 2985 | COMPRESSIVE WIDEBAND SPECTRUM SENSING AND CARRIER FREQUENCY ESTIMATION WITH UNKNOWN MIMO CHANNELS |
4013 COMPUTATIONALLY EFFICIENT DNN-BASED APPROXIMATION OF AN AUDITORY MODEL FOR APPLICATIONS IN SPEECH PROCESSING Anil Nagathil, Florian Göbel, Alexandru Nelus, Ian C. Bruce 4013 | COMPUTATIONALLY EFFICIENT DNN-BASED APPROXIMATION OF AN AUDITORY MODEL FOR APPLICATIONS IN SPEECH PROCESSING |
3877 CONFIDENCE ESTIMATION FOR ATTENTION-BASED SEQUENCE-TO-SEQUENCE MODELS FOR SPEECH RECOGNITION Qiujia Li, David Qiu, Yu Zhang, Bo Li, Yanzhang He, Phil Woodland, Liangliang Cao, Trevor Strohman 3877 | CONFIDENCE ESTIMATION FOR ATTENTION-BASED SEQUENCE-TO-SEQUENCE MODELS FOR SPEECH RECOGNITION |
5626 Consensus Based Distributed Spectral Radius Estimation Gowtham Muniraju, Cihan Tepedelenlioglu, Andreas Spanias 5626 | Consensus Based Distributed Spectral Radius Estimation |
2946 Constant approximation algorithm for minimizing concave impurity Thuan Nguyen, Hoang Le, Thinh Nguyen 2946 | Constant approximation algorithm for minimizing concave impurity |
3069 CONSTRAINED TENSOR DECOMPOSITION FOR 2D DOA ESTIMATION IN TRANSMIT BEAMSPACE MIMO RADAR WITH SUBARRAYS Feng Xu, Sergiy Vorobyov 3069 | CONSTRAINED TENSOR DECOMPOSITION FOR 2D DOA ESTIMATION IN TRANSMIT BEAMSPACE MIMO RADAR WITH SUBARRAYS |
1351 CONSTRUCTION OF A LARGE-SCALE JAPANESE ASR CORPUS ON TV RECORDINGS Shintaro Ando, Hiromasa Fujihara 1351 | CONSTRUCTION OF A LARGE-SCALE JAPANESE ASR CORPUS ON TV RECORDINGS |
1846 CONSTRUCTION OF UNIT-NORM TIGHT FRAME BASED PRECONDITIONER FOR SPARSE CODING Huang Bai, Chuanrong Hong, Xiumei Li 1846 | CONSTRUCTION OF UNIT-NORM TIGHT FRAME BASED PRECONDITIONER FOR SPARSE CODING |
4653 CONTACT TRACING ENHANCES THE EFFICIENCY OF COVID-19 GROUP TESTING Ritesh Goenka, Shu-Jie Cao, Chau-Wai Wong, Ajit Rajwade, Dror Baron 4653 | CONTACT TRACING ENHANCES THE EFFICIENCY OF COVID-19 GROUP TESTING |
4112 Content-Aware Speaker Embeddings for Speaker Diarisation Guangzhi Sun, Danyi Liu, Chao Zhang, Phil Woodland 4112 | Content-Aware Speaker Embeddings for Speaker Diarisation |
3949 CONTEXT-AWARE PROSODY CORRECTION FOR TEXT-BASED SPEECH EDITING Max Morrison, Lucas Rencker, Zeyu Jin, Nicholas Bryan, Juan-Pablo Caceres, Bryan Pardo 3949 | CONTEXT-AWARE PROSODY CORRECTION FOR TEXT-BASED SPEECH EDITING |
3568 Context-Aware Speech Stress Detection in Hospital Workers Using Bi-LSTM Classifiers Amr Gaballah, Abhishek Tiwari, Shrikanth Narayanan, Tiago Falk 3568 | Context-Aware Speech Stress Detection in Hospital Workers Using Bi-LSTM Classifiers |
2150 CONTINUOUS CNN FOR NONUNIFORM TIME SERIES Hui Shi, Yang Zhang, Hao Wu, Shiyu Chang, Kaizhi Qian, Mark Hasegawa-Johnson, Jishen Zhao 2150 | CONTINUOUS CNN FOR NONUNIFORM TIME SERIES |
5149 CONTINUOUS FACE AGING GENERATIVE ADVERSARIAL NETWORKS Seogkyu Jeon, Pilhyeon Lee, Kibeom Hong, Hyeran Byun 5149 | CONTINUOUS FACE AGING GENERATIVE ADVERSARIAL NETWORKS |
1894 CONTINUOUS SPEECH SEPARATION WITH CONFORMER Sanyuan Chen, Yu Wu, Zhuo Chen, Jian Wu, Jinyu Li, Takuya Yoshioka, Chengyi Wang, Shujie Liu, Ming Zhou 1894 | CONTINUOUS SPEECH SEPARATION WITH CONFORMER |
3525 CONTINUOUS-TIME SELF-ATTENTION IN NEURAL DIFFERENTIAL EQUATION Jen-Tzung Chien, Yi-Hsiang Chen 3525 | CONTINUOUS-TIME SELF-ATTENTION IN NEURAL DIFFERENTIAL EQUATION |
5171 CONTRASTIVE EMBEDDIND LEARNING METHOD FOR RESPIRATORY SOUND CLASSIFICATION Wenjie Song, Jiqing Han, Hongwei Song 5171 | CONTRASTIVE EMBEDDIND LEARNING METHOD FOR RESPIRATORY SOUND CLASSIFICATION |
3982 Contrastive learning for perceptual audio similarity Pranay Manocha, Zeyu Jin, Richard Zhang, Adam Finkelstein 3982 | Contrastive learning for perceptual audio similarity |
2114 CONTRASTIVE LEARNING OF GENERAL-PURPOSE AUDIO REPRESENTATIONS Aaqib Saeed, David Grangier, Neil Zeghidour 2114 | CONTRASTIVE LEARNING OF GENERAL-PURPOSE AUDIO REPRESENTATIONS |
5300 Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations Janek Ebbers, Michael Kuhlmann, Tobias Cord-Landwehr, Reinhold Haeb-Umbach 5300 | Contrastive Predictive Coding Supported Factorized Variational Autoencoder for Unsupervised Learning of Disentangled Speech Representations |
2553 Contrastive Self-supervised Learning for Text-independent Speaker Verification Haoran Zhang, Yuexian Zou, Helin Wang 2553 | Contrastive Self-supervised Learning for Text-independent Speaker Verification |
5025 CONTRASTIVE SELF-SUPERVISED LEARNING FOR WIRELESS POWER CONTROL Navid Naderializadeh 5025 | CONTRASTIVE SELF-SUPERVISED LEARNING FOR WIRELESS POWER CONTROL |
4042 CONTRASTIVE SEMI-SUPERVISED LEARNING FOR ASR Alex Xiao, Christian Fuegen, Abdelrahman Mohamed 4042 | CONTRASTIVE SEMI-SUPERVISED LEARNING FOR ASR |
5179 CONTRASTIVE SEPARATIVE CODING FOR SELF-SUPERVISED REPRESENTATION LEARNING Jun Wang, Max W. Y. Lam, Dan Su, Dong Yu 5179 | CONTRASTIVE SEPARATIVE CODING FOR SELF-SUPERVISED REPRESENTATION LEARNING |
3434 CONTRASTIVE UNSUPERVISED LEARNING FOR SPEECH EMOTION RECOGNITION Mao Li, Bo Yang, Joshua Levy, Andreas Stolcke, Viktor Rozgic, Spyros Matsoukas, Constantinos Papayiannis, Daniel Bone, Chao Wang 3434 | CONTRASTIVE UNSUPERVISED LEARNING FOR SPEECH EMOTION RECOGNITION |
1514 CONTROL ARCHITECTURE OF THE DOUBLE-CROSS-CORRELATION PROCESSOR FOR SAMPLING-RATE-OFFSET ESTIMATION IN ACOUSTIC SENSOR NETWORKS Aleksej Chinaev, Sven Wienand, Gerald Enzner 1514 | CONTROL ARCHITECTURE OF THE DOUBLE-CROSS-CORRELATION PROCESSOR FOR SAMPLING-RATE-OFFSET ESTIMATION IN ACOUSTIC SENSOR NETWORKS |
2116 CONTROLLED TESTING AND ISOLATION FOR SUPPRESSING COVID-19 Kobi Cohen, Amir Leshem 2116 | CONTROLLED TESTING AND ISOLATION FOR SUPPRESSING COVID-19 |
3490 CONVERGENCE ANALYSIS OF THE GRAPH-TOPOLOGY-INFERENCE KERNEL LMS ALGORITHM Mircea Moscu, Ricardo Borsoi, Cédric Richard 3490 | CONVERGENCE ANALYSIS OF THE GRAPH-TOPOLOGY-INFERENCE KERNEL LMS ALGORITHM |
3735 CONVERSATIONAL QUERY REWRITING WITH SELF-SUPERVISED LEARNING Hang Liu, Meng Chen, Youzheng Wu, Xiaodong He, Bowen Zhou 3735 | CONVERSATIONAL QUERY REWRITING WITH SELF-SUPERVISED LEARNING |
5131 CONVEX NEURAL AUTOREGRESSIVE MODELS: TOWARDS TRACTABLE, EXPRESSIVE, AND THEORETICALLY-BACKED MODELS FOR SEQUENTIAL FORECASTING AND GENERATION Vikul Gupta, Burak Bartan, Tolga Ergen, Mert Pilanci 5131 | CONVEX NEURAL AUTOREGRESSIVE MODELS: TOWARDS TRACTABLE, EXPRESSIVE, AND THEORETICALLY-BACKED MODELS FOR SEQUENTIAL FORECASTING AND GENERATION |
4166 CONVOLUTIONAL DROPOUT AND WORDPIECE AUGMENTATION FOR END-TO-END SPEECH RECOGNITION Hainan Xu, Yinghui Huang, Yun Zhu, Kartik Audhkhasi, Bhuvana Ramabhadran 4166 | CONVOLUTIONAL DROPOUT AND WORDPIECE AUGMENTATION FOR END-TO-END SPEECH RECOGNITION |
3298 CONVOLUTIONAL NEURAL NETWORK-AIDED BIT-FLIPPING FOR BELIEF PROPAGATION DECODING OF POLAR CODES Chieh-Fang Teng, Andrew Kuan-Shiuan Ho, Chen-Hsi Wu, Sin-Sheng Wong, An-Yeu Wu 3298 | CONVOLUTIONAL NEURAL NETWORK-AIDED BIT-FLIPPING FOR BELIEF PROPAGATION DECODING OF POLAR CODES |
3052 CONVOLUTIVE TRANSFER FUNCTION INVARIANT SDR TRAINING CRITERIA FOR MULTI-CHANNEL REVERBERANT SPEECH SEPARATION Christoph Boeddeker, Wangyou Zhang, Tomohiro Nakatani, Keisuke Kinoshita, Tsubasa Ochiai, Marc Delcroix, Naoyuki Kamo, Yanmin Qian, Reinhold Haeb-Umbach 3052 | CONVOLUTIVE TRANSFER FUNCTION INVARIANT SDR TRAINING CRITERIA FOR MULTI-CHANNEL REVERBERANT SPEECH SEPARATION |
5591 COOPERATIVE PARAMETER ESTIMATION ON THE UNIT SPHERE USING A NETWORK OF DIFFUSION PARTICLE FILTERS Caio de Figueredo, Claudio Bordin, Marcelo Bruno 5591 | COOPERATIVE PARAMETER ESTIMATION ON THE UNIT SPHERE USING A NETWORK OF DIFFUSION PARTICLE FILTERS |
1179 COOPERATIVE PARAMETER TRACKING ON THE UNIT SPHERE USING DISTRIBUTED ADAPT-THEN-COMBINE PARTICLE FILTERS AND PARALLEL TRANSPORT Caio Figueredo, Claudio Bordin, Marcelo Bruno 1179 | COOPERATIVE PARAMETER TRACKING ON THE UNIT SPHERE USING DISTRIBUTED ADAPT-THEN-COMBINE PARTICLE FILTERS AND PARALLEL TRANSPORT |
2733 Cooperative Scenarios For Multi-agent Reinforcement learning In Wireless Edge Caching Navneet Garg, Tharmalingam Ratnarajah 2733 | Cooperative Scenarios For Multi-agent Reinforcement learning In Wireless Edge Caching |
3066 COOPNET: MULTI-MODAL COOPERATIVE GENDER PREDICTION IN SOCIAL MEDIA USER PROFILING Lin Li, Kaixi Hu, Yunpei Zheng, Jianquan Liu, Kong Aik Lee 3066 | COOPNET: MULTI-MODAL COOPERATIVE GENDER PREDICTION IN SOCIAL MEDIA USER PROFILING |
4110 CopyPaste: An Augmentation Method for Speech Emotion Recognition Raghavendra Pappagari, Jesus Villalba, Piotr Zelasko, Laureano Moro-Velazquez, Najim Dehak 4110 | CopyPaste: An Augmentation Method for Speech Emotion Recognition |
2093 CORRELATION-BASED ROBUST LINEAR REGRESSION WITH ITERATIVE OUTLIER REMOVAL Jian Ding, Jianji Wang, Yue Zhang, Yuanjie Li, Nanning Zheng 2093 | CORRELATION-BASED ROBUST LINEAR REGRESSION WITH ITERATIVE OUTLIER REMOVAL |
5537 CORRUPTED CONTEXTUAL BANDITS: ONLINE LEARNING WITH CORRUPTED CONTEXT Djallel Bouneffouf 5537 | CORRUPTED CONTEXTUAL BANDITS: ONLINE LEARNING WITH CORRUPTED CONTEXT |
2960 Cost Affinity Learning Network for Stereo Matching Shenglun Chen, Baopu Li, Wei Wang, Hong Zhang, Haojie Li, Zhihui Wang 2960 | Cost Affinity Learning Network for Stereo Matching |
2745 COUGHWATCH: REAL-WORLD COUGH DETECTION USING SMARTWATCHES Daniyal Liaqat, Salaar Liaqat, Jun Lin Chen, Tina Sedaghat, Moshe Gabel, Frank Rudzicz, Eyal de Lara 2745 | COUGHWATCH: REAL-WORLD COUGH DETECTION USING SMARTWATCHES |
5479 COUNT AND SEPARATE: INCORPORATING SPEAKER COUNTING FOR CONTINUOUS SPEAKER SEPARATION Zhong-Qiu Wang, DeLiang Wang 5479 | COUNT AND SEPARATE: INCORPORATING SPEAKER COUNTING FOR CONTINUOUS SPEAKER SEPARATION |
5164 COUNT SKETCH WITH ZERO CHECKING: EFFICIENT RECOVERY OF HEAVY COMPONENTS Guanqiang Zhou, Zhi Tian 5164 | COUNT SKETCH WITH ZERO CHECKING: EFFICIENT RECOVERY OF HEAVY COMPONENTS |
4685 crank: an open-source software for nonparallel voice conversion based on vector-quantized variational autoencoder Kazuhiro Kobayashi, Wen-Chin Huang, Yi-Chiao Wu, Patrick Lumban Tobing, Tomoki Hayashi, Tomoki Toda 4685 | crank: an open-source software for nonparallel voice conversion based on vector-quantized variational autoencoder |
2659 Cross Scene Video Foreground Segmentation via Co-occurrence Probability Oriented Supervised and Unsupervised Model Interaction Dong Liang, Bin Kang, Xinyu Liu 2659 | Cross Scene Video Foreground Segmentation via Co-occurrence Probability Oriented Supervised and Unsupervised Model Interaction |
4878 Cross-Corpus Speech Emotion Recognition Using Joint Distribution Adaptive Regression Jiacheng Zhang, Lin Jiang, Yuan Zong, Wenming Zheng, Li Zhao 4878 | Cross-Corpus Speech Emotion Recognition Using Joint Distribution Adaptive Regression |
3360 Cross-Domain Semi-Supervised Deep Metric Learning for Image Sentiment Analysis Yun Liang, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama 3360 | Cross-Domain Semi-Supervised Deep Metric Learning for Image Sentiment Analysis |
5110 Cross-Domain Sentiment Classification With Contrastive Learning and Mutual Information Maximization Tian Li, Xiang Chen, Shanghang Zhang, Zhen Dong, Kurt Keutzer 5110 | Cross-Domain Sentiment Classification With Contrastive Learning and Mutual Information Maximization |
2155 CROSS-MODAL INFORMATION MAXIMIZATION FOR MEDICAL IMAGING: CMIM Tristan Sylvain, Francis Dutil, Tess Berthier, Lisa Di Jorio, Margaux Luck, Devon Hjelm, Yoshua Bengio 2155 | CROSS-MODAL INFORMATION MAXIMIZATION FOR MEDICAL IMAGING: CMIM |
1572 CROSS-MODAL KNOWLEDGE DISTILLATION FOR FINE-GRAINED ONE-SHOT CLASSIFICATION Jiabao Zhao, Xin Lin, Yifan Yang, Jing Yang, Liang He 1572 | CROSS-MODAL KNOWLEDGE DISTILLATION FOR FINE-GRAINED ONE-SHOT CLASSIFICATION |
1491 Cross-Modal Representation Reconstruction for Zero-Shot Classification Yu Wang, Shengjie Zhao 1491 | Cross-Modal Representation Reconstruction for Zero-Shot Classification |
5033 CROSS-SILO FEDERATED TRAINING IN THE CLOUD WITH DIVERSITY SCALING AND SEMI-SUPERVISED LEARNING Kishore Nandury, Anand Mohan, Frederick Weber 5033 | CROSS-SILO FEDERATED TRAINING IN THE CLOUD WITH DIVERSITY SCALING AND SEMI-SUPERVISED LEARNING |
3498 CROSS-TEAGER ENERGY CEPSTRAL COEFFICIENTS FOR REPLAY SPOOF DETECTION ON VOICE ASSISTANTS Rajul Acharya, Harsh Kotta, Ankur T. Patil, Hemant A. Patil 3498 | CROSS-TEAGER ENERGY CEPSTRAL COEFFICIENTS FOR REPLAY SPOOF DETECTION ON VOICE ASSISTANTS |
1759 Crowd Counting via multi-level regression with Latent Gaussian maps Yukang Gao, Hua Yang 1759 | Crowd Counting via multi-level regression with Latent Gaussian maps |
4622 Crowdsourcing approach for subjective evaluation of echo impairment Ross Cutler, Babak Nadari, Markus Loide, Sten Sootla, Ando Saabas 4622 | Crowdsourcing approach for subjective evaluation of echo impairment |
3747 CRYPTO-ORIENTED NEURAL ARCHITECTURE DESIGN Avital Shafran, Gil Segev, Shmuel Peleg, Yedid Hoshen 3747 | CRYPTO-ORIENTED NEURAL ARCHITECTURE DESIGN |
2648 CT-CAPS: Feature Extraction-based Automated Framework for COVID-19 Disease Identification from Chest CT Scans using Capsule Networks Shahin Heidarian, Parnian Afshar, Arash Mohammadi, Moezedin Javad Rafiee, Anastasia Oikonomou, Konstantinos N. Plataniotis, Farnoosh Naderkhani 2648 | CT-CAPS: Feature Extraction-based Automated Framework for COVID-19 Disease Identification from Chest CT Scans using Capsule Networks |
3787 CUE-PRESERVING MMSE FILTER WITH BAYESIAN SNR MARGINALIZATION FOR BINAURAL SPEECH ENHANCEMENT Stefan Thaleiser, Gerald Enzner 3787 | CUE-PRESERVING MMSE FILTER WITH BAYESIAN SNR MARGINALIZATION FOR BINAURAL SPEECH ENHANCEMENT |
2615 CYCLE GENERATIVE ADVERSARIAL NETWORK APPROACHES TO PRODUCE NOVEL PORTABLE CHEST X-RAYS IMAGES FOR COVID-19 DIAGNOSIS Daniel I. Morís, Joaquim de Moura, Jorge Novo, Marcos Ortega 2615 | CYCLE GENERATIVE ADVERSARIAL NETWORK APPROACHES TO PRODUCE NOVEL PORTABLE CHEST X-RAYS IMAGES FOR COVID-19 DIAGNOSIS |
4597 DAG-GAN: CAUSAL STRUCTURE LEARNING WITH GENERATIVE ADVERSARIAL NETS Yinghua Gao, Li Shen, Shu-Tao Xia 4597 | DAG-GAN: CAUSAL STRUCTURE LEARNING WITH GENERATIVE ADVERSARIAL NETS |
1802 DATA AUGMENTATION WITH SIGNAL COMPANDING FOR DETECTION OF LOGICAL ACCESS ATTACKS Rohan Kumar Das, Jichen Yang, Haizhou Li 1802 | DATA AUGMENTATION WITH SIGNAL COMPANDING FOR DETECTION OF LOGICAL ACCESS ATTACKS |
2535 DATA DISCOVERY USING LOSSLESS COMPRESSION-BASED SPARSE REPRESENTATION Elyas Sabeti, Peter Song, Alfred Hero 2535 | DATA DISCOVERY USING LOSSLESS COMPRESSION-BASED SPARSE REPRESENTATION |
3828 DATA FUSION FOR AUDIOVISUAL SPEAKER LOCALIZATION: EXTENDING DYNAMIC STREAM WEIGHTS TO THE SPATIAL DOMAIN Julio Wissing, Benedikt Boenninghoff, Dorothea Kolossa, Tsubasa Ochiai, Marc Delcroix, Keisuke Kinoshita, Tomohiro Nakatani, Shoko Araki, Christopher Schymura 3828 | DATA FUSION FOR AUDIOVISUAL SPEAKER LOCALIZATION: EXTENDING DYNAMIC STREAM WEIGHTS TO THE SPATIAL DOMAIN |
1297 Data-Driven Adaptive Network Resource Slicing for Multi-Tenant Networks Navid Reyhanian, Hamid Farmanbar, Zhi-Quan Luo 1297 | Data-Driven Adaptive Network Resource Slicing for Multi-Tenant Networks |
3624 DATA-EFFICIENT FRAMEWORK FOR REAL-WORLD MULTIPLE SOUND SOURCE 2D LOCALIZATION Guillaume Le Moing, Phongtharin Vinayavekhin, Don Joven Agravante, Tadanobu Inoue, Jayakorn Vongkulbhisal, Asim Munawar, Ryuki Tachibana 3624 | DATA-EFFICIENT FRAMEWORK FOR REAL-WORLD MULTIPLE SOUND SOURCE 2D LOCALIZATION |
3123 DBNET: DOA-DRIVEN BEAMFORMING NETWORK FOR END-TO-END FARFIELD SOUND SOURCE SEPARATION Ali Aroudi, Sebastian Braun 3123 | DBNET: DOA-DRIVEN BEAMFORMING NETWORK FOR END-TO-END FARFIELD SOUND SOURCE SEPARATION |
1345 DCASENET: AN INTEGRATED PRETRAINED DEEP NEURAL NETWORK FOR DETECTING AND CLASSIFYING ACOUSTIC SCENES AND EVENTS Jee-weon Jung, Hye-jin Shim, Ju-ho Kim, Ha-Jin Yu 1345 | DCASENET: AN INTEGRATED PRETRAINED DEEP NEURAL NETWORK FOR DETECTING AND CLASSIFYING ACOUSTIC SCENES AND EVENTS |
3722 DEAAN: DISENTANGLED EMBEDDING AND ADVERSARIAL ADAPTATION NETWORK FOR ROBUST SPEAKER REPRESENTATION LEARNING Mufan Sang, Wei Xia, John H.L. Hansen 3722 | DEAAN: DISENTANGLED EMBEDDING AND ADVERSARIAL ADAPTATION NETWORK FOR ROBUST SPEAKER REPRESENTATION LEARNING |
2986 Decentralized Deep Learning using Momentum-Accelerated Consensus Aditya Balu, Zhanhong Jiang, Sin Yong Tan, Chinmay Hedge, Young M Lee, Soumik Sarkar 2986 | Decentralized Deep Learning using Momentum-Accelerated Consensus |
4613 Decentralized motion inference and registration of Neuropixel data Erdem Varol, Julien Boussard, Nishchal Dethe, Liam Paninski 4613 | Decentralized motion inference and registration of Neuropixel data |
4252 DECENTRALIZED OPTIMIZATION ON TIME-VARYING DIRECTED GRAPHS UNDER COMMUNICATION CONSTRAINTS Yiyue Chen, Abolfazl Hashemi, Haris Vikalo 4252 | DECENTRALIZED OPTIMIZATION ON TIME-VARYING DIRECTED GRAPHS UNDER COMMUNICATION CONSTRAINTS |
4654 DECENTRALIZED OPTIMIZATION OVER NOISY, RATE-CONSTRAINED NETWORKS: HOW WE AGREE BY TALKING ABOUT HOW WE DISAGREE Rajarshi Saha, Stefano Rini, Milind Rao, Andrea Goldsmith 4654 | DECENTRALIZED OPTIMIZATION OVER NOISY, RATE-CONSTRAINED NETWORKS: HOW WE AGREE BY TALKING ABOUT HOW WE DISAGREE |
1396 Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition Chao-Han Huck Yang, Jun Qi, Pin-Yu Chen, Yen-Chi Samuel Chen, Sabato Marco Siniscalchi, Xiaoli Ma, Chin-Hui Lee 1396 | Decentralizing Feature Extraction with Quantum Convolutional Neural Network for Automatic Speech Recognition |
3204 DECISION TREE BASED INTER PARTITION TERMINATION FOR AV1 ENCODING Xinyao Chen, Yiwei Zhang, Yanghao Li, Jiangtao Wen 3204 | DECISION TREE BASED INTER PARTITION TERMINATION FOR AV1 ENCODING |
4548 DECODING MUSIC ATTENTION FROM "EEG HEADPHONES": A USER-FRIENDLY AUDITORY BRAIN-COMPUTER INTERFACE Wenkang An, Barbara Shinn-Cunningham, Hannes Gamper, Dimitra Emmanouilidou, David Johnston, Mihai Jalobeanu, Edward Cutrell, Andrew Wilson, Kuan-Jung Chiang, Ivan Tashev 4548 | DECODING MUSIC ATTENTION FROM "EEG HEADPHONES": A USER-FRIENDLY AUDITORY BRAIN-COMPUTER INTERFACE |
2001 Decoding neural representations of rhythmic sounds from magnetoencephalography Pei-Chun Chang, Jia-Ren Chang, Po-Yu Chen, Li-Kai Cheng, Jen-Chuen Hsieh, Hsin-Yen Yu, Li-Fen Chen, Yong-Sheng Chen 2001 | Decoding neural representations of rhythmic sounds from magnetoencephalography |
3602 DECOMPOSING TEXTURES USING EXPONENTIAL ANALYSIS Yuan Hou, Annie Cuyt, Wen-shin Lee, Deepayan Bhowmik 3602 | DECOMPOSING TEXTURES USING EXPONENTIAL ANALYSIS |
2185 Decouple the High-Frequency and Low-Frequency Information of Images for Semantic Segmentation lianlei shan, xiaobin li, weiqiang wang 2185 | Decouple the High-Frequency and Low-Frequency Information of Images for Semantic Segmentation |
4894 DECOUPLING PRONUNCIATION AND LANGUAGE FOR END-TO-END CODE-SWITCHING AUTOMATIC SPEECH RECOGNITION Shuai Zhang, Jiangyan Yi, Zhengkun Tian, Ye Bai, Jianhua Tao, Zhengqi Wen 4894 | DECOUPLING PRONUNCIATION AND LANGUAGE FOR END-TO-END CODE-SWITCHING AUTOMATIC SPEECH RECOGNITION |
1397 DEEP ACTIVE LEARNING APPROACH TO ADAPTIVE BEAMFORMING FOR MMWAVE INITIAL ALIGNMENT Foad Sohrabi, Zhilin Chen, Wei Yu 1397 | DEEP ACTIVE LEARNING APPROACH TO ADAPTIVE BEAMFORMING FOR MMWAVE INITIAL ALIGNMENT |
2224 DEEP ADVERSARIAL QUANTIZATION NETWORK FOR CROSS-MODAL RETRIEVAL Yu Zhou, Yong Feng, Mingliang Zhou, Baohua Qiang, Leong Hou U, Jiajie Zhu 2224 | DEEP ADVERSARIAL QUANTIZATION NETWORK FOR CROSS-MODAL RETRIEVAL |
5620 Deep and Ordinal Ensemble Learning for Human Age Estimation From Facial Images Jiu-Cheng Xie, Chi-Man Pun 5620 | Deep and Ordinal Ensemble Learning for Human Age Estimation From Facial Images |
1931 Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition Hatef Otroshi, Sébastien Marcel 1931 | Deep Auto-Encoding and Biohashing for Secure Finger Vein Recognition |
3282 DEEP COLOR CONSTANCY USING TEMPORAL GRADIENT UNDER AC LIGHT SOURCES Jeong-Won Ha, Jun-Sang Yoo, Jong-Ok Kim 3282 | DEEP COLOR CONSTANCY USING TEMPORAL GRADIENT UNDER AC LIGHT SOURCES |
5039 DEEP CONVOLUTIONAL AND RECURRENT NETWORKS FOR POLYPHONIC INSTRUMENT CLASSIFICATION FROM MONOPHONIC RAW AUDIO WAVEFORMS Kleanthis Avramidis, Agelos Kratimenos, Christos Garoufis, Athanasia Zlatintsi, Petros Maragos 5039 | DEEP CONVOLUTIONAL AND RECURRENT NETWORKS FOR POLYPHONIC INSTRUMENT CLASSIFICATION FROM MONOPHONIC RAW AUDIO WAVEFORMS |
4120 Deep Convolutional Gaussian Processes for mmWave Outdoor Localization Xuyu Wang, Mohini Patil, Chao Yang, Shiwen Mao, Palak Anilkumar Patel 4120 | Deep Convolutional Gaussian Processes for mmWave Outdoor Localization |
3749 DEEP DETERMINISTIC INFORMATION BOTTLENECK WITH MATRIX-BASED ENTROPY FUNCTIONAL Xi Yu, Shujian Yu, Jose Principe 3749 | DEEP DETERMINISTIC INFORMATION BOTTLENECK WITH MATRIX-BASED ENTROPY FUNCTIONAL |
3693 DEEP ENSEMBLE SIAMESE NETWORK FOR INCREMENTAL SIGNAL CLASSIFICATION Chen Yang, shuyuan yang 3693 | DEEP ENSEMBLE SIAMESE NETWORK FOR INCREMENTAL SIGNAL CLASSIFICATION |
1097 DEEP GENERATIVE DEMIXING: ERROR BOUNDS FOR DEMIXING SUBGAUSSIAN MIXTURES OF LIPSCHITZ SIGNALS Aaron Berk 1097 | DEEP GENERATIVE DEMIXING: ERROR BOUNDS FOR DEMIXING SUBGAUSSIAN MIXTURES OF LIPSCHITZ SIGNALS |
2142 Deep Generative Model Learning for Blind Spectrum Cartography with NMF-based Radio Map Disaggregation Sagar Shrestha, Xiao Fu, Mingyi Hong 2142 | Deep Generative Model Learning for Blind Spectrum Cartography with NMF-based Radio Map Disaggregation |
4460 DEEP HASHING FOR MOTION CAPTURE DATA RETRIEVAL Na Lv, Ying Wang, Zhiquan Feng, Jingliang Peng 4460 | DEEP HASHING FOR MOTION CAPTURE DATA RETRIEVAL |
4697 DEEP LEARNING ARCHITECTURAL DESIGNS FOR SUPER-RESOLUTION OF NOISY IMAGES Angel Villar-Corrales, Franziska Schirrmacher, Christian Riess 4697 | DEEP LEARNING ARCHITECTURAL DESIGNS FOR SUPER-RESOLUTION OF NOISY IMAGES |
4847 DEEP LEARNING BASED HYBRID PRECODING IN DUAL-BAND COMMUNICATION SYSTEMS Rafail Ismayilov, Renato L. G. Cavalcante, Sławomir Stańczak 4847 | DEEP LEARNING BASED HYBRID PRECODING IN DUAL-BAND COMMUNICATION SYSTEMS |
1841 Deep Learning for Linear Inverse Problems Using the Plug-and-Play Priors Framework Wei Chen, David Wipf, Miguel Rodrigues 1841 | Deep Learning for Linear Inverse Problems Using the Plug-and-Play Priors Framework |
5035 DEEP LEARNING-BASED CROSS-LAYER RESOURCE ALLOCATION FOR WIRED COMMUNICATION SYSTEMS Pourya Behmandpoor, Jeroen Verdyck, Marc Moonen 5035 | DEEP LEARNING-BASED CROSS-LAYER RESOURCE ALLOCATION FOR WIRED COMMUNICATION SYSTEMS |
3368 DEEP LUNG AUSCULTATION USING ACOUSTIC BIOMARKERS FOR ABNORMAL RESPIRATORY SOUND EVENT DETECTION Upasana Tiwari, Swapnil Bhosale, Rupayan Chakraborty, Sunil Kumar Kopparapu 3368 | DEEP LUNG AUSCULTATION USING ACOUSTIC BIOMARKERS FOR ABNORMAL RESPIRATORY SOUND EVENT DETECTION |
5066 DEEP MULTI-FRAME MVDR FILTERING FOR SINGLE-MICROPHONE SPEECH ENHANCEMENT Marvin Tammen, Simon Doclo 5066 | DEEP MULTI-FRAME MVDR FILTERING FOR SINGLE-MICROPHONE SPEECH ENHANCEMENT |
4856 Deep Multiway Canonical Correlation Analysis for Multi-subject EEG Normalization Jaswanth Reddy Katthi, Sriram Ganapathy 4856 | Deep Multiway Canonical Correlation Analysis for Multi-subject EEG Normalization |
2712 Deep Neural Network based Cough Detection using Bed-mounted Accelerometer Measurements Madhurananda Pahar, Igor Miranda, Andreas Diacon, Thomas Niesler 2712 | Deep Neural Network based Cough Detection using Bed-mounted Accelerometer Measurements |
3505 DEEP NEURAL NETWORK EMBEDDINGS FOR THE ESTIMATION OF THE DEGREE OF SLEEPINESS José Vicente Egas-López, Gábor Gosztolya 3505 | DEEP NEURAL NETWORK EMBEDDINGS FOR THE ESTIMATION OF THE DEGREE OF SLEEPINESS |
4526 DEEP NEURAL NETWORKS WITH FLEXIBLE COMPLEXITY WHILE TRAINING BASED ON NEURAL ORDINARY DIFFERENTIAL EQUATIONS Zhengbo Luo, Sei-ichiro Kamata, Zitang Sun 4526 | DEEP NEURAL NETWORKS WITH FLEXIBLE COMPLEXITY WHILE TRAINING BASED ON NEURAL ORDINARY DIFFERENTIAL EQUATIONS |
2394 DEEP RESIDUAL ECHO SUPPRESSION WITH A TUNABLE TRADEOFF BETWEEN SIGNAL DISTORTION AND ECHO SUPPRESSION Amir Ivry, Israel Cohen, Baruch Berdugo 2394 | DEEP RESIDUAL ECHO SUPPRESSION WITH A TUNABLE TRADEOFF BETWEEN SIGNAL DISTORTION AND ECHO SUPPRESSION |
1551 DEEP S3PR: SIMULTANEOUS SOURCE SEPARATION AND PHASE RETRIEVAL USING DEEP GENERATIVE MODELS Christopher Metzler, Gordon Wetzstein 1551 | DEEP S3PR: SIMULTANEOUS SOURCE SEPARATION AND PHASE RETRIEVAL USING DEEP GENERATIVE MODELS |
3717 DEEP SEMI-SUPERVISED METRIC LEARNING VIA IDENTIFICATION OF MANIFOLD MEMBERSHIPS Furen Zhuang, Pierre Moulin 3717 | DEEP SEMI-SUPERVISED METRIC LEARNING VIA IDENTIFICATION OF MANIFOLD MEMBERSHIPS |
2770 DEEP TRANSFORM AND METRIC LEARNING NETWORKS Wen Tang, Emilie Chouzenoux, Jean-Christophe Pesquet, Hamid Krim 2770 | DEEP TRANSFORM AND METRIC LEARNING NETWORKS |
5011 Deep Unfolding Network for Block-Sparse Signal Recovery Rong Fu, Vincent Monardo, Tianyao Huang, Yimin Liu 5011 | Deep Unfolding Network for Block-Sparse Signal Recovery |
1795 DEEP WEIGHTED MMSE DOWNLINK BEAMFORMING Lissy Pellaco, Mats Bengtsson, Joakim Jaldén 1795 | DEEP WEIGHTED MMSE DOWNLINK BEAMFORMING |
4179 DeepEmoCluster: A Semi-Supervised Framework for Latent Cluster Representation of Speech Emotions Wei-Cheng Lin, Kusha Sridhar, Carlos Busso 4179 | DeepEmoCluster: A Semi-Supervised Framework for Latent Cluster Representation of Speech Emotions |
2201 DEEPF0: END-TO-END FUNDAMENTAL FREQUENCY ESTIMATION FOR MUSIC AND SPEECH SIGNALS Satwinder Singh, Ruili Wang, Yuanhang Qiu 2201 | DEEPF0: END-TO-END FUNDAMENTAL FREQUENCY ESTIMATION FOR MUSIC AND SPEECH SIGNALS |
2401 DeepNodule: Multi-task Learning of Segmentation Bootstrap for Pulmonary Nodule Detection Jingqin Li, Kun Wang, Dan Yang, Xiaohong Zhang, Luwen Huangfu, Chen Liu 2401 | DeepNodule: Multi-task Learning of Segmentation Bootstrap for Pulmonary Nodule Detection |
4229 DEEPTALK: VOCAL STYLE ENCODING FOR SPEAKER RECOGNITION AND SPEECH SYNTHESIS Anurag Chowdhury, Arun Ross, Prabu David 4229 | DEEPTALK: VOCAL STYLE ENCODING FOR SPEAKER RECOGNITION AND SPEECH SYNTHESIS |
5195 DEFICIENT BASIS ESTIMATION OF NOISE SPATIAL COVARIANCE MATRIX FOR RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION METHOD IN BLIND SPEECH EXTRACTION Yuto Kondo, Yuki Kubo, Norihiro Takamune, Daichi Kitamura, Hiroshi Saruwatari 5195 | DEFICIENT BASIS ESTIMATION OF NOISE SPATIAL COVARIANCE MATRIX FOR RANK-CONSTRAINED SPATIAL COVARIANCE MATRIX ESTIMATION METHOD IN BLIND SPEECH EXTRACTION |
5217 DEFORMABLE CONVOLUTION DENSE NETWORK FOR COMPRESSED VIDEO QUALITY ENHANCEMENT jiahui liu, mingcai zhou, peng lu, meng xiao, wang yin 5217 | DEFORMABLE CONVOLUTION DENSE NETWORK FOR COMPRESSED VIDEO QUALITY ENHANCEMENT |
2327 DEMYSTIFYING MODEL AVERAGING FOR COMMUNICATION-EFFICIENT FEDERATED MATRIX FACTORIZATION Shuai Wang, Richard Cornelius Suwandi, Tsung-Hui Chang 2327 | DEMYSTIFYING MODEL AVERAGING FOR COMMUNICATION-EFFICIENT FEDERATED MATRIX FACTORIZATION |
1895 DENOISPEECH: DENOISING TEXT TO SPEECH WITH FRAME-LEVEL NOISE MODELING Chen Zhang, Yi Ren, Xu Tan, Jinglin Liu, Kejun Zhang, Tao Qin, Sheng Zhao, Tie-Yan Liu 1895 | DENOISPEECH: DENOISING TEXT TO SPEECH WITH FRAME-LEVEL NOISE MODELING |
3631 Dense attention module for accurate pulmonary nodule detection Jiannan Liu, Jie Li, Fanyong Xue, Chentao Wu 3631 | Dense attention module for accurate pulmonary nodule detection |
1399 Dense Feature Pyramid Grids Network for Single Image Deraining Zhen Wang, Cong Wang, Zhixun Su, Junyang Chen 1399 | Dense Feature Pyramid Grids Network for Single Image Deraining |
5448 DENSELY CONNECTED MULTI-STAGE MODEL WITH CHANNEL WISE SUBBAND FEATURE FOR REAL-TIME SPEECH ENHANCEMENT JingDong Li, Dawei Luo, Yun Liu, YuanYuan Zhu, Zhaoxia Li, Guohui Cui, Wenqi Tang, Wei Chen 5448 | DENSELY CONNECTED MULTI-STAGE MODEL WITH CHANNEL WISE SUBBAND FEATURE FOR REAL-TIME SPEECH ENHANCEMENT |
3547 DEPENDENCE-GUIDED MULTI-VIEW CLUSTERING Xia Dong, Danyang Wu, Feiping Nie, Rong Wang, Xuelong Li 3547 | DEPENDENCE-GUIDED MULTI-VIEW CLUSTERING |
2463 DEPRESSION DETECTION BY ANALYSING EYE MOVEMENTS ON EMOTIONAL IMAGES Ruizhe Shen, Qi Zhan, Yu Wang, Huimin Ma 2463 | DEPRESSION DETECTION BY ANALYSING EYE MOVEMENTS ON EMOTIONAL IMAGES |
5384 DESIGN OF GRAPH SIGNAL SAMPLING MATRICES FOR ARBITRARY SIGNAL SUBSPACES Junya Hara, Koki Yamada, Shunsuke Ono, Yuichi Tanaka 5384 | DESIGN OF GRAPH SIGNAL SAMPLING MATRICES FOR ARBITRARY SIGNAL SUBSPACES |
5575 DESIGNING RANDOM FM RADAR WAVEFORMS WITH COMPACT SPECTRUM Charles Mohr, Shannon Blunt 5575 | DESIGNING RANDOM FM RADAR WAVEFORMS WITH COMPACT SPECTRUM |
3788 DETECTING ACOUSTIC REFLECTORS USING A ROBOT’S EGO-NOISE Usama Saqib, Antoine Deleforge, Jesper Rindom Jensen 3788 | DETECTING ACOUSTIC REFLECTORS USING A ROBOT’S EGO-NOISE |
5382 DETECTING ADVERSARIAL ATTACKS ON AUDIOVISUAL SPEECH RECOGNITION Pingchuan Ma, Petridis Stavros, Maja Pantic 5382 | DETECTING ADVERSARIAL ATTACKS ON AUDIOVISUAL SPEECH RECOGNITION |
3416 DETECTING ALZHEIMER'S DISEASE FROM SPEECH USING NEURAL NETWORKS WITH BOTTLENECK FEATURES AND DATA AUGMENTATION Zhaoci Liu, Zhiqiang Guo, Zhenhua Ling, Yunxia Li 3416 | DETECTING ALZHEIMER'S DISEASE FROM SPEECH USING NEURAL NETWORKS WITH BOTTLENECK FEATURES AND DATA AUGMENTATION |
5389 DETECTING SIGNAL CORRUPTIONS IN VOICE RECORDINGS FOR SPEECH THERAPY Helmer Nylén, Saikat Chatterjee, Sten Ternström 5389 | DETECTING SIGNAL CORRUPTIONS IN VOICE RECORDINGS FOR SPEECH THERAPY |
2772 DETECTION OF AUDIO-VIDEO SYNCHRONIZATION ERRORS VIA EVENT DETECTION Joshua Ebenezer, Yongjun Wu, Hai Wei, Sriram Sethuraman, Zongyi Liu 2772 | DETECTION OF AUDIO-VIDEO SYNCHRONIZATION ERRORS VIA EVENT DETECTION |
2828 DETECTION OF COVID-19 THROUGH THE ANALYSIS OF VOCAL FOLD OSCILLATIONS Mahmoud Al Ismail, Soham Deshmukh, Rita Singh 2828 | DETECTION OF COVID-19 THROUGH THE ANALYSIS OF VOCAL FOLD OSCILLATIONS |
4167 DETECTION OF MALICIOUS DNS AND WEB SERVERS USING GRAPH-BASED APPROACHES Jinyuan Jia, Zheng Dong, Jie Li, Jack W. Stokes 4167 | DETECTION OF MALICIOUS DNS AND WEB SERVERS USING GRAPH-BASED APPROACHES |
2748 DETECTION OF POST-TRAUMATIC STRESS DISORDER USING LEARNED TIME-FREQUENCY REPRESENTATIONS FROM PUPILLOMETRY Bilal Taha, Megan KirK, Paul Ritvo, Dimitrios Hatzinakos 2748 | DETECTION OF POST-TRAUMATIC STRESS DISORDER USING LEARNED TIME-FREQUENCY REPRESENTATIONS FROM PUPILLOMETRY |
1892 DEVELOPING REAL-TIME STREAMING TRANSFORMER TRANSDUCER FOR SPEECH RECOGNITION ON LARGE-SCALE DATASET Xie Chen, Yu Wu, Zhenghao Wang, Shujie Liu, Jinyu Li 1892 | DEVELOPING REAL-TIME STREAMING TRANSFORMER TRANSDUCER FOR SPEECH RECOGNITION ON LARGE-SCALE DATASET |
3488 DEVELOPMENT OF THE CUHK ELDERLY SPEECH RECOGNITION SYSTEM FOR NEUROCOGNITIVE DISORDER DETECTION USING THE DEMENTIABANK CORPUS Zi Ye, Shoukang Hu, Jinchao Li, Xurong Xie, Mengzhe Geng, Jianwei Yu, Junhao Xu, Boyang Xue, Shansong Liu, Xunying Liu, Helen Meng 3488 | DEVELOPMENT OF THE CUHK ELDERLY SPEECH RECOGNITION SYSTEM FOR NEUROCOGNITIVE DISORDER DETECTION USING THE DEMENTIABANK CORPUS |
1943 DFDM: A DEEP FEATURE DECOUPLING MODULE FOR LUNG NODULE SEGMENTATION Wei Chen, Qiuli Wang, Sheng Huang, Xiaohong Zhang, Yucong Li, Chen Liu, Luwen Huangfu 1943 | DFDM: A DEEP FEATURE DECOUPLING MODULE FOR LUNG NODULE SEGMENTATION |
3985 DHASP: DIFFERENTIABLE HEARING AID SPEECH PROCESSING Zehai Tu, Ning Ma, Jon Barker 3985 | DHASP: DIFFERENTIABLE HEARING AID SPEECH PROCESSING |
3510 DHCN: DEEP HIERARCHICAL CONTEXT NETWORKS FOR IMAGE ANNOTATION Mingyuan Jiu, Hichem Sahbi 3510 | DHCN: DEEP HIERARCHICAL CONTEXT NETWORKS FOR IMAGE ANNOTATION |
3354 DIDISPEECH: A LARGE SCALE MANDARIN SPEECH CORPUS Tingwei Guo, Cheng Wen, Dongwei Jiang, Ne Luo, Ruixiong Zhang, Shuaijiang Zhao, Wubo Li, Cheng Gong, Wei Zou, Kun Han, Xiangang Li 3354 | DIDISPEECH: A LARGE SCALE MANDARIN SPEECH CORPUS |
2096 Differentiable Signal Processing With Black-Box Audio Effects Marco A Martínez Ramírez, Oliver Wang, Paris Smaragdis, Nicholas J Bryan 2096 | Differentiable Signal Processing With Black-Box Audio Effects |
2352 DIFFERENTIAL CHAOS SHIFT KEYING-BASED WIRELESS POWER TRANSFER Priyadarshi Mukherjee, Constantinos Psomas, Ioannis Krikidis 2352 | DIFFERENTIAL CHAOS SHIFT KEYING-BASED WIRELESS POWER TRANSFER |
1834 Differential Convolution Feature Guided Deep Multi-scale Multiple Instance Learning for Aerial Scene Classification Beichen Zhou, Jingjun Yi, Qi Bi 1834 | Differential Convolution Feature Guided Deep Multi-scale Multiple Instance Learning for Aerial Scene Classification |
4011 Dimension Selected Subspace Clustering Shuoyang Li, Yuhui Luo, Jonathon Chambers, Wenwu Wang 4011 | Dimension Selected Subspace Clustering |
3959 DIRECTION OF ARRIVAL ESTIMATION FOR NON-COHERENT SUB-ARRAYS VIA JOINT SPARSE AND LOW-RANK SIGNAL RECOVERY Tom Tirer, Oded Bialer 3959 | DIRECTION OF ARRIVAL ESTIMATION FOR NON-COHERENT SUB-ARRAYS VIA JOINT SPARSE AND LOW-RANK SIGNAL RECOVERY |
2547 DIRECTION PRESERVING WIND NOISE REDUCTION OF B-FORMAT SIGNALS Adrian Herzog, Daniele Mirabilii, Emanuël Habets 2547 | DIRECTION PRESERVING WIND NOISE REDUCTION OF B-FORMAT SIGNALS |
4060 DIRECTIONAL ASR: A NEW PARADIGM FOR E2E MULTI-SPEAKER SPEECH RECOGNITION WITH SOURCE LOCALIZATION Aswin Shanmugam Subramanian, Chao Weng, Shinji Watanabe, Meng Yu, Yong Xu, Shi-Xiong Zhang, Dong Yu 4060 | DIRECTIONAL ASR: A NEW PARADIGM FOR E2E MULTI-SPEAKER SPEECH RECOGNITION WITH SOURCE LOCALIZATION |
1410 DIRECTIONAL SPARSE FILTERING USING WEIGHTED LEHMER MEAN FOR BLIND SEPARATION OF UNBALANCED SPEECH MIXTURES Karn Watcharasupat, Anh H. T. Nguyen, Ching-Hui Ooi, Andy W. H. Khong 1410 | DIRECTIONAL SPARSE FILTERING USING WEIGHTED LEHMER MEAN FOR BLIND SEPARATION OF UNBALANCED SPEECH MIXTURES |
3916 DISCRETE COSINE TRANSFORM BASED CAUSAL CONVOLUTIONAL NEURAL NETWORK FOR DRIFT COMPENSATION IN CHEMICAL SENSORS Diaa Badawi, Agamyrat Agambayev, Sule Ozev, Ahmet Enis Cetin 3916 | DISCRETE COSINE TRANSFORM BASED CAUSAL CONVOLUTIONAL NEURAL NETWORK FOR DRIFT COMPENSATION IN CHEMICAL SENSORS |
3352 DISCRIMINABILITY OF SINGLE-LAYER GRAPH NEURAL NETWORKS Samuel Pfrommer, Fernando Gama, Alejandro Ribeiro 3352 | DISCRIMINABILITY OF SINGLE-LAYER GRAPH NEURAL NETWORKS |
5256 DISENTANGLED SPEAKER AND LANGUAGE REPRESENTATIONS USING MUTUAL INFORMATION MINIMIZATION AND DOMAIN ADAPTATION FOR CROSS-LINGUAL TTS Detai Xin, Tatsuya Komatsu, Shinnosuke Takamichi, Hiroshi Saruwatari 5256 | DISENTANGLED SPEAKER AND LANGUAGE REPRESENTATIONS USING MUTUAL INFORMATION MINIMIZATION AND DOMAIN ADAPTATION FOR CROSS-LINGUAL TTS |
2265 DISENTANGLEMENT FOR AUDIO-VISUAL EMOTION RECOGNITION USING MULTITASK SETUP Raghuveer Peri, Srinivas Parthasarathy, Charles Bradshaw, Shiva Sundaram 2265 | DISENTANGLEMENT FOR AUDIO-VISUAL EMOTION RECOGNITION USING MULTITASK SETUP |
1425 DISENTANGLING SUBJECT-DEPENDENT/-INDEPENDENT REPRESENTATIONSFOR 2D MOTION RETARGETING Fanglu Xie, Go Irie, Tatsushi Matsubayashi 1425 | DISENTANGLING SUBJECT-DEPENDENT/-INDEPENDENT REPRESENTATIONSFOR 2D MOTION RETARGETING |
2727 DISTRIBUTED SCHEDULING USING GRAPH NEURAL NETWORKS Zhongyuan Zhao, Gunjan Verma, Chirag Rao, Ananthram Swami, Santiago Segarra 2727 | DISTRIBUTED SCHEDULING USING GRAPH NEURAL NETWORKS |
4132 DISTRIBUTED SPEECH SEPARATION IN SPATIALLY UNCONSTRAINED MICROPHONE ARRAYS Nicolas Furnon, Romain Serizel, Irina Illina, Slim Essid 4132 | DISTRIBUTED SPEECH SEPARATION IN SPATIALLY UNCONSTRAINED MICROPHONE ARRAYS |
2225 DISTRIBUTION-AWARE HIERARCHICAL WEIGHTING METHOD FOR DEEP METRIC LEARNING Yinong Zhu, Yong Feng, Mingliang Zhou, Baohua Qiang, Leong Hou U, Jiajie Zhu 2225 | DISTRIBUTION-AWARE HIERARCHICAL WEIGHTING METHOD FOR DEEP METRIC LEARNING |
5329 DIVIDE AND CONQUER: ONE-BIT MIMO-OFDM DETECTION BY INEXACT EXPECTATION MAXIMIZATION Mingjie Shao, Wing-Kin Ma 5329 | DIVIDE AND CONQUER: ONE-BIT MIMO-OFDM DETECTION BY INEXACT EXPECTATION MAXIMIZATION |
1398 DNANet: Dense Nested Attention Network for Single Image Dehazing Dongdong Ren, Jinbao Li, Meng Han, Minglei Shu 1398 | DNANet: Dense Nested Attention Network for Single Image Dehazing |
4095 DNSMOS: A NON-INTRUSIVE PERCEPTUAL OBJECTIVE SPEECH QUALITY METRIC TO EVALUATE NOISE SUPPRESSORS Chandan Karadagur Ananda Reddy, Vishak Gopal, Ross Cutler 4095 | DNSMOS: A NON-INTRUSIVE PERCEPTUAL OBJECTIVE SPEECH QUALITY METRIC TO EVALUATE NOISE SUPPRESSORS |
4267 DO AS I MEAN, NOT AS I SAY: SEQUENCE LOSS TRAINING FOR SPOKEN LANGUAGE UNDERSTANDING Milind Rao, Pranav Dheram, Gautam Tiwari, Anirudh Raju, Jasha Droppo, Ariya Rastrow, Andreas Stolcke 4267 | DO AS I MEAN, NOT AS I SAY: SEQUENCE LOSS TRAINING FOR SPOKEN LANGUAGE UNDERSTANDING |
2603 DOA ESTIMATION OF A HIDDEN RF SOURCE EXPLOITING SIMPLE BACKSCATTER RADIO TAGS Georgios Vougioukas, Aggelos Bletsas 2603 | DOA ESTIMATION OF A HIDDEN RF SOURCE EXPLOITING SIMPLE BACKSCATTER RADIO TAGS |
1128 Domain Adaptation for Learning Generator from Paired Few-Shot Data Chun-Chih Teng, Pin-Yu Chen, Wei-Chen Chiu 1128 | Domain Adaptation for Learning Generator from Paired Few-Shot Data |
4344 DOMAIN-ADVERSARIAL AUTOENCODER WITH ATTENTION BASED FEATURE LEVEL FUSION FOR SPEECH EMOTION RECOGNITION Yuan Gao, Jiaxing Liu, Longbiao Wang, Jianwu Dang 4344 | DOMAIN-ADVERSARIAL AUTOENCODER WITH ATTENTION BASED FEATURE LEVEL FUSION FOR SPEECH EMOTION RECOGNITION |
1394 DOMAIN-AWARE NEURAL LANGUAGE MODELS FOR SPEECH RECOGNITION Linda Liu, Yile Gu, Aditya Gourav, Ankur Gandhe, Shashank Kalmane, Denis Filimonov, Ariya Rastrow, Ivan Bulyko 1394 | DOMAIN-AWARE NEURAL LANGUAGE MODELS FOR SPEECH RECOGNITION |
1255 DOMESTIC ACTIVITIES CLUSTERING FROM AUDIO RECORDINGS USING CONVOLUTIONAL CAPSULE AUTOENCODER NETWORK Ziheng Lin, Yanxiong Li, Zhangjin Huang, Wenhao Zhang, Yufeng Tan, Yichun Chen, Qianhua He 1255 | DOMESTIC ACTIVITIES CLUSTERING FROM AUDIO RECORDINGS USING CONVOLUTIONAL CAPSULE AUTOENCODER NETWORK |
5014 DON’T LOOK BACK: AN ONLINE BEAT TRACKING METHOD USING RNN AND ENHANCED PARTICLE FILTERING Mojtaba Heydari, Zhiyao Duan 5014 | DON’T LOOK BACK: AN ONLINE BEAT TRACKING METHOD USING RNN AND ENHANCED PARTICLE FILTERING |
1903 DON’T SHOOT BUTTERFLY WITH RIFLES: MULTI-CHANNEL CONTINUOUS SPEECH SEPARATION WITH EARLY EXIT TRANSFORMER Sanyuan Chen, Yu Wu, Zhuo Chen, Takuya Yoshioka, Shujie Liu, Jinyu Li, Xiangzhan Yu 1903 | DON’T SHOOT BUTTERFLY WITH RIFLES: MULTI-CHANNEL CONTINUOUS SPEECH SEPARATION WITH EARLY EXIT TRANSFORMER |
1837 Double Multi-Head Attention for Speaker Verification Miquel India Massana, Pooyan Safari, Javier Hernando 1837 | Double Multi-Head Attention for Speaker Verification |
2618 Double-DCCCAE: Estimation of Body Gestures from Speech Waveform JinHong Lu, TianHang Liu, Shuzhuang Xu, Hiroshi Shimodaira 2618 | Double-DCCCAE: Estimation of Body Gestures from Speech Waveform |
5524 DOUBLE-LINEAR THOMPSON SAMPLING FOR CONTEXT-ATTENTIVE BANDITS Djallel Bouneffouf, Raphael Feraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish 5524 | DOUBLE-LINEAR THOMPSON SAMPLING FOR CONTEXT-ATTENTIVE BANDITS |
5360 DP-SIGNSGD: WHEN EFFICIENCY MEETS PRIVACY AND ROBUSTNESS Lingjuan Lyu 5360 | DP-SIGNSGD: WHEN EFFICIENCY MEETS PRIVACY AND ROBUSTNESS |
1358 DP-VTON: TOWARD DETAIL-PRESERVING IMAGE-BASED VIRTUAL TRY-ON NETWORK Yuan Chang, Tao Peng, Ruhan He, Xinrong Hu, Junping Liu, Zili Zhang, Minghua Jiang 1358 | DP-VTON: TOWARD DETAIL-PRESERVING IMAGE-BASED VIRTUAL TRY-ON NETWORK |
1743 DrawGAN: Text to Image Synthesis with Drawing Generative Adversarial Networks Zhiqiang Zhang, Jinjia Zhou, Wenxin Yu, Ning Jiang 1743 | DrawGAN: Text to Image Synthesis with Drawing Generative Adversarial Networks |
1487 Drawing Order Recovery from Trajectory Components Minghao Yang, Xukang Zhou, Yangchang Sun, Jinglong Chen, Baohua Qiang 1487 | Drawing Order Recovery from Trajectory Components |
3658 DUAL METRIC DISCRIMINATOR FOR OPEN SET VIDEO DOMAIN ADAPTATION Yatian Wang, Xiaolin Song, Yezhen Wang, Pengfei Xu, Runbo Hu, Hua Chai 3658 | DUAL METRIC DISCRIMINATOR FOR OPEN SET VIDEO DOMAIN ADAPTATION |
3542 DUALFORMER: A UNIFIED BIDIRECTIONAL SEQUENCE-TO-SEQUENCE LEARNING Jen-Tzung Chien, Wei-Hsiang Chang 3542 | DUALFORMER: A UNIFIED BIDIRECTIONAL SEQUENCE-TO-SEQUENCE LEARNING |
5107 Dual-Path Modeling for Long Recording Speech Separation in Meetings Chenda Li, Zhuo Chen, Yi Luo, Cong Han, Tianyan Zhou, Keisuke Kinoshita, Marc Delcroix, Shinji Watanabe, Yanmin Qian 5107 | Dual-Path Modeling for Long Recording Speech Separation in Meetings |
1411 DUAL-STREAM NETWORK BASED ON GLOBAL GUIDANCE FOR SALIENT OBJECT DETECTION shuyong gao, qianyu guo, wei zhang, wenqiang zhang 1411 | DUAL-STREAM NETWORK BASED ON GLOBAL GUIDANCE FOR SALIENT OBJECT DETECTION |
4953 DURAS: Deep Unfolded Radar Sensing Using Doppler Focusing Pranav Goyal, Satish Mulleti, Anubha Gupta, Yonina Eldar 4953 | DURAS: Deep Unfolded Radar Sensing Using Doppler Focusing |
2198 D-VDAMP: DENOISING-BASED APPROXIMATE MESSAGE PASSING FOR COMPRESSIVE MRI Christopher Metzler, Gordon Wetzstein 2198 | D-VDAMP: DENOISING-BASED APPROXIMATE MESSAGE PASSING FOR COMPRESSIVE MRI |
3795 DYNAMIC CURRICULUM LEARNING VIA DATA PARAMETERS FOR NOISE ROBUST KEYWORD SPOTTING Takuya Higuchi, Shreyas Saxena, Mehrez Souden, Tien Dung Tran, Masood Delfarah, Chandra Dhir 3795 | DYNAMIC CURRICULUM LEARNING VIA DATA PARAMETERS FOR NOISE ROBUST KEYWORD SPOTTING |
1722 Dynamic Graph Learning based on Graph Laplacian Bo Jiang, Yiyi Yu, Hamid Krim, Spencer Smith 1722 | Dynamic Graph Learning based on Graph Laplacian |
4787 DYNAMIC GRAPH MODELING OF SIMULTANEOUS EEG AND EYE-TRACKING DATA FOR READING TASK IDENTIFICATION Puneet Mathur, Trisha Mittal, Dinesh Manocha 4787 | DYNAMIC GRAPH MODELING OF SIMULTANEOUS EEG AND EYE-TRACKING DATA FOR READING TASK IDENTIFICATION |
5118 DYNAMIC POINT CLOUD COMPRESSION USING A CUBOID ORIENTED DISCRETE COSINE BASED MOTION MODEL Ashek Ahmmed, Manoranjan Paul, Manzur Murshed, David Taubman 5118 | DYNAMIC POINT CLOUD COMPRESSION USING A CUBOID ORIENTED DISCRETE COSINE BASED MOTION MODEL |
3870 DYNAMIC RESOURCE OPTIMIZATION FOR ADAPTIVE FEDERATED LEARNING AT THE WIRELESS NETWORK EDGE Paolo Di Lorenzo, Claudio Battiloro, Mattia Merluzzi, Sergio Barbarossa 3870 | DYNAMIC RESOURCE OPTIMIZATION FOR ADAPTIVE FEDERATED LEARNING AT THE WIRELESS NETWORK EDGE |
2788 Dynamic Sparsity Neural Networks for Automatic Speech Recognition Zhaofeng Wu, Ding Zhao, Qiao Liang, Jiahui Yu, Anmol Gulati, Ruoming Pang 2788 | Dynamic Sparsity Neural Networks for Automatic Speech Recognition |
1131 DYNAMIC TEXTURE RECOGNITION VIA NUCLEAR DISTANCES ON KERNELIZED SCATTERING HISTOGRAM SPACES Alexander Sagel, Julian Wörmann, Hao Shen 1131 | DYNAMIC TEXTURE RECOGNITION VIA NUCLEAR DISTANCES ON KERNELIZED SCATTERING HISTOGRAM SPACES |
3067 EADNET: EFFICIENT ASYMMETRIC DILATED NETWORK FOR SEMANTIC SEGMENTATION Qihang Yang, Tao Chen, Jiayuan Fan, Ye Lu, Chongyan Zuo, Qinghua Chi 3067 | EADNET: EFFICIENT ASYMMETRIC DILATED NETWORK FOR SEMANTIC SEGMENTATION |
4781 EAT: ENHANCED ASR-TTS FOR SELF-SUPERVISED SPEECH RECOGNITION Murali Karthick Baskar, Lukas Burget, Shinji Watanabe, Ramon Astudillo, Jan ``Honza'' \v{C}ernock\'{y} 4781 | EAT: ENHANCED ASR-TTS FOR SELF-SUPERVISED SPEECH RECOGNITION |
2170 ECCL: EXPLICIT CORRELATION-BASED CONVOLUTION BOUNDARY LOCATOR FOR MOMENT LOCALIZATION Xinfang Liu, Xiushan Nie, Junya Teng, Fanchang Hao, Yilong Yin 2170 | ECCL: EXPLICIT CORRELATION-BASED CONVOLUTION BOUNDARY LOCATOR FOR MOMENT LOCALIZATION |
2174 ECG HEART-BEAT CLASSIFICATION USING MULTIMODAL IMAGE FUSION ZEESHAN AHMAD, Anika Tabassum, Ling Guan, Naimul Khan 2174 | ECG HEART-BEAT CLASSIFICATION USING MULTIMODAL IMAGE FUSION |
4561 ECHO STATE SPEECH RECOGNITION Harsh Shrivastava, Ankush Garg, Yuan Cao, Yu Zhang, Tara Sainath 4561 | ECHO STATE SPEECH RECOGNITION |
5373 EDGE-AWARE MULTI-SCALE PROGRESSIVE COLORIZATION Jun Xia, Guanghua Tan, Yi Xiao, Fangqiang Xu, Chi-Sing Leung 5373 | EDGE-AWARE MULTI-SCALE PROGRESSIVE COLORIZATION |
2878 EEG-BASED EMOTION CLASSIFICATION USING GRAPH SIGNAL PROCESSING Seyed Saman Saboksayr, Gonzalo Mateos, Mujdat Cetin 2878 | EEG-BASED EMOTION CLASSIFICATION USING GRAPH SIGNAL PROCESSING |
2583 Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms Babak Naderi, Gabriel Mittag, Rafael Zequeira Jiménez, Sebastian Möller 2583 | Effect of Language Proficiency on Subjective Evaluation of Noise Suppression Algorithms |
5450 EFFECT OF NOISE AND MODEL COMPLEXITY ON DETECTION OF AMYOTROPHIC LATERAL SCLEROSIS AND PARKINSON’S DISEASE USING PITCH AND MFCC Tanuka Bhattacharjee, Jhansi Mallela, Yamini Belur, Nalini Atchayaram, Ravi Yadav, Pradeep Reddy, Dipanjan Gope, Prasanta Kumar Ghosh 5450 | EFFECT OF NOISE AND MODEL COMPLEXITY ON DETECTION OF AMYOTROPHIC LATERAL SCLEROSIS AND PARKINSON’S DISEASE USING PITCH AND MFCC |
4272 EFFECT OF VIDEO PIXEL-BINNING ON SOURCE ATTRIBUTION OF MIXED MEDIA Samet Taspinar, Manoranjan Mohanty, Nasir Memon 4272 | EFFECT OF VIDEO PIXEL-BINNING ON SOURCE ATTRIBUTION OF MIXED MEDIA |
5161 EFFECTIVE RANK-BASED ESTIMATION OF THE COHERENT-TO-DIFFUSE POWER RATIO Heinrich Loellmann, Andreas Brendel, Walter Kellermann 5161 | EFFECTIVE RANK-BASED ESTIMATION OF THE COHERENT-TO-DIFFUSE POWER RATIO |
4889 Efficient Adversarial Audio Synthesis via Progressive Upsampling Youngwoo Cho, Minwook Chang, Sanghyeon Lee, Hyoungwoo Lee, Gerard Jounghyun Kim, Jaegul Choo 4889 | Efficient Adversarial Audio Synthesis via Progressive Upsampling |
3512 EFFICIENT CLIENT CONTRIBUTION EVALUATION FOR HORIZONTAL FEDERATED LEARNING Jie Zhao, Xinghua Zhu, Jianzong Wang, Jing Xiao 3512 | EFFICIENT CLIENT CONTRIBUTION EVALUATION FOR HORIZONTAL FEDERATED LEARNING |
4144 EFFICIENT END-TO-END AUDIO EMBEDDINGS GENERATION FOR AUDIO CLASSIFICATION ON TARGET APPLICATIONS Paulo Lopez-Meyer, Juan A. Del Hoyo Ontiveros, Hong Lu, Georg Stemmer 4144 | EFFICIENT END-TO-END AUDIO EMBEDDINGS GENERATION FOR AUDIO CLASSIFICATION ON TARGET APPLICATIONS |
2909 EFFICIENT FACE MANIPULATION VIA DEEP FEATURE DISENTANGLEMENT AND REINTEGRATION NET Bin Cheng, Tao Dai, Bin Chen, Shutao Xia, Xiu Li 2909 | EFFICIENT FACE MANIPULATION VIA DEEP FEATURE DISENTANGLEMENT AND REINTEGRATION NET |
4182 EFFICIENT KNOWLEDGE DISTILLATION FOR RNN-TRANSDUCER MODELS SANKARAN PANCHAPAGESAN, Daniel Park, Chung-Cheng Chiu, Yuan Shangguan, Qiao Liang, Alexander Gruenstein 4182 | EFFICIENT KNOWLEDGE DISTILLATION FOR RNN-TRANSDUCER MODELS |
1226 EFFICIENT LONG PERIODIC BINARY SEQUENCE DESIGNS FOR AUTOMOTIVE RADAR Yutao Chen, Ronghao Lin, Jian Li 1226 | EFFICIENT LONG PERIODIC BINARY SEQUENCE DESIGNS FOR AUTOMOTIVE RADAR |
4535 EFFICIENT MIGRATION TO THE NEXT GENERATION OF NETWORKS BASED ON DIGITAL ANNEALING Mohammad Javad-Kalbasi, Shahrokh Valaee 4535 | EFFICIENT MIGRATION TO THE NEXT GENERATION OF NETWORKS BASED ON DIGITAL ANNEALING |
3079 EFFICIENT MULTI-OBJECTIVE GANS FOR IMAGE RESTORATION Jingwen Su, Hujun Yin 3079 | EFFICIENT MULTI-OBJECTIVE GANS FOR IMAGE RESTORATION |
4107 EFFICIENT NETWORK PROTECTION GAMES AGAINST MULTIPLE TYPES OF STRATEGIC ATTACKERS Zhifan Xu, Melike Baykal-Gursoy 4107 | EFFICIENT NETWORK PROTECTION GAMES AGAINST MULTIPLE TYPES OF STRATEGIC ATTACKERS |
2780 EFFICIENT POWER ALLOCATION USING GRAPH NEURAL NETWORKS AND DEEP ALGORITHM UNFOLDING Arindam Chowdhury, Gunjan Verma, Chirag Rao, Ananthram Swami, Santiago Segarra 2780 | EFFICIENT POWER ALLOCATION USING GRAPH NEURAL NETWORKS AND DEEP ALGORITHM UNFOLDING |
1671 EFFICIENT REAL-TIME VIDEO STABILIZATION WITH A NOVEL LEAST SQUARES FORMULATION Jianwei Ke, Alex Watras, Jae-Jun Kim, Hewei Liu, Hongrui Jiang, Yu Hen Hu 1671 | EFFICIENT REAL-TIME VIDEO STABILIZATION WITH A NOVEL LEAST SQUARES FORMULATION |
1431 EFFICIENT SPEECH EMOTION RECOGNITION USING MULTI-SCALE CNN AND ATTENTION Zixuan Peng, Yu Lu, Shengfeng Pan, Yunfeng Liu 1431 | EFFICIENT SPEECH EMOTION RECOGNITION USING MULTI-SCALE CNN AND ATTENTION |
1958 EFFICIENT TRAINING DATA GENERATION FOR PHASE-BASED DOA ESTIMATION Fabian Hübner, Wolfgang Mack, Emanuël Habets 1958 | EFFICIENT TRAINING DATA GENERATION FOR PHASE-BASED DOA ESTIMATION |
3134 Efficient Use of End-to-end Data in Spoken Language Processing Yiting Lu, Yu Wang, Mark Gales 3134 | Efficient Use of End-to-end Data in Spoken Language Processing |
3666 EGO-BASED ENTROPY MEASURES FOR STRUCTURAL REPRESENTATIONS ON GRAPHS George Dasoulas, Giannis Nikolentzos, Kevin Scaman, Aladin Virmaux, Michalis Vazirgiannis 3666 | EGO-BASED ENTROPY MEASURES FOR STRUCTURAL REPRESENTATIONS ON GRAPHS |
4094 EGO-GNNS: EXPLOITING EGO STRUCTURES IN GRAPH NEURAL NETWORKS Dylan Sandfelder, Priyesh Vijayan, William Hamilton 4094 | EGO-GNNS: EXPLOITING EGO STRUCTURES IN GRAPH NEURAL NETWORKS |
5616 EIGENVECTORS OF ORDINARY, GENERALIZED, CENTERED AND OFFSET DISCRETE FOURIER TRANSFORMS BASED ON LOOKUP TABLE METHODS: EFFICIENCY AND APPROXIMATION USES Wen-Liang Hsue 5616 | EIGENVECTORS OF ORDINARY, GENERALIZED, CENTERED AND OFFSET DISCRETE FOURIER TRANSFORMS BASED ON LOOKUP TABLE METHODS: EFFICIENCY AND APPROXIMATION USES |
4506 EKFNET: LEARNING SYSTEM NOISE STATISTICS FROM MEASUREMENT DATA Liang Xu, Ruixin Niu 4506 | EKFNET: LEARNING SYSTEM NOISE STATISTICS FROM MEASUREMENT DATA |
5068 ELBERT: FAST ALBERT WITH CONFIDENCE-WINDOW BASED EARLY EXIT Keli Xie, Siyuan Lu, Meiqi Wang, Zhongfeng Wang 5068 | ELBERT: FAST ALBERT WITH CONFIDENCE-WINDOW BASED EARLY EXIT |
5483 Elliptical Shape Recovery from Blurred Pixels using Deep Learning Hojatollah Zamani, Peyman Rostami, Arash Amini, Farokh Marvasti 5483 | Elliptical Shape Recovery from Blurred Pixels using Deep Learning |
1831 Embedding Semantic Hierarchy in Discrete Optimal Transport for Risk Minimization Xiaofeng Liu, Yubin Ge, Xuyang Li, Wanqing Xie, Fangfang Fan, Jane You 1831 | Embedding Semantic Hierarchy in Discrete Optimal Transport for Risk Minimization |
1684 EMFORMER: EFFICIENT MEMORY TRANSFORMER BASED ACOUSTIC MODEL FORLOW LATENCY STREAMING SPEECH RECOGNITION Yangyang Shi, Yongqiang Wang, Chunyang Wu, Ching-Feng Yeh, Julian Chan, Frank Zhang, Duc Le, Mike Seltzer 1684 | EMFORMER: EFFICIENT MEMORY TRANSFORMER BASED ACOUSTIC MODEL FORLOW LATENCY STREAMING SPEECH RECOGNITION |
1863 EMOTION CONTROLLABLE SPEECH SYNTHESIS USING EMOTION-UNLABELED DATASET WITH THE ASSISTANCE OF CROSS-DOMAIN SPEECH EMOTION RECOGNITION Xiong Cai, Dongyang Dai, Zhiyong Wu, Xiang Li, Jingbei Li, Helen Meng 1863 | EMOTION CONTROLLABLE SPEECH SYNTHESIS USING EMOTION-UNLABELED DATASET WITH THE ASSISTANCE OF CROSS-DOMAIN SPEECH EMOTION RECOGNITION |
2923 EMOTION RECOGNITION BY FUSING TIME SYNCHRONOUS AND TIME ASYNCHRONOUS REPRESENTATIONS Wen Wu, Chao Zhang, Philip C. Woodland, 2923 | EMOTION RECOGNITION BY FUSING TIME SYNCHRONOUS AND TIME ASYNCHRONOUS REPRESENTATIONS |
3479 EMPIRICALLY ACCELERATING SCALED GRADIENT PROJECTION USING DEEP NEURAL NETWORK FOR INVERSE PROBLEMS IN IMAGE PROCESSING Byung Hyun Lee, Se Young Chun 3479 | EMPIRICALLY ACCELERATING SCALED GRADIENT PROJECTION USING DEEP NEURAL NETWORK FOR INVERSE PROBLEMS IN IMAGE PROCESSING |
3110 Enabling Efficient and Expressive Spatial Keyword Queries on Encrypted Data Xiangyu Wang, Jianfeng Ma, Ximeng Liu 3110 | Enabling Efficient and Expressive Spatial Keyword Queries on Encrypted Data |
4524 ENCODER-DECODER BASED PITCH TRACKING AND JOINT MODEL TRAINING FOR MANDARIN TONE CLASSIFICATION Hao Huang, Kai Wang, Ying Hu, Sheng Li 4524 | ENCODER-DECODER BASED PITCH TRACKING AND JOINT MODEL TRAINING FOR MANDARIN TONE CLASSIFICATION |
1550 END TO END LEARNING FOR CONVOLUTIVE MULTI-CHANNEL WIENER FILTERING Masahito Togami 1550 | END TO END LEARNING FOR CONVOLUTIVE MULTI-CHANNEL WIENER FILTERING |
3182 END2END ACOUSTIC TO SEMANTIC TRANSDUCTION Valentin Pelloin, Nathalie Camelin, Antoine Laurent, Renato De Mori, Antoine Caubrière, Yannick Estève, Sylvain Meignier 3182 | END2END ACOUSTIC TO SEMANTIC TRANSDUCTION |
2383 END-2-END MODELING OF SPEECH AND GAIT FROM PATIENTS WITH PARKINSON'S DISEASE: COMPARISON BETWEEN HIGH QUALITY VS. SMARTPHONE DATA Juan Camilo Vasquez-Correa, Tomás Arias-Vergara, Philipp Klumpp, Paula Andrea Perez-Toro, Juan Rafael Orozco-Arroyave, Elmar Nöth 2383 | END-2-END MODELING OF SPEECH AND GAIT FROM PATIENTS WITH PARKINSON'S DISEASE: COMPARISON BETWEEN HIGH QUALITY VS. SMARTPHONE DATA |
3729 END-TO-END ANTI-SPOOFING WITH RAWNET2 Hemlata Tak, Jose Patino, Massimiliano Todisco, Andreas Nautsch, Nicholas Evans, Anthony Larcher 3729 | END-TO-END ANTI-SPOOFING WITH RAWNET2 |
4989 END-TO-END AUDIO-VISUAL SPEECH RECOGNITION WITH CONFORMERS Pingchuan Ma, Stavros Petridis, Maja Pantic 4989 | END-TO-END AUDIO-VISUAL SPEECH RECOGNITION WITH CONFORMERS |
3384 END-TO-END DEREVERBERATION, BEAMFORMING, AND SPEECH RECOGNITION WITH IMPROVED NUMERICAL STABILITY AND ADVANCED FRONTEND Wangyou Zhang, Christoph Boeddeker, Shinji Watanabe, Tomohiro Nakatani, Marc Delcroix, Keisuke Kinoshita, Tsubasa Ochiai, Naoyuki Kamo, Reinhold Haeb-Umbach, Yanmin Qian 3384 | END-TO-END DEREVERBERATION, BEAMFORMING, AND SPEECH RECOGNITION WITH IMPROVED NUMERICAL STABILITY AND ADVANCED FRONTEND |
4614 END-TO-END DIARIZATION FOR VARIABLE NUMBER OF SPEAKERS WITH LOCAL-GLOBAL NETWORKS AND DISCRIMINATIVE SPEAKER EMBEDDINGS Soumi Maiti, Hakan Erdogan, Kevin Wilson, Scott Wisdom, Shinji Watanabe, John Hershey 4614 | END-TO-END DIARIZATION FOR VARIABLE NUMBER OF SPEAKERS WITH LOCAL-GLOBAL NETWORKS AND DISCRIMINATIVE SPEAKER EMBEDDINGS |
3887 End-to-end learning of variational models and solvers for the resolution of interpolation problems ronan fablet, Lucas Drumetz, Francois Rousseau 3887 | End-to-end learning of variational models and solvers for the resolution of interpolation problems |
4691 End-to-end lyrics Recognition with Voice to Singing Style Transfer Sakya Basak, Shrutina Agarwal, Sriram Ganapathy, Naoya Takahashi 4691 | End-to-end lyrics Recognition with Voice to Singing Style Transfer |
5246 END-TO-END MULTI-ACCENT SPEECH RECOGNITION WITH UNSUPERVISED ACCENT MODELLING song li, beibei ouyang, dexin liao, shipeng xia, lin li, qingyang hong 5246 | END-TO-END MULTI-ACCENT SPEECH RECOGNITION WITH UNSUPERVISED ACCENT MODELLING |
2064 END-TO-END MULTI-CHANNEL TRANSFORMER FOR SPEECH RECOGNITION Feng-Ju Chang, Martin Radfar, Athanasios Mouchtaris, Brian King, Siegfried Kunzmann 2064 | END-TO-END MULTI-CHANNEL TRANSFORMER FOR SPEECH RECOGNITION |
3459 END-TO-END MULTILINGUAL AUTOMATIC SPEECH RECOGNITION FOR LESS-RESOURCED LANGUAGES: THE CASE OF FOUR ETHIOPIAN LANGUAGES Solomon Teferra Abate, Martha Yifiru Tachbelie, Tanja Schultz 3459 | END-TO-END MULTILINGUAL AUTOMATIC SPEECH RECOGNITION FOR LESS-RESOURCED LANGUAGES: THE CASE OF FOUR ETHIOPIAN LANGUAGES |
1408 END-TO-END SPEAKER DIARIZATION AS POST-PROCESSING Shota Horiguchi, Paola Garcia, Yusuke Fujita, Shinji Watanabe, Kenji Nagamatsu 1408 | END-TO-END SPEAKER DIARIZATION AS POST-PROCESSING |
4343 END-TO-END SPOKEN LANGUAGE UNDERSTANDING USING TRANSFORMER NETWORKS AND SELF-SUPERVISED PRE-TRAINED FEATURES EDMILSON MORAIS, Hong Kwang J Kuo, Samuel Thomas, Zoltan Tuske, Brian Kingsbury 4343 | END-TO-END SPOKEN LANGUAGE UNDERSTANDING USING TRANSFORMER NETWORKS AND SELF-SUPERVISED PRE-TRAINED FEATURES |
2872 END-TO-END TEXT-TO-SPEECH USING LATENT DURATION BASED ON VQ-VAE Yusuke Yasuda, Xin Wang, Junichi Yamagishi 2872 | END-TO-END TEXT-TO-SPEECH USING LATENT DURATION BASED ON VQ-VAE |
3474 ENERGY EFFICIENCY OPTIMIZATION TECHNIQUE FOR SWIPT-ENABLED MULTI-GROUP MULTICASTING SYSTEMS WITH HETEROGENEOUS USERS Sumit Gautam, Symeon Chatzinotas, Bjorn Ottersten 3474 | ENERGY EFFICIENCY OPTIMIZATION TECHNIQUE FOR SWIPT-ENABLED MULTI-GROUP MULTICASTING SYSTEMS WITH HETEROGENEOUS USERS |
2804 Energy Minimization for Federated Learning with IRS-Assisted Over-the-Air Computation Yuntao Hu, Ming Chen, Mingzhe Chen, Zhaohui Yang, Mohammad Shikh-Bahaei, H. Vincent Poor, Shuguang Cui 2804 | Energy Minimization for Federated Learning with IRS-Assisted Over-the-Air Computation |
3493 Enhanced Automotive Target Detection through Radar and Communications Sensor Fusion Sayed Hossein Dokhanchi, bhavani shankar mysore, kumar vijay mishra, bjorn ottersten 3493 | Enhanced Automotive Target Detection through Radar and Communications Sensor Fusion |
1320 ENHANCED BLIND CALIBRATION OF UNIFORM LINEAR ARRAYS WITH ONE-BIT QUANTIZATION BY KULLBACK-LEIBLER DIVERGENCE COVARIANCE FITTING Amir Weiss, Arie Yeredor 1320 | ENHANCED BLIND CALIBRATION OF UNIFORM LINEAR ARRAYS WITH ONE-BIT QUANTIZATION BY KULLBACK-LEIBLER DIVERGENCE COVARIANCE FITTING |
3974 ENHANCED STANDARD ESPRIT FOR OVERCOMING IMPERFECTIONS IN DOA ESTIMATION Majdoddin Esfandiari, Sergiy A. Vorobyov 3974 | ENHANCED STANDARD ESPRIT FOR OVERCOMING IMPERFECTIONS IN DOA ESTIMATION |
4052 ENHANCING AUDIO AUGMENTATION METHODS WITH CONSISTENCY LEARNING Turab Iqbal, Karim Helwani, Arvindh Krishnaswamy, Wenwu Wang 4052 | ENHANCING AUDIO AUGMENTATION METHODS WITH CONSISTENCY LEARNING |
4747 ENHANCING DATA-FREE ADVERSARIAL DISTILLATION WITH ACTIVATION REGULARIZATION AND VIRTUAL INTERPOLATION Xiaoyang Qu, Jianzong Wang, Jing Xiao 4747 | ENHANCING DATA-FREE ADVERSARIAL DISTILLATION WITH ACTIVATION REGULARIZATION AND VIRTUAL INTERPOLATION |
2734 Enhancing Deep Paraphrase Identification via Leveraging Word Alignment Information Boxin Li, Tingwen Liu, Bin Wang, Lihong Wang 2734 | Enhancing Deep Paraphrase Identification via Leveraging Word Alignment Information |
1280 ENHANCING IMAGE STEGANOGRAPHY VIA STEGO GENERATION AND SELECTION Tingting Song, Minglin Liu, Weiqi Luo, Peijia Zheng 1280 | ENHANCING IMAGE STEGANOGRAPHY VIA STEGO GENERATION AND SELECTION |
4682 ENHANCING INTO THE CODEC: NOISE ROBUST SPEECH CODING WITH VECTOR-QUANTIZED AUTOENCODERS Jonah Casebeer, Vinjai Vale, Umut Isik, Jean-Marc Valin, Ritwik Giri, Arvindh Krishnaswamy 4682 | ENHANCING INTO THE CODEC: NOISE ROBUST SPEECH CODING WITH VECTOR-QUANTIZED AUTOENCODERS |
1563 ENHANCING MODEL ROBUSTNESS BY INCORPORATING ADVERSARIAL KNOWLEDGE INTO SEMANTIC REPRESENTATION Jinfeng Li, Tianyu Du, Xiangyu Liu, Rong Zhang, Hui Xue, Shouling Ji 1563 | ENHANCING MODEL ROBUSTNESS BY INCORPORATING ADVERSARIAL KNOWLEDGE INTO SEMANTIC REPRESENTATION |
3649 ENHANCING MULTI-CHANNEL EEG CLASSIFICATION WITH GRAMIAN TEMPORAL GENERATIVE ADVERSARIAL NETWORKS Chi Nok Enoch Kan, Richard Povinelli, Dong Hye Ye 3649 | ENHANCING MULTI-CHANNEL EEG CLASSIFICATION WITH GRAMIAN TEMPORAL GENERATIVE ADVERSARIAL NETWORKS |
1576 ENSEMBLE COMBINATION BETWEEN DIFFERENT TIME SEGMENTATIONS Jeremy Heng Meng Wong, Dimitrios Dimitriadis, Kenichi Kumatani, Yashesh Gaur, George Polovets, Partha Parthasarathy, Eric Sun, Jinyu Li, Yifan Gong 1576 | ENSEMBLE COMBINATION BETWEEN DIFFERENT TIME SEGMENTATIONS |
2740 ENSEMBLE DISTILLATION APPROACHES FOR GRAMMATICAL ERROR CORRECTION Yassir Fathullah, Mark Gales, Andrey Malinin 2740 | ENSEMBLE DISTILLATION APPROACHES FOR GRAMMATICAL ERROR CORRECTION |
1608 ENSEMBLING OBJECT DETECTORS FOR IMAGE AND VIDEO DATA ANALYSIS Kateryna Chumachenko, Jenni Raitoharju, Alexandros Iosifidis, Moncef Gabbouj 1608 | ENSEMBLING OBJECT DETECTORS FOR IMAGE AND VIDEO DATA ANALYSIS |
3953 ENSURE: Ensemble Stein's Unbiased Risk Estimator for Unsupervised Learning Hemant Kumar Aggarwal, Aniket Pramanik, Mathews Jacob 3953 | ENSURE: Ensemble Stein's Unbiased Risk Estimator for Unsupervised Learning |
2345 ENVIRONMENT-INDEPENDENT WI-FI HUMAN ACTIVITY RECOGNITION WITH ADVERSARIAL NETWORK Zhengyang Wang, Sheng Chen, Wei Yang, Yang Xu 2345 | ENVIRONMENT-INDEPENDENT WI-FI HUMAN ACTIVITY RECOGNITION WITH ADVERSARIAL NETWORK |
4795 ERROR ESTIMATES IN SECOND-ORDER CONTINUOUS-TIME SIGMA-DELTA MODULATORS Dilshad Surroop, Pascal Combes, Philippe Martin 4795 | ERROR ESTIMATES IN SECOND-ORDER CONTINUOUS-TIME SIGMA-DELTA MODULATORS |
5425 ERROR-DRIVEN FIXED-BUDGET PERSONALIZATION FOR ACCENTED SPEAKERS Abhijeet Awasthi, Aman Kansal, Sunita Sarawagi, Preethi Jyothi 5425 | ERROR-DRIVEN FIXED-BUDGET PERSONALIZATION FOR ACCENTED SPEAKERS |
3062 Error-driven Pruning of Language Models for Virtual Assistants Sashank Gondala, Lyan Verwimp, Ernest Pusateri, Manos Tsagkias, Christophe Van Gysel 3062 | Error-driven Pruning of Language Models for Virtual Assistants |
3538 Estimating Fiedler value on large networks based on random walk observations Alexandre Reiffers-Masson, Thierry Chonavel, Yezekael Hayel 3538 | Estimating Fiedler value on large networks based on random walk observations |
5604 ESTIMATING NETWORK PROCESSES VIA BLIND IDENTIFICATION OF MULTIPLE GRAPH FILTERS Yu Zhu, Fernando J. Iglesias Garcia, Antonio G. Marques, Santiago Segarra 5604 | ESTIMATING NETWORK PROCESSES VIA BLIND IDENTIFICATION OF MULTIPLE GRAPH FILTERS |
4566 ESTIMATING SEVERITY OF DEPRESSION FROM ACOUSTIC FEATURES AND EMBEDDINGS OF NATURAL SPEECH Sri Harsha Dumpala, Sheri Rempel, Katerina Dikaios, Mehri Sajjadian, Rudolf Uher, Sageev Oore 4566 | ESTIMATING SEVERITY OF DEPRESSION FROM ACOUSTIC FEATURES AND EMBEDDINGS OF NATURAL SPEECH |
5072 ESTIMATION OF GROUNDWATER STORAGE VARIATIONS IN INDUS RIVER BASIN USING GRACE DATA Yahya Sattar, Zubair Khalid 5072 | ESTIMATION OF GROUNDWATER STORAGE VARIATIONS IN INDUS RIVER BASIN USING GRACE DATA |
2654 ESTIMATION OF MICROPHONE CLUSTERS IN ACOUSTIC SENSOR NETWORKS USING UNSUPERVISED FEDERATED LEARNING Alexandru Nelus, Rene Glitza, Rainer Martin 2654 | ESTIMATION OF MICROPHONE CLUSTERS IN ACOUSTIC SENSOR NETWORKS USING UNSUPERVISED FEDERATED LEARNING |
5060 ESTIMATION OF VISUAL FEATURES OF VIEWED IMAGE FROM INDIVIDUAL AND SHARED BRAIN INFORMATION BASED ON FMRI DATA USING PROBABILISTIC GENERATIVE MODEL Takaaki Higashi, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama 5060 | ESTIMATION OF VISUAL FEATURES OF VIEWED IMAGE FROM INDIVIDUAL AND SHARED BRAIN INFORMATION BASED ON FMRI DATA USING PROBABILISTIC GENERATIVE MODEL |
3494 Evaluation and Comparison of Three Source Direction-of-Arrival Estimators Using Relative Harmonic Coefficients Yonggang Hu, Prasanga Samarasinghe, Sharon Gannot, Thushara Abhayapala 3494 | Evaluation and Comparison of Three Source Direction-of-Arrival Estimators Using Relative Harmonic Coefficients |
4715 EVENT-DRIVEN MODULO SAMPLING Dorian Florescu, Felix Krahmer, Ayush Bhandari 4715 | EVENT-DRIVEN MODULO SAMPLING |
2977 EVOLUTIONARY QUANTIZATION OF NEURAL NETWORKS WITH MIXED-PRECISION Zhenhua Liu, Xinfeng Zhang, Shanshe Wang, Siwei Ma, Wen Gao 2977 | EVOLUTIONARY QUANTIZATION OF NEURAL NETWORKS WITH MIXED-PRECISION |
2458 EVOLVING QUANTIZED NEURAL NETWORKS FOR IMAGE CLASSIFICATION USING A MULTI-OBJECTIVE GENETIC ALGORITHM Yong Wang, Xiaojing Wang, Xiaoyu He 2458 | EVOLVING QUANTIZED NEURAL NETWORKS FOR IMAGE CLASSIFICATION USING A MULTI-OBJECTIVE GENETIC ALGORITHM |
4186 Exact Linear Convergence Rate Analysis for Low-Rank Symmetric Matrix Completion via Gradient Descent Trung Vu, Raviv Raich 4186 | Exact Linear Convergence Rate Analysis for Low-Rank Symmetric Matrix Completion via Gradient Descent |
3918 EXPEDITING DISCOVERY IN NEURAL ARCHITECTURE SEARCH BY COMBINING LEARNING WITH PLANNING Farzaneh S. Fard, Vikrant Tomar 3918 | EXPEDITING DISCOVERY IN NEURAL ARCHITECTURE SEARCH BY COMBINING LEARNING WITH PLANNING |
3127 EXPLOITING NON-NEGATIVE MATRIX FACTORIZATION FOR BINAURAL SOUND LOCALIZATION IN THE PRESENCE OF DIRECTIONAL INTERFERENCE Ingvi Örnolfsson, Torsten Dau, Tobias May, Ning Ma 3127 | EXPLOITING NON-NEGATIVE MATRIX FACTORIZATION FOR BINAURAL SOUND LOCALIZATION IN THE PRESENCE OF DIRECTIONAL INTERFERENCE |
3268 EXPLOITING THE DUAL-TREE COMPLEX WAVELET TRANSFORM FOR SHIP WAKE DETECTION IN SAR IMAGERY Wanli Ma, Alin Achim, Oktay Karakuş 3268 | EXPLOITING THE DUAL-TREE COMPLEX WAVELET TRANSFORM FOR SHIP WAKE DETECTION IN SAR IMAGERY |
2361 EXPLORING AUTOMATIC COVID-19 DIAGNOSIS VIA VOICE AND SYMPTOMS FROM CROWDSOURCED DATA Jing Han, Chloe Brown, Jagmohan Chauhan, Andreas Grammenos, Apinan Hasthanasombat, Dimitris Spathis, Tong Xia, Pietro Cicuta, Cecilia Mascolo 2361 | EXPLORING AUTOMATIC COVID-19 DIAGNOSIS VIA VOICE AND SYMPTOMS FROM CROWDSOURCED DATA |
1115 Exploring the application of synthetic audio in training keyword spotters Andrew Werchniak, Roberto Barra Chicote, Yuriy Mishchenko, Jasha Droppo, Peng Liu, Jeff Condal, Anish Shah 1115 | Exploring the application of synthetic audio in training keyword spotters |
3866 EXPLORING THE USE OF COMMON LABEL SET TO IMPROVE SPEECH RECOGNITION OF LOW RESOURCE INDIAN LANGUAGES Vishwas M Shetty, Srinivasan Umesh 3866 | EXPLORING THE USE OF COMMON LABEL SET TO IMPROVE SPEECH RECOGNITION OF LOW RESOURCE INDIAN LANGUAGES |
1064 EXPLORING VISUAL-AUDIO COMPOSITION ALIGNMENT NETWORK FOR QUALITY FASHION RETRIEVAL IN VIDEO Yanhao Zhang, Jianmin Wu, Xiong Xiong, Dangwei Li, Chenwei Xie, Yun Zheng, Pan Pan, Yinghui Xu 1064 | EXPLORING VISUAL-AUDIO COMPOSITION ALIGNMENT NETWORK FOR QUALITY FASHION RETRIEVAL IN VIDEO |
1634 EXPOSING GAN-GENERATED FACES USING INCONSISTENT CORNEAL SPECULAR HIGHLIGHTS Shu Hu, Yuezun Li, Siwei Lyu 1634 | EXPOSING GAN-GENERATED FACES USING INCONSISTENT CORNEAL SPECULAR HIGHLIGHTS |
5586 EXTENDED NESTED ARRAYS FOR CONSECUTIVE VIRTUAL APERTURE ENHANCEMENT Shiwei Ren, Wentao Dong, Xiangnan Li, Weijiang Wang, Xiaoran Li 5586 | EXTENDED NESTED ARRAYS FOR CONSECUTIVE VIRTUAL APERTURE ENHANCEMENT |
3852 Extended Object Tracking with Automotive Radar Using B-Spline Chained Ellipses Model Gang Yao, Pu Wang, Karl Berntorp, Hassan Mansour, Petros Boufounos, Philip Orlik 3852 | Extended Object Tracking with Automotive Radar Using B-Spline Chained Ellipses Model |
3581 EXTENDING MUSIC BASED ON EMOTION AND TONALITY VIA GENERATIVE ADVERSARIAL NETWORK Bo-Wei Tseng, Yih-Liang Shen, Tai-Shih Chi 3581 | EXTENDING MUSIC BASED ON EMOTION AND TONALITY VIA GENERATIVE ADVERSARIAL NETWORK |
4284 EXTENDING PARROTRON: AN END-TO-END, SPEECH CONVERSION AND SPEECH RECOGNITION MODEL FOR ATYPICAL SPEECH Rohan Doshi, Youzheng Chen, Liyang Jiang, Xia Zhang, Fadi Biadsy, Bhuvana Ramabhadran, Fang Chu, Andrew Rosenberg, Pedro J. Moreno 4284 | EXTENDING PARROTRON: AN END-TO-END, SPEECH CONVERSION AND SPEECH RECOGNITION MODEL FOR ATYPICAL SPEECH |
2709 Extending the Reverse JPEG Compatibility Attack to Double Compressed Images Jan Butora, Jessica Fridrich 2709 | Extending the Reverse JPEG Compatibility Attack to Double Compressed Images |
1900 Factorized CRF with batch normalization based on the entire training data Eran Goldman, Jacob Goldberger 1900 | Factorized CRF with batch normalization based on the entire training data |
2130 FAILURE PREDICTION BY CONFIDENCE ESTIMATION OF UNCERTAINTY-AWARE DIRICHLET NETWORKS Theodoros Tsiligkaridis 2130 | FAILURE PREDICTION BY CONFIDENCE ESTIMATION OF UNCERTAINTY-AWARE DIRICHLET NETWORKS |
5583 Fast Adaptive Reparametrization (FAR) With Application to Human Action Recognition Enjie Ghorbel, Girum Demisse, Djamila Aouada, Björn Ottersten 5583 | Fast Adaptive Reparametrization (FAR) With Application to Human Action Recognition |
2277 Fast and Provable Robust PCA via Normalized Coherence Pursuit Mostafa Rahmani, Ping Li 2277 | Fast and Provable Robust PCA via Normalized Coherence Pursuit |
2607 FAST AND ROBUST ADMM FOR BLIND SUPER-RESOLUTION Yifan Ran, Wei Dai 2607 | FAST AND ROBUST ADMM FOR BLIND SUPER-RESOLUTION |
4133 FAST AND ROBUST STRATIFIED SELF-CALIBRATION USING TIME-DIFFERENCE-OF-ARRIVAL MEASUREMENTS Martin Larsson, Gabrielle Flood, Magnus Oskarsson, Kalle Åström 4133 | FAST AND ROBUST STRATIFIED SELF-CALIBRATION USING TIME-DIFFERENCE-OF-ARRIVAL MEASUREMENTS |
4829 FAST DCTTS: EFFICIENT DEEP CONVOLUTIONAL TEXT-TO-SPEECH Minsu Kang, Jihyun Lee, Simin Kim, Injung Kim 4829 | FAST DCTTS: EFFICIENT DEEP CONVOLUTIONAL TEXT-TO-SPEECH |
2098 FAST DECENTRALIZED LINEAR FUNCTIONS VIA SUCCESSIVE GRAPH SHIFT OPERATORS Siavash Mollaebrahim Ghari, Daniel Romero, Baltasar Beferull-Lozano 2098 | FAST DECENTRALIZED LINEAR FUNCTIONS VIA SUCCESSIVE GRAPH SHIFT OPERATORS |
2366 FAST GRAPH KERNEL WITH OPTICAL RANDOM FEATURES Hashem Ghanem, Nicolas Keriven, Nicolas Tremblay 2366 | FAST GRAPH KERNEL WITH OPTICAL RANDOM FEATURES |
2351 Fast Hierarchy Preserving Graph Embedding via Subspace Constraints Xu Chen, Lun Du, Mengyuan Chen, Yun Wang, QingQing Long, Kunqing Xie 2351 | Fast Hierarchy Preserving Graph Embedding via Subspace Constraints |
4101 FAST INVERSE MAPPING OF FACE GANS Nicky Bayat, Vahid Reza Khazaie, Yalda Mohsenzadeh 4101 | FAST INVERSE MAPPING OF FACE GANS |
2264 FAST LOCAL REPRESENTATION LEARNING WITH ADAPTIVE ANCHOR GRAPH Canyu Zhang, Feiping Nie, Zheng Wang, Rong Wang, Xuelong Li 2264 | FAST LOCAL REPRESENTATION LEARNING WITH ADAPTIVE ANCHOR GRAPH |
2951 Fast Manifold Landmarking Using Extreme Eigen-pairs Fen Wang, Gene Cheung, Yongchao Wang, Wai-Tian Tan 2951 | Fast Manifold Landmarking Using Extreme Eigen-pairs |
3370 FAST THRESHOLD OPTIMIZATION FOR MULTI-LABEL AUDIO TAGGING USING SURROGATE GRADIENT LEARNING Thomas Pellegrini, Timothée Masquelier 3370 | FAST THRESHOLD OPTIMIZATION FOR MULTI-LABEL AUDIO TAGGING USING SURROGATE GRADIENT LEARNING |
1995 FAST: FEATURE AGGREGATION FOR DETECTING SALIENT OBJECT IN REAL-TIME Lv Tang, Bo Li 1995 | FAST: FEATURE AGGREGATION FOR DETECTING SALIENT OBJECT IN REAL-TIME |
2175 FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang 2175 | FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization |
5296 FASTPITCH: PARALLEL TEXT-TO-SPEECH WITH PITCH PREDICTION Adrian Lancucki 5296 | FASTPITCH: PARALLEL TEXT-TO-SPEECH WITH PITCH PREDICTION |
2236 FC2RN: A FULLY CONVOLUTIONAL CORNER REFINEMENT NETWORK FOR ACCURATE MULTI-ORIENTED SCENE TEXT DETECTION Xugong Qin, Yu Zhou, Youhui Guo, Dayan Wu, Weiping Wang 2236 | FC2RN: A FULLY CONVOLUTIONAL CORNER REFINEMENT NETWORK FOR ACCURATE MULTI-ORIENTED SCENE TEXT DETECTION |
3410 FCL-TACO2: TOWARDS FAST, CONTROLLABLE AND LIGHTWEIGHT TEXT-TO-SPEECH SYNTHESIS Disong Wang, Liqun Deng, Yang Zhang, Nianzu Zheng, Yu Ting Yeung, Xiao Chen, Xunying Liu, Helen Meng 3410 | FCL-TACO2: TOWARDS FAST, CONTROLLABLE AND LIGHTWEIGHT TEXT-TO-SPEECH SYNTHESIS |
5391 FEATURE INTEGRATION VIA SEMI-SUPERVISED ORDINALLY MULTI-MODAL GAUSSIAN PROCESS LATENT VARIABLE MODEL Kyohei Kamikawa, Keisuke Maeda, Takahiro Ogawa, Miki Haseyama 5391 | FEATURE INTEGRATION VIA SEMI-SUPERVISED ORDINALLY MULTI-MODAL GAUSSIAN PROCESS LATENT VARIABLE MODEL |
5233 FEATURE REDUNDANCY MINING: DEEP LIGHT-WEIGHT IMAGE SUPER-RESOLUTION MODEL Jun Xiao, Wenqi Jia, Kin-Man Lam 5233 | FEATURE REDUNDANCY MINING: DEEP LIGHT-WEIGHT IMAGE SUPER-RESOLUTION MODEL |
4067 FEATURE REUSE FOR A RANDOMIZATION BASED NEURAL NETWORK Xinyue Liang, Mikael Skoglund, Saikat Chatterjee 4067 | FEATURE REUSE FOR A RANDOMIZATION BASED NEURAL NETWORK |
2852 FEDERATED ACOUSTIC MODELING FOR AUTOMATIC SPEECH RECOGNITION Xiaodong Cui, Songtao Lu, Brian Kingsbury 2852 | FEDERATED ACOUSTIC MODELING FOR AUTOMATIC SPEECH RECOGNITION |
4632 FEDERATED ALGORITHM WITH BAYESIAN APPROACH: OMNI-FEDGE Sai Anuroop Kesanapalli, B. N. Bharath 4632 | FEDERATED ALGORITHM WITH BAYESIAN APPROACH: OMNI-FEDGE |
2298 FEDERATED DROPOUT LEARNING FOR HYBRID BEAMFORMING WITH SPATIAL PATH INDEX MODULATION IN MULTI-USER MMWAVE-MIMO SYSTEMS Ahmet M. Elbir, Sinem Coleri, Kumar Vijay Mishra 2298 | FEDERATED DROPOUT LEARNING FOR HYBRID BEAMFORMING WITH SPATIAL PATH INDEX MODULATION IN MULTI-USER MMWAVE-MIMO SYSTEMS |
1532 FEDERATED LEARNING FROM BIG DATA OVER NETWORKS Yasmin SarcheshmehPour, Miika Leinonen, Alexander Jung 1532 | FEDERATED LEARNING FROM BIG DATA OVER NETWORKS |
2751 Federated Learning with Local Differential Privacy: Trade-offs between Privacy, Utility, and Communication Muah Kim, Onur Günlü, Rafael F. Schaefer 2751 | Federated Learning with Local Differential Privacy: Trade-offs between Privacy, Utility, and Communication |
2101 FEDERATED MARGINAL PERSONALIZATION FOR ASR RESCORING Zhe Liu, Fuchun Peng 2101 | FEDERATED MARGINAL PERSONALIZATION FOR ASR RESCORING |
5623 Feedforward Selective Fixed-Filter Active Noise Control: Algorithm and Implementation DONGYUAN SHI, Woon-Seng Gan, Bhan Lam, Shulin Wen 5623 | Feedforward Selective Fixed-Filter Active Noise Control: Algorithm and Implementation |
2966 FEW-SHOT CONTINUAL LEARNING FOR AUDIO CLASSIFICATION Yu Wang, Nicholas J. Bryan, Mark Cartwright, Juan Pablo Bello, Justin Salamon 2966 | FEW-SHOT CONTINUAL LEARNING FOR AUDIO CLASSIFICATION |
3874 FEW-SHOT IMAGE CLASSIFICATION WITH MULTI-FACET PROTOTYPES Kun Yan, Zied Bouraoui, Ping Wang, Shoaib Jameel, Steven Schockaert 3874 | FEW-SHOT IMAGE CLASSIFICATION WITH MULTI-FACET PROTOTYPES |
2823 Few-shot Learning for CT Scan based COVID-19 Diagnosis Yifan Jiang, Han Chen, David Han, Hanseok Ko 2823 | Few-shot Learning for CT Scan based COVID-19 Diagnosis |
1168 Few-Shot Learning for Decoding Surface Electromyography for Hand Gesture Recognition Elahe Rahimian, Soheil Zabihi, Amir Asif, Seyed Farokh Atashzar, Arash Mohammadi 1168 | Few-Shot Learning for Decoding Surface Electromyography for Hand Gesture Recognition |
3125 Fiber-Sampled Stochastic Mirror Descent For Tensor Decomposition with $\beta$-Divergence Wenqiang Pu, Shahana Ibrahim, Xiao Fu, Mingyi Hong 3125 | Fiber-Sampled Stochastic Mirror Descent For Tensor Decomposition with $\beta$-Divergence |
3853 FiGLearn: Filter and Graph Learning using Optimal Transport Matthias Minder, Zahra Farsijani, Dhruti Shah, Mireille El Gheche, Pascal Frossard 3853 | FiGLearn: Filter and Graph Learning using Optimal Transport |
3154 FINE-GRAINED MRI RECONSTRUCTION USING ATTENTIVE SELECTION GENERATIVE ADVERSARIAL NETWORKS Jingshuai Liu, Mehrdad Yaghoobi 3154 | FINE-GRAINED MRI RECONSTRUCTION USING ATTENTIVE SELECTION GENERATIVE ADVERSARIAL NETWORKS |
5242 Fine-Grained Pose Temporal Memory Module for Video Pose Estimation and Tracking Chaoyi Wang, Yang Hua, Tao Song, Zhengui Xue, Ruhui Ma, Neil Robertson, Haibing Guan 5242 | Fine-Grained Pose Temporal Memory Module for Video Pose Estimation and Tracking |
3765 FINE-TUNING OF PRE-TRAINED END-TO-END SPEECH RECOGNITION WITH GENERATIVE ADVERSARIAL NETWORKS Md. Akmal Haidar, Mehdi Rezagholizadeh 3765 | FINE-TUNING OF PRE-TRAINED END-TO-END SPEECH RECOGNITION WITH GENERATIVE ADVERSARIAL NETWORKS |
4226 FIRST-ORDER FAST ALGORITHM FOR STRUCTURALLY OPTIMAL MULTI-GROUP MULTICAST BEAMFORMING IN LARGE-SCALE SYSTEMS Chong Zhang, Min Dong, Ben Liang 4226 | FIRST-ORDER FAST ALGORITHM FOR STRUCTURALLY OPTIMAL MULTI-GROUP MULTICAST BEAMFORMING IN LARGE-SCALE SYSTEMS |
3235 FLOW-BASED SELF-SUPERVISED DENSITY ESTIMATION FOR ANOMALOUS SOUND DETECTION Kota Dohi, Takashi Endo, Harsh Purohit, Ryo Tanabe, Yohei Kawaguchi 3235 | FLOW-BASED SELF-SUPERVISED DENSITY ESTIMATION FOR ANOMALOUS SOUND DETECTION |
1916 FMA-ETA: ESTIMATING TRAVEL TIME ENTIRELY BASED ON FFN WITH ATTENTION Yiwen Sun, Yulu Wang, Kun Fu, Zheng Wang, Ziang Yan, Changshui Zhang, Jieping Ye 1916 | FMA-ETA: ESTIMATING TRAVEL TIME ENTIRELY BASED ON FFN WITH ATTENTION |
1047 F-NET: FUSION NEURAL NETWORK FOR VEHICLE TRAJECTORY PREDICTION IN AUTONOMOUS DRIVING Jue Wang, Ping Wang, Jun Li 1047 | F-NET: FUSION NEURAL NETWORK FOR VEHICLE TRAJECTORY PREDICTION IN AUTONOMOUS DRIVING |
2807 Focus on the present: a regularization method for the ASR source-target attention layer Nanxin Chen, Piotr Zelasko, Jesus Villalba, Najim Dehak 2807 | Focus on the present: a regularization method for the ASR source-target attention layer |
4618 Focus: Context-Aware Masking for Robust Speaker Verification Ya-Qi Yu, Siqi Zheng, Hongbin Suo, Yun Lei, Wu-Jun Li 4618 | Focus: Context-Aware Masking for Robust Speaker Verification |
5602 FOCUSING AND FREQUENCY SMOOTHING FOR ARBITRARY ARRAYS WITH APPLICATION TO SPEAKER LOCALIZATION Hanan Beit-On, Boaz Rafaely 5602 | FOCUSING AND FREQUENCY SMOOTHING FOR ARBITRARY ARRAYS WITH APPLICATION TO SPEAKER LOCALIZATION |
2482 FOCUSING-BASED WIDEBAND ADAPTIVE BEAMFORMING USING COVARIANCE MATRIX RECONSTRUCTION Peng Chen, Wei Wang, Jingjie Gao 2482 | FOCUSING-BASED WIDEBAND ADAPTIVE BEAMFORMING USING COVARIANCE MATRIX RECONSTRUCTION |
3077 FONTNET: ON-DEVICE FONT UNDERSTANDING AND PREDICTION PIPELINE Rakshith S, Rishabh Khurana, Vibhav Agarwal, Jayesh Rajkumar Vachhani, Guggilla Bhanodai 3077 | FONTNET: ON-DEVICE FONT UNDERSTANDING AND PREDICTION PIPELINE |
3850 FOOLHD: FOOLING SPEAKER IDENTIFICATION BY HIGHLY IMPERCEPTIBLE ADVERSARIAL DISTURBANCES Ali Shahin Shamsabadi, Francisco Sepúlveda Teixeira, Alberto Abad, Bhiksha Raj, Andrea Cavallaro, Isabel Trancoso 3850 | FOOLHD: FOOLING SPEAKER IDENTIFICATION BY HIGHLY IMPERCEPTIBLE ADVERSARIAL DISTURBANCES |
3691 FORENSICABILITY OF DEEP NEURAL NETWORK INFERENCE PIPELINES Alexander Schlögl, Tobias Kupek, Rainer Böhme 3691 | FORENSICABILITY OF DEEP NEURAL NETWORK INFERENCE PIPELINES |
4452 Four-Dimensional High-Resolution Automotive Radar Imaging Exploiting Joint Sparse-Frequency and Sparse-Array Design Shunqiao Sun, Yimin Zhang 4452 | Four-Dimensional High-Resolution Automotive Radar Imaging Exploiting Joint Sparse-Frequency and Sparse-Array Design |
4084 FOURIER TRANSFORMATION AUTOENCODERS FOR ANOMALY DETECTION Demetris Lappas, Vasileios Argyriou, Dimitrios Makris 4084 | FOURIER TRANSFORMATION AUTOENCODERS FOR ANOMALY DETECTION |
2193 FOVEAL AVASCULAR ZONE SEGMENTATION OF OCTA IMAGES USING DEEP LEARNING APPROACH WITH UNSUPERVISED VESSEL SEGMENTATION Zhijin Liang, Junkang Zhang, Cheolhong An 2193 | FOVEAL AVASCULAR ZONE SEGMENTATION OF OCTA IMAGES USING DEEP LEARNING APPROACH WITH UNSUPERVISED VESSEL SEGMENTATION |
2548 FPGA HARDWARE DESIGN FOR PLENOPTIC 3D IMAGE PROCESSING ALGORITHM TARGETING A MOBILE APPLICATION Faraz Bhatti, Thomas Greiner 2548 | FPGA HARDWARE DESIGN FOR PLENOPTIC 3D IMAGE PROCESSING ALGORITHM TARGETING A MOBILE APPLICATION |
1371 FRAGMENTVC: ANY-TO-ANY VOICE CONVERSION BY END-TO-END EXTRACTING AND FUSING FINE-GRAINED VOICE FRAGMENTS WITH ATTENTION Yist Y. Lin, Chung-Ming Chien, Jheng-Hao Lin, Hung-yi Lee, Lin-shan Lee 1371 | FRAGMENTVC: ANY-TO-ANY VOICE CONVERSION BY END-TO-END EXTRACTING AND FUSING FINE-GRAINED VOICE FRAGMENTS WITH ATTENTION |
3694 FRAME RATE UP-CONVERSION USING KEY POINT AGNOSTIC FREQUENCY-SELECTIVE MESH-TO-GRID RESAMPLING Viktoria Heimann, Andreas Spruck, André Kaup 3694 | FRAME RATE UP-CONVERSION USING KEY POINT AGNOSTIC FREQUENCY-SELECTIVE MESH-TO-GRID RESAMPLING |
1318 Frame-rate-aware Aggregation For Efficient Video Super-resolution Takashi Isobe, Fang Zhu, Shengjin Wang 1318 | Frame-rate-aware Aggregation For Efficient Video Super-resolution |
5596 FREQUENCY ESTIMATION IN COHERENT, PERIODIC PULSE TRAINS Ian Clarkson, Songsri Sirianunpiboon, Stephen Howard 5596 | FREQUENCY ESTIMATION IN COHERENT, PERIODIC PULSE TRAINS |
4814 Frequency-Temporal Attention Network for Singing Melody Extraction Shuai Yu, Xiaoheng Sun, Yi Yu, Wei Li 4814 | Frequency-Temporal Attention Network for Singing Melody Extraction |
5258 Full-Duplex Multifunction Transceiver with Joint Constant Envelope Transmission and Wideband Reception Jaakko Marin, Micael Bernhardt, Taneli Riihonen 5258 | Full-Duplex Multifunction Transceiver with Joint Constant Envelope Transmission and Wideband Reception |
4270 FULLSUBNET: A FULL-BAND AND SUB-BAND FUSION MODEL FOR REAL-TIME SINGLE-CHANNEL SPEECH ENHANCEMENT Xiang Hao, Xiangdong Su, Radu Horaud, Xiaofei Li 4270 | FULLSUBNET: A FULL-BAND AND SUB-BAND FUSION MODEL FOR REAL-TIME SINGLE-CHANNEL SPEECH ENHANCEMENT |
3093 FULLY-NEURAL APPROACH TO VEHICLE WEIGHING AND STRAIN PREDICTION ON BRIDGES USING WIRELESS ACCELEROMETERS Takaya Kawakatsu, Kenro Aihara, Atsuhiro Takasu, Jun Adachi, Haoqi Wang, Tomonori Nagayama 3093 | FULLY-NEURAL APPROACH TO VEHICLE WEIGHING AND STRAIN PREDICTION ON BRIDGES USING WIRELESS ACCELEROMETERS |
5160 FUNDAMENTAL FREQUENCY FEATURE NORMALIZATION AND DATA AUGMENTATION FOR CHILD SPEECH RECOGNITION Gary Yeung, Ruchao Fan, Abeer Alwan 5160 | FUNDAMENTAL FREQUENCY FEATURE NORMALIZATION AND DATA AUGMENTATION FOR CHILD SPEECH RECOGNITION |
4696 FUNDAMENTAL TRADE-OFFS IN NOISY SUPER-RESOLUTION WITH SYNTHETIC APERTURES Sina Shahsavari, Jacob Millhiser, Piya Pal 4696 | FUNDAMENTAL TRADE-OFFS IN NOISY SUPER-RESOLUTION WITH SYNTHETIC APERTURES |
3751 FUSING INFORMATION STREAMS IN END-TO-END AUDIO-VISUAL SPEECH RECOGNITION Wentao Yu, Steffen Zeiler, Dorothea Kolossa 3751 | FUSING INFORMATION STREAMS IN END-TO-END AUDIO-VISUAL SPEECH RECOGNITION |
2036 FUSING MULTITASK MODELS BY RECURSIVE LEAST SQUARES Xiaobin Li, Lianlei Shan, Weiqiang Wang 2036 | FUSING MULTITASK MODELS BY RECURSIVE LEAST SQUARES |
1656 FUSION-BASED DIGITAL IMAGE CORRELATION FRAMEWORK FOR STRAIN MEASUREMENT Laixi Shi, Dehong Liu, Masaki Umeda, Norihiko Hana 1656 | FUSION-BASED DIGITAL IMAGE CORRELATION FRAMEWORK FOR STRAIN MEASUREMENT |
3769 FWB-NET: FRONT WHITE BALANCE NETWORK FOR COLOR SHIFT CORRECTION IN SINGLE IMAGE DEHAZING VIA ATMOSPHERIC LIGHT ESTIMATION Cong Wang, Yan Huang, Yuexian Zou, Yong Xu 3769 | FWB-NET: FRONT WHITE BALANCE NETWORK FOR COLOR SHIFT CORRECTION IN SINGLE IMAGE DEHAZING VIA ATMOSPHERIC LIGHT ESTIMATION |
5269 GAN-BASED OUT-OF-DOMAIN DETECTION USING BOTH IN-DOMAIN AND OUT-OF-DOMAIN SAMPLES Chaojie Liang, Peijie Huang, Wenbin Lai, Ziheng Ruan 5269 | GAN-BASED OUT-OF-DOMAIN DETECTION USING BOTH IN-DOMAIN AND OUT-OF-DOMAIN SAMPLES |
3094 G-ARRAYS: GEOMETRIC ARRAYS FOR EFFICIENT POINT CLOUD PROCESSING Hoda Roodaki, Masoud Dehyadegari, Mahdi Nazm Bojnordi 3094 | G-ARRAYS: GEOMETRIC ARRAYS FOR EFFICIENT POINT CLOUD PROCESSING |
1252 GATE TRIMMING: ONE-SHOT CHANNEL PRUNING FOR EFFICIENT CONVOLUTIONAL NEURAL NETWORKS Fang Yu, Chuanqi Han, Pengcheng Wang, Xi Huang, Li Cui 1252 | GATE TRIMMING: ONE-SHOT CHANNEL PRUNING FOR EFFICIENT CONVOLUTIONAL NEURAL NETWORKS |
5085 Gating Feature Dense Network for Single Anisotropic MR Image Super-resolution Weidong He, Yangjinan Hu, Lulu Wang, Zhongshi He, Jinglong Du 5085 | Gating Feature Dense Network for Single Anisotropic MR Image Super-resolution |
4093 Gaussian Kernelized Self-Attention for Long Sequence Data and Its Application to CTC-based Speech Recognition Yosuke Kashiwagi, Emiru Tsunoo, Shinji Watanabe 4093 | Gaussian Kernelized Self-Attention for Long Sequence Data and Its Application to CTC-based Speech Recognition |
2711 Gaussian Process Temporal-Difference Learning with Scalability and Worst-Case Performance Guarantees Qin Lu, Georgios B. Giannakis 2711 | Gaussian Process Temporal-Difference Learning with Scalability and Worst-Case Performance Guarantees |
2858 GDTW: A NOVEL DIFFERENTIABLE DTW LOSS FOR TIME SERIES TASKS Xiang Liu, Naiqi Li, Shu-Tao Xia 2858 | GDTW: A NOVEL DIFFERENTIABLE DTW LOSS FOR TIME SERIES TASKS |
3768 GENERAL TOTAL VARIATION REGULARIZED SPARSE BAYESIAN LEARNING FOR ROBUST BLOCK-SPARSE SIGNAL RECOVERY Aditya Sant, Markus Leinonen, Bhaskar Rao 3768 | GENERAL TOTAL VARIATION REGULARIZED SPARSE BAYESIAN LEARNING FOR ROBUST BLOCK-SPARSE SIGNAL RECOVERY |
3028 GENERALIZED KNOWLEDGE DISTILLATION FROM AN ENSEMBLE OF SPECIALIZED TEACHERS LEVERAGING UNSUPERVISED NEURAL CLUSTERING Takashi Fukuda, Gakuto Kurata 3028 | GENERALIZED KNOWLEDGE DISTILLATION FROM AN ENSEMBLE OF SPECIALIZED TEACHERS LEVERAGING UNSUPERVISED NEURAL CLUSTERING |
5363 GENERALIZED POLYTOPIC MATRIX FACTORIZATION Gokcan Tatli, Alper T. Erdogan 5363 | GENERALIZED POLYTOPIC MATRIX FACTORIZATION |
1710 Generalized Thinned Coprime Array for DOA Estimation Junpeng Shi, Yongxiang Liu, Fangqing Wen, Zhenghui Gong, Panhe Hu 1710 | Generalized Thinned Coprime Array for DOA Estimation |
2311 GENERATING EMPATHETIC RESPONSES BY INJECTING ANTICIPATED EMOTION Yuhan Liu, Jiachen Du, Xiang Li, Ruifeng Xu 2311 | GENERATING EMPATHETIC RESPONSES BY INJECTING ANTICIPATED EMOTION |
5071 GENERATING HUMAN READABLE TRANSCRIPT FOR AUTOMATIC SPEECH RECOGNITION WITH PRE-TRAINED LANGUAGE MODEL Junwei Liao, Yu Shi, Ming Gong, Linjun Shou, Sefik Eskimez, Liyang Lu, Hong Qu, Michael Zeng 5071 | GENERATING HUMAN READABLE TRANSCRIPT FOR AUTOMATIC SPEECH RECOGNITION WITH PRE-TRAINED LANGUAGE MODEL |
3872 GENERATING NATURAL QUESTIONS FROM IMAGES FOR MULTIMODAL ASSISTANTS Alkesh Patel, Akanksha Bindal, Hadas Kotek, Christopher Klein, Jason Williams 3872 | GENERATING NATURAL QUESTIONS FROM IMAGES FOR MULTIMODAL ASSISTANTS |
5301 GENERATIVE INFORMATION FUSION Kenneth Tran, Wesam Sakla, Hamid Krim 5301 | GENERATIVE INFORMATION FUSION |
3105 GENERATIVE SPEECH CODING WITH PREDICTIVE VARIANCE REGULARIZATION W Bastiaan Kleijn, Andrew Storus, Michael Chinen, Tom Denton, Felicia S. C. Lim, Alejandro Luebs, Jan Skoglund, Hengchin Yeh 3105 | GENERATIVE SPEECH CODING WITH PREDICTIVE VARIANCE REGULARIZATION |
4021 GEOMETRIC SCATTERING ATTENTION NETWORKS Yimeng Min, Frederik Wenkel, Guy Wolf 4021 | GEOMETRIC SCATTERING ATTENTION NETWORKS |
1338 GEOMETRY CONSISTENCY OF AUGMENTED REALITY BASED ON SEMANTICS Hongyan Quan, Mingwei Yao, XiaoXiao Qian 1338 | GEOMETRY CONSISTENCY OF AUGMENTED REALITY BASED ON SEMANTICS |
3455 GEOM-SPIDER-EM: FASTER VARIANCE REDUCED STOCHASTIC EXPECTATION MAXIMIZATION FOR NONCONVEX FINITE-SUM OPTIMIZATION Gersende Fort, Eric Moulines, Hoi-To Wai 3455 | GEOM-SPIDER-EM: FASTER VARIANCE REDUCED STOCHASTIC EXPECTATION MAXIMIZATION FOR NONCONVEX FINITE-SUM OPTIMIZATION |
2299 GLOBAL-LOCALIZED AGENT GRAPH CONVOLUTION FOR MULTI-AGENT REINFORCEMENT LEARNING Yuntao Liu, Yong Dou, Siqi Shen, Peng Qiao 2299 | GLOBAL-LOCALIZED AGENT GRAPH CONVOLUTION FOR MULTI-AGENT REINFORCEMENT LEARNING |
2302 GLOBALLY OPTIMAL BEAMFORMING FOR RATE SPLITTING MULTIPLE ACCESS Bho Matthiesen, Yijie Mao, Petar Popovski, Bruno Clerckx 2302 | GLOBALLY OPTIMAL BEAMFORMING FOR RATE SPLITTING MULTIPLE ACCESS |
4062 GPS-DENIED NAVIGATION USING SAR IMAGES AND NEURAL NETWORKS Teresa White, Jesse Wheeler, Colton Lindstrom, Randall Christensen, Kevin Moon 4062 | GPS-DENIED NAVIGATION USING SAR IMAGES AND NEURAL NETWORKS |
2692 GRADRAKER-BASED PREDICTION ALGORITHMS ON MULTI-LAYER GRAPHS Yue Zhao, Ender Ayanoglu 2692 | GRADRAKER-BASED PREDICTION ALGORITHMS ON MULTI-LAYER GRAPHS |
4823 GRADUAL FEDERATED LEARNING USING SIMULATED ANNEALING Luong Trung Nguyen, Byonghyo Shim 4823 | GRADUAL FEDERATED LEARNING USING SIMULATED ANNEALING |
1447 GRAMIAN-BASED ADAPTIVE COMBINATION POLICIES FOR DIFFUSION LEARNING OVER NETWORKS Y. Efe Erginbas, Stefan Vlaski, Ali H. Sayed 1447 | GRAMIAN-BASED ADAPTIVE COMBINATION POLICIES FOR DIFFUSION LEARNING OVER NETWORKS |
4253 GRANGER CAUSALITY BASED DIRECTIONAL PHASE-AMPLITUDE COUPLING MEASURE Tamanna Tabassum Khan Munia, Selin Aviyente 4253 | GRANGER CAUSALITY BASED DIRECTIONAL PHASE-AMPLITUDE COUPLING MEASURE |
1736 GRAPH ATTENTION AND INTERACTION NETWORK WITH MULTI-TASK LEARNING FOR FACT VERIFICATION Rui Yang, Runze Wang, Zhen-Hua Ling 1736 | GRAPH ATTENTION AND INTERACTION NETWORK WITH MULTI-TASK LEARNING FOR FACT VERIFICATION |
4354 GRAPH ATTENTION NETWORKS FOR SPEAKER VERIFICATION Jee-weon Jung, Hee-Soo Heo, Ha-Jin Yu, Joon Son Chung 4354 | GRAPH ATTENTION NETWORKS FOR SPEAKER VERIFICATION |
4900 GRAPH EMBEDDING USING MULTI-LAYER ADJACENT POINT MERGING MODEL Jianming Huang, Hiroyuki Kasai 4900 | GRAPH EMBEDDING USING MULTI-LAYER ADJACENT POINT MERGING MODEL |
3759 GRAPH ENHANCED QUERY REWRITING FOR SPOKEN LANGUAGE UNDERSTANDING SYSTEM Siyang Yuan, Saurabh Gupta, Xing Fan, Derek Liu, Yang Liu, Chenlei Guo 3759 | GRAPH ENHANCED QUERY REWRITING FOR SPOKEN LANGUAGE UNDERSTANDING SYSTEM |
4333 GRAPH FREQUENCY ANALYSIS OF COVID-19 INCIDENCE TO IDENTIFY COUNTY-LEVEL CONTAGION PATTERNS IN THE UNITED STATES Yang Li, Gonzalo Mateos 4333 | GRAPH FREQUENCY ANALYSIS OF COVID-19 INCIDENCE TO IDENTIFY COUNTY-LEVEL CONTAGION PATTERNS IN THE UNITED STATES |
5556 Graph learning under spectral sparsity constraints Subbareddy Batreddy, Aditya Siripuram, Jingxin Zhang 5556 | Graph learning under spectral sparsity constraints |
3646 GRAPH NEURAL NETWORK FOR LARGE-SCALE NETWORK LOCALIZATION Wenzhong Yan, Di Jin, Zhidi Lin, Feng Yin 3646 | GRAPH NEURAL NETWORK FOR LARGE-SCALE NETWORK LOCALIZATION |
3446 GRAPH NEURAL NETWORKS FOR DECENTRALIZED CONTROLLERS Fernando Gama, Ekaterina Tolstaya, Alejandro Ribeiro 3446 | GRAPH NEURAL NETWORKS FOR DECENTRALIZED CONTROLLERS |
1222 Graph Signal Compression via Task-Based Quantization Pei Li, Nir Shlezinger, Haiyang Zhang, Baoyun Wang, Yonina C. Eldar 1222 | Graph Signal Compression via Task-Based Quantization |
5095 GRAPH SIGNAL DENOISING USING NESTED-STRUCTURED DEEP ALGORITHM UNROLLING Masatoshi Nagahama, Koki Yamada, Yuichi Tanaka, Stanley Chan, Yonina Eldar 5095 | GRAPH SIGNAL DENOISING USING NESTED-STRUCTURED DEEP ALGORITHM UNROLLING |
2689 Graph signal denoising via unrolling networks Siheng Chen, Yonina Eldar 2689 | Graph signal denoising via unrolling networks |
1676 Graph-Adaptive Incremental learning using an ensemble of Gaussian process experts Konstantinos D. Polyzos, Qin Lu, Georgios B. Giannakis 1676 | Graph-Adaptive Incremental learning using an ensemble of Gaussian process experts |
3555 Graph-Based Pyramid Global Context Reasoning with A Saliency-Aware Projection for COVID-19 Lung Infections Segmentation Huimin Huang, Ming Cai, Lanfen Lin, Jing Zheng, Xiongwei Mao, Xiaohan Qian, Zhiyi Peng, Jianying Zhou, Yutaro Iwamoto, Xian-Hua Han, Yen-Wei Chen, Ruofeng Tong 3555 | Graph-Based Pyramid Global Context Reasoning with A Saliency-Aware Projection for COVID-19 Lung Infections Segmentation |
3419 GRAPHCOMM: A GRAPH NEURAL NETWORK BASED METHOD FOR MULTI-AGENT REINFORCEMENT LEARNING Siqi SHEN, Yongquan Fu, Huayou Su, Hengyue Pan, Qiao Peng, Yong DouCheng Wang 3419 | GRAPHCOMM: A GRAPH NEURAL NETWORK BASED METHOD FOR MULTI-AGENT REINFORCEMENT LEARNING |
3429 Graph-Homomorphic Perturbations for Private Decentralized Learning Stefan Vlaski, Ali H. Sayed 3429 | Graph-Homomorphic Perturbations for Private Decentralized Learning |
3372 GraphNet: Graph Clustering with Deep Neural Networks Xianchao Zhang, Jie Mu, Han Liu, Xiaotong Zhang 3372 | GraphNet: Graph Clustering with Deep Neural Networks |
1842 GRAPHON AND GRAPH NEURAL NETWORK STABILITY Luana Ruiz, Zhiyang Wang, Alejandro Ribeiro 1842 | GRAPHON AND GRAPH NEURAL NETWORK STABILITY |
5299 GRAPHSPEECH: SYNTAX-AWARE GRAPH ATTENTION NETWORK FOR NEURAL SPEECH SYNTHESIS Rui liu, Berrak Sisman, Haizhou Li 5299 | GRAPHSPEECH: SYNTAX-AWARE GRAPH ATTENTION NETWORK FOR NEURAL SPEECH SYNTHESIS |
4348 Grid Optimization for Matrix-based Source Localization under Inhomogeneous Sensor Topology Hao Sun, Junting Chen 4348 | Grid Optimization for Matrix-based Source Localization under Inhomogeneous Sensor Topology |
5579 GROOVE2GROOVE: ONE-SHOT MUSIC STYLE TRANSFER WITH SUPERVISION FROM SYNTHETIC DATA Ondřej Cífka, Umut Şimşekli, Gaël Richard 5579 | GROOVE2GROOVE: ONE-SHOT MUSIC STYLE TRANSFER WITH SUPERVISION FROM SYNTHETIC DATA |
4338 GTA-NET: GRADUAL TEMPORAL AGGREGATION NETWORK FOR FAST VIDEO DERAINING Xinwei Xue, Xiangyu Meng, Long Ma, Risheng Liu, Xin Fan 4338 | GTA-NET: GRADUAL TEMPORAL AGGREGATION NETWORK FOR FAST VIDEO DERAINING |
5125 Guaranteed reconstruction from integrate-and-fire neurons with alpha synaptic activation Marek Hilton, Roxana Alexandru, Pier Luigi Dragotti 5125 | Guaranteed reconstruction from integrate-and-fire neurons with alpha synaptic activation |
4901 Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier Guillaume Carbajal, Julius Richter, Timo Gerkmann 4901 | Guided Variational Autoencoder for Speech Enhancement With a Supervised Classifier |
3644 HANDLING CLASS IMBALANCE IN LOW-RESOURCE DIALOGUE SYSTEMS BY COMBINING FEW-SHOT CLASSIFICATION AND INTERPOLATION Vishal Sunder, Eric Fosler-Lussier 3644 | HANDLING CLASS IMBALANCE IN LOW-RESOURCE DIALOGUE SYSTEMS BY COMBINING FEW-SHOT CLASSIFICATION AND INTERPOLATION |
4135 HANDWRITTEN DIGITS RECONSTRUCTION FROM UNLABELLED EMBEDDINGS Thomas Thebaud, Gaël Le Lan, Anthony Larcher 4135 | HANDWRITTEN DIGITS RECONSTRUCTION FROM UNLABELLED EMBEDDINGS |
5235 HARDWARE IMPLEMENTATION OF ITERATIVE PROJECTION-AGGREGATION DECODING OF REED-MULLER CODES Marzieh Hashemipour-Nazari, Kees Goossens, Alexios Balatsoukas-Stimming 5235 | HARDWARE IMPLEMENTATION OF ITERATIVE PROJECTION-AGGREGATION DECODING OF REED-MULLER CODES |
1871 HAVE YOU MADE A DECISION? WHERE? A PILOT STUDY ON INTERPRETABILITY OF POLARITY ANALYSIS BASED ON ADVISING PROBLEM Tianda LI, Jia-Chen Gu, Hui Liu, Quan Liu, Zhen-hua LIng, Zhiming Su, Xiaodan Zhu 1871 | HAVE YOU MADE A DECISION? WHERE? A PILOT STUDY ON INTERPRETABILITY OF POLARITY ANALYSIS BASED ON ADVISING PROBLEM |
4661 HCAG: A HIERARCHICAL CONTEXT-AWARE GRAPH ATTENTION MODEL FOR DEPRESSION DETECTION Meng Niu, Kai Chen, Qingcai Chen, Lufeng Yang 4661 | HCAG: A HIERARCHICAL CONTEXT-AWARE GRAPH ATTENTION MODEL FOR DEPRESSION DETECTION |
3892 HCGM-NET: A DEEP UNFOLDING NETWORK FOR FINANCIAL INDEX TRACKING RUBEN PAUWELS, EVAGGELIA TSILIGIANNI, NIKOS DELIGIANNIS 3892 | HCGM-NET: A DEEP UNFOLDING NETWORK FOR FINANCIAL INDEX TRACKING |
3587 HEAD-SYNCHRONOUS DECODING FOR TRANSFORMER-BASED STREAMING ASR Mohan Li, Catalin Zorila, Rama Doddipatla 3587 | HEAD-SYNCHRONOUS DECODING FOR TRANSFORMER-BASED STREAMING ASR |
2462 HEBBNET: A SIMPLIFIED HEBBIAN LEARNING FRAMEWORK TO DO BIOLOGICALLY PLAUSIBLE LEARNING Manas Gupta, Arulmurugan Ambikapathi, Savitha Ramasamy 2462 | HEBBNET: A SIMPLIFIED HEBBIAN LEARNING FRAMEWORK TO DO BIOLOGICALLY PLAUSIBLE LEARNING |
4543 Heterogeneous Two-Stream Network with Hierarchical Feature Prefusion for Multispectral Pan-Sharpening Dong Wang, Yunpeng Bai, Bendu Bai, Chanyue Wu, Ying Li 4543 | Heterogeneous Two-Stream Network with Hierarchical Feature Prefusion for Multispectral Pan-Sharpening |
1229 HFGCNet: High-frequency Graph Reasoning for Finer Semantic Image Segmentation Zitang Sun, Ruojing Wang, Zhengbo Luo 1229 | HFGCNet: High-frequency Graph Reasoning for Finer Semantic Image Segmentation |
1404 H-GPR: A HYBRID STRATEGY FOR LARGE-SCALE GAUSSIAN PROCESS REGRESSION Naiqi Li, Yinghua Gao, Wenjie Li, Yong Jiang, Shu-Tao Xia 1404 | H-GPR: A HYBRID STRATEGY FOR LARGE-SCALE GAUSSIAN PROCESS REGRESSION |
1582 HIDDEN MARKOV MODEL DIARISATION WITH SPEAKER LOCATION INFORMATION Jeremy Heng Meng Wong, Xiong Xiao, Yifan Gong 1582 | HIDDEN MARKOV MODEL DIARISATION WITH SPEAKER LOCATION INFORMATION |
4612 Hide Chopin in the Music: Efficient Image Concealing via Random Shuffling Zhun Sun, Chao Li, Qibin Zhao 4612 | Hide Chopin in the Music: Efficient Image Concealing via Random Shuffling |
1090 Hierarchical Attention Fusion for Geo-Localization Dongfang Liu, Yiming Cui, Liqi Yan, yingjie chen 1090 | Hierarchical Attention Fusion for Geo-Localization |
1587 HIERARCHICAL ATTENTION-BASED TEMPORAL CONVOLUTIONAL NETWORKS FOR EEG-BASED EMOTION RECOGNITION Chao Li, Boyang Chen, Ziping Zhao, Nicholas Cummins, Björn Schuller 1587 | HIERARCHICAL ATTENTION-BASED TEMPORAL CONVOLUTIONAL NETWORKS FOR EEG-BASED EMOTION RECOGNITION |
4892 HIERARCHICAL BIT-WISE DIFFERENTIAL CODING (HBDC) OF POINT CLOUD ATTRIBUTES Yan Huang, Bin Wang, C.-C. Jay Kuo, Hui Yuan, Jingliang Peng 4892 | HIERARCHICAL BIT-WISE DIFFERENTIAL CODING (HBDC) OF POINT CLOUD ATTRIBUTES |
4148 HIERARCHICAL CODED ELASTIC COMPUTING Shahrzad Kianidehkordi, Tharindu Adikari, Stark Draper 4148 | HIERARCHICAL CODED ELASTIC COMPUTING |
1955 HIERARCHICAL CONTEXT GUIDED AGGREGATION NETWORK FOR STEREO MATCHING Jun Peng, Wangduo Xie, Zijing Huang, Wei Chen, Yong Zhao 1955 | HIERARCHICAL CONTEXT GUIDED AGGREGATION NETWORK FOR STEREO MATCHING |
2862 HIERARCHICAL NETWORK BASED ON THE FUSION OF STATIC AND DYNAMIC FEATURES FOR SPEECH EMOTION RECOGNITION Qi Cao, Mixiao Hou, Bingzhi Chen, Zheng Zhang, Guangming Lu 2862 | HIERARCHICAL NETWORK BASED ON THE FUSION OF STATIC AND DYNAMIC FEATURES FOR SPEECH EMOTION RECOGNITION |
4173 HIERARCHICAL POSE CLASSIFICATION FOR INFANT ACTION ANALYSIS AND MENTAL DEVELOPMENT ASSESSMENT Zhongyu Jiang, Jianxiong Zhou, Jang-Hee Yoo, Jenq-Neng Hwang 4173 | HIERARCHICAL POSE CLASSIFICATION FOR INFANT ACTION ANALYSIS AND MENTAL DEVELOPMENT ASSESSMENT |
3059 HIERARCHICAL RECURRENT NEURAL NETWORK FOR HANDWRITTEN STROKES CLASSIFICATION Illya Degtyarenko, Ivan Deriuga, Andrii Grygoriev, Serhii Polotskyi, Volodymyr Melnyk, Dmytro Zakharchuk, Olga Radyvonenko 3059 | HIERARCHICAL RECURRENT NEURAL NETWORK FOR HANDWRITTEN STROKES CLASSIFICATION |
5150 Hierarchical Refined Attention For Scene Text Recognition Min Zhang, Meng Ma, Ping Wang 5150 | Hierarchical Refined Attention For Scene Text Recognition |
3686 HIERARCHICAL SIMILARITY LEARNING FOR LANGUAGE-BASED PRODUCT IMAGE RETRIEVAL Zhe Ma, Fenghao Liu, Jianfeng Dong, Xiaoye Qu, Yuan He, Shouling Ji 3686 | HIERARCHICAL SIMILARITY LEARNING FOR LANGUAGE-BASED PRODUCT IMAGE RETRIEVAL |
3476 HIERARCHICAL SPEAKER-AWARE SEQUENCE-TO-SEQUENCE MODEL FOR DIALOGUE SUMMARIZATION Yuejie Lei, Yuanmeng Yan, Zhiyuan Zeng, Keqing He, Ximing Zhang, Weiran Xu 3476 | HIERARCHICAL SPEAKER-AWARE SEQUENCE-TO-SEQUENCE MODEL FOR DIALOGUE SUMMARIZATION |
4933 HIERARCHICAL TRANSFORMER-BASED LARGE-CONTEXT END-TO-END ASR WITH LARGE-CONTEXT KNOWLEDGE DISTILLATION Ryo Masumura, Naoki Makishima, Mana Ihori, Akihiko Takashima, Tomohiro Tanaka, Shota Orihashi 4933 | HIERARCHICAL TRANSFORMER-BASED LARGE-CONTEXT END-TO-END ASR WITH LARGE-CONTEXT KNOWLEDGE DISTILLATION |
1533 HIGCNN: HIERARCHICAL INTERLEAVED GROUP CONVOLUTIONAL NEURAL NETWORKS FOR POINT CLOUDS ANALYSIS Jisheng Dang, Jun Yang 1533 | HIGCNN: HIERARCHICAL INTERLEAVED GROUP CONVOLUTIONAL NEURAL NETWORKS FOR POINT CLOUDS ANALYSIS |
2652 HIGH ACCURACY TRACKING FOR OUTDOOR TARGETS USING MASSIVE MIMO Xiaolu Zeng, Feng Zhang, Beibei Wang, K. J. Ray Liu 2652 | HIGH ACCURACY TRACKING FOR OUTDOOR TARGETS USING MASSIVE MIMO |
3126 High Fidelity Speech Regeneration with Application to Speech Enhancement Adam Polyak, Lior Wolf, Yossi Adi, Ori Kabeli, Yaniv Taigman 3126 | High Fidelity Speech Regeneration with Application to Speech Enhancement |
2870 HIGH-FREQUENCY ADVERSARIAL DEFENSE FOR SPEECH AND AUDIO Raphael Olivier, Bhiksha Raj, Muhammad Shah 2870 | HIGH-FREQUENCY ADVERSARIAL DEFENSE FOR SPEECH AND AUDIO |
4700 HIGH-INTELLIGIBILITY SPEECH SYNTHESIS FOR DYSARTHRIC SPEAKERS WITH LPCNET-BASED TTS AND CYCLEVAE-BASED VC Keisuke Matsubara, Takuma Okamoto, Ryoichi Takashima, Tetsuya Takiguchi, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai 4700 | HIGH-INTELLIGIBILITY SPEECH SYNTHESIS FOR DYSARTHRIC SPEAKERS WITH LPCNET-BASED TTS AND CYCLEVAE-BASED VC |
1610 Highly Efficient Protection of Biometric Face Samples with Selective JPEG2000 Encryption Heinz Hofbauer, Yoanna Martínez-Díaz, Simon Kirchgasser, Heydi Méndez-Vázquez, Andreas Uhl 1610 | Highly Efficient Protection of Biometric Face Samples with Selective JPEG2000 Encryption |
4298 HIGH-THROUGHPUT VLSI ARCHITECTURE FOR SOFT-DECISION DECODING WITH ORBGRAND Syed Mohsin Abbas, Thibaud Tonnellier, Furkan Ercan, Marwan Jalaleddine, Warren Gross 4298 | HIGH-THROUGHPUT VLSI ARCHITECTURE FOR SOFT-DECISION DECODING WITH ORBGRAND |
2936 HISTORY UTTERANCE EMBEDDING TRANSFORMER LM FOR SPEECH RECOGNITION Keqi Deng, Gaofeng Cheng, Haoran Miao, Pengyuan Zhang, Yonghong Yan 2936 | HISTORY UTTERANCE EMBEDDING TRANSFORMER LM FOR SPEECH RECOGNITION |
4382 HOCA: HIGHER-ORDER CHANNEL ATTENTION FOR SINGLE IMAGE SUPER-RESOLUTION Yalei Lv, Tao Dai, Bin Chen, Jian Lu, Shu-Tao Xia 4382 | HOCA: HIGHER-ORDER CHANNEL ATTENTION FOR SINGLE IMAGE SUPER-RESOLUTION |
1475 HOW CONVOLUTIONAL NEURAL NETWORKS DEAL WITH ALIASING Antônio H. Ribeiro, Thomas B. Schön 1475 | HOW CONVOLUTIONAL NEURAL NETWORKS DEAL WITH ALIASING |
3840 How Phonotactics Affect Multilingual and Zero-shot ASR Performance Siyuan Feng, Piotr Żelasko, Laureano Moro-Velázquez, Ali Abavisani, Mark Hasegawa-Johnson, Odette Scharenborg, Najim Dehak 3840 | How Phonotactics Affect Multilingual and Zero-shot ASR Performance |
2831 How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers? Shuhei Kato, Yusuke Yasuda, Xin Wang, Erica Cooper, Junichi Yamagishi 2831 | How Similar or Different Is Rakugo Speech Synthesizer to Professional Performers? |
3291 HOW TO MAKE TEXT-TO-SPEECH SYSTEM PRONOUNCE “VOLDEMORT”: AN EXPERIMENTAL APPROACH OF FOREIGN WORD PHONEMIZATION IN VIETNAMESE Dang-Khoa Mac, Van-Huy Nguyen, Dinh-Nghi Nguyen, Kim-Anh Nguyen 3291 | HOW TO MAKE TEXT-TO-SPEECH SYSTEM PRONOUNCE “VOLDEMORT”: AN EXPERIMENTAL APPROACH OF FOREIGN WORD PHONEMIZATION IN VIETNAMESE |
5262 How to Use Time Information Effectively? Combining with Time Shift Module for Lipreading Mingfeng Hao, Mutallip Mamut, Nurbiya Yadikar, Alimjan Aysa, Kurban Ubul 5262 | How to Use Time Information Effectively? Combining with Time Shift Module for Lipreading |
1439 HSAN: A HIERARCHICAL SELF-ATTENTION NETWORK FOR MULTI-TURN DIALOGUE GENERATION Yawei Kong, Lu Zhang, Can Ma, Cong Cao 1439 | HSAN: A HIERARCHICAL SELF-ATTENTION NETWORK FOR MULTI-TURN DIALOGUE GENERATION |
1387 HUBERT: HOW MUCH CAN A BAD TEACHER BENEFIT ASR PRE-TRAINING? Wei-Ning Hsu, Yao-Hung Hubert Tsai, Benjamin Bolte, Ruslan Salakhutdinov, Abdelrahman Mohamed 1387 | HUBERT: HOW MUCH CAN A BAD TEACHER BENEFIT ASR PRE-TRAINING? |
1681 HUMANACGAN: CONDITIONAL GENERATIVE ADVERSARIAL NETWORK WITH HUMAN-BASED AUXILIARY CLASSIFIER AND ITS EVALUATION IN PHONEME PERCEPTION Yota Ueda, Kazuki Fujii, Yuki Saito, Shinnosuke Takamichi, Yukino Baba, Hiroshi Saruwatari 1681 | HUMANACGAN: CONDITIONAL GENERATIVE ADVERSARIAL NETWORK WITH HUMAN-BASED AUXILIARY CLASSIFIER AND ITS EVALUATION IN PHONEME PERCEPTION |
2443 HUMAN-AWARE COARSE-TO-FINE ONLINE ACTION DETECTION Zichen Yang, Di Huang, Jie Qin, Yunhong Wang 2443 | HUMAN-AWARE COARSE-TO-FINE ONLINE ACTION DETECTION |
5460 Human-centered Favorite Music Classification Using EEG-based Individual Music Preference via Deep Time-series CCA Ryosuke Sawata, Takahiro Ogawa, Miki Haseyama 5460 | Human-centered Favorite Music Classification Using EEG-based Individual Music Preference via Deep Time-series CCA |
2696 Human-Expert-Level Brain Tumor Detection Using Deep Learning with Data Distillation and Augmentation Diyuan Lu, Nenad Polomac, Iskra Gacheva, Elke Hattingen, Jochen Triesch 2696 | Human-Expert-Level Brain Tumor Detection Using Deep Learning with Data Distillation and Augmentation |
1718 HVS-BASED PERCEPTUAL COLOR COMPRESSION OF IMAGE DATA Lee Prangnell, Victor Sanchez 1718 | HVS-BASED PERCEPTUAL COLOR COMPRESSION OF IMAGE DATA |
1239 HYBRID ANALOG-DIGITAL MIMO RADAR RECEIVERS WITH BIT-LIMITED ADCS Feng Xi, Nir Shlezinger, Yonina C. Eldar 1239 | HYBRID ANALOG-DIGITAL MIMO RADAR RECEIVERS WITH BIT-LIMITED ADCS |
1567 HYBRID BEAMFORMING FOR WIDEBAND OFDM DUAL FUNCTION RADAR COMMUNICATIONS Ziyang Cheng, Jinyang He, Shengnan Shi, Zishu He, Bin Liao 1567 | HYBRID BEAMFORMING FOR WIDEBAND OFDM DUAL FUNCTION RADAR COMMUNICATIONS |
1654 HYPERPARAMETER TUNING FOR THE CONTEXTUAL BANDIT Djallel Bouneffouf, Emmanuelle Claeys 1654 | HYPERPARAMETER TUNING FOR THE CONTEXTUAL BANDIT |
4439 HYPERSPECTRAL IMAGE SUPER-RESOLUTION VIA ADJACENT SPECTRAL FUSION STRATEGY Qiang Li, Qi Wang, Xuelong Li 4439 | HYPERSPECTRAL IMAGE SUPER-RESOLUTION VIA ADJACENT SPECTRAL FUSION STRATEGY |
1327 HYPOTHESIS STITCHER FOR END-TO-END SPEAKER-ATTRIBUTED ASR ON LONG-FORM MULTI-TALKER RECORDINGS Xuankai Chang, Naoyuki Kanda, Yashesh Gaur, Xiaofei Wang, Zhong Meng, Takuya Yoshioka 1327 | HYPOTHESIS STITCHER FOR END-TO-END SPEAKER-ATTRIBUTED ASR ON LONG-FORM MULTI-TALKER RECORDINGS |
3740 ICA WITH ORTHOGONALITY CONSTRAINT: IDENTIFIABILITY AND A NEW EFFICIENT ALGORITHM Benjamin Gabrielson, Mohammad Akhonda, Zois Boukouvalas, Seung Jun Kim, Tülay Adali 3740 | ICA WITH ORTHOGONALITY CONSTRAINT: IDENTIFIABILITY AND A NEW EFFICIENT ALGORITHM |
4929 ICASSP 2021 ACOUSTIC ECHO CANCELLATION CHALLENGE: DATASETS, TESTING FRAMEWORK, AND RESULTS Kusha Sridhar, Ross Cutler, Ando Saabas, Tanel Parnamaa, Markus Loide, Hannes Gamper, Sebastian Braun, Robert Aichner, Sriram Srinivasan 4929 | ICASSP 2021 ACOUSTIC ECHO CANCELLATION CHALLENGE: DATASETS, TESTING FRAMEWORK, AND RESULTS |
4536 ICASSP 2021 ACOUSTIC ECHO CANCELLATION CHALLENGE: INTEGRATED ADAPTIVE ECHO CANCELLATION WITH TIME ALIGNMENT AND DEEP LEARNING-BASED RESIDUAL ECHO PLUS NOISE SUPPRESSION Renhua Peng, Linjuan Cheng, Chengshi Zheng, Xiaodong Li 4536 | ICASSP 2021 ACOUSTIC ECHO CANCELLATION CHALLENGE: INTEGRATED ADAPTIVE ECHO CANCELLATION WITH TIME ALIGNMENT AND DEEP LEARNING-BASED RESIDUAL ECHO PLUS NOISE SUPPRESSION |
4147 ICASSP 2021 DEEP NOISE SUPPRESSION CHALLENGE Chandan Karadagur Ananda Reddy, Harishchandra Dubey, Vishak Gopal, Ross Cutler, Sebastian Braun, Hannes Gamper, Robert Aichner, Sriram Srinivasan 4147 | ICASSP 2021 DEEP NOISE SUPPRESSION CHALLENGE |
2859 ICASSP 2021 DEEP NOISE SUPPRESSION CHALLENGE: DECOUPLING MAGNITUDE AND PHASE OPTIMIZATION WITH A TWO-STAGE DEEP NETWORK Andong Li, Wenzhe Liu, Xiaoxue Luo, Chengshi Zheng, Xiaodong Li 2859 | ICASSP 2021 DEEP NOISE SUPPRESSION CHALLENGE: DECOUPLING MAGNITUDE AND PHASE OPTIMIZATION WITH A TWO-STAGE DEEP NETWORK |
2673 ICI-AWARE PARAMETER ESTIMATION FOR MIMO-OFDM RADAR VIA APES SPATIAL FILTERING Musa Furkan Keskin, Henk Wymeersch, Visa Koivunen 2673 | ICI-AWARE PARAMETER ESTIMATION FOR MIMO-OFDM RADAR VIA APES SPATIAL FILTERING |
2048 IDENTIFICATION OF DEEP BREATH WHILE MOVING FORWARD BASED ON MULTIPLE BODY REGIONS AND GRAPH SIGNAL ANALYSIS Yunlu Wang, Cheng Yang, Menghan Hu, Jian Zhang, Qingli Li, Guangtao Zhai, Xiao-Ping Zhang 2048 | IDENTIFICATION OF DEEP BREATH WHILE MOVING FORWARD BASED ON MULTIPLE BODY REGIONS AND GRAPH SIGNAL ANALYSIS |
4334 IDENTIFICATION OF UTERINE CONTRACTIONS BY AN ENSEMBLE OF GAUSSIAN PROCESSES Liu Yang, Cassandra Heiselman, J. Gerald Quirk, Petar M. Djurić 4334 | IDENTIFICATION OF UTERINE CONTRACTIONS BY AN ENSEMBLE OF GAUSSIAN PROCESSES |
5069 IDENTIFYING FIRST-ORDER LOWPASS GRAPH SIGNALS USING PERRON FROBENIUS THEOREM Yiran HE, Hoi-To WAI 5069 | IDENTIFYING FIRST-ORDER LOWPASS GRAPH SIGNALS USING PERRON FROBENIUS THEOREM |
1956 IDENTIFYING SPAMMERS TO BOOST CROWDSOURCED CLASSIFICATION Panagiotis Traganitis, Georgios B. Giannakis 1956 | IDENTIFYING SPAMMERS TO BOOST CROWDSOURCED CLASSIFICATION |
3195 IMAGE CODING FOR MACHINES: AN END-TO-END LEARNED APPROACH Nam Le, Honglei Zhang, Francesco Cricri, Ramin Ghaznavi Youvalari, Esa Rahtu 3195 | IMAGE CODING FOR MACHINES: AN END-TO-END LEARNED APPROACH |
4233 IMAGE CODING WITH NEURAL NETWORK-BASED COLORIZATION Diogo Lopes, João Ascenso, Catarina Brites, Fernando Pereira 4233 | IMAGE CODING WITH NEURAL NETWORK-BASED COLORIZATION |
3137 Image Denoising Based on Correlation Adaptive Sparse Modeling Hangfan Liu, Jian Zhang 3137 | Image Denoising Based on Correlation Adaptive Sparse Modeling |
2896 IMAGE GENERATION BASED ON TEXTURE GUIDED VAE-AGAN FOR REGIONS OF INTEREST DETECTION IN REMOTE SENSING IMAGES Libao Zhang, Yanan Liu 2896 | IMAGE GENERATION BASED ON TEXTURE GUIDED VAE-AGAN FOR REGIONS OF INTEREST DETECTION IN REMOTE SENSING IMAGES |
2181 IMAGE STEGANOGRAPHY BASED ON ITERATIVE ADVERSARIAL PERTURBATIONS ONTO A SYNCHRONIZED-DIRECTIONS SUB-IMAGE Xinghong Qin, Shunquan Tan, Weixuan Tang, Bin Li, Jiwu Huang 2181 | IMAGE STEGANOGRAPHY BASED ON ITERATIVE ADVERSARIAL PERTURBATIONS ONTO A SYNCHRONIZED-DIRECTIONS SUB-IMAGE |
4812 IMAGE SUPER-RESOLUTION USING MULTI-RESOLUTION ATTENTION NETWORK Anqi Liu, Sumei Li 4812 | IMAGE SUPER-RESOLUTION USING MULTI-RESOLUTION ATTENTION NETWORK |
4553 IMAGE-ASSISTED TRANSFORMER IN ZERO-RESOURCE MULTI-MODAL TRANSLATION Ping Huang, Shiliang Sun, Hao Yang 4553 | IMAGE-ASSISTED TRANSFORMER IN ZERO-RESOURCE MULTI-MODAL TRANSLATION |
2530 Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance Keisuke Imoto, Sakiko Mishima, Yumi Arai, Reishi Kondo 2530 | Impact of Sound Duration and Inactive Frames on Sound Event Detection Performance |
5177 Impact of speaking rate on the source filter Interaction in speech: a study Tilak Purohit, Achuth Rao M V, Prasanta Kumar Ghosh 5177 | Impact of speaking rate on the source filter Interaction in speech: a study |
2653 Implicit HRTF Modeling Using Temporal Convolutional Networks Israel D Gebru, Dejan Markovic, Alexander Richard, Steven Krenn, Gladstone Butler, Fernando De la Torre, Yaser Sheikh 2653 | Implicit HRTF Modeling Using Temporal Convolutional Networks |
3833 Improved Atomic Norm Based Channel Estimation for Time-varying Narrowband Leaked Channels Jianxiu Li, Urbashi Mitra 3833 | Improved Atomic Norm Based Channel Estimation for Time-varying Narrowband Leaked Channels |
5590 IMPROVED COVARIANCE MATRIX ESTIMATION WITH AN APPLICATION IN PORTFOLIO OPTIMIZATION Samruddhi Deshmukh, Amartansh Dubey 5590 | IMPROVED COVARIANCE MATRIX ESTIMATION WITH AN APPLICATION IN PORTFOLIO OPTIMIZATION |
4438 IMPROVED DATA SELECTION FOR DOMAIN ADAPTATION IN ASR Shannon Wotherspoon, William Hartmann, Matthew Snover, Owen Kimball 4438 | IMPROVED DATA SELECTION FOR DOMAIN ADAPTATION IN ASR |
2910 IMPROVED INTRA MODE CODING BEYOND AV1 Yize Jin, Liang Zhao, Xin Zhao, Shan Liu, Alan Bovik 2910 | IMPROVED INTRA MODE CODING BEYOND AV1 |
3669 IMPROVED MASK-CTC FOR NON-AUTOREGRESSIVE END-TO-END ASR Yosuke Higuchi, Hirofumi Inaguma, Shinji Watanabe, Tetsuji Ogawa, Tetsunori Kobayashi 3669 | IMPROVED MASK-CTC FOR NON-AUTOREGRESSIVE END-TO-END ASR |
3655 IMPROVED NEURAL LANGUAGE MODEL FUSION FOR STREAMING RECURRENT NEURAL NETWORK TRANSDUCER Suyoun Kim, Yuan Shangguan, Jay Mahadeokar, Antoine Bruguier, Christian Fuegen, Michael Seltzer, Duc Le 3655 | IMPROVED NEURAL LANGUAGE MODEL FUSION FOR STREAMING RECURRENT NEURAL NETWORK TRANSDUCER |
4906 Improved Probabilistic Context-Free Grammars for Passwords Using Word Extraction Haibo Cheng, Wenting Li, Ping Wang, Kaitai Liang 4906 | Improved Probabilistic Context-Free Grammars for Passwords Using Word Extraction |
2469 IMPROVED ROBUSTNESS TO DISFLUENCIES IN RNN-TRANSDUCER BASED SPEECH RECOGNITION Valentin Mendelev, Tina Raissi, Guglielmo Camporese, Manuel Giollo 2469 | IMPROVED ROBUSTNESS TO DISFLUENCIES IN RNN-TRANSDUCER BASED SPEECH RECOGNITION |
4980 IMPROVED STEP-SIZE SCHEDULES FOR NOISY GRADIENT METHODS Sarit Khirirat, Xiaoyu Wang, Sindri Magnússon, Mikael Johansson 4980 | IMPROVED STEP-SIZE SCHEDULES FOR NOISY GRADIENT METHODS |
2840 IMPROVED SUPERVISED TRAINING OF PHYSICS-GUIDED DEEP LEARNING IMAGE RECONSTRUCTION WITH MULTI-MASKING Burhaneddin Yaman, Seyed Amir Hossein Hosseini, Steen Moeller, Mehmet Akcakaya 2840 | IMPROVED SUPERVISED TRAINING OF PHYSICS-GUIDED DEEP LEARNING IMAGE RECONSTRUCTION WITH MULTI-MASKING |
4529 IMPROVEMENTS TO PROSODIC ALIGNMENT FOR AUTOMATIC DUBBING Yogesh Virkar, Marcello Federico, Robert Enyedi, Roberto Barra-Chicote 4529 | IMPROVEMENTS TO PROSODIC ALIGNMENT FOR AUTOMATIC DUBBING |
3786 IMPROVING AUDIO ANOMALIES RECOGNITION USING TEMPORAL CONVOLUTIONAL ATTENTION NETWORK Qiang Huang, Thomas Hain 3786 | IMPROVING AUDIO ANOMALIES RECOGNITION USING TEMPORAL CONVOLUTIONAL ATTENTION NETWORK |
4293 IMPROVING AUTOMATIC DRUM TRANSCRIPTION USING LARGE-SCALE AUDIO-TO-MIDI ALIGNED DATA I-CHIEH WEI, Chih-Wei Wu, Li Su 4293 | IMPROVING AUTOMATIC DRUM TRANSCRIPTION USING LARGE-SCALE AUDIO-TO-MIDI ALIGNED DATA |
3171 Improving Cross-domain Slot Filling with Common Syntactic Structure Luchen Liu, Xixun Lin, Peng Zhang, Bin Wang 3171 | Improving Cross-domain Slot Filling with Common Syntactic Structure |
4428 IMPROVING DEEP LEARNING SOUND EVENTS CLASSIFIERS USING GRAM MATRIX FEATURE-WISE CORRELATIONS Antonio Joia Neto, Andre Pacheco, Diogo Carbonera Luvizon 4428 | IMPROVING DEEP LEARNING SOUND EVENTS CLASSIFIERS USING GRAM MATRIX FEATURE-WISE CORRELATIONS |
4775 IMPROVING DIALOGUE RESPONSE GENERATION VIA KNOWLEDGE GRAPH FILTER Yanmeng Wang, Ye Wang, Xingyu Lou, Wenge Rong, Zhenghong Hao, Shaojun Wang 4775 | IMPROVING DIALOGUE RESPONSE GENERATION VIA KNOWLEDGE GRAPH FILTER |
4363 IMPROVING ENTITY RECALL IN AUTOMATIC SPEECH RECOGNITION WITH NEURAL EMBEDDINGS Christopher Li, Pat Rondon, Diamantino Caseiro, Leonid Velikovich, Xavier Velez, Petar Aleksic 4363 | IMPROVING ENTITY RECALL IN AUTOMATIC SPEECH RECOGNITION WITH NEURAL EMBEDDINGS |
3414 IMPROVING EVENT DETECTION BY EXPLOITING LABEL HIERARCHY Xiangyu Xi, Wei Ye, Tong Zhang, Quanxiu Wang, Shikun Zhang, Huixing Jiang, Wei Wu 3414 | IMPROVING EVENT DETECTION BY EXPLOITING LABEL HIERARCHY |
2127 IMPROVING IDENTIFICATION OF SYSTEM-DIRECTED SPEECH UTTERANCES BY DEEP LEARNING OF ASR-BASED WORD EMBEDDINGS AND CONFIDENCE METRICS Vilayphone Vilaysouk, Amr Nour-Eldin, Dermot Connolly 2127 | IMPROVING IDENTIFICATION OF SYSTEM-DIRECTED SPEECH UTTERANCES BY DEEP LEARNING OF ASR-BASED WORD EMBEDDINGS AND CONFIDENCE METRICS |
4425 IMPROVING INTRAOPERATIVE LIVER REGISTRATION IN IMAGE-GUIDED SURGERY WITH LEARNING-BASED RECONSTRUCTION Meng Jia, Matthew Kyan 4425 | IMPROVING INTRAOPERATIVE LIVER REGISTRATION IN IMAGE-GUIDED SURGERY WITH LEARNING-BASED RECONSTRUCTION |
3101 Improving memory banks for unsupervised learning with large mini-batch, consistency and hard negative mining Adrian Bulat, Enrique Sanchez-Lozano, Georgios Tzimiropoulos 3101 | Improving memory banks for unsupervised learning with large mini-batch, consistency and hard negative mining |
3731 IMPROVING MULTIMODAL SPEECH ENHANCEMENT BY INCORPORATING SELF-SUPERVISED AND CURRICULUM LEARNING Ying Cheng, Mengyu He, Jiashuo Yu, Rui Feng 3731 | IMPROVING MULTIMODAL SPEECH ENHANCEMENT BY INCORPORATING SELF-SUPERVISED AND CURRICULUM LEARNING |
3058 IMPROVING NATURALNESS AND CONTROLLABILITY OF SEQUENCE-TO-SEQUENCE SPEECH SYNTHESIS BY LEARNING LOCAL PROSODY REPRESENTATIONS Cheng Gong, Longbiao Wang, Zhenhua Ling, Shaotong Guo, Ju Zhang, Jianwu Dang 3058 | IMPROVING NATURALNESS AND CONTROLLABILITY OF SEQUENCE-TO-SEQUENCE SPEECH SYNTHESIS BY LEARNING LOCAL PROSODY REPRESENTATIONS |
1686 IMPROVING NER IN SOCIAL MEDIA VIA ENTITY TYPE-COMPATIBLE UNKNOWN WORD SUBSTITUTION Jian Xie, Kai Zhang, Lin Sun, Yindu Su, Chenxiang Xu 1686 | IMPROVING NER IN SOCIAL MEDIA VIA ENTITY TYPE-COMPATIBLE UNKNOWN WORD SUBSTITUTION |
3021 IMPROVING NEURAL TEXT NORMALIZATION WITH PARTIAL PARAMETER GENERATOR AND POINTER-GENERATOR NETWORK Weiwei Jiang, Junjie Li, Minchuan Chen, Jun Ma, Shaojun Wang, Jing Xiao 3021 | IMPROVING NEURAL TEXT NORMALIZATION WITH PARTIAL PARAMETER GENERATOR AND POINTER-GENERATOR NETWORK |
3240 IMPROVING PRONUNCIATION ASSESSMENT VIA ORDINAL REGRESSION WITH ANCHORED REFERENCE SAMPLES Bin Su, Shaoguang Mao, Frank Soong, Yan Xia, Jonathan Tien, Zhiyong Wu 3240 | IMPROVING PRONUNCIATION ASSESSMENT VIA ORDINAL REGRESSION WITH ANCHORED REFERENCE SAMPLES |
2513 IMPROVING PROSODY MODELLING WITH CROSS-UTTERANCE BERT EMBEDDINGS FOR END-TO-END SPEECH SYNTHESIS Guanghui Xu, Wei Song, Zhengchen Zhang, Chao Zhang, Xiaodong He, Bowen Zhou 2513 | IMPROVING PROSODY MODELLING WITH CROSS-UTTERANCE BERT EMBEDDINGS FOR END-TO-END SPEECH SYNTHESIS |
4807 IMPROVING RECONSTRUCTION LOSS BASED SPEAKER EMBEDDING IN UNSUPERVISED AND SEMI-SUPERVISED SCENARIOS Jaejin Cho, Piotr Zelasko, Jesus Villalba, Najim Dehak 4807 | IMPROVING RECONSTRUCTION LOSS BASED SPEAKER EMBEDDING IN UNSUPERVISED AND SEMI-SUPERVISED SCENARIOS |
2612 IMPROVING RNN TRANSDUCER MODELING FOR SMALL-FOOTPRINT KEYWORD SPOTTING Yao Tian, Haitao Yao, Meng Cai, Yaming Liu, Zejun Ma 2612 | IMPROVING RNN TRANSDUCER MODELING FOR SMALL-FOOTPRINT KEYWORD SPOTTING |
2776 IMPROVING RNN TRANSDUCER WITH TARGET SPEAKER EXTRACTION AND NEURAL UNCERTAINTY ESTIMATION Jiatong Shi, Chunlei Zhang, Chao Weng, Shinji Watanabe, Meng Yu, Dong Yu 2776 | IMPROVING RNN TRANSDUCER WITH TARGET SPEAKER EXTRACTION AND NEURAL UNCERTAINTY ESTIMATION |
3576 IMPROVING SOUND EVENT DETECTION METRICS: INSIGHTS FROM DCASE 2020 Giacomo Ferroni, Nicolas Turpault, Juan Azcarreta, Francesco Tuveri, Romain Serizel, Cagdas Bilen, Sacha Krstulovic 3576 | IMPROVING SOUND EVENT DETECTION METRICS: INSIGHTS FROM DCASE 2020 |
4620 IMPROVING SPEAKER VERIFICATION IN REVERBERANT ENVIRONMENTS Xiao Chen, Stephen Zahorian 4620 | IMPROVING SPEAKER VERIFICATION IN REVERBERANT ENVIRONMENTS |
1541 Improving Stability of Adversarial Li-ion Cell Usage Data Generation using Generative Latent Space Modelling Subhankar Chattoraj, Sawon Pratiher, Souvik Pratiher, Hubert Konik 1541 | Improving Stability of Adversarial Li-ion Cell Usage Data Generation using Generative Latent Space Modelling |
2785 IMPROVING STREAMING AUTOMATIC SPEECH RECOGNITION WITH NON-STREAMING MODEL DISTILLATION ON UNSUPERVISED DATA Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao 2785 | IMPROVING STREAMING AUTOMATIC SPEECH RECOGNITION WITH NON-STREAMING MODEL DISTILLATION ON UNSUPERVISED DATA |
3926 IMPROVING THE CLASSIFICATION OF RARE CHORDS WITH UNLABELED DATA Marcelo Bortolozzo, Rodrigo Schramm, Claudio R. Jung 3926 | IMPROVING THE CLASSIFICATION OF RARE CHORDS WITH UNLABELED DATA |
3714 IMPROVING THE ENERGY-EFFICIENCY OF A KALMAN FILTER USING UNRELIABLE MEMORIES Jonathan Kern, Elsa Dupraz, Abdeldjalil Aïssa-El-Bey, François Leduc-Primeau 3714 | IMPROVING THE ENERGY-EFFICIENCY OF A KALMAN FILTER USING UNRELIABLE MEMORIES |
5622 Improving the Harmony of the Composite Image by Spatial-Separated Attention Module Xiaodong Cun, Chi-Man Pun 5622 | Improving the Harmony of the Composite Image by Spatial-Separated Attention Module |
5193 Improving the Robustness of Right Whale Detection in Noisy Conditions using Denoising Autoencoders and Augmented Training William Vickers, Ben Milner, Robert Lee 5193 | Improving the Robustness of Right Whale Detection in Noisy Conditions using Denoising Autoencoders and Augmented Training |
4403 IMPROVING ULTRASOUND TONGUE CONTOUR EXTRACTION USING U-NET AND SHAPE CONSISTENCY-BASED REGULARIZER Ming Feng, Yin Wang, Kele Xu, Huaimin Wang, Bo Ding 4403 | IMPROVING ULTRASOUND TONGUE CONTOUR EXTRACTION USING U-NET AND SHAPE CONSISTENCY-BASED REGULARIZER |
3161 IMRNET: AN ITERATIVE MOTION COMPENSATION AND RESIDUAL RECONSTRUCTION NETWORK FOR VIDEO COMPRESSED SENSING Xin Yang, Chunling Yang 3161 | IMRNET: AN ITERATIVE MOTION COMPENSATION AND RESIDUAL RECONSTRUCTION NETWORK FOR VIDEO COMPRESSED SENSING |
1245 IN SITU CALIBRATION OF CROSS-SENSITIVE SENSORS IN MOBILE SENSOR ARRAYS USING FAST INFORMED NON-NEGATIVE MATRIX FACTORIZATION Olivier Vu thanh, Matthieu Puigt, Farouk Yahaya, Gilles Delmaire, Gilles Roussel 1245 | IN SITU CALIBRATION OF CROSS-SENSITIVE SENSORS IN MOBILE SENSOR ARRAYS USING FAST INFORMED NON-NEGATIVE MATRIX FACTORIZATION |
1125 In-bed Pressure-based Pose Estimation using Image Space Representation Learning Vandad Davoodnia, Saeed Ghorbani, Ali Etemad 1125 | In-bed Pressure-based Pose Estimation using Image Space Representation Learning |
5191 INCOMPLETE MULTI-VIEW SUBSPACE CLUSTERING WITH LOW-RANK TENSOR Jianlun Liu, Shaohua Teng, Wei Zhang, Xiaozhao Fang, Lunke Fei, Zhuxiu Zhang 5191 | INCOMPLETE MULTI-VIEW SUBSPACE CLUSTERING WITH LOW-RANK TENSOR |
1709 Incorporate Maximum Mean Discrepancy in Recurrent Latent Space for Sequential Generative Model Yuchi Zhang, Yongliang Wang, Yang Dong 1709 | Incorporate Maximum Mean Discrepancy in Recurrent Latent Space for Sequential Generative Model |
1813 INCORPORATING MULTIMODAL INFORMATION IN WORD REPRESENTATIONS USING GRAPH CONVOLUTIONAL NETWORKS Wenhao Zhu, Shuang Liu, Chaoming Liu 1813 | INCORPORATING MULTIMODAL INFORMATION IN WORD REPRESENTATIONS USING GRAPH CONVOLUTIONAL NETWORKS |
1726 INCORPORATING UNCERTAINTY IN DATA LABELING INTO DETECTION OF BRAIN INTERICTAL EPILEPTIFORM DISCHARGES FROM EEG USING WEIGHTED OPTIMIZATION Bahman Abdi-Sargezeh, Antonio Valentin, Gonzalo Alarcon, Saeid Sanei 1726 | INCORPORATING UNCERTAINTY IN DATA LABELING INTO DETECTION OF BRAIN INTERICTAL EPILEPTIFORM DISCHARGES FROM EEG USING WEIGHTED OPTIMIZATION |
3776 INDEPENDENT SIGN LANGUAGE RECOGNITION WITH 3D BODY, HANDS, AND FACE RECONSTRUCTION Agelos Kratimenos, Georgios Pavlakos, Petros Maragos 3776 | INDEPENDENT SIGN LANGUAGE RECOGNITION WITH 3D BODY, HANDS, AND FACE RECONSTRUCTION |
2018 INDEPENDENT VECTOR ANALYSIS USING SEMI-PARAMETRIC DENSITY ESTIMATION VIA MULTIVARIATE ENTROPY MAXIMIZATION Lucas Damasceno, Charles Cavalcante, Tülay Adali, Zois Boukouvalas 2018 | INDEPENDENT VECTOR ANALYSIS USING SEMI-PARAMETRIC DENSITY ESTIMATION VIA MULTIVARIATE ENTROPY MAXIMIZATION |
2032 Inertial Proximal Deep Learning Alternating Minimization for Efficient Neutral Network Training Linbo Qiao, Tao Sun, Hengyue Pan, Dongsheng Li 2032 | Inertial Proximal Deep Learning Alternating Minimization for Efficient Neutral Network Training |
4925 INFERRING HIGH-RESOLUTIONAL URBAN FLOW WITH INTERNET OF MOBILE THINGS Fan Zhou, Xin Jing, Liang Li, Ting Zhong 4925 | INFERRING HIGH-RESOLUTIONAL URBAN FLOW WITH INTERNET OF MOBILE THINGS |
2735 Information and Regularization in Restricted Boltzmann Machines Matias Vera, Leonardo Rey Vega, Pablo Piantanida 2735 | Information and Regularization in Restricted Boltzmann Machines |
2925 INFORMATION DECODING AND SDR IMPLEMENTATION OF DFRC SYSTEMS WITHOUT TRAINING SIGNALS Daniel Wong, Batu Chalise, Justin Metcalf, Moeness Amin 2925 | INFORMATION DECODING AND SDR IMPLEMENTATION OF DFRC SYSTEMS WITHOUT TRAINING SIGNALS |
2049 INJECTING WORD INFORMATION WITH MULTI-LEVEL WORD ADAPTER FOR CHINESE SPOKEN LANGUAGE UNDERSTANDING Dechuan Teng, Libo Qin, Wanxiang Che, Sendong Zhao, Ting Liu 2049 | INJECTING WORD INFORMATION WITH MULTI-LEVEL WORD ADAPTER FOR CHINESE SPOKEN LANGUAGE UNDERSTANDING |
1934 Instance segmentation with the number of clusters incorporated in embedding learning Jianfeng Cao, Hong Yan 1934 | Instance segmentation with the number of clusters incorporated in embedding learning |
4141 INSTRUMENT CLASSIFICATION OF SOLO SHEET MUSIC IMAGES Kevin Ji, Daniel Yang, TJ Tsai 4141 | INSTRUMENT CLASSIFICATION OF SOLO SHEET MUSIC IMAGES |
4026 Integer Carrier Frequency Offset Estimation In OFDM with Zadoff-Chu Sequences John Roth, David Garren, Clark Robertson 4026 | Integer Carrier Frequency Offset Estimation In OFDM with Zadoff-Chu Sequences |
1635 INTEGRATED CLASSIFICATION AND LOCALIZATION OF TARGETS USING BAYESIAN FRAMEWORK IN AUTOMOTIVE RADARS Anand Dubey, Jonas Fuchs, Maximilian Luebke, Robert Weigel, Fabian Lurz, Avik Santra 1635 | INTEGRATED CLASSIFICATION AND LOCALIZATION OF TARGETS USING BAYESIAN FRAMEWORK IN AUTOMOTIVE RADARS |
4169 INTEGRATED GRAD-CAM: SENSITIVITY-AWARE VISUAL EXPLANATION OF DEEP CONVOLUTIONAL NETWORKS VIA INTEGRATED GRADIENT-BASED SCORING Sam Sattarzadeh, Mahesh Sudhakar, Konstantinos N. Plataniotis, Jongseong Jang, Yeonjeong Jeong, Hyunwoo Kim 4169 | INTEGRATED GRAD-CAM: SENSITIVITY-AWARE VISUAL EXPLANATION OF DEEP CONVOLUTIONAL NETWORKS VIA INTEGRATED GRADIENT-BASED SCORING |
3817 INTEGRATING DEEP LEARNING WITH FIRST-ORDER LOGIC PROGRAMMED CONSTRAINTS FOR ZERO-DAY PHISHING ATTACK DETECTION Seok-Jun Bu, Sung-Bae Cho 3817 | INTEGRATING DEEP LEARNING WITH FIRST-ORDER LOGIC PROGRAMMED CONSTRAINTS FOR ZERO-DAY PHISHING ATTACK DETECTION |
3671 Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds Keisuke Kinoshita, Marc Delcroix, Naohiro Tawara 3671 | Integrating end-to-end neural and clustering-based diarization: Getting the best of both worlds |
3558 INTEGRATING SUBGRAPH-AWARE RELATION AND DIRECTION REASONING FOR QUESTION ANSWERING Xu Wang, Shuai Zhao, Bo Cheng, Jiale Han, Yingting Li, Hao Yang, Ivan SekulicGuoshun Nan 3558 | INTEGRATING SUBGRAPH-AWARE RELATION AND DIRECTION REASONING FOR QUESTION ANSWERING |
2071 INTERFERENCE ANALYSIS IN RECONFIGURABLE INTELLIGENT SURFACE-ASSISTED MULTIPLE-INPUT MULTIPLE-OUTPUT SYSTEMS Jiang Liu, Xuewen Qian, Marco Di Renzo 2071 | INTERFERENCE ANALYSIS IN RECONFIGURABLE INTELLIGENT SURFACE-ASSISTED MULTIPLE-INPUT MULTIPLE-OUTPUT SYSTEMS |
5115 INTERMEDIATE LOSS REGULARIZATION FOR CTC-BASED SPEECH RECOGNITION Jaesong Lee, Shinji Watanabe 5115 | INTERMEDIATE LOSS REGULARIZATION FOR CTC-BASED SPEECH RECOGNITION |
1878 INTERNAL LANGUAGE MODEL TRAINING FOR DOMAIN-ADAPTIVE END-TO-END SPEECH RECOGNITION Zhong Meng, Naoyuki Kanda, Yashesh Gaur, Sarangarajan Parthasarathy, Eric Sun, Liang Lu, Xie Chen, Jinyu Li, Yifan Gong 1878 | INTERNAL LANGUAGE MODEL TRAINING FOR DOMAIN-ADAPTIVE END-TO-END SPEECH RECOGNITION |
4001 INTERPOLATION OF IRREGULARLY SAMPLED FREQUENCY RESPONSE FUNCTIONS USING CONVOLUTIONAL NEURAL NETWORKS Matteo Acerbi, Raffaele Malvermi, Mirco Pezzoli, Fabio Antonacci, Augusto Sarti, Roberto Corradi 4001 | INTERPOLATION OF IRREGULARLY SAMPLED FREQUENCY RESPONSE FUNCTIONS USING CONVOLUTIONAL NEURAL NETWORKS |
4279 Interpreting glottal flow dynamics for detecting COVID-19 from voice Soham Deshmukh, Mahmoud Al Ismail, Rita Singh 4279 | Interpreting glottal flow dynamics for detecting COVID-19 from voice |
1389 Introducing Deep Reinforcement Learning to NLU Ranking Tasks Ge Yu, Chengwei Su, Emre Barut 1389 | Introducing Deep Reinforcement Learning to NLU Ranking Tasks |
4195 INVARIANT RISK MINIMIZATION-BASED TREATMENT EFFECT ESTIMATION Abhin Shah, Kartik Ahuja, Karthikeyan Shanmugam, Dennis Wei, Kush Varshney, Amit Dhurandhar 4195 | INVARIANT RISK MINIMIZATION-BASED TREATMENT EFFECT ESTIMATION |
3308 Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Zeyu Xie, Kai Yu 3308 | Investigating Local and Global Information for Automated Audio Captioning with Transfer Learning |
5311 INVESTIGATING THE EFFICACY OF MUSIC VERSION RETRIEVAL SYSTEMS FOR SETLIST IDENTIFICATION Furkan Yesiler, Emilio Molina, Joan Serrà, Emilia Gómez 5311 | INVESTIGATING THE EFFICACY OF MUSIC VERSION RETRIEVAL SYSTEMS FOR SETLIST IDENTIFICATION |
2921 INVESTIGATION OF FAST AND EFFICIENT METHODS FOR MULTI-SPEAKER MODELING AND SPEAKER ADAPTATION Yibin Zheng, Xinhui Li, Li Lu 2921 | INVESTIGATION OF FAST AND EFFICIENT METHODS FOR MULTI-SPEAKER MODELING AND SPEAKER ADAPTATION |
1030 ITERATIVE GEOMETRY CALIBRATION FROM DISTANCE ESTIMATES FOR WIRELESS ACOUSTIC SENSOR NETWORKS Tobias Gburrek, Joerg Schmalenstroeer, Reinhold Haeb-Umbach 1030 | ITERATIVE GEOMETRY CALIBRATION FROM DISTANCE ESTIMATES FOR WIRELESS ACOUSTIC SENSOR NETWORKS |
1420 ITERATIVE REWEIGHTED ALGORITHMS FOR JOINT USER IDENTIFICATION AND CHANNEL ESTIMATION IN SPATIALLY CORRELATED MASSIVE MTC Hamza Djelouat, Markus Leinonen, Markku Juntti 1420 | ITERATIVE REWEIGHTED ALGORITHMS FOR JOINT USER IDENTIFICATION AND CHANNEL ESTIMATION IN SPATIALLY CORRELATED MASSIVE MTC |
4835 JAMMING STRATEGY GENERATION FOR HIDDEN COMMUNICATION MODES VIA GRAPH CONVOLUTION NETWORKS Fanxiang Kong, Qiang Li, Huaizong Shao 4835 | JAMMING STRATEGY GENERATION FOR HIDDEN COMMUNICATION MODES VIA GRAPH CONVOLUTION NETWORKS |
4706 Joint Alignment Learning-Attention based Model for Grapheme-to-Phoneme Conversion Yonghe Wang, Feilong Bao, Hui Zhang, Guanglai Gao 4706 | Joint Alignment Learning-Attention based Model for Grapheme-to-Phoneme Conversion |
5615 JOINT AMPLITUDE AND PHASE REFINEMENT FOR MONAURAL SOURCE SEPARATION Yoshiki Masuyama, Kohei Yatabe, Kento Nagatomo, Yasuhiro Oikawa 5615 | JOINT AMPLITUDE AND PHASE REFINEMENT FOR MONAURAL SOURCE SEPARATION |
3170 JOINT ASR AND LANGUAGE IDENTIFICATION USING RNN-T: AN EFFICIENT APPROACH TO DYNAMIC LANGUAGE SWITCHING Surabhi Punjabi, Harish Arsikere, Zeynab Raeesy, Chander Chandak, Nikhil Bhave, Ankish Bansal, Markus Muller, Sergio Murillo, Ariya Rastrow, Andreas Stolcke, Jasha Droppo, Sri Garimella, Roland Maas, Mat Hans, Athanasios Mouchtaris, Siegfried Kunzmann 3170 | JOINT ASR AND LANGUAGE IDENTIFICATION USING RNN-T: AN EFFICIENT APPROACH TO DYNAMIC LANGUAGE SWITCHING |
3217 JOINT COMMUNICATIONS WITH FH-MIMO RADAR SYSTEMS : AN EXTENDED SIGNALING STRATEGY Xiangrong Wang, Jing Xu, Aboulnasr Hassanien, Elias Aboutanios 3217 | JOINT COMMUNICATIONS WITH FH-MIMO RADAR SYSTEMS : AN EXTENDED SIGNALING STRATEGY |
3265 JOINT COUPLED TRANSFORM LEARNING FRAMEWORK FOR MULTIMODAL IMAGE SUPER-RESOLUTION Andrew Gigie, Achanna Anil Kumar, Angshul Majumdar, Kriti Kumar, M Girish Chandra 3265 | JOINT COUPLED TRANSFORM LEARNING FRAMEWORK FOR MULTIMODAL IMAGE SUPER-RESOLUTION |
1933 JOINT DEREVERBERATION AND SEPARATION WITH ITERATIVE SOURCE STEERING Taishi Nakashima, Robin Scheibler, Masahito Togami, Nobutaka Ono 1933 | JOINT DEREVERBERATION AND SEPARATION WITH ITERATIVE SOURCE STEERING |
5609 Joint DOD and DOA Estimation in Slow-Time MIMO Radar via PARAFAC Decomposition Feng Xu, Sergiy Vorobyov, Xiaopeng Yang 5609 | Joint DOD and DOA Estimation in Slow-Time MIMO Radar via PARAFAC Decomposition |
1972 JOINT INTENT DETECTION AND SLOT FILLING BASED ON CONTINUAL LEARNING MODEL YANFEI HUI, Jianzong Wang, Ning Cheng, Fengying Yu, Tianbo Wu, Jing Xiao 1972 | JOINT INTENT DETECTION AND SLOT FILLING BASED ON CONTINUAL LEARNING MODEL |
1896 JOINT LEARNING OF IMAGE AESTHETIC QUALITY ASSESSMENT AND SEMANTIC RECOGNITION BASED ON FEATURE ENHANCEMENT Xiangfei Liu, Xiushan Nie, Zhen Shen, Yilong Yin 1896 | JOINT LEARNING OF IMAGE AESTHETIC QUALITY ASSESSMENT AND SEMANTIC RECOGNITION BASED ON FEATURE ENHANCEMENT |
2332 JOINT LOCALIZATION AND PREDICTIVE BEAMFORMING IN VEHICULAR NETWORKS: POWER ALLOCATION BEYOND WATER-FILLING Fan Liu, Christos Masouros 2332 | JOINT LOCALIZATION AND PREDICTIVE BEAMFORMING IN VEHICULAR NETWORKS: POWER ALLOCATION BEYOND WATER-FILLING |
4384 JOINT MASKED CPC AND CTC TRAINING FOR ASR Chaitanya Talnikar, Tatiana Likhomanenko, Ronan Collobert, Gabriel Synnaeve 4384 | JOINT MASKED CPC AND CTC TRAINING FOR ASR |
1034 JOINT MAXIMUM LIKELIHOOD ESTIMATION OF POWER SPECTRAL DENSITIES AND RELATIVE ACOUSTIC TRANSFER FUNCTIONS FOR ACOUSTIC BEAMFORMING Poul Hoang, Jesper Jensen, Zheng-Hua Tan, Jan Mark de Han 1034 | JOINT MAXIMUM LIKELIHOOD ESTIMATION OF POWER SPECTRAL DENSITIES AND RELATIVE ACOUSTIC TRANSFER FUNCTIONS FOR ACOUSTIC BEAMFORMING |
3345 JOINT MULTI-PITCH DETECTION AND SCORE TRANSCRIPTION FOR POLYPHONIC PIANO MUSIC Lele Liu, Veronica Morfi, Emmanouil Benetos 3345 | JOINT MULTI-PITCH DETECTION AND SCORE TRANSCRIPTION FOR POLYPHONIC PIANO MUSIC |
2009 JOINT OPTIMIZATION FOR FULL-DUPLEX CELLULAR COMMUNICATIONS VIA INTELLIGENT REFLECTING SURFACE Zhangjie Peng, Cunhua Pan, Zhenkun Zhang, Xianzhe Chen, Li Li, A. Lee Swindlehurst 2009 | JOINT OPTIMIZATION FOR FULL-DUPLEX CELLULAR COMMUNICATIONS VIA INTELLIGENT REFLECTING SURFACE |
3709 JOINT OPTIMIZATION OF SPECTRALLY CO-EXISTING MULTI-CARRIER RADAR AND COMMUNICATION SYSTEMS IN CLUTTERED ENVIRONMENTS Fangzhou Wang, Hongbin Li, Braham Himed 3709 | JOINT OPTIMIZATION OF SPECTRALLY CO-EXISTING MULTI-CARRIER RADAR AND COMMUNICATION SYSTEMS IN CLUTTERED ENVIRONMENTS |
2541 JOINT REINFORCEMENT LEARNING AND GAME THEORY BITRATE CONTROL METHOD FOR 360-DEGREE DYNAMIC ADAPTIVE STREAMING Xuekai WEI, Mingliang Zhou, Sam Kwong, Hui Yuan, Tao Xiang 2541 | JOINT REINFORCEMENT LEARNING AND GAME THEORY BITRATE CONTROL METHOD FOR 360-DEGREE DYNAMIC ADAPTIVE STREAMING |
2525 JOINTLY TRAINED TRANSFORMERS MODELS FOR SPOKEN LANGUAGE TRANSLATION Hari Krishna Vydana, Martin Karafiat, Katerina Zmolikova, Lukas Burget, Honza Cernocky 2525 | JOINTLY TRAINED TRANSFORMERS MODELS FOR SPOKEN LANGUAGE TRANSLATION |
3415 KALMAN FILTER BASED MIMO CSI PHASE RECOVERY FOR COTS WIFI DEVICES Chu Li, Jeremy Brauer, Aydin Sezgin, Christian Zenger 3415 | KALMAN FILTER BASED MIMO CSI PHASE RECOVERY FOR COTS WIFI DEVICES |
3375 Kalman Optimizer for Consistent Gradient Descent Xingyi Yang 3375 | Kalman Optimizer for Consistent Gradient Descent |
2257 KALMANNET: DATA-DRIVEN KALMAN FILTERING Guy Revach, Nir Shlezinger, Ruud J. G. van Sloun, Yonina C. Eldar 2257 | KALMANNET: DATA-DRIVEN KALMAN FILTERING |
3486 KAN: KNOWLEDGE-AUGMENTED NETWORKS FOR FEW-SHOT LEARNING Zeyang Zhu, Xin Lin 3486 | KAN: KNOWLEDGE-AUGMENTED NETWORKS FOR FEW-SHOT LEARNING |
4508 KARAOKE KEY RECOMMENDATION VIA PERSONALIZED COMPETENCE-BASED RATING PREDICTION Yuan Wang, Shigeki Tanaka, Keita Yokoyama, Hsin-Tai Wu, Yi Fang 4508 | KARAOKE KEY RECOMMENDATION VIA PERSONALIZED COMPETENCE-BASED RATING PREDICTION |
5374 KERNEL LEARNING WITH TENSOR NETWORKS Kriton Konstantinidis, Shengxi Li, Danilo Mandic 5374 | KERNEL LEARNING WITH TENSOR NETWORKS |
2038 KERNEL ORTHOGONAL NONNEGATIVE MATRIX FACTORIZATION: APPLICATION TO MULTISPECTRAL DOCUMENT IMAGE DECOMPOSITION Abderrahmane Rahiche, Mohamed Cheriet 2038 | KERNEL ORTHOGONAL NONNEGATIVE MATRIX FACTORIZATION: APPLICATION TO MULTISPECTRAL DOCUMENT IMAGE DECOMPOSITION |
4106 KERNEL REGRESSION ON GRAPHS IN RANDOM FOURIER FEATURES SPACE Vitor Elias, Vinay Gogineni, Wallace Martins, Stefan Werner 4106 | KERNEL REGRESSION ON GRAPHS IN RANDOM FOURIER FEATURES SPACE |
3627 KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING Rami Mowakeaa, Seung-Jun Kim, Darren Emge 3627 | KERNEL-BASED LIFELONG POLICY GRADIENT REINFORCEMENT LEARNING |
2600 KERNEL-INTERPOLATION-BASED FILTERED-X LEAST MEAN SQUARE FOR SPATIAL ACTIVE NOISE CONTROL IN TIME DOMAIN Jesper Brunnström, Shoichi Koyama 2600 | KERNEL-INTERPOLATION-BASED FILTERED-X LEAST MEAN SQUARE FOR SPATIAL ACTIVE NOISE CONTROL IN TIME DOMAIN |
2713 KLD MINIMIZATION-BASED CONSTRAINED MEASUREMENT FILTERING FOR TWO-STEP TDOA INDOOR TRACKING Rui Huang, Le Yang, Jun Tao, Yanbo Xue 2713 | KLD MINIMIZATION-BASED CONSTRAINED MEASUREMENT FILTERING FOR TWO-STEP TDOA INDOOR TRACKING |
2132 KNOWLEDGE DISTILLATION FOR IMPROVED ACCURACY IN SPOKEN QUESTION ANSWERING Chenyu You, Nuo Chen, Yuexian Zou 2132 | KNOWLEDGE DISTILLATION FOR IMPROVED ACCURACY IN SPOKEN QUESTION ANSWERING |
2399 KNOWLEDGE REASONING FOR SEMANTIC SEGMENTATION Shengjia Chen, Zhixin Li, Xiwei Yang 2399 | KNOWLEDGE REASONING FOR SEMANTIC SEGMENTATION |
2675 KNOWLEDGE TRANSFER FOR EFFICIENT ON-DEVICE FALSE TRIGGER MITIGATION Pranay Dighe, Erik Marchi, Srikanth Vishnubhotla, Sachin Kajarekar, Devang Naik 2675 | KNOWLEDGE TRANSFER FOR EFFICIENT ON-DEVICE FALSE TRIGGER MITIGATION |
2508 KNOWLEDGE-BASED CHAT DETECTION WITH FALSE MENTION DISCRIMINATION Wei Liu, Peijie Huang, Dongzhu Liang, Zihao Zhou 2508 | KNOWLEDGE-BASED CHAT DETECTION WITH FALSE MENTION DISCRIMINATION |
1357 Label-aware Text Representation for Multi-label Text Classification Hao Guo, Xiangyang Li, Lei Zhang, Jia Liu, Wei Chen 1357 | Label-aware Text Representation for Multi-label Text Classification |
1330 LABEL-GUIDED DICTIONARY PAIR LEARNING FOR ECG BIOMETRIC RECOGNITION Mingzhu Ma, Gongping Yang, Kuikui Wang, Yuwen Huang, Yilong Yin 1330 | LABEL-GUIDED DICTIONARY PAIR LEARNING FOR ECG BIOMETRIC RECOGNITION |
2809 LANGUAGE MODEL IS ALL YOU NEED: NATURAL LANGUAGE UNDERSTANDING AS QUESTION ANSWERING Mahdi Namazifar, Alexandros Papangelis, Gokhan Tur, Dilek Hakkani-Tur 2809 | LANGUAGE MODEL IS ALL YOU NEED: NATURAL LANGUAGE UNDERSTANDING AS QUESTION ANSWERING |
3378 Language-sensitive Music Emotion Recognition models: are we really there yet? Juan Sebastián Gómez-Cañón, Estefanía Cano, Ana Gabriela Pandrea, Perfecto Herrera, Emilia Gómez 3378 | Language-sensitive Music Emotion Recognition models: are we really there yet? |
4197 LAPLACIAN REGULARIZED TENSOR LOW-RANK MINIMIZATION FOR HYPERSPECTRAL SNAPSHOT COMPRESSIVE IMAGING Yi Yang, Fei Jiang, Hongtao Lu 4197 | LAPLACIAN REGULARIZED TENSOR LOW-RANK MINIMIZATION FOR HYPERSPECTRAL SNAPSHOT COMPRESSIVE IMAGING |
5612 Large Database Compression Based on Perceived Information Thomas Maugey, Laura Toni 5612 | Large Database Compression Based on Perceived Information |
2216 LARGE MARGIN TRAINING IMPROVES LANGUAGE MODELS FOR ASR Jilin Wang, Jiaji Huang, Kenneth Church 2216 | LARGE MARGIN TRAINING IMPROVES LANGUAGE MODELS FOR ASR |
4909 LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation Woosung Choi, Minseok Kim, Jaehwa Chung, Soonyoung Jung 4909 | LaSAFT: Latent Source Attentive Frequency Transformation for Conditioned Source Separation |
4307 LATENT SPACE MOTION ANALYSIS FOR COLLABORATIVE INTELLIGENCE Mateen Ulhaq, Ivan Bajic 4307 | LATENT SPACE MOTION ANALYSIS FOR COLLABORATIVE INTELLIGENCE |
4986 LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS Apoorv Vyas, Srikanth Madikeri, Hervé Bourlard 4986 | LATTICE-FREE MMI ADAPTATION OF SELF-SUPERVISED PRETRAINED ACOUSTIC MODELS |
3467 LAYER-WISE INTERPRETATION OF DEEP NEURAL NETWORKS USING IDENTITY INITIALIZATION Shohei Kubota, Hideaki Hayashi, Tomohiro Hayase, Seiichi Uchida 3467 | LAYER-WISE INTERPRETATION OF DEEP NEURAL NETWORKS USING IDENTITY INITIALIZATION |
4850 Leaky Integrator Dynamical Systems and Reachable Sets Brian Whiteaker, Peter Gerstoft 4850 | Leaky Integrator Dynamical Systems and Reachable Sets |
3537 LEARNED DECIMATION FOR NEURAL BELIEF PROPAGATION DECODERS Andreas Buchberger, Christian Häger, Henry D. Pfister, Laurent Schmalen, Alexandre Graell i Amat 3537 | LEARNED DECIMATION FOR NEURAL BELIEF PROPAGATION DECODERS |
1347 LEARNED TRANSFERABLE ARCHITECTURES CAN SURPASS HAND-DESIGNED ARCHITECTURES FOR LARGE SCALE SPEECH RECOGNITION Liqiang He, Dan Su, Dong Yu 1347 | LEARNED TRANSFERABLE ARCHITECTURES CAN SURPASS HAND-DESIGNED ARCHITECTURES FOR LARGE SCALE SPEECH RECOGNITION |
2019 Learning a Sparse Generative Non-Parametric Supervised Autoencoder Michel Barlaud, Frederic Guyard 2019 | Learning a Sparse Generative Non-Parametric Supervised Autoencoder |
1823 LEARNING A TREE OF NEURAL NETS Arman Zharmagambetov, Miguel Carreira-Perpiñán 1823 | LEARNING A TREE OF NEURAL NETS |
2229 Learning Audio Embeddings with User Listening Data for Content-based Music Recommendation Ke Chen, Beici Liang, Xiaoshuan Ma, Minwei Gu 2229 | Learning Audio Embeddings with User Listening Data for Content-based Music Recommendation |
1828 LEARNING AUDIO-VISUAL CORRELATIONS FROM VARIATIONAL CROSS-MODAL GENERATION Ye Zhu, Yu Wu, Hugo Latapie, Yi Yang, Yan Yan 1828 | LEARNING AUDIO-VISUAL CORRELATIONS FROM VARIATIONAL CROSS-MODAL GENERATION |
1366 LEARNING BINARY SEMANTIC EMBEDDING FOR BREAST HISTOLOGY IMAGE CLASSIFICATION AND RETRIEVAL Xiao Kang, Xingbo Liu, Xiushan Nie, Yilong Yin 1366 | LEARNING BINARY SEMANTIC EMBEDDING FOR BREAST HISTOLOGY IMAGE CLASSIFICATION AND RETRIEVAL |
2409 LEARNING BOLLOBÁS-RIORDAN GRAPHS UNDER PARTIAL OBSERVABILITY Michele Cirillo, Vincenzo Matta, Ali H. Sayed 2409 | LEARNING BOLLOBÁS-RIORDAN GRAPHS UNDER PARTIAL OBSERVABILITY |
4055 Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags Xavier Favory, Konstantinos Drossos, Tuomas Virtanen, Xavier Serra 4055 | Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags |
2330 LEARNING DISCRIMINATIVE FEATURES FOR SEMI-SUPERVISED ANOMALY DETECTION Zhe Feng, Jie Tang, Yishun Dou, Gangshan Wu 2330 | LEARNING DISCRIMINATIVE FEATURES FOR SEMI-SUPERVISED ANOMALY DETECTION |
3115 LEARNING DISENTANGLED FEATURE REPRESENTATIONS FOR SPEECH ENHANCEMENT VIA ADVERSARIAL TRAINING Nana Hou, Chenglin Xu, Eng Siong Chng, Haizhou Li 3115 | LEARNING DISENTANGLED FEATURE REPRESENTATIONS FOR SPEECH ENHANCEMENT VIA ADVERSARIAL TRAINING |
2792 LEARNING DISENTANGLED PHONE AND SPEAKER REPRESENTATIONS IN A SEMI-SUPERVISED VQ-VAE PARADIGM Jennifer Williams, Zhao Yi, Erica Cooper, Junichi Yamagishi 2792 | LEARNING DISENTANGLED PHONE AND SPEAKER REPRESENTATIONS IN A SEMI-SUPERVISED VQ-VAE PARADIGM |
3523 Learning double-compression video fingerprints left from social media platforms Irene Amerini, Aris Anagnostopoulos, Luca Maiano, Lorenzo Ricciardi Celsi 3523 | Learning double-compression video fingerprints left from social media platforms |
2590 LEARNING FROM HETEROGENEOUS EEG SIGNALS WITH DIFFERENTIABLE CHANNEL REORDERING Aaqib Saeed, David Grangier, Olivier Pietquin, Neil Zeghidour 2590 | LEARNING FROM HETEROGENEOUS EEG SIGNALS WITH DIFFERENTIABLE CHANNEL REORDERING |
3212 LEARNING INTEGRODIFFERENTIAL MODELS FOR IMAGE DENOISING Tobias Alt, Joachim Weickert 3212 | LEARNING INTEGRODIFFERENTIAL MODELS FOR IMAGE DENOISING |
3972 LEARNING MIXED MEMBERSHIP FROM ADJACENCY GRAPH VIA SYSTEMATIC EDGE QUERY: IDENTIFIABILITY AND ALGORITHM Shahana Ibrahim, Xiao Fu 3972 | LEARNING MIXED MEMBERSHIP FROM ADJACENCY GRAPH VIA SYSTEMATIC EDGE QUERY: IDENTIFIABILITY AND ALGORITHM |
5618 LEARNING MIXTURES OF SEPARABLE DICTIONARIES FOR TENSOR DATA: ANALYSIS AND ALGORITHMS Mohsen Ghassemi, Zahra Shakeri, Anand Sarwate, Waheed Bajwa 5618 | LEARNING MIXTURES OF SEPARABLE DICTIONARIES FOR TENSOR DATA: ANALYSIS AND ALGORITHMS |
3061 LEARNING MODEL-BLIND TEMPORAL DENOISERS WITHOUT GROUND TRUTHS Yanghao Li, Bichuan Guo, Jiangtao Wen, Zhen Xia, Shan Liu, Yuxing Han 3061 | LEARNING MODEL-BLIND TEMPORAL DENOISERS WITHOUT GROUND TRUTHS |
5204 LEARNING ON HETEROGENEOUS GRAPHS USING HIGH-ORDER RELATIONS See Hian Lee, Feng Ji, Wee Peng Tay 5204 | LEARNING ON HETEROGENEOUS GRAPHS USING HIGH-ORDER RELATIONS |
5117 LEARNING OPTIMAL LATTICE CODES FOR MIMO COMMUNICATIONS Laia Amorós, Mikko Pitkänen 5117 | LEARNING OPTIMAL LATTICE CODES FOR MIMO COMMUNICATIONS |
4208 LEARNING POSE-ADAPTIVE LIP SYNC WITH CASCADED TEMPORAL CONVOLUTIONAL NETWORK Ruobing Zheng, Bo Song, Changjiang Ji 4208 | LEARNING POSE-ADAPTIVE LIP SYNC WITH CASCADED TEMPORAL CONVOLUTIONAL NETWORK |
2466 LEARNING REPRESENTATION OF MULTI-SCALE OBJECT FOR FINE-GRAINED IMAGE RETRIEVAL Kangbo Sun, Jie Zhu 2466 | LEARNING REPRESENTATION OF MULTI-SCALE OBJECT FOR FINE-GRAINED IMAGE RETRIEVAL |
2765 LEARNING SEPARABLE TIME-FREQUENCY FILTERBANKS FOR AUDIO CLASSIFICATION Jie Pu, Yannis Panagakis, Maja Pantic 2765 | LEARNING SEPARABLE TIME-FREQUENCY FILTERBANKS FOR AUDIO CLASSIFICATION |
3960 LEARNING SPARSE GRAPH LAPLACIAN WITH K EIGENVECTOR PRIOR VIA ITERATIVE GLASSO AND PROJECTION Saghar Bagheri, Gene Cheung, Antonio Ortega, Fen Wang 3960 | LEARNING SPARSE GRAPH LAPLACIAN WITH K EIGENVECTOR PRIOR VIA ITERATIVE GLASSO AND PROJECTION |
1704 LEARNING SPARSIFYING TRANSFORMS FOR IMAGE RECONSTRUCTION IN ELECTRICAL IMPEDANCE TOMOGRAPHY Kaiyi Yang, Narong Borijindargoon, Boon Poh Ng, Saiprasad Ravishankar, Bihan Wen 1704 | LEARNING SPARSIFYING TRANSFORMS FOR IMAGE RECONSTRUCTION IN ELECTRICAL IMPEDANCE TOMOGRAPHY |
4251 LEARNING THE RELEVANT SUBSTRUCTURES FOR TASKS ON GRAPH DATA Lei Chen, Zhengdao Chen, Joan Bruna 4251 | LEARNING THE RELEVANT SUBSTRUCTURES FOR TASKS ON GRAPH DATA |
2141 Learning to Continuously Optimize Wireless Resource In Episodically Dynamic Environment Haoran Sun, Wenqiang Pu, Minghe Zhu, Xiao Fu, Tsung-Hui Chang, Mingyi Hong 2141 | Learning to Continuously Optimize Wireless Resource In Episodically Dynamic Environment |
2879 LEARNING TO ESTIMATE KERNEL SCALE AND ORIENTATION OF DEFOCUS BLUR WITH ASYMMETRIC CODED APERTURE Jisheng Li, Qi Dai, Jiangtao Wen 2879 | LEARNING TO ESTIMATE KERNEL SCALE AND ORIENTATION OF DEFOCUS BLUR WITH ASYMMETRIC CODED APERTURE |
2518 LEARNING TO SELECT CONTEXT IN A HIERARCHICAL AND GLOBAL PERSPECTIVE FOR OPEN-DOMAIN DIALOGUE GENERATION Lei Shen, Haolan Zhan, Xin Shen, Yang Feng 2518 | LEARNING TO SELECT CONTEXT IN A HIERARCHICAL AND GLOBAL PERSPECTIVE FOR OPEN-DOMAIN DIALOGUE GENERATION |
4289 LEARNING TO SELECT FOR MIMO RADAR BASED ON HYBRID ANALOG-DIGITAL BEAMFORMING Zhaoyi Xu, Fan Liu, Konstantinos Diamantaras, Christos Masouros, Athina Petropulu 4289 | LEARNING TO SELECT FOR MIMO RADAR BASED ON HYBRID ANALOG-DIGITAL BEAMFORMING |
3993 LEARNING WORD-LEVEL CONFIDENCE FOR SUBWORD END-TO-END ASR David Qiu, Qiujia Li, Yanzhang He, Yu Zhang, Bo Li, Liangliang Cao, Rohit Prabhavalkar, Deepti Bhatia, Wei Li, Ke Hu, Tara Sainath, Ian McGraw 3993 | LEARNING WORD-LEVEL CONFIDENCE FOR SUBWORD END-TO-END ASR |
3767 LEARNING-BASED LOSSLESS COMPRESSION OF 3D POINT CLOUD GEOMETRY Dat Thanh Nguyen, Maurice Quach, Giuseppe Valenzise, Pierre Duhamel 3767 | LEARNING-BASED LOSSLESS COMPRESSION OF 3D POINT CLOUD GEOMETRY |
2519 LENGTH NO LONGER MATTERS: A REAL LENGTH ADAPTIVE ARRHYTHMIA CLASSIFICATION MODEL WITH MULTI-SCALE CONVOLUTION Chuanqi Han, Fang Yu, Peng Wang, Ruoran Huang, Xi Huang, Li Cui 2519 | LENGTH NO LONGER MATTERS: A REAL LENGTH ADAPTIVE ARRHYTHMIA CLASSIFICATION MODEL WITH MULTI-SCALE CONVOLUTION |
3008 LESS IS MORE: IMPROVED RNN-T DECODING USING LIMITED LABEL CONTEXT AND PATH MERGING Rohit Prabhavalkar, Yanzhang He, David Rybach, Sean Campbell, Arun Narayanan, Trevor Strohman, Tara N. Sainath 3008 | LESS IS MORE: IMPROVED RNN-T DECODING USING LIMITED LABEL CONTEXT AND PATH MERGING |
4145 LEVERAGING A MULTIPLE-STRAIN MODEL WITH MUTATIONS IN ANALYZING THE SPREAD OF COVID-19 Anirudh Sridhar, Osman Yagan, Rashad Eletreby, Simon Levin, Joshua Plotkin, H. Vincent Poor 4145 | LEVERAGING A MULTIPLE-STRAIN MODEL WITH MUTATIONS IN ANALYZING THE SPREAD OF COVID-19 |
3454 LEVERAGING ACOUSTIC AND LINGUISTIC EMBEDDINGS FROM PRETRAINED SPEECH AND LANGUAGE MODELS FOR INTENT CLASSIFICATION Bidisha Sharma, Maulik Madhavi, Haizhou Li 3454 | LEVERAGING ACOUSTIC AND LINGUISTIC EMBEDDINGS FROM PRETRAINED SPEECH AND LANGUAGE MODELS FOR INTENT CLASSIFICATION |
1629 LEVERAGING THE STRUCTURE OF MUSICAL PREFERENCE IN CONTENT-AWARE MUSIC RECOMMENDATION Paul Magron, Cédric Févotte 1629 | LEVERAGING THE STRUCTURE OF MUSICAL PREFERENCE IN CONTENT-AWARE MUSIC RECOMMENDATION |
1833 LIFI: TOWARDS LINGUISTICALLY INFORMED FRAME INTERPOLATION Aradhya Mathur, Yaman Kumar, Devansh Batra, Rajiv Ratn Shah, Roger Zimmermann, Amanda Stent 1833 | LIFI: TOWARDS LINGUISTICALLY INFORMED FRAME INTERPOLATION |
4074 LIGHT FIELD STYLE TRANSFER WITH LOCAL ANGULAR CONSISTENCY Donal Egan, Martin Alain, Aljosa Smolic 4074 | LIGHT FIELD STYLE TRANSFER WITH LOCAL ANGULAR CONSISTENCY |
5209 LIGHT-TTS: LIGHTWEIGHT MULTI-SPEAKER MULTI-LINGUAL TEXT-TO-SPEECH song li, beibei ouyang, lin li, qingyang hong 5209 | LIGHT-TTS: LIGHTWEIGHT MULTI-SPEAKER MULTI-LINGUAL TEXT-TO-SPEECH |
1300 LIGHTWEIGHT AND ACCURATE SINGLE IMAGE SUPER-RESOLUTION WITH CHANNEL SEGREGATION NETWORK Zhong-Han Niu, Xi-Peng Lin, An-Ni Yu, Yu-Bin Yang 1300 | LIGHTWEIGHT AND ACCURATE SINGLE IMAGE SUPER-RESOLUTION WITH CHANNEL SEGREGATION NETWORK |
4364 LIGHTWEIGHT AND FAST TEXT TO SPEECH WITH NEURAL ARCHITECTURE SEARCH Renqian Luo, Xu Tan, Rui Wang, Tao Qin, Jinzhu Li, Sheng Zhao, Enhong Chen, Tie-Yan Liu 4364 | LIGHTWEIGHT AND FAST TEXT TO SPEECH WITH NEURAL ARCHITECTURE SEARCH |
1644 LIGHTWEIGHT AND INTERPRETABLE NEURAL MODELING OF AN AUDIO DISTORTION EFFECT USING HYPERCONDITIONED DIFFERENTIABLE BIQUADS Shahan Nercessian, Andy Sarroff, Kurt James Werner 1644 | LIGHTWEIGHT AND INTERPRETABLE NEURAL MODELING OF AN AUDIO DISTORTION EFFECT USING HYPERCONDITIONED DIFFERENTIABLE BIQUADS |
5083 Lightweight Dual-task Networks for Crowd Counting in Aerial Images Ye Tian, Chengzhen Duan, Ruilin Zhang, Zhiwei Wei, Hongpeng Wang 5083 | Lightweight Dual-task Networks for Crowd Counting in Aerial Images |
1429 LIGHTWEIGHT HUMAN POSE ESTIMATION UNDER RESOURCE-LIMITED SCENES Zhe Zhang, Jie Tang, Gangshan Wu 1429 | LIGHTWEIGHT HUMAN POSE ESTIMATION UNDER RESOURCE-LIMITED SCENES |
2834 LIGHTWEIGHT NON-LOCAL NETWORK FOR IMAGE SUPER-RESOLUTION Risheng Wang, Wenzheng Zhou, Tao Lei, Qi Wang, Hongying Meng, Asoke K. Nandi 2834 | LIGHTWEIGHT NON-LOCAL NETWORK FOR IMAGE SUPER-RESOLUTION |
1596 LINEAR COMPUTATION CODING Ralf Müller, Bernhard Gäde, Ali Bereyhi 1596 | LINEAR COMPUTATION CODING |
2381 LINEAR MULTICHANNEL BLIND SOURCE SEPARATION BASED ON TIME-FREQUENCY MASK OBTAINED BY HARMONIC/PERCUSSIVE SOUND SEPARATION Soichiro Oyabu, Daichi Kitamura, Kohei Yatabe 2381 | LINEAR MULTICHANNEL BLIND SOURCE SEPARATION BASED ON TIME-FREQUENCY MASK OBTAINED BY HARMONIC/PERCUSSIVE SOUND SEPARATION |
4782 LITESING: TOWARDS FAST, LIGHTWEIGHT AND EXPRESSIVE SINGING VOICE SYNTHESIS Xiaobin Zhuang, Tao Jiang, Szu-yu chou, Bin Wu, Peng Hu, Simon Lui 4782 | LITESING: TOWARDS FAST, LIGHTWEIGHT AND EXPRESSIVE SINGING VOICE SYNTHESIS |
4164 LOCALLY OPTIMAL DETECTION OF STOCHASTIC TARGETED UNIVERSAL ADVERSARIAL PERTURBATIONS Amish Goel, Pierre Moulin 4164 | LOCALLY OPTIMAL DETECTION OF STOCHASTIC TARGETED UNIVERSAL ADVERSARIAL PERTURBATIONS |
3037 LONG-SHORT TEMPORAL MODELING FOR EFFICIENT ACTION RECOGNITION Liyu Wu, Yuexian Zou, Can Zhang 3037 | LONG-SHORT TEMPORAL MODELING FOR EFFICIENT ACTION RECOGNITION |
2148 LONG-TERM EFFECTIVENESS OF VOICE EMBEDDINGS FOR SPEAKER VERIFICATION Noah B. Murad, Daniel J. Liebling, Dirk Padfield, Bradley Green 2148 | LONG-TERM EFFECTIVENESS OF VOICE EMBEDDINGS FOR SPEAKER VERIFICATION |
3700 LOOKING THROUGH WALLS: INFERRING SCENES FROM VIDEO-SURVEILLANCE ENCRYPTED TRAFFIC Daniele Mari, Samuele Giuliano Piazzetta, Sara Bordin, Luca Pajola, Sebastiano Verde, Simone Milani, Mauro Conti 3700 | LOOKING THROUGH WALLS: INFERRING SCENES FROM VIDEO-SURVEILLANCE ENCRYPTED TRAFFIC |
3603 LOOPNET: MUSICAL LOOP SYNTHESIS CONDITIONED ON INTUITIVE MUSICAL PARAMETERS Pritish Chandna, Antonio Ramires, Xavier Serra, Emilia Gomez 3603 | LOOPNET: MUSICAL LOOP SYNTHESIS CONDITIONED ON INTUITIVE MUSICAL PARAMETERS |
5137 LOW COMPLEXITY SECURE P-TENSOR PRODUCT COMPRESSED SENSING RECONSTRUCTION OUTSOURCING AND IDENTITY AUTHENTICATION IN CLOUD Mengdi Wang, Di Xiao, Jia Liang 5137 | LOW COMPLEXITY SECURE P-TENSOR PRODUCT COMPRESSED SENSING RECONSTRUCTION OUTSOURCING AND IDENTITY AUTHENTICATION IN CLOUD |
3048 Low Complexity SLM for OFDMA System with Implicit Side Information Shicheng Hu, Miao Yang, Kai Kang, Hua Qian 3048 | Low Complexity SLM for OFDMA System with Implicit Side Information |
1586 LOW LATENCY ONLINE BLIND SOURCE SEPARATION BASED ON JOINT OPTIMIZATION WITH BLIND DEREVERBERATION Tetsuya Ueda, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Shoko Araki, Shoji Makino 1586 | LOW LATENCY ONLINE BLIND SOURCE SEPARATION BASED ON JOINT OPTIMIZATION WITH BLIND DEREVERBERATION |
3323 LOW MUTUAL COUPLING SPARSE ARRAY DESIGN USING ULA FITTING Wanlu Shi, Yingsong Li, Sergiy Vorobyov 3323 | LOW MUTUAL COUPLING SPARSE ARRAY DESIGN USING ULA FITTING |
3738 LOW RESOURCE AUDIO-TO-LYRICS ALIGNMENT FROM POLYPHONIC MUSIC RECORDINGS Emir Demirel, Sven Ahlbäck, Simon Dixon 3738 | LOW RESOURCE AUDIO-TO-LYRICS ALIGNMENT FROM POLYPHONIC MUSIC RECORDINGS |
5589 LOW-COMPLEXITY METHODS FOR ESTIMATION AFTER PARAMETER SELECTION Nadav Harel, Tirza Routtenberg 5589 | LOW-COMPLEXITY METHODS FOR ESTIMATION AFTER PARAMETER SELECTION |
1768 LOW-COMPLEXITY PARAMETER LEARNING FOR OTFS MODULATION BASED AUTOMOTIVE RADAR Chenwen Liu, Shengheng Liu, Zihuan Mao, Yongming Huang, Haiming Wang 1768 | LOW-COMPLEXITY PARAMETER LEARNING FOR OTFS MODULATION BASED AUTOMOTIVE RADAR |
3878 LOW-COMPLEXITY, REAL-TIME JOINT NEURAL ECHO CONTROL AND SPEECH ENHANCEMENT BASED ON PERCEPNET Jean-Marc Valin, Srikanth Tenneti, Karim Helwani, Umut Isik, Arvindh Krishnaswamy 3878 | LOW-COMPLEXITY, REAL-TIME JOINT NEURAL ECHO CONTROL AND SPEECH ENHANCEMENT BASED ON PERCEPNET |
4176 LOW-DIMENSIONAL DENOISING EMBEDDING TRANSFORMER FOR ECG CLASSIFICATION Jian Guan, Wenbo Wang, Pengming Feng, Xinxin Wang, Wenwu Wang 4176 | LOW-DIMENSIONAL DENOISING EMBEDDING TRANSFORMER FOR ECG CLASSIFICATION |
4904 LOW-LATENCY POLAR DECODER USING OVERLAPPED SCL PROCESSING Dongyun Kam, Byeong Yong Kong, Youngjoo Lee 4904 | LOW-LATENCY POLAR DECODER USING OVERLAPPED SCL PROCESSING |
5020 LOW-RANK AND SPARSE DECOMPOSITION FOR JOINT DOA ESTIMATION AND CONTAMINATED SENSORS DETECTION WITH SPARSELY CONTAMINATED ARRAYS Huiping Huang, Abdelhak Zoubir 5020 | LOW-RANK AND SPARSE DECOMPOSITION FOR JOINT DOA ESTIMATION AND CONTAMINATED SENSORS DETECTION WITH SPARSELY CONTAMINATED ARRAYS |
4406 Low-rank on Graphs plus Temporally Smooth Sparse Decomposition for Anomaly Detection in Spatiotemporal Data Seyyid Emre Sofuoglu, Selin Aviyente 4406 | Low-rank on Graphs plus Temporally Smooth Sparse Decomposition for Anomaly Detection in Spatiotemporal Data |
3233 LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH USING DATA AUGMENTATION Goeric Huybrechts, Thomas Merritt, Giulia Comini, Bartek Perz, Raahil Shah, Jaime Lorenzo-Trueba 3233 | LOW-RESOURCE EXPRESSIVE TEXT-TO-SPEECH USING DATA AUGMENTATION |
1082 L-RED: EFFICIENT POST-TRAINING DETECTION OF IMPERCEPTIBLE BACKDOOR ATTACKS WITHOUT ACCESS TO THE TRAINING SET Zhen Xiang, David Miller, George Kesidis 1082 | L-RED: EFFICIENT POST-TRAINING DETECTION OF IMPERCEPTIBLE BACKDOOR ATTACKS WITHOUT ACCESS TO THE TRAINING SET |
3524 LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION Weiquan Fan, Xiangmin Xu, Xiaofen Xing, Weidong Chen, Dongyan Huang 3524 | LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION |
1918 LTAF-NET: LEARNING TASK-AWARE ADAPTIVE FEATURES AND REFINING MASK FOR FEW-SHOT SEMANTIC SEGMENTATION Binjie Mao, Lingfeng Wang, Shiming Xiang, Chunhong Pan 1918 | LTAF-NET: LEARNING TASK-AWARE ADAPTIVE FEATURES AND REFINING MASK FOR FEW-SHOT SEMANTIC SEGMENTATION |
3858 LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation Zhen Zeng, Jianzong Wang, Ning Cheng, Jing Xiao 3858 | LVCNet: Efficient Condition-Dependent Modeling Network for Waveform Generation |
4507 MACHINE TRANSLATION VERBOSITY CONTROL FOR AUTOMATIC DUBBING Surafel Melaku Lakew, Marcello Federico, Yue Wang, Cuong Hoang, Yogesh Virkar, Roberto Barra-Chicote, Robert Enyedi 4507 | MACHINE TRANSLATION VERBOSITY CONTROL FOR AUTOMATIC DUBBING |
1278 m-Activity: ACCURATE AND REAL-TIME HUMAN ACTIVITY RECOGNITION VIA MILLIMETER WAVE RADAR Yuheng Wang, Haipeng Liu, Kening Cui, Anfu Zhou, Wensheng Li, Huadong Ma 1278 | m-Activity: ACCURATE AND REAL-TIME HUMAN ACTIVITY RECOGNITION VIA MILLIMETER WAVE RADAR |
1919 MAEC: Multi-instance learning with an Adversarial Auto-encoder-based Classifier for Speech Emotion Recognition Changzeng Fu, Chaoran Liu, Carlos Toshinori Ishi, Hiroshi Ishiguro 1919 | MAEC: Multi-instance learning with an Adversarial Auto-encoder-based Classifier for Speech Emotion Recognition |
4265 MAKF-SR: Multi-Agent Adaptive Kalman Filtering-based Successor Representations Mohammad Salimibeni, Parvin Malekzadeh, Arash Mohammadi, Petros Spachos, Konstantinos N. Plataniotis 4265 | MAKF-SR: Multi-Agent Adaptive Kalman Filtering-based Successor Representations |
3083 MAKING PUNCTUATION RESTORATION ROBUST AND FAST WITH MULTI-TASK LEARNING AND KNOWLEDGE DISTILLATION Michael Hentschel, Emiru Tsunoo, Takao Okuda 3083 | MAKING PUNCTUATION RESTORATION ROBUST AND FAST WITH MULTI-TASK LEARNING AND KNOWLEDGE DISTILLATION |
4578 MAPGN: MAsked Pointer-Generator Network for Sequence-to-Sequence Pre-training Mana Ihori, Naoki Makishima, Tomohiro Tanaka, Akihiko Takashima, Shota Orihashi, Ryo Masumura 4578 | MAPGN: MAsked Pointer-Generator Network for Sequence-to-Sequence Pre-training |
2931 MARBLENET: DEEP 1D TIME-CHANNEL SEPARABLE CONVOLUTIONAL NEURAL NETWORK FOR VOICE ACTIVITY DETECTION Fei Jia, Somshubra Majumdar, Boris Ginsburg 2931 | MARBLENET: DEEP 1D TIME-CHANNEL SEPARABLE CONVOLUTIONAL NEURAL NETWORK FOR VOICE ACTIVITY DETECTION |
5601 Mask Combination of Multi-Layer Graphs for Global Structure Inference Eda Bayram, Dorina Thanou, Elif Vural, Pascal Frossard 5601 | Mask Combination of Multi-Layer Graphs for Global Structure Inference |
1208 MASK4D: 4D CONVOLUTION NETWORK FOR LIGHT FIELD OCCLUSION REMOVAL Yingjie Li, Wei Yang, Zhenbo Xu, Zhi Cheng, Zhenbo Shi, Yi Zhang, Liusheng Huang 1208 | MASK4D: 4D CONVOLUTION NETWORK FOR LIGHT FIELD OCCLUSION REMOVAL |
5310 MASKCYCLEGAN-VC: LEARNING NON-PARALLEL VOICE CONVERSION WITH FILLING IN FRAMES Takuhiro Kaneko, Hirokazu Kameoka, Kou Tanaka, Nobukatsu Hojo 5310 | MASKCYCLEGAN-VC: LEARNING NON-PARALLEL VOICE CONVERSION WITH FILLING IN FRAMES |
5225 MATCHING AS COLOR IMAGES: THERMAL IMAGE LOCAL FEATURE DETECTION AND DESCRIPTION Bhavesh Deshpande, Sourabh Hanamsheth, Yawen Lu, Guoyu Lu 5225 | MATCHING AS COLOR IMAGES: THERMAL IMAGE LOCAL FEATURE DETECTION AND DESCRIPTION |
5297 MAXIMUM A POSTERIORI ESTIMATOR FOR CONVOLUTIVE SOUND SOURCE SEPARATION WITH SUB-SOURCE BASED NTF MODEL AND THE LOCALIZATION PROPABILISTIC PRIOR ON THE MIXING MATRIX Mieszko Fraś, Konrad Kowalczyk 5297 | MAXIMUM A POSTERIORI ESTIMATOR FOR CONVOLUTIVE SOUND SOURCE SEPARATION WITH SUB-SOURCE BASED NTF MODEL AND THE LOCALIZATION PROPABILISTIC PRIOR ON THE MIXING MATRIX |
5607 MAXIMUM ENTROPY-BASED INTERFERENCE-PLUS-NOISE COVARIANCE MATRIX RECONSTRUCTION FOR ROBUST ADAPTIVE BEAMFORMING Saeed Mohammadzadeh, Vitor H. Nascimento, Rodrigo C. de Lamare, Osman Kukrer 5607 | MAXIMUM ENTROPY-BASED INTERFERENCE-PLUS-NOISE COVARIANCE MATRIX RECONSTRUCTION FOR ROBUST ADAPTIVE BEAMFORMING |
1149 MCR-Net: A Multi-Step Co-Interactive Relation Network for Unanswerable Questions on Machine Reading Comprehension Wei Peng, Yu Hu, Jing Yu, Zihao Zhu, Luxi Xing, Yajing Sun, Yuqiang Xie 1149 | MCR-Net: A Multi-Step Co-Interactive Relation Network for Unanswerable Questions on Machine Reading Comprehension |
1910 MEASUREMENT CODING FRAMEWORK WITH ADJACENT PIXELS BASED MEASUREMENT MATRIX FOR COMPRESSIVELY SENSED IMAGES Rentao Wan, Jinjia Zhou, Bowen Huang, Hui Zeng, Yibo Fan 1910 | MEASUREMENT CODING FRAMEWORK WITH ADJACENT PIXELS BASED MEASUREMENT MATRIX FOR COMPRESSIVELY SENSED IMAGES |
1386 MEASURE-TRANSFORMED COVARIANCE TEST FOR ROBUST SPECTRUM SENSING Yair Sorek, Koby Todros 1386 | MEASURE-TRANSFORMED COVARIANCE TEST FOR ROBUST SPECTRUM SENSING |
5582 Measure-Transformed MVDR Beamforming Nadav Yazdi, Koby Todros 5582 | Measure-Transformed MVDR Beamforming |
3015 MELODY HARMONIZATION USING ORDERLESS NADE, CHORD BALANCING, AND BLOCKED GIBBS SAMPLING Chung-En Sun, Yi-Wei Chen, Hung-Shin Lee, Yen-Hsing Chen, Hsin-Min Wang 3015 | MELODY HARMONIZATION USING ORDERLESS NADE, CHORD BALANCING, AND BLOCKED GIBBS SAMPLING |
1463 MELON PLAYLIST DATASET: A PUBLIC DATASET FOR AUDIO-BASED PLAYLIST GENERATION AND MUSIC TAGGING Andres Ferraro, Yuntae Kim, Soohyeon Lee, Biho Kim, Namjun Jo, Semi Lim, Suyon Lim, Jungtaek Jang, Sehwan Kim, Xavier Serra, Dmitry Bogdanov 1463 | MELON PLAYLIST DATASET: A PUBLIC DATASET FOR AUDIO-BASED PLAYLIST GENERATION AND MUSIC TAGGING |
3520 MEMORY LAYERS WITH MULTI-HEAD ATTENTION MECHANISMS FOR TEXT-DEPENDENT SPEAKER VERIFICATION Victoria Mingote, Antonio Miguel, Alfonso Ortega, Eduardo Lleida 3520 | MEMORY LAYERS WITH MULTI-HEAD ATTENTION MECHANISMS FOR TEXT-DEPENDENT SPEAKER VERIFICATION |
3830 MEMORY-EFFICIENT SPEECH RECOGNITION ON SMART DEVICES Ganesh Venkatesh, Alagappan Valliappan, Jay Mahadeokar, Yuan Shangguan, Christian Fuegen, Mike Seltzer, Vikas Chandra 3830 | MEMORY-EFFICIENT SPEECH RECOGNITION ON SMART DEVICES |
3945 MESSAGE TRANSMISSION OVER RAPIDLY TIME-VARYING CHANNELS Alihan Kaplan, Volker Pohl 3945 | MESSAGE TRANSMISSION OVER RAPIDLY TIME-VARYING CHANNELS |
2228 Meta Ordinal Weighting Net For Improving Lung Nodule Classification Yiming Lei, Hongming Shan, Junping Zhang 2228 | Meta Ordinal Weighting Net For Improving Lung Nodule Classification |
5344 META-ADAPTER: EFFICIENT CROSS-LINGUAL ADAPTATION WITH META-LEARNING Wenxin Hou, Yidong Wang, Shengzhou Gao, Takahiro Shinozaki 5344 | META-ADAPTER: EFFICIENT CROSS-LINGUAL ADAPTATION WITH META-LEARNING |
5367 Meta-cognition-based Simple and Effective Approach to Object Detection Sannidhi P Kumar, Chandan Gautam, Suresh Sundaram 5367 | Meta-cognition-based Simple and Effective Approach to Object Detection |
5124 Meta-Learning for 6G Communication Networks with Reconfigurable Intelligent Surfaces Minchae Jung, Walid Saad 5124 | Meta-Learning for 6G Communication Networks with Reconfigurable Intelligent Surfaces |
3622 META-LEARNING FOR CROSS-CHANNEL SPEAKER VERIFICATION Hanyi Zhang, Longbiao Wang, Kong Aik Lee, Meng Liu, Jianwu Dang, Hui Chen 3622 | META-LEARNING FOR CROSS-CHANNEL SPEAKER VERIFICATION |
4780 META-LEARNING FOR IMPROVING RARE WORD RECOGNITION IN END-TO-END ASR Florian Lux, Ngoc Thang Vu 4780 | META-LEARNING FOR IMPROVING RARE WORD RECOGNITION IN END-TO-END ASR |
5196 Meta-learning for Low-Resource Speech Emotion Recognition Suransh Chopra, Puneet Mathur, Ramit Sawhney, Rajiv Ratn Shah 5196 | Meta-learning for Low-Resource Speech Emotion Recognition |
3988 META-LEARNING WITH ATTENTION FOR IMPROVED FEW-SHOT LEARNING Zejiang Hou, Anwar Walid, Sun-Yuan Kung 3988 | META-LEARNING WITH ATTENTION FOR IMPROVED FEW-SHOT LEARNING |
3356 MICAUGMENT: ONE-SHOT MICROPHONE STYLE TRANSFER Zalán Borsos, Yunpeng Li, Beat Gfeller, Marco Tagliasacchi 3356 | MICAUGMENT: ONE-SHOT MICROPHONE STYLE TRANSFER |
4220 MICROSOFT SPEAKER DIARIZATION SYSTEM FOR THE VOXCELEB SPEAKER RECOGNITION CHALLENGE 2020 Xiong Xiao, Naoyuki Kanda, Zhuo Chen, Tianyan Zhou, Takuya Yoshioka, Sanyuan Chen, Yong Zhao, Gang Liu, Yu Wu, Jian Wu, Shujie Liu, Jinyu Li, Yifan Gong 4220 | MICROSOFT SPEAKER DIARIZATION SYSTEM FOR THE VOXCELEB SPEAKER RECOGNITION CHALLENGE 2020 |
2681 MILLIMETER WAVE MIMO CHANNEL ESTIMATION WITH 1-BIT SPATIAL SIGMA-DELTA ANALOG-TO-DIGITAL CONVERTERS R. S. Prasobh Sankar, Sundeep Prabhakar Chepuri 2681 | MILLIMETER WAVE MIMO CHANNEL ESTIMATION WITH 1-BIT SPATIAL SIGMA-DELTA ANALOG-TO-DIGITAL CONVERTERS |
4329 MIND THE BEAT: DETECTING AUDIO ONSETS FROM EEG RECORDINGS OF MUSIC LISTENING Ashvala Vinay, Alexander Lerch, Grace Leslie 4329 | MIND THE BEAT: DETECTING AUDIO ONSETS FROM EEG RECORDINGS OF MUSIC LISTENING |
2226 Minimizing Weighted Concave Impurity Partition Under Constraints Thuan Nguyen, Thinh Nguyen 2226 | Minimizing Weighted Concave Impurity Partition Under Constraints |
1553 MINIMUM BAYES RISK TRAINING FOR END-TO-END SPEAKER-ATTRIBUTED ASR Naoyuki Kanda, Zhong Meng, Liang Lu, Yashesh Gaur, Xiaofei Wang, Zhuo Chen, Takuya Yoshioka 1553 | MINIMUM BAYES RISK TRAINING FOR END-TO-END SPEAKER-ATTRIBUTED ASR |
3085 MISALIGNMENT RECOGNITION IN ACOUSTIC SENSOR NETWORKS USING A SEMI-SUPERVISED SOURCE ESTIMATION METHOD AND MARKOV RANDOM FIELDS Gabriel F Miller, Andreas Brendel, Walter Kellermann, Sharon Gannot 3085 | MISALIGNMENT RECOGNITION IN ACOUSTIC SENSOR NETWORKS USING A SEMI-SUPERVISED SOURCE ESTIMATION METHOD AND MARKOV RANDOM FIELDS |
1141 MISPRONUNCIATION DETECTION IN NON-NATIVE (L2) ENGLISH WITH UNCERTAINTY MODELING Daniel Korzekwa, Jaime Lorenzo-Trueba, Szymon Zaporowski, Shira Calamaro, Thomas Drugman, Bozena Kostek 1141 | MISPRONUNCIATION DETECTION IN NON-NATIVE (L2) ENGLISH WITH UNCERTAINTY MODELING |
2799 MITIGATING CLIPPING DISTORTION IN OFDM USING DEEP RESIDUAL LEARNING Muhammad Shahmeer Omar, Xiaoli Ma 2799 | MITIGATING CLIPPING DISTORTION IN OFDM USING DEEP RESIDUAL LEARNING |
3556 MITIGATING INTER-SUBJECT BRAIN SIGNAL VARIABILITY FOR EEG-BASED DRIVER FATIGUE STATE CLASSIFICATION Sunhee Hwang, Sungho Park, Dohyung Kim, Jewook Lee, Hyeran Byun 3556 | MITIGATING INTER-SUBJECT BRAIN SIGNAL VARIABILITY FOR EEG-BASED DRIVER FATIGUE STATE CLASSIFICATION |
5603 MIXED MONOTONIC PROGRAMMING FOR FAST GLOBAL OPTIMIZATION Bho Matthiesen, Christoph Hellings, Eduard Jorswieck, Wolfgang Utschick 5603 | MIXED MONOTONIC PROGRAMMING FOR FAST GLOBAL OPTIMIZATION |
3457 MIXED PRECISION QUANTIZATION OF TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION Junhao Xu, Jianwei Yu, Shoukang Hu, Xunying Liu, Helen Mei-Ling Meng 3457 | MIXED PRECISION QUANTIZATION OF TRANSFORMER LANGUAGE MODELS FOR SPEECH RECOGNITION |
2230 MIXSPEECH: DATA AUGMENTATION FOR LOW-RESOURCE AUTOMATIC SPEECH RECOGNITION Linghui Meng, Jin Xu, Xu Tan, Jindong Wang, Tao Qin, Bo Xu 2230 | MIXSPEECH: DATA AUGMENTATION FOR LOW-RESOURCE AUTOMATIC SPEECH RECOGNITION |
2755 MIXTURE OF INFORMED EXPERTS FOR MULTILINGUAL SPEECH RECOGNITION Neeraj Gaur, Brian Farris, Parisa Haghani, Isabel Leal, Pedro Moreno, Manasa Prasad, Bhuvana Ramabhadran, Yun Zhu 2755 | MIXTURE OF INFORMED EXPERTS FOR MULTILINGUAL SPEECH RECOGNITION |
1788 Mixup Regularized Adversarial Networks for Multi-Domain Text Classification Yuan Wu, Diana Inkpen, Ahmed El-Roby 1788 | Mixup Regularized Adversarial Networks for Multi-Domain Text Classification |
1981 MODELING HOMOPHONE NOISE FOR ROBUST NEURAL MACHINE TRANSLATION Wenjie Qin, Xiang Li, Yuhui Sun, Deyi Xiong, Jianwei Cui, Bin Wang 1981 | MODELING HOMOPHONE NOISE FOR ROBUST NEURAL MACHINE TRANSLATION |
1365 MODEL-INSPIRED DEEP LEARNING FOR LIGHT-FIELD MICROSCOPY WITH APPLICATION TO NEURON LOCALIZATION Pingfan Song, Herman Verinaz Jadan, Carmel Howe, Peter Quicke, Amanda Foust, Pier Luigi Dragotti 1365 | MODEL-INSPIRED DEEP LEARNING FOR LIGHT-FIELD MICROSCOPY WITH APPLICATION TO NEURON LOCALIZATION |
4313 Modelling Paralinguistic Properties in Conversational Speech to Detect Bipolar Disorder and Borderline Personality Disorder Bo Wang, Yue Wu, Nemanja Vaci, Maria Liakata, Terry Lyons, Kate Saunders 4313 | Modelling Paralinguistic Properties in Conversational Speech to Detect Bipolar Disorder and Borderline Personality Disorder |
1440 MODIFIED ARCSINE LAW FOR ONE-BIT SAMPLED STATIONARY SIGNALS WITH TIME-VARYING THRESHOLDS Arian Eamaz, Farhang Yeganegi, Mojtaba Soltanalian 1440 | MODIFIED ARCSINE LAW FOR ONE-BIT SAMPLED STATIONARY SIGNALS WITH TIME-VARYING THRESHOLDS |
1237 Modular Binary Tree Architecture for Distributed Large Intelligent Surface Juan Vidal Alegría, Fredrik Rusek, Jesús Rodríguez Sánchez, Ove Edfors 1237 | Modular Binary Tree Architecture for Distributed Large Intelligent Surface |
1362 MODUREC: RECOMMENDER SYSTEMS WITH FEATURE AND TIME MODULATION Javier Maroto, Clément Vignac, Pascal Frossard 1362 | MODUREC: RECOMMENDER SYSTEMS WITH FEATURE AND TIME MODULATION |
2248 MONAURAL SPEECH ENHANCEMENT WITH COMPLEX CONVOLUTIONAL BLOCK ATTENTION MODULE AND JOINT TIME FREQUENCY LOSSES Shengkui Zhao, Trung Hieu Nguyen, Bin Ma 2248 | MONAURAL SPEECH ENHANCEMENT WITH COMPLEX CONVOLUTIONAL BLOCK ATTENTION MODULE AND JOINT TIME FREQUENCY LOSSES |
3497 MORE: A METRIC LEARNING BASED FRAMEWORK FOR OPEN-DOMAIN RELATION EXTRACTION Yutong Wang, Renze Lou, Kai Zhang, MAO YAN Chen, Yujiu Yang 3497 | MORE: A METRIC LEARNING BASED FRAMEWORK FOR OPEN-DOMAIN RELATION EXTRACTION |
3339 MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK Yichong Leng, Xu Tan, Sheng Zhao, Frank Soong, Xiang-Yang Li, Tao Qin 3339 | MOS PREDICTION FOR SYNTHESIZED SPEECH WITH MEAN-BIAS NETWORK |
3893 MOVEMENT DETECTION USING A RECIPROCAL RECEIVED SIGNAL STRENGTH MODEL Ossi Kaltiokallio, Huseyin Yigitler 3893 | MOVEMENT DETECTION USING A RECIPROCAL RECEIVED SIGNAL STRENGTH MODEL |
3097 MOVING OBJECT CLASSIFICATION WITH A SUB-6 GHZ MASSIVE MIMO ARRAY USING REAL DATA Manoj B. R., Guoda Tian, Sara Gunnarsson, Fredrik Tufvesson, Erik Larsson 3097 | MOVING OBJECT CLASSIFICATION WITH A SUB-6 GHZ MASSIVE MIMO ARRAY USING REAL DATA |
1249 MPDNet: A 3D Missing Part Detection Network Based on Point Cloud Segmentation Zhaoxin Fan, Hongyan Liu, Jun He, Min Zhang, Xiaoyong Du 1249 | MPDNet: A 3D Missing Part Detection Network Based on Point Cloud Segmentation |
3966 MRI IMAGE RECOVERY USING DAMPED DENOISING VECTOR AMP Subrata Sarkar, Rizwan Ahmad, Philip Schniter 3966 | MRI IMAGE RECOVERY USING DAMPED DENOISING VECTOR AMP |
4422 MS-CSPN: MULTI-SCALE CASCADE SPATIAL PYRAMID NETWORK FOR OBJECT DETECTION Tianyuan Wang, Can Ma, Haoshan Su, Weiping Wang 4422 | MS-CSPN: MULTI-SCALE CASCADE SPATIAL PYRAMID NETWORK FOR OBJECT DETECTION |
4466 MSR-GAN: Multi-Segment Reconstruction via Adversarial Learning Mona Zehni, Zhizhen Zhao 4466 | MSR-GAN: Multi-Segment Reconstruction via Adversarial Learning |
3599 MUG : A MULTIPATH-EXPLOITED AND GRID-FREE LOCALISATION METHOD Hengyan Liu, Wei Dai, Yuan Shen 3599 | MUG : A MULTIPATH-EXPLOITED AND GRID-FREE LOCALISATION METHOD |
1641 MULTI PATH TRAINING FRAMEWORK FOR DATA-DRIVEN OPEN-DOMAIN CONVERSATION SYSTEM Sixing Wu, Dawei Zhang, Ying Li, Zhonghai Wu 1641 | MULTI PATH TRAINING FRAMEWORK FOR DATA-DRIVEN OPEN-DOMAIN CONVERSATION SYSTEM |
2699 MULTI-BRANCH TOMLINSON-HARASHIMA PRECODING FOR RATE SPLITTING BASED SYSTEMS WITH MULTIPLE ANTENNAS Andre Robert Flores, Rodrigo de Lamare, Bruno Clerckx 2699 | MULTI-BRANCH TOMLINSON-HARASHIMA PRECODING FOR RATE SPLITTING BASED SYSTEMS WITH MULTIPLE ANTENNAS |
1367 MULTICHANNEL OVERLAPPING SPEAKER SEGMENTATION USING MULTIPLE HYPOTHESIS TRACKING OF ACOUSTIC AND SPATIAL FEATURES Aidan Hogg, Christine Evers, Patrick Naylor 1367 | MULTICHANNEL OVERLAPPING SPEAKER SEGMENTATION USING MULTIPLE HYPOTHESIS TRACKING OF ACOUSTIC AND SPATIAL FEATURES |
4287 Multi-Channel Speech Enhancement using Graph Neural Networks Panagiotis Tzirakis, Anurag Kumar, Jacob Donley 4287 | Multi-Channel Speech Enhancement using Graph Neural Networks |
1448 MULTI-CHANNEL TARGET SPEECH EXTRACTION WITH CHANNEL DECORRELATION AND TARGET SPEAKER ADAPTATION Jiangyu Han, Xinyuan Zhou, Yanhua Long, Yijie Li 1448 | MULTI-CHANNEL TARGET SPEECH EXTRACTION WITH CHANNEL DECORRELATION AND TARGET SPEAKER ADAPTATION |
5156 Multichannel-based learning for audio object extraction Daniel Arteaga, Jordi Pons 5156 | Multichannel-based learning for audio object extraction |
4510 MULTI-DECODER DPRNN: HIGH ACCURACY SOURCE COUNTING AND SEPARATION Junzhe Zhu, Raymond Yeh, Mark Hasegawa-Johnson 4510 | MULTI-DECODER DPRNN: HIGH ACCURACY SOURCE COUNTING AND SEPARATION |
5592 Multi-Delay Sparse Approach to Residual Crosstalk Reduction for Blind Source Separation Satoru Emura, Hiroshi Sawada, Shoko Araki, Noboru Harada 5592 | Multi-Delay Sparse Approach to Residual Crosstalk Reduction for Blind Source Separation |
3736 MULTI-DIALECT SPEECH RECOGNITION IN ENGLISH USING ATTENTION ON ENSEMBLE OF EXPERTS Amit Das, Kshitiz Kumar, Jian Wu 3736 | MULTI-DIALECT SPEECH RECOGNITION IN ENGLISH USING ATTENTION ON ENSEMBLE OF EXPERTS |
5152 MULTI-DIRECTIONAL CONVOLUTION NETWORKS WITH SPATIAL-TEMPORAL FEATURE PYRAMID MODULE FOR ACTION RECOGNITION Bohong Yang, Zijian Wang, Wu Ran, Hong Lu, Yi-Ping Phoebe Chen 5152 | MULTI-DIRECTIONAL CONVOLUTION NETWORKS WITH SPATIAL-TEMPORAL FEATURE PYRAMID MODULE FOR ACTION RECOGNITION |
2560 Multi-Entity Collaborative Relation Extraction Haozhuang Liu, Ziran Li, Dongming Sheng, Hai-Tao Zheng, Ying Shen 2560 | Multi-Entity Collaborative Relation Extraction |
1984 MULTI-GRANULARITY FEATURE INTERACTION AND RELATION REASONING FOR 3D DENSE ALIGNMENT AND FACE RECONSTRUCTION Lei Li, Xiangzheng Li, Kangbo Wu, Kui Lin, Suping Wu 1984 | MULTI-GRANULARITY FEATURE INTERACTION AND RELATION REASONING FOR 3D DENSE ALIGNMENT AND FACE RECONSTRUCTION |
3041 MULTI-GRANULARITY HETEROGENEOUS GRAPH FOR DOCUMENT-LEVEL RELATION EXTRACTION Hengzhu Tang, Yanan Cao, Zhenyu Zhang, Ruipeng Jia, Fang Fang, Shi Wang 3041 | MULTI-GRANULARITY HETEROGENEOUS GRAPH FOR DOCUMENT-LEVEL RELATION EXTRACTION |
4749 MULTI-INITIALIZATION META-LEARNING WITH DOMAIN ADAPTATION Zhengyu Chen, Donglin Wang 4749 | MULTI-INITIALIZATION META-LEARNING WITH DOMAIN ADAPTATION |
1543 MULTILABEL 12-LEAD ELECTROCARDIOGRAM CLASSIFICATION USING BEAT TO SEQUENCE AUTOENCODERS Alexander William Wong, Amir Salimi, Abram Hindle, Sunil Vasu Kalmady, Padma Kaul 1543 | MULTILABEL 12-LEAD ELECTROCARDIOGRAM CLASSIFICATION USING BEAT TO SEQUENCE AUTOENCODERS |
5145 MULTI-LEVEL ADAPTIVE REGION OF INTEREST AND GRAPH LEARNING FOR FACIAL ACTION UNIT RECOGNITION Jingwei Yan, Boyuan Jiang, Jingjing Wang, Qiang Li, Chunmao Wang, Shiliang Pu 5145 | MULTI-LEVEL ADAPTIVE REGION OF INTEREST AND GRAPH LEARNING FOR FACIAL ACTION UNIT RECOGNITION |
1235 Multi-Level Group Testing with Application to One-Shot Pooled COVID-19 Tests Alejandro Cohen, Nir Shlezinger, Amit Solomon, Yonina C. Eldar, Muriel Medard 1235 | Multi-Level Group Testing with Application to One-Shot Pooled COVID-19 Tests |
3805 MULTI-LEVEL REVERSIBLE ENCRYPTION FOR ECG SIGNALS USING COMPRESSIVE SENSING Mikko Impiö, Mehmet Yamaç, Jenni Raitoharju 3805 | MULTI-LEVEL REVERSIBLE ENCRYPTION FOR ECG SIGNALS USING COMPRESSIVE SENSING |
4237 MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION Xinjian Li, David Mortensen, Florian Metze, Alan Black 4237 | MULTILINGUAL PHONETIC DATASET FOR LOW RESOURCE SPEECH RECOGNITION |
5358 MULTIMODAL CROSS- AND SELF-ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION Licai Sun, Bin Liu, Jianhua Tao, Zheng Lian 5358 | MULTIMODAL CROSS- AND SELF-ATTENTION NETWORK FOR SPEECH EMOTION RECOGNITION |
3472 MULTIMODAL EMOTION RECOGNITION WITH CAPSULE GRAPH CONVOLUTIONAL BASED REPRESENTATION FUSION Jiaxing Liu, Sen Chen, Longbiao Wang, Zhilei Liu, Yahui Fu, Lili Guo, Jianwu Dang 3472 | MULTIMODAL EMOTION RECOGNITION WITH CAPSULE GRAPH CONVOLUTIONAL BASED REPRESENTATION FUSION |
5159 MULTI-MODAL LABEL DEQUANTIZED GAUSSIAN PROCESS LATENT VARIABLE MODEL FOR ORDINAL LABEL ESTIMATION Masanao Matsumoto, Keisuke Maeda, Naoki Saito, Takahiro Ogawa, Miki Haseyama 5159 | MULTI-MODAL LABEL DEQUANTIZED GAUSSIAN PROCESS LATENT VARIABLE MODEL FOR ORDINAL LABEL ESTIMATION |
3334 MULTIMODAL METRIC LEARNING FOR TAG-BASED MUSIC RETRIEVAL Minz Won, Sergio Oramas, Oriol Nieto, Fabien Gouyon, Xavier Serra 3334 | MULTIMODAL METRIC LEARNING FOR TAG-BASED MUSIC RETRIEVAL |
4088 MULTIMODAL PUNCTUATION PREDICTION WITH CONTEXTUAL DROPOUT Andrew Silva, Barry-John Theobald, Nicholas Apostoloff 4088 | MULTIMODAL PUNCTUATION PREDICTION WITH CONTEXTUAL DROPOUT |
3284 MULTI-MODELS FUSION FOR LIGHT FIELD ANGULAR SUPER-RESOLUTION FengYin Cao, Ping An, Xinpeng Huang, Chao Yang, Qiang Wu 3284 | MULTI-MODELS FUSION FOR LIGHT FIELD ANGULAR SUPER-RESOLUTION |
2837 MULTI-OBJECT TRACKING USING POISSON MULTI-BERNOULLI MIXTURE FILTERING FOR AUTONOMOUS VEHICLES Su Pang, Hayder Radha 2837 | MULTI-OBJECT TRACKING USING POISSON MULTI-BERNOULLI MIXTURE FILTERING FOR AUTONOMOUS VEHICLES |
4538 MULTI-ORDER ADVERSARIAL REPRESENTATION LEARNING FOR COMPOSED QUERY IMAGE RETRIEVAL Zhixiao Fu, Xinyuan Chen, Jianfeng Dong, Shouling Ji 4538 | MULTI-ORDER ADVERSARIAL REPRESENTATION LEARNING FOR COMPOSED QUERY IMAGE RETRIEVAL |
1865 MULTIPHISH:MULTI-MODAL FEATURES FUSION NETWORKS FOR PHISHING DETECTION Lei Zhang, Peng Zhang, Luchen Liu, Jianlong Tan 1865 | MULTIPHISH:MULTI-MODAL FEATURES FUSION NETWORKS FOR PHISHING DETECTION |
1817 Multiple Auxiliary Networks for Single Blind Image Deblurring Chen Li, Qi Wang, Shaoteng Liu, Xuelong Li 1817 | Multiple Auxiliary Networks for Single Blind Image Deblurring |
4771 MULTIPLE HUMAN TRACKING IN NON-SPECIFIC COVERAGE WITH WEARABLE CAMERAS Sibo Wang, Ruize Han, Wei Feng, Song Wang 4771 | MULTIPLE HUMAN TRACKING IN NON-SPECIFIC COVERAGE WITH WEARABLE CAMERAS |
2742 MULTIPLE-HYPOTHESIS CTC-BASED SEMI-SUPERVISED ADAPTATION OF END-TO-END SPEECH RECOGNITION Cong-Thanh Do, Rama Doddipatla, Thomas Hain 2742 | MULTIPLE-HYPOTHESIS CTC-BASED SEMI-SUPERVISED ADAPTATION OF END-TO-END SPEECH RECOGNITION |
1299 MULTIPLE-INPUT MULTIPLE-OUTPUT FUSION NETWORK FOR GENERALIZED ZERO-SHOT LEARNING Fangming Zhong, Guangze Wang, Zhikui Chen, Xu Yuan, Feng Xia 1299 | MULTIPLE-INPUT MULTIPLE-OUTPUT FUSION NETWORK FOR GENERALIZED ZERO-SHOT LEARNING |
4240 MULTI-RATE ATTENTION ARCHITECTURE FOR FAST STREAMABLE TEXT-TO-SPEECH SPECTRUM MODELING Qing He, Zhiping Xiu, Thilo Koehler, Jilong Wu 4240 | MULTI-RATE ATTENTION ARCHITECTURE FOR FAST STREAMABLE TEXT-TO-SPEECH SPECTRUM MODELING |
1198 MULTI-SAMPLE ONLINE LEARNING FOR SPIKING NEURAL NETWORKS BASED ON GENERALIZED EXPECTATION MAXIMIZATION Hyeryung Jang, Osvaldo Simeone 1198 | MULTI-SAMPLE ONLINE LEARNING FOR SPIKING NEURAL NETWORKS BASED ON GENERALIZED EXPECTATION MAXIMIZATION |
5368 MULTI-SCALE AND MULTI-REGION FACIAL DISCRIMINATIVE REPRESENTATION FOR AUTOMATIC DEPRESSION LEVEL DETECTION Mingyue Niu, Jianhua tao, Bin Liu 5368 | MULTI-SCALE AND MULTI-REGION FACIAL DISCRIMINATIVE REPRESENTATION FOR AUTOMATIC DEPRESSION LEVEL DETECTION |
2061 MULTI-SCALE CASCADE DISPARITY REFINEMENT STEREO NETWORK Xiaogang Jia, Wei Chen, Zhengfa Liang, Mingfei Wu, Xuehui Wang 2061 | MULTI-SCALE CASCADE DISPARITY REFINEMENT STEREO NETWORK |
4727 MULTI-SCALE FEATURE-GUIDED STEREOSCOPIC VIDEO QUALITY ASSESSMENT BASED ON 3D CONVOLUTIONAL NEURAL NETWORK Yingjie Feng, Sumei Li, Yongli Chang 4727 | MULTI-SCALE FEATURE-GUIDED STEREOSCOPIC VIDEO QUALITY ASSESSMENT BASED ON 3D CONVOLUTIONAL NEURAL NETWORK |
3219 MULTI-SCALE SAMPLE SELECTION BASED ON STATISTICAL CHARACTERISTICS FOR OBJECT DETECTION Zhiguo Li, Yuan Yuan, Dandan Ma 3219 | MULTI-SCALE SAMPLE SELECTION BASED ON STATISTICAL CHARACTERISTICS FOR OBJECT DETECTION |
4225 MULTI-SCALE SPEAKER DIARIZATION WITH NEURAL AFFINITY SCORE FUSION Taejin Park, Manoj Kumar, Shrikanth Narayanan 4225 | MULTI-SCALE SPEAKER DIARIZATION WITH NEURAL AFFINITY SCORE FUSION |
3450 MULTI-SPEAKER EMOTIONAL SPEECH SYNTHESIS WITH FINE-GRAINED PROSODY MODELING Chunhui Lu, Xue Wen, Ruolan Liu, Xiao Chen 3450 | MULTI-SPEAKER EMOTIONAL SPEECH SYNTHESIS WITH FINE-GRAINED PROSODY MODELING |
3440 Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals Meng Ge, Chenglin Xu, Longbiao Wang, Eng Siong Chng, Jianwu Dang, Haizhou Li 3440 | Multi-stage Speaker Extraction with Utterance and Frame-Level Reference Signals |
4065 Multi-Step Spoken Language Understanding System based on Adversarial Learning Yu Wang, Yilin Shen, Hongxia Jin 4065 | Multi-Step Spoken Language Understanding System based on Adversarial Learning |
5555 MULTISTREAM CNN FOR ROBUST ACOUSTIC MODELING Kyu Han, Jing Pan, Venkata Tadala, Tao Ma, Dan Povey 5555 | MULTISTREAM CNN FOR ROBUST ACOUSTIC MODELING |
1809 Multi-target DoA estimation with an audio-visual fusion mechanism xinyuan qian, maulik Madhavi,, zexu pan, jiadong wang, haizhou li 1809 | Multi-target DoA estimation with an audio-visual fusion mechanism |
3699 Multi-task Estimation of Age and Cognitive Decline from Speech Yilin Pan, Venkata Srikanth Nallanthighal, Daniel Blackburn, Heidi Christensen, Aki Harma 3699 | Multi-task Estimation of Age and Cognitive Decline from Speech |
3075 MULTITASK LEARNING AND JOINT OPTIMIZATION FOR TRANSFORMER-RNN-TRANSDUCER SPEECH RECOGNITION Jae-Jin Jeon, Eesung Kim 3075 | MULTITASK LEARNING AND JOINT OPTIMIZATION FOR TRANSFORMER-RNN-TRANSDUCER SPEECH RECOGNITION |
3318 Multi-Task Learning via Sharing Inexact Low-Rank Subspace Xiaoqian Wang, Feiping Nie, Heng Huang 3318 | Multi-Task Learning via Sharing Inexact Low-Rank Subspace |
2075 MULTI-TASK SELF-SUPERVISED PRE-TRAINING FOR MUSIC CLASSIFICATION Ho-Hsiang Wu, Chieh-Chi Kao, Qingming Tang, Ming Sun, Brian McFee, Juan Bello, Chao Wang 2075 | MULTI-TASK SELF-SUPERVISED PRE-TRAINING FOR MUSIC CLASSIFICATION |
1747 MULTI-TASK TRANSFORMER WITH INPUT FEATURE RECONSTRUCTION FOR DYSARTHRIC SPEECH RECOGNITION Chaoyue Ding, Shiliang Sun, Jing Zhao 1747 | MULTI-TASK TRANSFORMER WITH INPUT FEATURE RECONSTRUCTION FOR DYSARTHRIC SPEECH RECOGNITION |
5611 Multi-Task WaveRNN with an Integrated Architecture for Cross-lingual Voice Conversion Yi Zhou, Xiaohai Tian, Haizhou Li 5611 | Multi-Task WaveRNN with an Integrated Architecture for Cross-lingual Voice Conversion |
3804 MULTI-TIER FEDERATED LEARNING FOR VERTICALLY PARTITIONED DATA Anirban Das, Stacy Patterson 3804 | MULTI-TIER FEDERATED LEARNING FOR VERTICALLY PARTITIONED DATA |
4087 MULTIVARIATE NON-NEGATIVE MATRIX FACTORIZATION WITH APPLICATION TO ENERGY DISAGGREGATION Pascal Alexander Schirmer, Iosif Mporas 4087 | MULTIVARIATE NON-NEGATIVE MATRIX FACTORIZATION WITH APPLICATION TO ENERGY DISAGGREGATION |
1210 Multi-Vehicle Velocity Estimation Using IEEE 802.11ad Waveform Geonho Han, Sucheol Kim, Junil Choi 1210 | Multi-Vehicle Velocity Estimation Using IEEE 802.11ad Waveform |
4105 MULTI-VIEW AUDIO AND MUSIC CLASSIFICATION Huy Phan, Huy Le Nguyen, Oliver Chén, Lam Pham, Philipp Koch, Ian McLoughlin, Alfred Mertins 4105 | MULTI-VIEW AUDIO AND MUSIC CLASSIFICATION |
1095 MULTI-VIEW CONTRASTIVE LEARNING FOR ONLINE KNOWLEDGE DISTILLATION Chuanguang Yang, Zhulin An, Yongjun Xu 1095 | MULTI-VIEW CONTRASTIVE LEARNING FOR ONLINE KNOWLEDGE DISTILLATION |
2631 MULTIVIEW SENSING WITH UNKNOWN PERMUTATIONS: AN OPTIMAL TRANSPORT APPROACH Yanting Ma, Petros Boufounos, Hassan Mansour, Shuchin Aeron 2631 | MULTIVIEW SENSING WITH UNKNOWN PERMUTATIONS: AN OPTIMAL TRANSPORT APPROACH |
4039 MULTIVIEW VARIATIONAL GRAPH AUTOENCODERS FOR CANONICAL CORRELATION ANALYSIS Yacouba Kaloga, Pierre Borgnat, Sundeep Prabhakar Chepuri, Patrice Abry, Amaury Habrard 4039 | MULTIVIEW VARIATIONAL GRAPH AUTOENCODERS FOR CANONICAL CORRELATION ANALYSIS |
2238 Muse: Multi-modal target speaker extraction with visual cues Zexu Pan, Ruijie Tao, Chenglin Xu, Haizhou Li 2238 | Muse: Multi-modal target speaker extraction with visual cues |
3304 MUTUAL INFORMATION FLOWS IN A BIVARIATE POINT PROCESS Syed Ahmed Pasha, Victor Solo 3304 | MUTUAL INFORMATION FLOWS IN A BIVARIATE POINT PROCESS |
5446 MUTUALLY-CONSTRAINED MONOTONIC MULTIHEAD ATTENTION FOR ONLINE ASR Jaeyun Song, Hajin Shim, Eunho Yang 5446 | MUTUALLY-CONSTRAINED MONOTONIC MULTIHEAD ATTENTION FOR ONLINE ASR |
2973 NASA: A Noise-Adaptive and Structure-Aware Learning Framework for Image Deblurring Xiaokun Liu, Long Ma, Risheng Liu, Wei Zhong, Xin Fan, Zhongxuan Luo 2973 | NASA: A Noise-Adaptive and Structure-Aware Learning Framework for Image Deblurring |
1133 NEAR-OPTIMAL ALGORITHMS FOR PIECEWISE-STATIONARY CASCADING BANDITS Lingda Wang, Huozhi Zhou, Bingcong Li, Lav R. Varshney, Zhizhen Zhao 1133 | NEAR-OPTIMAL ALGORITHMS FOR PIECEWISE-STATIONARY CASCADING BANDITS |
3742 NEAR-OPTIMAL RESAMPLING IN PARTICLE FILTERS USING THE ISING ENERGY MODEL Muhammed Tahsin Rahman, Mohammad Javad-Kalbasi, Shahrokh Valaee 3742 | NEAR-OPTIMAL RESAMPLING IN PARTICLE FILTERS USING THE ISING ENERGY MODEL |
2494 NESTED ERROR MAP GENERATION NETWORK FOR NO-REFERENCE IMAGE QUALITY ASSESSMENT Junming Chen, Haiqiang Wang, Ge Li, Shan Liu 2494 | NESTED ERROR MAP GENERATION NETWORK FOR NO-REFERENCE IMAGE QUALITY ASSESSMENT |
1462 NESTED LEARNING FOR MULTI-LEVEL CLASSIFICATION Raphaël Achddou, J.Matias di Martino, Guillermo Sapiro 1462 | NESTED LEARNING FOR MULTI-LEVEL CLASSIFICATION |
4589 NETWORK AND CONTENT-DEPENDENT BITRATE LADDER ESTIMATION FOR ADAPTIVE BITRATE VIDEO STREAMING Pierre Lebreton, Kazuhisa Yamagishi 4589 | NETWORK AND CONTENT-DEPENDENT BITRATE LADDER ESTIMATION FOR ADAPTIVE BITRATE VIDEO STREAMING |
2407 NETWORK CLASSIFIERS BASED ON SOCIAL LEARNING Virginia Bordignon, Stefan Vlaski, Vincenzo Matta, Ali H. Sayed 2407 | NETWORK CLASSIFIERS BASED ON SOCIAL LEARNING |
5605 NETWORK INFERENCE FROM CONSENSUS DYNAMICS WITH UNKNOWN PARAMETERS Yu Zhu, Michael T. Schaub, Ali Jadbabaie, Santiago Segarra 5605 | NETWORK INFERENCE FROM CONSENSUS DYNAMICS WITH UNKNOWN PARAMETERS |
3133 NETWORK PRUNING USING LINEAR DEPENDENCY ANALYSIS ON FEATURE MAPS Hao Pan, Zhongdi Chao, Jiang Qian, Bojin Zhuang, Shaojun Wang, Jing Xiao 3133 | NETWORK PRUNING USING LINEAR DEPENDENCY ANALYSIS ON FEATURE MAPS |
3897 NETWORK TOPOLOGY CHANGE-POINT DETECTION FROM GRAPH SIGNALS WITH PRIOR SPECTRAL SIGNATURES Chiraag Kaushik, T. Mitchell Roddenberry, Santiago Segarra 3897 | NETWORK TOPOLOGY CHANGE-POINT DETECTION FROM GRAPH SIGNALS WITH PRIOR SPECTRAL SIGNATURES |
1548 NETWORK TOPOLOGY INFERENCE WITH GRAPHON SPECTRAL PENALTIES T. Mitchell Roddenberry, Madeline Navarro, Santiago Segarra 1548 | NETWORK TOPOLOGY INFERENCE WITH GRAPHON SPECTRAL PENALTIES |
3607 NETWORK-AWARE OPTIMAL MICROPHONE CHANNEL SELECTION IN WIRELESS ACOUSTIC SENSOR NETWORKS Michael Günther, Haitham Afifi, Andreas Brendel, Holger Karl, Walter Kellermann 3607 | NETWORK-AWARE OPTIMAL MICROPHONE CHANNEL SELECTION IN WIRELESS ACOUSTIC SENSOR NETWORKS |
3056 Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks Shoukang Hu, Xurong Xie, Shansong Liu, Mingyu Cui, Mengzhe Geng, Xunying Liu, Helen Meng 3056 | Neural Architecture Search For LF-MMI Trained Time Delay Neural Networks |
4978 NEURAL AUDIO FINGERPRINT FOR HIGH-SPECIFIC AUDIO RETRIEVAL BASED ON CONTRASTIVE LEARNING Sungkyun Chang, Donmoon Lee, Jeongsoo Park, Hyungui Lim, Kyogu Lee, Karam Ko, Yoonchang Han 4978 | NEURAL AUDIO FINGERPRINT FOR HIGH-SPECIFIC AUDIO RETRIEVAL BASED ON CONTRASTIVE LEARNING |
4370 NEURAL INVERSE TEXT NORMALIZATION Monica Sunkara, Chaitanya Shivade, sravan bodapati, Katrin Kirchhoff 4370 | NEURAL INVERSE TEXT NORMALIZATION |
3262 Neural Kalman Filtering for Speech Enhancement Wei Xue, Gang Quan, Chao Zhang, Guohong Ding, Xiaodong He, Bowen Zhou 3262 | Neural Kalman Filtering for Speech Enhancement |
3305 NEURAL LAYERED MIN-SUM DECODING FOR PROTOGRAPH LDPC CODES Dexin Zhang, Jincheng Dai, Kailin Tan, Kai Niu, Mingzhe Chen, H. Vincent Poor, Shuguang Cui 3305 | NEURAL LAYERED MIN-SUM DECODING FOR PROTOGRAPH LDPC CODES |
2453 NEURAL NETWORK-BASED VIRTUAL MICROPHONE ESTIMATOR Tsubasa Ochiai, Marc Delcroix, Tomohiro Nakatani, Rintaro Ikeshita, Keisuke Kinoshita, Shoko Araki 2453 | NEURAL NETWORK-BASED VIRTUAL MICROPHONE ESTIMATOR |
3668 NEURAL NOISE EMBEDDING FOR END-TO-END SPEECH ENHANCEMENT WITH CONDITIONAL LAYER NORMALIZATION Zhihui Zhang, Xiaoqi Li, Yaxing Li, Yuanjie Dong, Dan Wang, Shengwu Xiong 3668 | NEURAL NOISE EMBEDDING FOR END-TO-END SPEECH ENHANCEMENT WITH CONDITIONAL LAYER NORMALIZATION |
5400 NEURAL UTTERANCE CONFIDENCE MEASURE FOR RNN-TRANSDUCERS AND TWO PASS MODELS Ashutosh Gupta, Ankur Kumar, Dhananjaya Gowda, Kwangyoun Kim, Sachin Singh, Shatrughan Singh, Chanwoo kim 5400 | NEURAL UTTERANCE CONFIDENCE MEASURE FOR RNN-TRANSDUCERS AND TWO PASS MODELS |
1820 NEURO-STEERED MUSIC SOURCE SEPARATION WITH EEG-BASED AUDITORY ATTENTION DECODING AND CONTRASTIVE-NMF Giorgia Cantisani, Slim Essid, Gaël Richard 1820 | NEURO-STEERED MUSIC SOURCE SEPARATION WITH EEG-BASED AUDITORY ATTENTION DECODING AND CONTRASTIVE-NMF |
3561 NEW VARIANTS OF DFA BASED ON LOESS AND LOWESS METHODS: GENERALIZATION OF THE DETRENDING MOVING AVERAGE Bastien Berthelot, Éric Grivel, Pierrick Legrand 3561 | NEW VARIANTS OF DFA BASED ON LOESS AND LOWESS METHODS: GENERALIZATION OF THE DETRENDING MOVING AVERAGE |
3417 NISP: A MULTI-LINGUAL MULTI-ACCENT DATASET FOR SPEAKER PROFILING Shareef Babu Kalluri, Deepu Vijayasenan, Sriram Ganapathy, Ragesh Rajan M, Prashant Krishnan 3417 | NISP: A MULTI-LINGUAL MULTI-ACCENT DATASET FOR SPEAKER PROFILING |
4893 NLKD: using coarse annotations for semantic segmentation based on knowledge distillation Dong Liang, Yun Du 4893 | NLKD: using coarse annotations for semantic segmentation based on knowledge distillation |
1627 NMF-SAE: AN INTERPRETABLE SPARSE AUTOENCODER FOR HYPERSPECTRAL UNMIXING Fengchao Xiong, Jun Zhou, Minchao Ye, Jianfeng Lu, Yuntao Qian 1627 | NMF-SAE: AN INTERPRETABLE SPARSE AUTOENCODER FOR HYPERSPECTRAL UNMIXING |
5045 NNAKF: A NEURAL NETWORK ADAPTED KALMAN FILTER FOR TARGET TRACKING Sami Jouaber, Silvère Bonnabel, Santiago Velasco-Forero, Marion Pilté 5045 | NNAKF: A NEURAL NETWORK ADAPTED KALMAN FILTER FOR TARGET TRACKING |
5348 NN-KOG2P: A NOVEL GRAPHEME-TO-PHONEME MODEL FOR KOREAN LANGUAGE Hwa-Yeon Kim, Jong-Hwan Kim, Jae-Min Kim 5348 | NN-KOG2P: A NOVEL GRAPHEME-TO-PHONEME MODEL FOR KOREAN LANGUAGE |
4758 NO RELAXATION: GUARANTEED RECOVERY OF FINITE-VALUED SIGNALS FROM UNDERSAMPLED MEASUREMENTS Pulak Sarangi, Piya Pal 4758 | NO RELAXATION: GUARANTEED RECOVERY OF FINITE-VALUED SIGNALS FROM UNDERSAMPLED MEASUREMENTS |
4911 NODE ATTRIBUTE COMPLETION IN KNOWLEDGE GRAPHS WITH MULTI-RELATIONAL PROPAGATION Eda Bayram, Alberto Garcia-Duran, Robert West 4911 | NODE ATTRIBUTE COMPLETION IN KNOWLEDGE GRAPHS WITH MULTI-RELATIONAL PROPAGATION |
2791 NOISE LEVEL LIMITED SUB-MODELING FOR DIFFUSION PROBABILISTIC VOCODERS Takuma Okamoto, Tomoki Toda, Yoshinori Shiga, Hisashi Kawai 2791 | NOISE LEVEL LIMITED SUB-MODELING FOR DIFFUSION PROBABILISTIC VOCODERS |
5091 NOISE-ASSISTED MULTIVARIATE VARIATIONAL MODE DECOMPOSITION Charilaos Zisou, Georgios Apostolidis, Leontios Hadjileontiadis 5091 | NOISE-ASSISTED MULTIVARIATE VARIATIONAL MODE DECOMPOSITION |
3500 NOISE-ROBUST ADAPTATION CONTROL FOR SUPERVISED ACOUSTIC SYSTEM IDENTIFICATION EXPLOITING A NOISE DICTIONARY Thomas Haubner, Andreas Brendel, Mohamed Elminshawi, Walter Kellermann 3500 | NOISE-ROBUST ADAPTATION CONTROL FOR SUPERVISED ACOUSTIC SYSTEM IDENTIFICATION EXPLOITING A NOISE DICTIONARY |
1744 NON-AUTOREGRESSIVE SEQUENCE-TO-SEQUENCE VOICE CONVERSION Tomoki Hayashi, Wen-Chin Huang, Kazuhiro Kobayashi, Tomoki Toda 1744 | NON-AUTOREGRESSIVE SEQUENCE-TO-SEQUENCE VOICE CONVERSION |
1599 NON-AUTOREGRESSIVE TRANSFORMER ASR WITH CTC-ENHANCED DECODER INPUT Xingchen Song, Zhiyong Wu, Yiheng Huang, Chao Weng, Dan Su, Helen Meng 1599 | NON-AUTOREGRESSIVE TRANSFORMER ASR WITH CTC-ENHANCED DECODER INPUT |
3886 NON-COHERENT DOA ESTIMATION OF OFF-GRID SIGNALS WITH UNIFORM CIRCULAR ARRAYS Zhengyu Wan, Wei Liu 3886 | NON-COHERENT DOA ESTIMATION OF OFF-GRID SIGNALS WITH UNIFORM CIRCULAR ARRAYS |
1879 Noncontact Heartbeat Detection by Viterbi Algorithm with Fusion of Beat-Beat Interval and Deep Learning-driven Branch Metrics Kohei Yamamoto, Tomoaki Ohtsuki 1879 | Noncontact Heartbeat Detection by Viterbi Algorithm with Fusion of Beat-Beat Interval and Deep Learning-driven Branch Metrics |
2477 NON-CONVEX SPARSE DEVIATION MODELING VIA GENERATIVE MODELS Yaxi Yang, Hailin Wang, Haiquan Qiu, Jianjun Wang, Yao Wang 2477 | NON-CONVEX SPARSE DEVIATION MODELING VIA GENERATIVE MODELS |
3541 NON-INTRUSIVE BINAURAL PREDICTION OF SPEECH INTELLIGIBILITY BASED ON PHONEME CLASSIFICATION Jana Roßbach, Saskia Röttges, Christopher F. Hauth, Thomas Brand, Bernd T. Meyer 3541 | NON-INTRUSIVE BINAURAL PREDICTION OF SPEECH INTELLIGIBILITY BASED ON PHONEME CLASSIFICATION |
1471 NON-ITERATIVE BLIND CALIBRATION OF NESTED ARRAYS WITH ASYMPTOTICALLY OPTIMAL WEIGHTING Amir Weiss, Arie Yeredor 1471 | NON-ITERATIVE BLIND CALIBRATION OF NESTED ARRAYS WITH ASYMPTOTICALLY OPTIMAL WEIGHTING |
5227 NONLINEAR STATE-SPACE GENERALIZATIONS OF GRAPH CONVOLUTIONAL NEURAL NETWORKS Luana Ruiz, Fernando Gama, Alejandro Ribeiro, Elvin Isufi 5227 | NONLINEAR STATE-SPACE GENERALIZATIONS OF GRAPH CONVOLUTIONAL NEURAL NETWORKS |
1018 NON-LOCAL SINGLE IMAGE DE-RAINING WITHOUT DECOMPOSITION Chaobing Zheng, Zhengguo Li, Yuwen Li, Shiqian Wu 1018 | NON-LOCAL SINGLE IMAGE DE-RAINING WITHOUT DECOMPOSITION |
1234 Nonnegative Unimodal Matrix Factorization Andersen Man Shun Ang, Nicolas Gillis, Arnaud Vandaele, Hans De Sterck 1234 | Nonnegative Unimodal Matrix Factorization |
5305 NON-PARALLEL MANY-TO-MANY VOICE CONVERSION BY KNOWLEDGE TRANSFER FROM A TEXT-TO-SPEECH MODEL Xinyuan Yu, Brian Mak 5305 | NON-PARALLEL MANY-TO-MANY VOICE CONVERSION BY KNOWLEDGE TRANSFER FROM A TEXT-TO-SPEECH MODEL |
1111 NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING LOCAL LINGUISTIC TOKENS Chao Wang, Yibiao Yu 1111 | NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING LOCAL LINGUISTIC TOKENS |
4844 NON-RECURSIVE GRAPH CONVOLUTIONAL NETWORKS Hao Chen, Zengde Deng, Yue Xu, Zhoujun Li 4844 | NON-RECURSIVE GRAPH CONVOLUTIONAL NETWORKS |
4108 NON-SINGULAR ADVERSARIAL ROBUSTNESS OF NEURAL NETWORKS Yu-Lin Tsai, Chia-Yi Hsu, Chia-Mu Yu, Pin-Yu Chen 4108 | NON-SINGULAR ADVERSARIAL ROBUSTNESS OF NEURAL NETWORKS |
2373 NONSTATIONARY PORTFOLIOS: DIVERSIFICATION IN THE SPECTRAL DOMAIN Bruno Scalzo, Alvaro Arroyo, Ljubisa Stankovic, Anthony G. Constantinides, Danilo P. Mandic 2373 | NONSTATIONARY PORTFOLIOS: DIVERSIFICATION IN THE SPECTRAL DOMAIN |
4755 NO-REFERENCE STEREOSCOPIC IMAGE QUALITY ASSESSMENT BASED ON THE HUMAN VISUAL SYSTEM Fan Meng, Sumei Li, Yongli Chang 4755 | NO-REFERENCE STEREOSCOPIC IMAGE QUALITY ASSESSMENT BASED ON THE HUMAN VISUAL SYSTEM |
5613 NOVEL ARCHITECTURES FOR UNSUPERVISED INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION OF MEETINGS Nauman Dawalatabad, Srikanth Madikeri, C Chandra Sekhar, Hema A Murthy 5613 | NOVEL ARCHITECTURES FOR UNSUPERVISED INFORMATION BOTTLENECK BASED SPEAKER DIARIZATION OF MEETINGS |
1165 NUMERICAL SOLUTION OF STOCHASTIC DIFFERENTIAL EQUATIONS IN STIEFEL MANIFOLDS VIA TANGENT SPACE PARAMETRIZATION Victor Solo, Zhichao Wang 1165 | NUMERICAL SOLUTION OF STOCHASTIC DIFFERENTIAL EQUATIONS IN STIEFEL MANIFOLDS VIA TANGENT SPACE PARAMETRIZATION |
1070 OAS-NET: OCCLUSION AWARE SAMPLING NETWORK FOR ACCURATE OPTICAL FLOW Lingtong Kong, Xiaohang Yang, Jie Yang 1070 | OAS-NET: OCCLUSION AWARE SAMPLING NETWORK FOR ACCURATE OPTICAL FLOW |
3124 Object-Oriented Relational Distillation for Object Detection Shuyu Miao, Rui Feng 3124 | Object-Oriented Relational Distillation for Object Detection |
4206 On a Guided Nonnegative Matrix Factorization Joshua Vendrow, Jamie Haddock, Elizaveta Rebrova, Deanna Needell 4206 | On a Guided Nonnegative Matrix Factorization |
2955 ON DISTRIBUTED COMPOSITE TESTS WITH DEPENDENT OBSERVATIONS IN WSN Juan Augusto Maya, Leonardo Rey Vega 2955 | ON DISTRIBUTED COMPOSITE TESTS WITH DEPENDENT OBSERVATIONS IN WSN |
3290 ON INFORMATION ASYMMETRY IN ONLINE REINFORCEMENT LEARNING Ezra Tampubolon, Haris Ceribasic, Holger Boche 3290 | ON INFORMATION ASYMMETRY IN ONLINE REINFORCEMENT LEARNING |
4419 ON LOSS FUNCTIONS FOR DEEP-LEARNING BASED T60 ESTIMATION Yuying Li, Yuchen Liu, Donald S. Williamson 4419 | ON LOSS FUNCTIONS FOR DEEP-LEARNING BASED T60 ESTIMATION |
1660 ON MINIMUM WORD ERROR RATE TRAINING OF THE HYBRID AUTOREGRESSIVE TRANSDUCER Liang Lu, Zhong Meng, Naoyuki Kanda, Jinyu Li, Yifan Gong 1660 | ON MINIMUM WORD ERROR RATE TRAINING OF THE HYBRID AUTOREGRESSIVE TRANSDUCER |
2213 ON OVERFITTING IN DISCRETE SUPER-RESOLUTION RECOVERY Wenzhe Lu, Heng Qiao 2213 | ON OVERFITTING IN DISCRETE SUPER-RESOLUTION RECOVERY |
3140 ON PERMUTATION INVARIANT TRAINING FOR SPEECH SOURCE SEPARATION Xiaoyu Liu, Jordi Pons 3140 | ON PERMUTATION INVARIANT TRAINING FOR SPEECH SOURCE SEPARATION |
5252 ON SCALING CONTRASTIVE REPRESENTATIONS FOR LOW-RESOURCE SPEECH RECOGNITION Lasse Borgholt, Tycho M. S. Tax, Jakob D. Havtorn, Lars Maaløe, Christian Igel 5252 | ON SCALING CONTRASTIVE REPRESENTATIONS FOR LOW-RESOURCE SPEECH RECOGNITION |
3774 On Strategic Jamming in Distributed Detection Networks Chen Quan, Baocheng Geng, Pramod K. Varshney 3774 | On Strategic Jamming in Distributed Detection Networks |
1607 ON THE ACCURACY LIMIT OF JOINT TIME-DELAY/DOPPLER/ACCELERATION ESTIMATION WITH A BAND-LIMITED SIGNAL Hamish Mcphee, Lorenzo Ortega, Jordi Vilà-Valls, Eric Chaumette 1607 | ON THE ACCURACY LIMIT OF JOINT TIME-DELAY/DOPPLER/ACCELERATION ESTIMATION WITH A BAND-LIMITED SIGNAL |
1076 ON THE ADVERSARIAL ROBUSTNESS OF PRINCIPAL COMPONENT ANALYSIS Ying Li, Fuwei Li, Lifeng Lai, Jun Wu 1076 | ON THE ADVERSARIAL ROBUSTNESS OF PRINCIPAL COMPONENT ANALYSIS |
5427 On the Asymptotic Performance of One-Bit Co-Array-Based MUSIC Saeid Sedighi, Bhavani Shankar, Mojtaba Soltanalian, Bjorn Ottersten 5427 | On the Asymptotic Performance of One-Bit Co-Array-Based MUSIC |
1167 ON THE CAMERA POSITION DITHERING IN VISUAL 3D RECONSTRUCTION Qier An, Yuan Shen 1167 | ON THE CAMERA POSITION DITHERING IN VISUAL 3D RECONSTRUCTION |
5538 ON THE CONVERGENCE OF RANDOMIZED BREGMAN COORDINATE DESCENT FOR NON-LIPSCHITZ COMPOSITE PROBLEMS Tianxiang Gao, Songtao Lu, Jia Liu, Chris Chu 5538 | ON THE CONVERGENCE OF RANDOMIZED BREGMAN COORDINATE DESCENT FOR NON-LIPSCHITZ COMPOSITE PROBLEMS |
1290 ON THE DESIGN OF SQUARE DIFFERENTIAL MICROPHONE ARRAYS WITH A MULTISTAGE STRUCTURE Xudong Zhao, Gongping Huang, Jacob Benesty, Jingdong Chen, Israel Cohen 1290 | ON THE DESIGN OF SQUARE DIFFERENTIAL MICROPHONE ARRAYS WITH A MULTISTAGE STRUCTURE |
3954 ON THE DETECTION OF PITCH-SHIFTED VOICE: MACHINES AND HUMAN LISTENERS David Looney, Nikolay D. Gaubitch 3954 | ON THE DETECTION OF PITCH-SHIFTED VOICE: MACHINES AND HUMAN LISTENERS |
4257 ON THE EFFECT OF SPATIAL CORRELATION ON DISTRIBUTED ENERGY DETECTION OF A STOCHASTIC PROCESS Juan Augusto Maya, Leonardo Rey Vega 4257 | ON THE EFFECT OF SPATIAL CORRELATION ON DISTRIBUTED ENERGY DETECTION OF A STOCHASTIC PROCESS |
5599 On the Identifiability of Transform Learning for Non-Negative Matrix Factorization Sixin Zhang, Emmanuel Soubies, Cédric Févotte 5599 | On the Identifiability of Transform Learning for Non-Negative Matrix Factorization |
4796 ON THE MARGINAL BENEFIT OF ACTIVE LEARNING: DOES SELF-SUPERVISION EAT ITS CAKE? Yao-Chun Chan, Mingchen Li, Samet Oymak 4796 | ON THE MARGINAL BENEFIT OF ACTIVE LEARNING: DOES SELF-SUPERVISION EAT ITS CAKE? |
3760 On the Optimality of Backward Regression: Sparse Recovery and Subset Selection Sebastian Ament, Carla Gomes 3760 | On the Optimality of Backward Regression: Sparse Recovery and Subset Selection |
4658 On the Performance-Complexity Tradeoff in Stochastic Greedy Weak Submodular Optimization Abolfazl Hashemi, Haris Vikalo, Gustavo de Veciana 4658 | On the Performance-Complexity Tradeoff in Stochastic Greedy Weak Submodular Optimization |
3854 ON THE POWER OF DEEP BUT NAIVE PARTIAL LABEL LEARNING Junghoon Seo, Joon Suk Huh 3854 | ON THE POWER OF DEEP BUT NAIVE PARTIAL LABEL LEARNING |
2876 ON THE PREDICTABILITY OF HRTFS FROM EAR SHAPES USING DEEP NETWORKS Yaxuan Zhou, Hao Jiang, Vamsi Krishna Ithapu 2876 | ON THE PREDICTABILITY OF HRTFS FROM EAR SHAPES USING DEEP NETWORKS |
3519 ON THE PREPARATION AND VALIDATION OF A LARGE-SCALE DATASET OF SINGING TRANSCRIPTION Jun-You Wang, Jyh-Shing Roger Jang 3519 | ON THE PREPARATION AND VALIDATION OF A LARGE-SCALE DATASET OF SINGING TRANSCRIPTION |
5187 ON THE RELATIONSHIP BETWEEN SPEECH-BASED BREATHING SIGNAL PREDICTION EVALUATION MEASURES AND BREATHING PARAMETERS ESTIMATION Zohreh Mostaani, Venkata Srikanth Nallanthighal, Aki Harma, Helmer Strik, Mathew Magimai-Doss 5187 | ON THE RELATIONSHIP BETWEEN SPEECH-BASED BREATHING SIGNAL PREDICTION EVALUATION MEASURES AND BREATHING PARAMETERS ESTIMATION |
2814 ON THE ROLE OF VISUAL CUES IN AUDIOVISUAL SPEECH ENHANCEMENT Zakaria Aldeneh, Anushree Prasanna Kumar, Barry-John Theobald, Erik Marchi, Sachin Kajarekar, Devang Naik, Ahmed Hussen Abdelaziz 2814 | ON THE ROLE OF VISUAL CUES IN AUDIOVISUAL SPEECH ENHANCEMENT |
3601 ON THE STABILITY OF GRAPH CONVOLUTIONAL NEURAL NETWORKS UNDER EDGE REWIRING Henry Kenlay, Dorina Thanou, Xiaowen Dong 3601 | ON THE STABILITY OF GRAPH CONVOLUTIONAL NEURAL NETWORKS UNDER EDGE REWIRING |
4836 ONE SHOT LEARNING FOR SPEECH SEPARATION Yuan-Kuei Wu, Kuan-Po Huang, Yu Tsao, Hung-yi Lee 4836 | ONE SHOT LEARNING FOR SPEECH SEPARATION |
4311 ONE-BIT AUTOCORRELATION ESTIMATION WITH NON-ZERO THRESHOLDS Chun-Lin Liu, Zi-Min Lin 4311 | ONE-BIT AUTOCORRELATION ESTIMATION WITH NON-ZERO THRESHOLDS |
3983 ONE-BIT COMPRESSED SENSING USING UNTRAINED NETWORK PRIOR Swatantra Kafle, Geethu Joseph, Pramod K. Varshney 3983 | ONE-BIT COMPRESSED SENSING USING UNTRAINED NETWORK PRIOR |
3816 ONE-SHOT CONDITIONAL AUDIO FILTERING OF ARBITRARY SOUNDS Beat Gfeller, Dominik Roblek, Marco Tagliasacchi 3816 | ONE-SHOT CONDITIONAL AUDIO FILTERING OF ARBITRARY SOUNDS |
2217 ONE-SHOT VOICE CONVERSION BASED ON SPEAKER AWARE MODULE Ying Zhang, Hao Che, Jie LiChenxing Li, Xiaorui Wang, Zhongyuan Wang 2217 | ONE-SHOT VOICE CONVERSION BASED ON SPEAKER AWARE MODULE |
5061 Online Antenna Selection for Enhanced DOA Estimation Elias Aboutanios, Hamed Nosrati, Xiangrong Wang 5061 | Online Antenna Selection for Enhanced DOA Estimation |
5581 Online Automatic Speech Recognition With Listen, Attend and Spell Model Roger Hsiao, Dogan Can, Tim Ng, Ruchir Travadi, Arnab Ghoshal 5581 | Online Automatic Speech Recognition With Listen, Attend and Spell Model |
3571 ONLINE CLASSIFICATION OF DYNAMIC MULTILAYER-NETWORK TIME SERIES IN RIEMANNIAN MANIFOLDS Cong Ye, Konstantinos Slavakis, Johan Nakuci, Sarah Muldoon, John Medaglia 3571 | ONLINE CLASSIFICATION OF DYNAMIC MULTILAYER-NETWORK TIME SERIES IN RIEMANNIAN MANIFOLDS |
5338 Online Dynamic Window (ODW) Assisted 2-stage LSTM Indoor Localization for Smart Phones Mohammadamin Atashi, Arash Mohammadi 5338 | Online Dynamic Window (ODW) Assisted 2-stage LSTM Indoor Localization for Smart Phones |
3838 ONLINE LEARNING OF TIME-VARYING SIGNALS AND GRAPHS Stefania Sardellitti, Sergio Barbarossa, Paolo Di Lorenzo 3838 | ONLINE LEARNING OF TIME-VARYING SIGNALS AND GRAPHS |
3951 Online Multi-hop Information based Kernel Learning over Graphs Zixiao Zong, Yanning Shen 3951 | Online Multi-hop Information based Kernel Learning over Graphs |
5598 ONLINE SPECTROGRAM INVERSION FOR LOW-LATENCY AUDIO SOURCE SEPARATION Paul Magron, Tuomas Virtanen 5598 | ONLINE SPECTROGRAM INVERSION FOR LOW-LATENCY AUDIO SOURCE SEPARATION |
3970 ONLINE TIME-VARYING TOPOLOGY IDENTIFICATION VIA PREDICTION-CORRECTION ALGORITHMS Alberto Natali, Mario Coutino, Elvin Isufi, Geert Leus 3970 | ONLINE TIME-VARYING TOPOLOGY IDENTIFICATION VIA PREDICTION-CORRECTION ALGORITHMS |
4494 ONLINE UNSUPERVISED LEARNING USING ENSEMBLE GAUSSIAN PROCESSES WITH RANDOM FEATURES Georgios V. Karanikolas, Qin Lu, Georgios B. Giannakis 4494 | ONLINE UNSUPERVISED LEARNING USING ENSEMBLE GAUSSIAN PROCESSES WITH RANDOM FEATURES |
3932 OPTIMAL ATTACKING STRATEGY AGAINST ONLINE REPUTATION SYSTEMS WITH CONSIDERATION OF THE MESSAGE-BASED PERSUASION PHENOMENON Zhanjiang Chen, H. Vicky Zhao 3932 | OPTIMAL ATTACKING STRATEGY AGAINST ONLINE REPUTATION SYSTEMS WITH CONSIDERATION OF THE MESSAGE-BASED PERSUASION PHENOMENON |
1848 Optimal Detection in the Presence of Non-Gaussian Jamming Khalid Almahorg, Ramy Gohary 1848 | Optimal Detection in the Presence of Non-Gaussian Jamming |
3108 OPTIMAL IMPORTANCE SAMPLING FOR FEDERATED LEARNING Elsa Rizk, Stefan Vlaski, Ali H. Sayed 3108 | OPTIMAL IMPORTANCE SAMPLING FOR FEDERATED LEARNING |
5278 OPTIMAL QUESTIONNAIRES FOR SCREENING OF STRATEGIC AGENTS Anuj Vora, Ankur Kulkarni 5278 | OPTIMAL QUESTIONNAIRES FOR SCREENING OF STRATEGIC AGENTS |
2726 OPTIMAL SELECTION OF MATRIX SHAPE AND DECOMPOSITION SCHEME FOR NEURAL NETWORK COMPRESSION Yerlan Idelbayev, Miguel Carreira-Perpiñán 2726 | OPTIMAL SELECTION OF MATRIX SHAPE AND DECOMPOSITION SCHEME FOR NEURAL NETWORK COMPRESSION |
1055 OPTIMAL TOA LOCALIZATION FOR MOVING SENSOR IN ASYMMETRIC NETWORK Sihao Zhao, Xiao-Ping Zhang, Xiaowei Cui, Mingquan Lu 1055 | OPTIMAL TOA LOCALIZATION FOR MOVING SENSOR IN ASYMMETRIC NETWORK |
2721 OPTIMIZE WHAT MATTERS: TRAINING DNN-HMM KEYWORD SPOTTING MODEL USING END METRIC Ashish Shrivastava, Arnav Kundu, Chandra Dhir, Devang Naik, Oncel Tuzel 2721 | OPTIMIZE WHAT MATTERS: TRAINING DNN-HMM KEYWORD SPOTTING MODEL USING END METRIC |
3665 OPTIMIZING COVERAGE AND CAPACITY IN CELLULAR NETWORKS USING MACHINE LEARNING Ryan Dreifuerst, Samuel Daulton, Yuchen Qian, Paul Varkey, Maximilian Balandat, Sanjay Kasturia, Anoop Tomar, Ali Yazdan, Vish Ponnampalam, Robert Heath 3665 | OPTIMIZING COVERAGE AND CAPACITY IN CELLULAR NETWORKS USING MACHINE LEARNING |
4666 OPTIMIZING SHORT-TIME FOURIER TRANSFORM PARAMETERS VIA GRADIENT DESCENT An Zhao, Krishna Subramani, Paris Smaragdis 4666 | OPTIMIZING SHORT-TIME FOURIER TRANSFORM PARAMETERS VIA GRADIENT DESCENT |
1536 OPTIMUM FEATURE ORDERING FOR DYNAMIC INSTANCE–WISE JOINT FEATURE SELECTION AND CLASSIFICATION Yasitha Warahena Liyanage, Daphney-Stavroula Zois 1536 | OPTIMUM FEATURE ORDERING FOR DYNAMIC INSTANCE–WISE JOINT FEATURE SELECTION AND CLASSIFICATION |
1153 ORDERED RELIABILITY BITS GUESSING RANDOM ADDITIVE NOISE DECODING Ken Duffy 1153 | ORDERED RELIABILITY BITS GUESSING RANDOM ADDITIVE NOISE DECODING |
5034 ORTHOGONALITY AND ZERO DC TRADEOFFS IN BIORTHOGONAL GRAPH FILTERBANKS Dion E.O. Tzamarias, Eduardo Pavez, Benjamin Girault, Antonio Ortega, Ian Blanes, Joan Serra-Sagristà 5034 | ORTHOGONALITY AND ZERO DC TRADEOFFS IN BIORTHOGONAL GRAPH FILTERBANKS |
4098 Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder Hirofumi Inaguma, Yosuke Higuchi, Kevin Duh, Tatsuya Kawahara, Shinji Watanabe 4098 | Orthros: Non-autoregressive End-to-end Speech Translation with Dual-decoder |
2079 OUTLIER-ROBUST KERNEL HIERARCHICAL-OPTIMIZATION RLS ON A BUDGET WITH AFFINE CONSTRAINTS Konstantinos Slavakis, Masahiro Yukawa 2079 | OUTLIER-ROBUST KERNEL HIERARCHICAL-OPTIMIZATION RLS ON A BUDGET WITH AFFINE CONSTRAINTS |
5170 OVERCOMING MEASUREMENT INCONSISTENCY IN DEEP LEARNING FOR LINEAR INVERSE PROBLEMS: APPLICATIONS IN MEDICAL IMAGING Marija Vella, Joao Mota 5170 | OVERCOMING MEASUREMENT INCONSISTENCY IN DEEP LEARNING FOR LINEAR INVERSE PROBLEMS: APPLICATIONS IN MEDICAL IMAGING |
5000 PARAGRAPH LEVEL MULTI-PERSPECTIVE CONTEXT MODELING FOR QUESTION GENERATION Jun Bai, Wenge Rong, Feiyu Xia, Yanmeng Wang, Yuanxin Ouyang, Zhang Xiong 5000 | PARAGRAPH LEVEL MULTI-PERSPECTIVE CONTEXT MODELING FOR QUESTION GENERATION |
3827 PARALLEL ITERATED EXTENDED AND SIGMA-POINT KALMAN SMOOTHERS Fatemeh Yaghoobi, Adrien Corenflos, Sakira Hassan, Simo Särkkä 3827 | PARALLEL ITERATED EXTENDED AND SIGMA-POINT KALMAN SMOOTHERS |
3566 PARALLEL TACOTRON: NON-AUTOREGRESSIVE AND CONTROLLABLE TTS Isaac Elias, Heiga Zen, Jonathan Shen, Yu Zhang, Ye Jia, Ron Weiss, Yonghui Wu 3566 | PARALLEL TACOTRON: NON-AUTOREGRESSIVE AND CONTROLLABLE TTS |
3306 PARALLEL WAVEFORM SYNTHESIS BASED ON GENERATIVE ADVERSARIAL NETWORKS WITH VOICING-AWARE CONDITIONAL DISCRIMINATORS Ryuichi Yamamoto, Eunwoo Song, Min-Jae Hwang, Jae-Min Kim 3306 | PARALLEL WAVEFORM SYNTHESIS BASED ON GENERATIVE ADVERSARIAL NETWORKS WITH VOICING-AWARE CONDITIONAL DISCRIMINATORS |
2329 Parameter Estimation for Coherent Passive MIMO Radar with Unknown Signals under Direct Path Influence Zhen Wang, Qian He 2329 | Parameter Estimation for Coherent Passive MIMO Radar with Unknown Signals under Direct Path Influence |
1616 PARAMETER ESTIMATION FOR STUDENT'S t VAR MODEL WITH MISSING DATA Rui Zhou, Junyan Liu, Sandeep Kumar, Daniel Palomar 1616 | PARAMETER ESTIMATION FOR STUDENT'S t VAR MODEL WITH MISSING DATA |
4650 Parameter Identifiability of Spatial-Smoothing-Based Bistatic MIMO Radar Junpeng Shi, Fangqing Wen, Yongxiang Liu, Qinmu Shen, Zhihui Li 4650 | Parameter Identifiability of Spatial-Smoothing-Based Bistatic MIMO Radar |
4473 Parametric Spectral Filters for Fast Converging, Scalable Convolutional Neural Networks Luke Wood, Eric Larson 4473 | Parametric Spectral Filters for Fast Converging, Scalable Convolutional Neural Networks |
1979 PART-ALIGNED NETWORK WITH BACKGROUND FOR MISALIGNED PERSON SEARCH Xian Zhong, Yiting Liu, Wenxin Huang, Xiao Wang, Bo Ma, Jingling Yuan 1979 | PART-ALIGNED NETWORK WITH BACKGROUND FOR MISALIGNED PERSON SEARCH |
1341 PARTIAL FEATURE AGGREGATION NETWORK FOR REAL-TIME OBJECT COUNTING Houshun Yu, Li Zhang 1341 | PARTIAL FEATURE AGGREGATION NETWORK FOR REAL-TIME OBJECT COUNTING |
1495 PARTIALLY OVERLAPPED INFERENCE FOR LONG-FORM SPEECH RECOGNITION Tae Gyoon Kang, Ho-Gyeong Kim, Min-Joong Lee, Jihyun Lee, Hoshik Lee 1495 | PARTIALLY OVERLAPPED INFERENCE FOR LONG-FORM SPEECH RECOGNITION |
3773 Particle Gibbs Sampling for Regime-Switching State-Space Models Yousef El-Laham, Liu Yang, Heather Lynch, Petar Djuric, Monica Bugallo 3773 | Particle Gibbs Sampling for Regime-Switching State-Space Models |
4973 PATCH DECODER-SIDE DEPTH ESTIMATION IN MPEG IMMERSIVE VIDEO Marta Milovanovic, Felix Henry, Marco CAGNAZZO, Joel Jung 4973 | PATCH DECODER-SIDE DEPTH ESTIMATION IN MPEG IMMERSIVE VIDEO |
2984 PATNET : A PHONEME-LEVEL AUTOREGRESSIVE TRANSFORMER NETWORK FOR SPEECH SYNTHESIS Shiming Wang, Zhenhua Ling, Ruibo Fu, Jiangyan Yi, Jianhua Tao 2984 | PATNET : A PHONEME-LEVEL AUTOREGRESSIVE TRANSFORMER NETWORK FOR SPEECH SYNTHESIS |
4227 PAUSE-ENCODED LANGUAGE MODELS FOR RECOGNITION OF ALZHEIMER'S DISEASE AND EMOTION Jiahong Yuan, Xingyu Cai, Kenneth Church 4227 | PAUSE-ENCODED LANGUAGE MODELS FOR RECOGNITION OF ALZHEIMER'S DISEASE AND EMOTION |
3484 PD-GAN: PERCEPTUAL-DETAILS GAN FOR EXTREMELY NOISY LOW LIGHT IMAGE ENHANCEMENT Yijun Liu, Zhengning Wang, Yi Zeng, Hao Zeng, Deming Zhao 3484 | PD-GAN: PERCEPTUAL-DETAILS GAN FOR EXTREMELY NOISY LOW LIGHT IMAGE ENHANCEMENT |
3931 PERCEPTUAL LOSS BASED SPEECH DENOISING WITH AN ENSEMBLE OF AUDIO PATTERN RECOGNITION AND SELF-SUPERVISED MODELS Saurabh Kataria, Jesús Villalba, Najim Dehak 3931 | PERCEPTUAL LOSS BASED SPEECH DENOISING WITH AN ENSEMBLE OF AUDIO PATTERN RECOGNITION AND SELF-SUPERVISED MODELS |
4502 Perceptual Quality Assessment for Recognizing True and Pseudo 4K Content Wenhan Zhu, Guangtao Zhai, Xiongkuo Min, Xiaokang Yang, Xiaoping Zhang 4502 | Perceptual Quality Assessment for Recognizing True and Pseudo 4K Content |
4432 PERFORMANCE ANALYSIS OF SPATIAL AND FREQUENCY DOMAIN INDEX-MODULATED RECONFIGURABLE INTELLIGENT METASURFACES John Hodge, Kumar Vijay Mishra, Brian Sadler, Amir Zaghloul 4432 | PERFORMANCE ANALYSIS OF SPATIAL AND FREQUENCY DOMAIN INDEX-MODULATED RECONFIGURABLE INTELLIGENT METASURFACES |
1645 PERIODIC SIGNAL DENOISING: AN ANALYSIS-SYNTHESIS FRAMEWORK BASED ON RAMANUJAN FILTER BANKS AND DICTIONARIES Pranav Kulkarni, P. P. Vaidyanathan 1645 | PERIODIC SIGNAL DENOISING: AN ANALYSIS-SYNTHESIS FRAMEWORK BASED ON RAMANUJAN FILTER BANKS AND DICTIONARIES |
5255 PERIODNET: A NON-AUTOREGRESSIVE WAVEFORM GENERATION MODEL WITH A STRUCTURE SEPARATING PERIODIC AND APERIODIC COMPONENTS Yukiya Hono, Shinji Takaki, Kei Hashimoto, Keiichiro Oura, Yoshihiko Nankaku, Keiichi Tokuda 5255 | PERIODNET: A NON-AUTOREGRESSIVE WAVEFORM GENERATION MODEL WITH A STRUCTURE SEPARATING PERIODIC AND APERIODIC COMPONENTS |
2784 PERSONALIZATION STRATEGIES FOR END-TO-END SPEECH RECOGNITION SYSTEMS Aditya Gourav, Linda Liu, Ankur Gandhe, Yile Gu, Guitang Lan, Xiangyang Huang, Shashank Kalmane, Gautam Tiwari, Denis Filimonov, Ariya Rastrow, Andreas Stolcke, Ivan Bulyko 2784 | PERSONALIZATION STRATEGIES FOR END-TO-END SPEECH RECOGNITION SYSTEMS |
3228 PERSONALIZED HRTF MODELING USING DNN-AUGMENTED BEM Mengfan Zhang, Jui-Hsien Wang, Doug James 3228 | PERSONALIZED HRTF MODELING USING DNN-AUGMENTED BEM |
2371 PHASE RECOVERY WITH BREGMAN DIVERGENCES FOR AUDIO SOURCE SEPARATION Paul Magron, Pierre-Hugo Vial, Thomas Oberlin, Cédric Févotte 2371 | PHASE RECOVERY WITH BREGMAN DIVERGENCES FOR AUDIO SOURCE SEPARATION |
5326 PHASE TRANSITIONS FOR ONE-VS-ONE AND ONE-VS-ALL LINEAR SEPARABILITY IN MULTICLASS GAUSSIAN MIXTURES Ganesh Ramachandra Kini, Christos Thrampoulidis 5326 | PHASE TRANSITIONS FOR ONE-VS-ONE AND ONE-VS-ALL LINEAR SEPARABILITY IN MULTICLASS GAUSSIAN MIXTURES |
3888 PHONE DISTRIBUTION ESTIMATION FOR LOW RESOURCE LANGUAGES Xinjian Li, Juncheng Li, Jiali Yao, Alan Black, Florian Metze 3888 | PHONE DISTRIBUTION ESTIMATION FOR LOW RESOURCE LANGUAGES |
4200 Phoneme based Neural Transducer for Large Vocabulary Speech Recognition Wei Zhou, Simon Berger, Ralf Schlüter, Hermann Ney 4200 | Phoneme based Neural Transducer for Large Vocabulary Speech Recognition |
4559 Phoneme-Based Distribution Regularization for Speech Enhancement Yajing Liu, Xiulian Peng, Zhiwei Xiong, Yan Lu 4559 | Phoneme-Based Distribution Regularization for Speech Enhancement |
4030 PHYSICAL-LAYER SECURITY VIA DISTRIBUTED BEAMFORMING IN THE PRESENCE OF ADVERSARIES WITH UNKNOWN LOCATIONS Yagiz Savas, Abolfazl Hashemi, Abraham P. Vinod, Brian M. Sadler, Ufuk Topcu 4030 | PHYSICAL-LAYER SECURITY VIA DISTRIBUTED BEAMFORMING IN THE PRESENCE OF ADVERSARIES WITH UNKNOWN LOCATIONS |
1504 PIPELINE SAFETY EARLY WARNING METHOD FOR DISTRIBUTED SIGNAL USING BILINEAR CNN AND LIGHTGBM Yiyuan Yang, Yi Li, Haifeng Zhang 1504 | PIPELINE SAFETY EARLY WARNING METHOD FOR DISTRIBUTED SIGNAL USING BILINEAR CNN AND LIGHTGBM |
5290 PITCH-TIMBRE DISENTANGLEMENT OF MUSICAL INSTRUMENT SOUNDS BASED ON VAE-BASED METRIC LEARNING Keitaro Tanaka, Ryo Nishikimi, Yoshiaki Bando, Kazuyoshi Yoshii, Shigeo Morishima 5290 | PITCH-TIMBRE DISENTANGLEMENT OF MUSICAL INSTRUMENT SOUNDS BASED ON VAE-BASED METRIC LEARNING |
2395 PLANAR ARRAY GEOMETRY OPTIMIZATION FOR REGION SOUND ACQUISITION Xi Chen, Chao Pan, Jingdong Chen, Jacob Benesty 2395 | PLANAR ARRAY GEOMETRY OPTIMIZATION FOR REGION SOUND ACQUISITION |
3883 PLAYING A PART: SPEAKER VERIFICATION AT THE MOVIES Andrew Brown, Jaesung Huh, Arsha Nagrani, Joon Son Chung, Andrew Zisserman 3883 | PLAYING A PART: SPEAKER VERIFICATION AT THE MOVIES |
4048 Plug-And-Play Learned Gaussian-mixture Approximate Message Passing Osman Musa, Peter Jung, Giuseppe Caire 4048 | Plug-And-Play Learned Gaussian-mixture Approximate Message Passing |
3315 POINT OF CARE IMAGE ANALYSIS FOR COVID19 Daniel Yaron, Daphna Keidar, Elisha Goldstein, Yair Shachar, Ayelet Blass, Oz Frank, Nir Schipper, Nogah Shabshin, Ahuva Grubstein, Dror Suhami, Naama R. Bogot, Eyal Sela, Amiel A. Dror, Mordehay Vaturi, Federico Mento, Elena Torri, Riccardo Inchingolo, Andrea Smargiassi, Gino Soldati, Tiziano Perrone, Libertario Demi, Meirav Galun, Shai Bagon, Yishai M. Elyada, Yonina C. Eldar 3315 | POINT OF CARE IMAGE ANALYSIS FOR COVID19 |
1284 POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING Yi Zhang, Wei Yang, Zhenbo Xu, Yingjie Li, Zhi Chen, Liusheng Huang 1284 | POINTER NETWORKS FOR ARBITRARY-SHAPED TEXT SPOTTING |
3130 POLA: ONLINE TIME SERIES PREDICTION BY ADAPTIVE LEARNING RATES Wenyu Zhang 3130 | POLA: ONLINE TIME SERIES PREDICTION BY ADAPTIVE LEARNING RATES |
4528 POLICY AUGMENTATION: AN EXPLORATION STRATEGY FOR FASTER CONVERGENCE OF DEEP REINFORCEMENT LEARNING ALGORITHMS Arash Golibagh Mahyari 4528 | POLICY AUGMENTATION: AN EXPLORATION STRATEGY FOR FASTER CONVERGENCE OF DEEP REINFORCEMENT LEARNING ALGORITHMS |
1218 POLYNOMIAL MATRIX EIGENVALUE DECOMPOSITION OF SPHERICAL HARMONICS FOR SPEECH ENHANCEMENT Vincent W. Neo, Christine Evers, Patrick A. Naylor 1218 | POLYNOMIAL MATRIX EIGENVALUE DECOMPOSITION OF SPHERICAL HARMONICS FOR SPEECH ENHANCEMENT |
5614 POPS: POLICY PRUNING AND SHRINKING FOR DEEP REINFORCEMENT LEARNING Dor Livne, Kobi Cohen 5614 | POPS: POLICY PRUNING AND SHRINKING FOR DEEP REINFORCEMENT LEARNING |
5463 PORTABLE PHOTOGLOTTOGRAPHY FOR MONITORING VOCAL FOLD VIBRATIONS IN SPEECH PRODUCTION Yujie Chi, Kiyoshi Honda, Jianguo Wei 5463 | PORTABLE PHOTOGLOTTOGRAPHY FOR MONITORING VOCAL FOLD VIBRATIONS IN SPEECH PRODUCTION |
4201 POSITNN: TRAINING DEEP NEURAL NETWORKS WITH MIXED LOW-PRECISION POSIT Gonçalo Raposo, Pedro Tomás, Nuno Roma 4201 | POSITNN: TRAINING DEEP NEURAL NETWORKS WITH MIXED LOW-PRECISION POSIT |
3470 PPG-BASED SINGING VOICE CONVERSION WITH ADVERSARIAL REPRESENTATION LEARNING Zhonghao Li, Benlai Tang, Xiang Yin, Yuan Wan, Ling Xu, Chen Shen, Zejun Ma 3470 | PPG-BASED SINGING VOICE CONVERSION WITH ADVERSARIAL REPRESENTATION LEARNING |
4390 Prediction of EGFR Mutation Status in Lung Adenocarcinoma using Multi-source Feature Representations Jianhong Cheng, Jin Liu, Meilin Jiang, Hailin Yue, Lin Wu, Jianxin Wang 4390 | Prediction of EGFR Mutation Status in Lung Adenocarcinoma using Multi-source Feature Representations |
2677 PREDICTION OF OBJECT GEOMETRY FROM ACOUSTIC SCATTERING USING CONVOLUTIONAL NEURAL NETWORKS Ziqi Fan, Vibhav Vineet, Chenshen Lu, Kyla McMullen 2677 | PREDICTION OF OBJECT GEOMETRY FROM ACOUSTIC SCATTERING USING CONVOLUTIONAL NEURAL NETWORKS |
3992 PREDICTIVE CODING FOR LOSSLESS DATASET COMPRESSION Madeleine Barowsky, Alexander Mariona, Flavio P. Calmon 3992 | PREDICTIVE CODING FOR LOSSLESS DATASET COMPRESSION |
3286 PRE-TRAINING TRANSFORMER DECODER FOR END-TO-END ASR MODEL WITH UNPAIRED TEXT DATA Changfeng Gao, Gaofeng Cheng, Runyan Yang, Han Zhu, Pengyuan Zhang, Yonghong Yan 3286 | PRE-TRAINING TRANSFORMER DECODER FOR END-TO-END ASR MODEL WITH UNPAIRED TEXT DATA |
3564 PREVENTING EARLY ENDPOINTING FOR ONLINE AUTOMATIC SPEECH RECOGNITION Yingzhu Zhao, Chongjia Ni, Cheung-Chi Leung, Shafiq Joty, Eng Siong Chng, Bin Ma 3564 | PREVENTING EARLY ENDPOINTING FOR ONLINE AUTOMATIC SPEECH RECOGNITION |
2581 PRIVACY-ACCURACY TRADE-OFF OF INFERENCE AS SERVICE Yulu Jin, Lifeng Lai 2581 | PRIVACY-ACCURACY TRADE-OFF OF INFERENCE AS SERVICE |
3185 Privacy-Preserving Cloud-based DNN Inference Shangyu Xie, Bingyu Liu, Yuan Hong 3185 | Privacy-Preserving Cloud-based DNN Inference |
1818 PRIVACY-PRESERVING NEAR NEIGHBOR SEARCH VIA SPARSE CODING WITH AMBIGUATION Behrooz Razeghi, Sohrab Ferdowsi, Dimche Kostadinov, Flavio P. Clamon, Slava Voloshynovskiy 1818 | PRIVACY-PRESERVING NEAR NEIGHBOR SEARCH VIA SPARSE CODING WITH AMBIGUATION |
2400 PRIVACY-PRESERVING OPTIMAL INSULIN DOSING DECISION Zuobin Ying, Shuanglong Cao, Shengmin Xu, Ximeng Liu, Lingjuan Lyu 2400 | PRIVACY-PRESERVING OPTIMAL INSULIN DOSING DECISION |
5392 PRIVATE WIRELESS FEDERATED LEARNING WITH ANONYMOUS OVER-THE-AIR COMPUTATION Burak Hasırcıoğlu, Deniz Gündüz 5392 | PRIVATE WIRELESS FEDERATED LEARNING WITH ANONYMOUS OVER-THE-AIR COMPUTATION |
2187 PROBABILISTIC GRAPH NEURAL NETWORKS FOR TRAFFIC SIGNAL CONTROL Ting Zhong, Zheyang Xu, Fan Zhou 2187 | PROBABILISTIC GRAPH NEURAL NETWORKS FOR TRAFFIC SIGNAL CONTROL |
3981 Probabilistic Massive MIMO Channel Estimation with Built-in Parameter Estimation Shuai Huang, Deqiang Qiu, Trac D. Tran 3981 | Probabilistic Massive MIMO Channel Estimation with Built-in Parameter Estimation |
2695 Probability of Resolution of g-MUSIC: An Asymptotic Approach David Schenck, Xavier Mestre, Marius Pesavento 2695 | Probability of Resolution of g-MUSIC: An Asymptotic Approach |
4414 PROBING ACOUSTIC REPRESENTATIONS FOR PHONETIC PROPERTIES Danni Ma, Neville Ryant, Mark Liberman 4414 | PROBING ACOUSTIC REPRESENTATIONS FOR PHONETIC PROPERTIES |
4007 PROCESSING PIPELINES FOR EFFICIENT, PHYSICALLY-ACCURATE SIMULATION OF MICROPHONE ARRAY SIGNALS IN DYNAMIC SOUND SCENES Alastair H. Moore, Rebecca R. Vos, Patrick A. Naylor, Mike Brookes 4007 | PROCESSING PIPELINES FOR EFFICIENT, PHYSICALLY-ACCURATE SIMULATION OF MICROPHONE ARRAY SIGNALS IN DYNAMIC SOUND SCENES |
1766 Progressive Co-teaching for Ambiguous Speech Emotion Recognition Yifei Yin, Yu Gu, Longshan Yao, Ying Zhou, Xuefeng Liang, He Zhang 1766 | Progressive Co-teaching for Ambiguous Speech Emotion Recognition |
2250 PROGRESSIVE DIALOGUE STATE TRACKING FOR MULTI-DOMAIN DIALOGUE SYSTEMS Jiahao Wang, Minqian Liu, Xiaojun Quan 2250 | PROGRESSIVE DIALOGUE STATE TRACKING FOR MULTI-DOMAIN DIALOGUE SYSTEMS |
4304 PROGRESSIVE MULTI-STAGE FEATURE MIX FOR PERSON RE-IDENTIFICATION Yan Zhang, Binyu He, Li Sun, Qingli Li 4304 | PROGRESSIVE MULTI-STAGE FEATURE MIX FOR PERSON RE-IDENTIFICATION |
4965 PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION Negar Heidari, Alexandros Iosifidis 4965 | PROGRESSIVE SPATIO-TEMPORAL GRAPH CONVOLUTIONAL NETWORK FOR SKELETON-BASED HUMAN ACTION RECOGNITION |
3104 PROGRESSIVE VOICE TRIGGER DETECTION: ACCURACY VS LATENCY Siddharth Sigtia, John Bridle, Hywel Richards, Pascal Clark, Erik Marchi, Vineet Garg 3104 | PROGRESSIVE VOICE TRIGGER DETECTION: ACCURACY VS LATENCY |
3464 PROSODIC CLUSTERING FOR PHONEME-LEVEL PROSODY CONTROL IN END-TO-END SPEECH SYNTHESIS Alexandra Vioni, Myrsini Christidou, Nikolaos Ellinas, Georgios Vamvoukakis, Panos Kakoulidis, Taehoon Kim, June Sig Sung, Hyoungmin Park, Aimilios Chalamandaris, Pirros Tsiakoulis 3464 | PROSODIC CLUSTERING FOR PHONEME-LEVEL PROSODY CONTROL IN END-TO-END SPEECH SYNTHESIS |
3119 PROSODIC REPRESENTATION LEARNING AND CONTEXTUAL SAMPLING FOR NEURAL TEXT-TO-SPEECH Sri Karlapati, Ammar Abbas, Zack Hodari, Alexis Moinet, Arnaud Joly, Penny Karanasou, Thomas Drugman 3119 | PROSODIC REPRESENTATION LEARNING AND CONTEXTUAL SAMPLING FOR NEURAL TEXT-TO-SPEECH |
4010 PROTOTYPE-BASED PERSONALIZED PRUNING Jangho Kim, Simyung Chang, Sungrack Yun, Nojun Kwak 4010 | PROTOTYPE-BASED PERSONALIZED PRUNING |
1812 PROTOTYPICAL NETWORKS FOR DOMAIN ADAPTATION IN ACOUSTIC SCENE CLASSIFICATION Shubhr Singh, Helen L. Bear, Emmanouil Benetos 1812 | PROTOTYPICAL NETWORKS FOR DOMAIN ADAPTATION IN ACOUSTIC SCENE CLASSIFICATION |
5098 PROVABLY FAST ASYNCHRONOUS AND DISTRIBUTED ALGORITHMS FOR PAGERANK CENTRALITY COMPUTATION Yiran HE, Hoi-To WAI 5098 | PROVABLY FAST ASYNCHRONOUS AND DISTRIBUTED ALGORITHMS FOR PAGERANK CENTRALITY COMPUTATION |
2259 PRUNING OF CONVOLUTIONAL NEURAL NETWORKS USING ISING ENERGY MODEL Hojjat Salehinejad, Shahrokh Valaee 2259 | PRUNING OF CONVOLUTIONAL NEURAL NETWORKS USING ISING ENERGY MODEL |
1667 PUSHING THE LIMIT OF PHASE OFFSET FOR CONTACTLESS SENSING USING COMMODITY WIFI Dongheng Zhang, Xiong Li, Yan Chen 1667 | PUSHING THE LIMIT OF PHASE OFFSET FOR CONTACTLESS SENSING USING COMMODITY WIFI |
4449 PUSHING THE LIMIT OF TYPE I CODEBOOK FOR FDD MASSIVE MIMO BEAMFORMING: A CHANNEL COVARIANCE RECONSTRUCTION APPROACH Kai Li, Ying Li, Lei Cheng, Qingjiang Shi, Zhi-Quan Luo 4449 | PUSHING THE LIMIT OF TYPE I CODEBOOK FOR FDD MASSIVE MIMO BEAMFORMING: A CHANNEL COVARIANCE RECONSTRUCTION APPROACH |
1703 Pyramid U-Net for Retinal Vessel Segmentation Jiawei Zhang, Yanchun Zhang, Xiaowei Xu 1703 | Pyramid U-Net for Retinal Vessel Segmentation |
4932 QOE-DRIVEN AND TILE-BASED ADAPTIVE STREAMING FOR POINT CLOUDS Lisha Wang, Chenglin Li, Wenrui Dai, Junni Zou, Hongkai Xiong 4932 | QOE-DRIVEN AND TILE-BASED ADAPTIVE STREAMING FOR POINT CLOUDS |
4071 QUATERNION-VALUED VARIATIONAL AUTOENCODER Eleonora Grassucci, Danilo Comminiello, Aurelio Uncini 4071 | QUATERNION-VALUED VARIATIONAL AUTOENCODER |
5089 QUERY-BY-EXAMPLE KEYWORD SPOTTING SYSTEM USING MULTI-HEAD ATTENTION AND SOFTTRIPLE LOSS Jinmiao Huang, Waseem Gharbieh, Han Suk Shim, Eugene Kim 5089 | QUERY-BY-EXAMPLE KEYWORD SPOTTING SYSTEM USING MULTI-HEAD ATTENTION AND SOFTTRIPLE LOSS |
5201 QUERYD: A VIDEO DATASET WITH HIGH-QUALITY TEXTUAL AND AUDIO NARRATIONS Andreea-Maria Oncescu, Jõao F. Henriques, Yang Liu, Andrew Zisserman, Samuel Albanie 5201 | QUERYD: A VIDEO DATASET WITH HIGH-QUALITY TEXTUAL AND AUDIO NARRATIONS |
4017 QUICKEST CHANGE DETECTION WITH TIME INCONSISTENT ANTICIPATORY AGENTS IN CYBER-PHYSICAL SYSTEMS Vikram Krishnamurthy 4017 | QUICKEST CHANGE DETECTION WITH TIME INCONSISTENT ANTICIPATORY AGENTS IN CYBER-PHYSICAL SYSTEMS |
4603 Quickest Joint Detection and Classification of Faults In Statistically Periodic Processes Taposh Banerjee, Smruti Padhy, Ahmad Taha, Eugene John 4603 | Quickest Joint Detection and Classification of Faults In Statistically Periodic Processes |
1262 RADAR CLUTTER CLASSIFICATION USING EXPECTATION-MAXIMIZATION METHOD Sudan Han, Pia Addabbo, Danilo Orlando, Giuseppe Ricci 1262 | RADAR CLUTTER CLASSIFICATION USING EXPECTATION-MAXIMIZATION METHOD |
1522 RADIO FREQUENCY BASED HEART RATE VARIABILITY MONITORING Fengyu Wang, Xiaolu Zeng, Chenshu Wu, Beibei Wang, K. J. Ray Liu 1522 | RADIO FREQUENCY BASED HEART RATE VARIABILITY MONITORING |
3149 RANDOM PROJECTION STREAMS FOR (WEIGHTED) NONNEGATIVE MATRIX FACTORIZATION Farouk Yahaya, Matthieu Puigt, Gilles Delmaire, Gilles Roussel 3149 | RANDOM PROJECTION STREAMS FOR (WEIGHTED) NONNEGATIVE MATRIX FACTORIZATION |
2640 RANGE GUIDED DEPTH REFINEMENT AND UNCERTAINTY-AWARE AGGREGATION FOR VIEW SYNTHESIS Yuan Chang, Yisong Chen, Guoping Wang 2640 | RANGE GUIDED DEPTH REFINEMENT AND UNCERTAINTY-AWARE AGGREGATION FOR VIEW SYNTHESIS |
5317 RANK-REVEALING BLOCK-TERM DECOMPOSITION FOR TENSOR COMPLETION Athanasios Rontogiannis, Paris Giampouras, Eleftherios Kofidis 5317 | RANK-REVEALING BLOCK-TERM DECOMPOSITION FOR TENSOR COMPLETION |
3930 RATE 1 QUASI ORTHOGONAL UNIVERSAL TRANSMISSION AND COMBINING FOR MIMO SYSTEMS ACHIEVING FULL DIVERSITY Barak Avraham, Uri Erez, Elad Domanovitz 3930 | RATE 1 QUASI ORTHOGONAL UNIVERSAL TRANSMISSION AND COMBINING FOR MIMO SYSTEMS ACHIEVING FULL DIVERSITY |
3643 Rate-distortion optimized motion estimation for on-the-sphere compression of 360 videos Alban Marie, Navid Mahmoudian Bidgoli, Thomas Maugey, Aline Roumy 3643 | Rate-distortion optimized motion estimation for on-the-sphere compression of 360 videos |
3783 RAW DATA PROCESSING FOR PRACTICAL TIME-OF-FLIGHT SUPER-RESOLUTION Miguel Heredia Conde 3783 | RAW DATA PROCESSING FOR PRACTICAL TIME-OF-FLIGHT SUPER-RESOLUTION |
2703 Real Image Super-Resolution using Token Based Contextual Attention Zhihong Pan, Baopu Li 2703 | Real Image Super-Resolution using Token Based Contextual Attention |
3994 REAL NUMBER SIGNAL PROCESSING CAN DETECT DENIAL-OF-SERVICE ATTACKS Holger Boche, Rafael F. Schaefer, H. Vincent Poor 3994 | REAL NUMBER SIGNAL PROCESSING CAN DETECT DENIAL-OF-SERVICE ATTACKS |
1206 Real Time Synchronization in Neural Networks for Multivariate Time series Anomaly Detection Ahmed Abdulaal, Tomer Lancewicki 1206 | Real Time Synchronization in Neural Networks for Multivariate Time series Anomaly Detection |
2812 REAL VERSUS FAKE 4K - AUTHENTIC RESOLUTION ASSESSMENT Rishi Rajesh Shah, Vyas Anirudh Akundy, Zhou Wang 2812 | REAL VERSUS FAKE 4K - AUTHENTIC RESOLUTION ASSESSMENT |
1886 REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET Hyeong-Seok Choi, Sungjin Park, Jie Hwan Lee, Hoon Heo, Dongsuk Jeon, Kyogu Lee 1886 | REAL-TIME DENOISING AND DEREVERBERATION WTIH TINY RECURRENT U-NET |
2656 REAL-TIME INTERAURAL TIME DELAY ESTIMATION VIA ONSET DETECTION Elizabeth Ren, Gustavo Cid Ornelas, Hans-Andrea Loeliger 2656 | REAL-TIME INTERAURAL TIME DELAY ESTIMATION VIA ONSET DETECTION |
4484 Real-Time Radio Modulation Classification with an LSTM Auto-Encoder Ziqi Ke, Haris Vikalo 4484 | Real-Time Radio Modulation Classification with an LSTM Auto-Encoder |
3496 REAL-TIME SPEECH ENHANCEMENT FOR MOBILE COMMUNICATION BASED ON DUAL-CHANNEL COMPLEX SPECTRAL MAPPING Ke Tan, Xueliang Zhang, DeLiang Wang 3496 | REAL-TIME SPEECH ENHANCEMENT FOR MOBILE COMMUNICATION BASED ON DUAL-CHANNEL COMPLEX SPECTRAL MAPPING |
3186 Real-time Speech Frequency Bandwidth Extension Yunpeng Li, Marco Tagliasacchi, Oleg Rybakov, Victor Ungureanu, Dominik Roblek 3186 | Real-time Speech Frequency Bandwidth Extension |
1271 Recent Advances in Arabic Syntactic Diacritics Restoration Yasser Hifny 1271 | Recent Advances in Arabic Syntactic Diacritics Restoration |
3904 RECENT DEVELOPMENTS ON ESPNET TOOLKIT BOOSTED BY CONFORMER Pengcheng Guo, Florian Boyer, Xuankai Chang, Tomoki Hayashi, Yosuke Higuchi, Hirofumi Inaguma, Naoyuki Kamo, Chenda Li, Daniel Garcia-Romero, Jiatong Shi, Jing Shi, Shinji Watanabe, Kun Wei, Wangyou Zhang, Yuekai Zhang 3904 | RECENT DEVELOPMENTS ON ESPNET TOOLKIT BOOSTED BY CONFORMER |
3096 RECOGNITION OF DYNAMIC HAND GESTURE BASED ON MM-WAVE FMCW RADAR MICRO-DOPPLER SIGNATURES Wen Jiang, Yihui Ren, Ying Liu, Ziao Wang 3096 | RECOGNITION OF DYNAMIC HAND GESTURE BASED ON MM-WAVE FMCW RADAR MICRO-DOPPLER SIGNATURES |
3451 RECURRENT PHASE RECONSTRUCTION USING ESTIMATED PHASE DERIVATIVES FROM DEEP NEURAL NETWORKS Lars Thieling, Daniel Wilhelm, Peter Jax 3451 | RECURRENT PHASE RECONSTRUCTION USING ESTIMATED PHASE DERIVATIVES FROM DEEP NEURAL NETWORKS |
3111 RECURSIVE INPUT AND STATE ESTIMATION: A GENERAL FRAMEWORK FOR LEARNING FROM TIME SERIES WITH MISSING DATA Alberto Garcia-Duran, Robert West 3111 | RECURSIVE INPUT AND STATE ESTIMATION: A GENERAL FRAMEWORK FOR LEARNING FROM TIME SERIES WITH MISSING DATA |
2760 REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling Hu Hu, Xuesong Yang, Zeynab Raeesy, Jinxi Guo, Gokce Keskin, Harish Arsikere, Ariya Rastrow, Andreas Stolcke, Roland Maas 2760 | REDAT: Accent-Invariant Representation for End-to-End ASR by Domain Adversarial Training with Relabeling |
3775 REDUCED-COMPLEXITY CHANNEL ESTIMATION BY HIERARCHICAL INTERPOLATION EXPLOITING SPARSITY FOR MASSIVE MIMO SYSTEMS WITH UNIFORM RECTANGULAR ARRAY Chi-Shiang Wang, Pei-Yun Tsai 3775 | REDUCED-COMPLEXITY CHANNEL ESTIMATION BY HIERARCHICAL INTERPOLATION EXPLOITING SPARSITY FOR MASSIVE MIMO SYSTEMS WITH UNIFORM RECTANGULAR ARRAY |
2724 REDUCED-COMPLEXITY MODULAR POLYNOMIAL MULTIPLICATION FOR R-LWE CRYPTOSYSTEMS Xinmiao Zhang, Keshab Parhi 2724 | REDUCED-COMPLEXITY MODULAR POLYNOMIAL MULTIPLICATION FOR R-LWE CRYPTOSYSTEMS |
2914 REDUCING MODAL ERROR PROPAGATION THROUGH CORRECTING MISMATCHED MICROPHONE GAINS USING RAPID Noman Akbar, Glenn Dickins, Mark R. P. Thomas, Prasanga Samarasinghe, Thushara Abhayapala 2914 | REDUCING MODAL ERROR PROPAGATION THROUGH CORRECTING MISMATCHED MICROPHONE GAINS USING RAPID |
3196 REDUCING SPELLING INCONSISTENCIES IN CODE-SWITCHING ASR USING CONTEXTUALIZED CTC LOSS Burin Naowarat, Thananchai Kongthaworn, Korrawe Karunratanakul, Sheng Hui Wu, Ekapol Chuangsuwanich 3196 | REDUCING SPELLING INCONSISTENCIES IN CODE-SWITCHING ASR USING CONTEXTUALIZED CTC LOSS |
1496 REFINEMENT OF DIRECTION OF ARRIVAL ESTIMATORS BY MAJORIZATION-MINIMIZATION OPTIMIZATION ON THE ARRAY MANIFOLD Robin Scheibler, Masahito Togami 1496 | REFINEMENT OF DIRECTION OF ARRIVAL ESTIMATORS BY MAJORIZATION-MINIMIZATION OPTIMIZATION ON THE ARRAY MANIFOLD |
2767 REFINING AUTOMATIC SPEECH RECOGNITION SYSTEM FOR OLDER ADULTS Liu Chen, Meysam Asgari 2767 | REFINING AUTOMATIC SPEECH RECOGNITION SYSTEM FOR OLDER ADULTS |
1421 REFLECTANCE-ORIENTED PROBABILISTIC EQUALIZATION FOR IMAGE ENHANCEMENT Xiaomeng Wu, Yongqing Sun, Akisato Kimura, Kunio Kashino 1421 | REFLECTANCE-ORIENTED PROBABILISTIC EQUALIZATION FOR IMAGE ENHANCEMENT |
4416 REGRESSION OR CLASSIFICATION? NEW METHODS TO EVALUATE NO-REFERENCE PICTURE AND VIDEO QUALITY MODELS Zhengzhong Tu, Chia-Ju Chen, Li-Heng Chen, Yilin Wang, Neil Birkbeck, Balu Adsumilli, Alan Bovik 4416 | REGRESSION OR CLASSIFICATION? NEW METHODS TO EVALUATE NO-REFERENCE PICTURE AND VIDEO QUALITY MODELS |
2672 REGULARIZED RECOVERY BY MULTI-ORDER PARTIAL HYPERGRAPH TOTAL VARIATION Ruyuan Qu, Jiaqi He, Hui Feng, Chongbin Xu, Bo Hu 2672 | REGULARIZED RECOVERY BY MULTI-ORDER PARTIAL HYPERGRAPH TOTAL VARIATION |
5012 Reinforcement Stacked Learning with Semantic-Associated Attention for Visual Question Answering Xinyu Xiao, Chunxia Zhang, Shiming Xiang, Chunhong Pan 5012 | Reinforcement Stacked Learning with Semantic-Associated Attention for Visual Question Answering |
2200 Relaxed Wasserstein with Applications to GANs Xin Guo, Johnny Hong, Tianyi Lin, Nan Yang 2200 | Relaxed Wasserstein with Applications to GANs |
3209 RELIABILITY ASSESSMENT OF SINGING VOICE F0-ESTIMATES USING MULTIPLE ALGORITHMS Sebastian Rosenzweig, Frank Scherbaum, Meinard Müller 3209 | RELIABILITY ASSESSMENT OF SINGING VOICE F0-ESTIMATES USING MULTIPLE ALGORITHMS |
3609 RELYING ON A RATE CONSTRAINT TO REDUCE MOTION ESTIMATION COMPLEXITY Gabriel B. Sant'Anna, Luiz Henrique Cancellier, Ismael Seidel, Mateus Grellert, José Luís Güntzel 3609 | RELYING ON A RATE CONSTRAINT TO REDUCE MOTION ESTIMATION COMPLEXITY |
2154 REPAC: RELIABLE ESTIMATION OF PHASE-AMPLITUDE COUPLING IN BRAIN NETWORKS Giulia Cisotto 2154 | REPAC: RELIABLE ESTIMATION OF PHASE-AMPLITUDE COUPLING IN BRAIN NETWORKS |
2446 REPLACING HUMAN AUDIO WITH SYNTHETIC AUDIO FOR ON-DEVICE UNSPOKEN PUNCTUATION PREDICTION Daria Soboleva, Ondrej Skopek, Márius Šajgalík, Victor Cărbune, Felix Weissenberger, Julia Proskurnia, Bogdan Prisacari, Daniel Valcarce, Justin Lu, Rohit Prabhavalkar, Balint Miklos 2446 | REPLACING HUMAN AUDIO WITH SYNTHETIC AUDIO FOR ON-DEVICE UNSPOKEN PUNCTUATION PREDICTION |
2972 REPLAY AND SYNTHETIC SPEECH DETECTION WITH RES2NET ARCHITECTURE Xu Li, Na Li, Chao Weng, Xunying Liu, Dan Su, Dong Yu, Helen Meng 2972 | REPLAY AND SYNTHETIC SPEECH DETECTION WITH RES2NET ARCHITECTURE |
4939 Replay-Attack Detection using Features with Adaptive Spectro-Temporal Resolution Meng Liu, Longbiao Wang, Kong Aik Lee, Xuanda Chen, Jianwu Dang 4939 | Replay-Attack Detection using Features with Adaptive Spectro-Temporal Resolution |
4883 Representation Learning For Speech Recognition Using Feedback Based Relevance Weighting Purvi Agrawal, Sriram Ganapathy 4883 | Representation Learning For Speech Recognition Using Feedback Based Relevance Weighting |
3346 REPRESENTATION LEARNING WITH SPECTRO-TEMPORAL-CHANNEL ATTENTION FOR SPEECH EMOTION RECOGNITION Lili Guo, Longbiao Wang, Chenglin Xu, Jianwu Dang, Eng Siong Chng, Haizhou Li 3346 | REPRESENTATION LEARNING WITH SPECTRO-TEMPORAL-CHANNEL ATTENTION FOR SPEECH EMOTION RECOGNITION |
1633 REPRESENTATIVE LOCAL FEATURE MINING FOR FEW-SHOT LEARNING Kun Yan, Lingbo Liu, Jun Hou, Ping Wang 1633 | REPRESENTATIVE LOCAL FEATURE MINING FOR FEW-SHOT LEARNING |
1204 Resolution Limits of 20 Questions Search Strategies for Moving Targets Lin Zhou, Alfred Hero 1204 | Resolution Limits of 20 Questions Search Strategies for Moving Targets |
4832 RESPIPE: RESILIENT MODEL-DISTRIBUTED DNN TRAINING AT EDGE NETWORKS Pengzhen Li, Erdem Koyuncu, Hulya Seferoglu 4832 | RESPIPE: RESILIENT MODEL-DISTRIBUTED DNN TRAINING AT EDGE NETWORKS |
3502 REST: Robust Learned Shrinkage-Thresholding unrolled network Wei Pu, Chao Zhou, Yonina Eldar, Miguel Rodrigues 3502 | REST: Robust Learned Shrinkage-Thresholding unrolled network |
2475 RETHINKING THE SEPARATION LAYERS IN SPEECH SEPARATION NETWORKS Yi Luo, Zhuo Chen, Cong Han, Chenda Li, Tianyan Zhou, Nima Mesgarani 2475 | RETHINKING THE SEPARATION LAYERS IN SPEECH SEPARATION NETWORKS |
4908 REVERB CONVERSION OF MIXED VOCAL TRACKS USING AN END-TO-END CONVOLUTIONAL DEEP NEURAL NETWORK Junghyun Koo, Seungryeol Paik, Kyogu Lee 4908 | REVERB CONVERSION OF MIXED VOCAL TRACKS USING AN END-TO-END CONVOLUTIONAL DEEP NEURAL NETWORK |
3129 REVERSIBLE DATA HIDING IN JPEG IMAGES FOR PRIVACY PROTECTION Yuxuan Huang, Xin Cao, Hao-Tian Wu, Yiu-ming Cheung 3129 | REVERSIBLE DATA HIDING IN JPEG IMAGES FOR PRIVACY PROTECTION |
5228 REWEIGHTED DYNAMIC GROUP CONVOLUTION Weiwei Chen, Chong Wang, Zhehao Zhang, Zheng Huo, Linlin Gao 5228 | REWEIGHTED DYNAMIC GROUP CONVOLUTION |
1960 RGLN: Robust Residual Graph Learning Networks via Similarity-Preserving Mapping on Graphs Jiaxiang Tang, Xiang Gao, Wei Hu 1960 | RGLN: Robust Residual Graph Learning Networks via Similarity-Preserving Mapping on Graphs |
4326 Riemannian Geometric Optimization Methods for Joint Design of Transmit Sequence and Receive Filter of MIMO Radar Jie Li, Guisheng Liao, Yan Huang, Arye Nehorai 4326 | Riemannian Geometric Optimization Methods for Joint Design of Transmit Sequence and Receive Filter of MIMO Radar |
1374 RIEMANNIAN GEOMETRY ON CONNECTIVITY FOR CLINICAL BCI Marie-Constance Corsi, Florian Yger, Sylvain Chevallier, Camille Noûs 1374 | RIEMANNIAN GEOMETRY ON CONNECTIVITY FOR CLINICAL BCI |
1457 RIEMANNIAN GEOMETRY-BASED DECODING OF THE DIRECTIONAL FOCUS OF AUDITORY ATTENTION USING EEG Simon Geirnaert, Tom Francart, Alexander Bertrand 1457 | RIEMANNIAN GEOMETRY-BASED DECODING OF THE DIRECTIONAL FOCUS OF AUDITORY ATTENTION USING EEG |
3958 RIS-AIDED JOINT LOCALIZATION AND SYNCHRONIZATION WITH A SINGLE-ANTENNA MMWAVE RECEIVER Alessio Fascista, Angelo Coluccia, Henk Wymeersch, Gonzalo Seco-Granados 3958 | RIS-AIDED JOINT LOCALIZATION AND SYNCHRONIZATION WITH A SINGLE-ANTENNA MMWAVE RECEIVER |
2636 RNN TRANSDUCER MODELS FOR SPOKEN LANGUAGE UNDERSTANDING Samuel Thomas, Hong-Kwang Kuo, George Saon, Zoltan Tuske, Brian Kingsbury, Gakuto Kurata, Zvi Kons, Ron Hoory 2636 | RNN TRANSDUCER MODELS FOR SPOKEN LANGUAGE UNDERSTANDING |
4728 RNN-T BASED OPEN-VOCABULARY KEYWORD SPOTTING IN MANDARIN WITH MULTI-LEVEL DETECTION Zuozhen Liu, Ta Li, Pengyuan Zhang 4728 | RNN-T BASED OPEN-VOCABULARY KEYWORD SPOTTING IN MANDARIN WITH MULTI-LEVEL DETECTION |
4851 Robust Binary Loss for Multi-category Classification with Label Noise Defu Liu, Guowu Yang, Jinzhao Wu, Jiayi Zhao, Fengmao Lv 4851 | Robust Binary Loss for Multi-category Classification with Label Noise |
1436 ROBUST DEEP REINFORCEMENT LEARNING FOR UNDERWATER NAVIGATION WITH UNKNOWN DISTURBANCES Juan Parras, Santiago Zazo 1436 | ROBUST DEEP REINFORCEMENT LEARNING FOR UNDERWATER NAVIGATION WITH UNKNOWN DISTURBANCES |
2688 ROBUST DEVICE-FREE PROXIMITY DETECTION USING WIFI Yuqian Hu, Muhammed Zahid Ozturk, Feng Zhang, Beibei Wang, K. J. Ray Liu 2688 | ROBUST DEVICE-FREE PROXIMITY DETECTION USING WIFI |
3188 ROBUST DOMAIN-FREE DOMAIN GENERALIZATION WITH CLASS-AWARE ALIGNMENT Wenyu Zhang, Mohamed Ragab, Ramon Sagarna 3188 | ROBUST DOMAIN-FREE DOMAIN GENERALIZATION WITH CLASS-AWARE ALIGNMENT |
1784 Robust estimation of high-order phase dynamics using Variational Bayes inference Fabio Fabozzi, Stéphanie Bidon, Sébastien Roche 1784 | Robust estimation of high-order phase dynamics using Variational Bayes inference |
4680 ROBUST GRAPH AUTOENCODER FOR HYPERSPECTRAL ANOMALY DETECTION Ganghui Fan, Yong Ma, Jun Huang, Xiaoguang Mei, Jiayi Ma 4680 | ROBUST GRAPH AUTOENCODER FOR HYPERSPECTRAL ANOMALY DETECTION |
5036 ROBUST GRAPH-FILTER IDENTIFICATION WITH GRAPH DENOISING REGULARIZATION Samuel Rey, Antonio Marques 5036 | ROBUST GRAPH-FILTER IDENTIFICATION WITH GRAPH DENOISING REGULARIZATION |
4194 ROBUST LATENT REPRESENTATIONS VIA CROSS-MODAL TRANSLATION AND ALIGNMENT Vandana Rajan, Alessio Brutti, Andrea Cavallaro 4194 | ROBUST LATENT REPRESENTATIONS VIA CROSS-MODAL TRANSLATION AND ALIGNMENT |
5349 ROBUST MAML: PRIORITIZATION TASK BUFFER WITH ADAPTIVE LEARNING PROCESS FOR MODEL-AGNOSTIC META-LEARNING Thanh Nguyen, Tung Luu, Sanzhar Rakhimkul, Trung Pham, Chang Dong Yoo 5349 | ROBUST MAML: PRIORITIZATION TASK BUFFER WITH ADAPTIVE LEARNING PROCESS FOR MODEL-AGNOSTIC META-LEARNING |
2380 Robust PCA through Maximum Correntropy Power Iterations Jean Chereau, Bruno Scalzo, Danilo P. Mandic 2380 | Robust PCA through Maximum Correntropy Power Iterations |
1990 Robust Recursive Least M-estimate Adaptive Filter for the Identification of Low-Rank Acoustic Systems Hongsen He, Jingdong Chen, Jacob Benesty, Yi Yu 1990 | Robust Recursive Least M-estimate Adaptive Filter for the Identification of Low-Rank Acoustic Systems |
2419 ROBUST SPATIAL-TEMPORAL CORRELATION MODEL FOR BACKGROUND INITIALIZATION IN SEVERE SCENE Yuheng Deng, Wenjun Zhou, Bo Peng, Dong Liang, Shun'ichi Kaneko 2419 | ROBUST SPATIAL-TEMPORAL CORRELATION MODEL FOR BACKGROUND INITIALIZATION IN SEVERE SCENE |
1506 ROBUST STEERABLE DIFFERENTIAL BEAMFORMERS WITH NULL CONSTRAINTS FOR CONCENTRIC CIRCULAR MICROPHONE ARRAYS Xuehan Wang, Gongping Huang, Israel Cohen, Jacob Benesty, Jingdong Chen 1506 | ROBUST STEERABLE DIFFERENTIAL BEAMFORMERS WITH NULL CONSTRAINTS FOR CONCENTRIC CIRCULAR MICROPHONE ARRAYS |
2076 ROBUST STFT DOMAIN MULTI-CHANNEL ACOUSTIC ECHO CANCELLATION WITH ADAPTIVE DECORRELATION OF THE REFERENCE SIGNALS Saeed Bagheri Sereshki, Daniele Giacobello 2076 | ROBUST STFT DOMAIN MULTI-CHANNEL ACOUSTIC ECHO CANCELLATION WITH ADAPTIVE DECORRELATION OF THE REFERENCE SIGNALS |
1258 Robust Voice Activity Detection Using A Masked Auditory Encoder Based Convolutional Neural Network Nan LI, Longbiao Wang, Unoki Masashi, Sheng LI, Rui Wang, Meng Ge, Jianwu Dang 1258 | Robust Voice Activity Detection Using A Masked Auditory Encoder Based Convolutional Neural Network |
4353 ROBUSTNESS AND DIVERSITY SEEKING DATA-FREE KNOWLEDGE DISTILLATION Pengchao Han, Jihong Park, Shiqiang Wang, Yejun Liu 4353 | ROBUSTNESS AND DIVERSITY SEEKING DATA-FREE KNOWLEDGE DISTILLATION |
4583 ROLE AWARE MULTI-PARTY DIALOGUE QUESTION ANSWERING Jui-Heng Hsu, Po-Wei Shen, Hung-Ting Su, Chen-Hsi Chang, Jia-Fong Yeh, Winston H. Hsu 4583 | ROLE AWARE MULTI-PARTY DIALOGUE QUESTION ANSWERING |
2573 Room adaptive conditioning method for sound event classification in reverberant environments Jaejun Lee, Donmoon Lee, Hyeong-Seok Choi, Kyogu Lee 2573 | Room adaptive conditioning method for sound event classification in reverberant environments |
1473 ROOM IMPULSE RESPONSE INTERPOLATION FROM A SPARSE SET OF MEASUREMENTS USING A MODAL ARCHITECTURE Orchisama Das, Paul Calamia, Sebastia Gari 1473 | ROOM IMPULSE RESPONSE INTERPOLATION FROM A SPARSE SET OF MEASUREMENTS USING A MODAL ARCHITECTURE |
4785 Rotation Invariance Analysis of Local Convolutional Features in Image Retrieval Longjiao Zhao, Yu Wang, Jien Kato 4785 | Rotation Invariance Analysis of Local Convolutional Features in Image Retrieval |
4523 ROTATION-ROBUST BEAMFORMING BASED ON SOUND FIELD INTERPOLATION WITH REGULARLY CIRCULAR MICROPHONE ARRAY Yukoh Wakabayashi, Kouei Yamaoka, Nobutaka Ono 4523 | ROTATION-ROBUST BEAMFORMING BASED ON SOUND FIELD INTERPOLATION WITH REGULARLY CIRCULAR MICROPHONE ARRAY |
2557 RoutingGAN: Routing Age Progression and Regression with Disentangled Learning Zhizhong Huang, Hongming Shan, Junping Zhang 2557 | RoutingGAN: Routing Age Progression and Regression with Disentangled Learning |
4590 Rule-embedded network for audio-visual voice activity detection in live musical video streams Yuanbo Hou, Yi Deng, Bilei Zhu, Zejun Ma, Dick Botteldooren 4590 | Rule-embedded network for audio-visual voice activity detection in live musical video streams |
3350 SAFE SCREENING FOR SPARSE REGRESSION WITH THE KULLBACK-LEIBLER DIVERGENCE Cassio Dantas, Emmanuel Soubies, Cédric Févotte 3350 | SAFE SCREENING FOR SPARSE REGRESSION WITH THE KULLBACK-LEIBLER DIVERGENCE |
1160 SAGA: SPARSE ADVERSARIAL ATTACK ON EEG-BASED BRAIN COMPUTER INTERFACE Boyuan Feng, Yuke Wang, Yufei Ding 1160 | SAGA: SPARSE ADVERSARIAL ATTACK ON EEG-BASED BRAIN COMPUTER INTERFACE |
3144 SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION Kristian Fischer, Felix Fleckenstein, Christian Herglotz, André Kaup 3144 | SALIENCY-DRIVEN VERSATILE VIDEO CODING FOR NEURAL OBJECT DETECTION |
5132 SAMPLE EFFICIENT SUBSPACE-BASED REPRESENTATIONS FOR NONLINEAR META-LEARNING Ibrahim Gulluk, Yue Sun, Samet Oymak, Maryam Fazel 5132 | SAMPLE EFFICIENT SUBSPACE-BASED REPRESENTATIONS FOR NONLINEAR META-LEARNING |
1336 SANDGLASSET: A LIGHT MULTI-GRANULARITY SELF-ATTENTIVE NETWORK FOR TIME-DOMAIN SPEECH SEPARATION Max W. Y. Lam, Jun Wang, Dan Su, Dong Yu 1336 | SANDGLASSET: A LIGHT MULTI-GRANULARITY SELF-ATTENTIVE NETWORK FOR TIME-DOMAIN SPEECH SEPARATION |
1733 SA-NET: SHUFFLE ATTENTION FOR DEEP CONVOLUTIONAL NEURAL NETWORKS QINGLONG Zhang, Yu-Bin Yang 1733 | SA-NET: SHUFFLE ATTENTION FOR DEEP CONVOLUTIONAL NEURAL NETWORKS |
3845 SANET++: ENHANCED SCALE AGGREGATION WITH DENSELY CONNECTED FEATURE FUSION FOR CROWD COUNTING Siyang Pan, Yanyun Zhao, Fei Su, Zhicheng Zhao 3845 | SANET++: ENHANCED SCALE AGGREGATION WITH DENSELY CONNECTED FEATURE FUSION FOR CROWD COUNTING |
2773 SAPAUGMENT: LEARNING A SAMPLE ADAPTIVE POLICY FOR DATA AUGMENTATION Ting-Yao Hu, Ashish Shrivastava, Rick Chang, Hema Koppula, Stefan Braun, Kyuyeon Hwang, Ozlem Kalini, Oncel Tuzel 2773 | SAPAUGMENT: LEARNING A SAMPLE ADAPTIVE POLICY FOR DATA AUGMENTATION |
2575 SAR IMAGE AUTOFOCUSING USING WIRTINGER CALCULUS AND CAUCHY REGULARIZATION Zi-Yao Zhang, Odysseas Pappas, Alin Achim 2575 | SAR IMAGE AUTOFOCUSING USING WIRTINGER CALCULUS AND CAUCHY REGULARIZATION |
4028 SCALABLE AND DISTRIBUTED MMSE ALGORITHMS FOR UPLINK RECEIVE COMBINING IN CELL-FREE MASSIVE MIMO SYSTEMS Robbe Van Rompaey, Marc Moonen 4028 | SCALABLE AND DISTRIBUTED MMSE ALGORITHMS FOR UPLINK RECEIVE COMBINING IN CELL-FREE MASSIVE MIMO SYSTEMS |
2240 SCALABLE DISCRIMINATIVE DISCRETE HASHING FOR LARGE-SCALE CROSS-MODAL RETRIEVAL Jianyang Qin, Lunke Fei, Jian Zhu, Jie Wen, Chunwei Tian, Shuai Wu 2240 | SCALABLE DISCRIMINATIVE DISCRETE HASHING FOR LARGE-SCALE CROSS-MODAL RETRIEVAL |
1247 SCALABLE MULTILEVEL QUANTIZATION FOR DISTRIBUTED DETECTION Gökhan Gül, Michael Bassler 1247 | SCALABLE MULTILEVEL QUANTIZATION FOR DISTRIBUTED DETECTION |
3755 Scalable Privacy-Preserving Distributed Extremely Randomized Trees for Structured Data with Multiple Colluding Parties Amin Aminifar, Fazle Rabbi, Yngve Lamo 3755 | Scalable Privacy-Preserving Distributed Extremely Randomized Trees for Structured Data with Multiple Colluding Parties |
1223 SCALABLE REINFORCEMENT LEARNING FOR ROUTING IN AD-HOC NETWORKS BASED ON PHYSICAL-LAYER ATTRIBUTES Wei Cui, Wei Yu 1223 | SCALABLE REINFORCEMENT LEARNING FOR ROUTING IN AD-HOC NETWORKS BASED ON PHYSICAL-LAYER ATTRIBUTES |
1119 SCALED FAST NESTED KEY EQUATION SOLVER FOR GENERALIZED INTEGRATED INTERLEAVED BCH DECODERS Zhenshan Xie, Xinmiao Zhang 1119 | SCALED FAST NESTED KEY EQUATION SOLVER FOR GENERALIZED INTEGRATED INTERLEAVED BCH DECODERS |
2152 Scene Completeness-Aware Lidar Depth Completion for Driving Scenario Cho-Ying Wu, Ulrich Neumann 2152 | Scene Completeness-Aware Lidar Depth Completion for Driving Scenario |
3733 SCORE-BASED CHANGE DETECTION FOR GRADIENT-BASED LEARNING MACHINES Lang Liu, Joseph Salmon, Zaid Harchaoui 3733 | SCORE-BASED CHANGE DETECTION FOR GRADIENT-BASED LEARNING MACHINES |
1433 SEARCHING FOR ANOMALIES WITH MULTIPLE PLAYS UNDER DELAY AND SWITCHING COSTS Tidhar Lambez, Kobi Cohen 1433 | SEARCHING FOR ANOMALIES WITH MULTIPLE PLAYS UNDER DELAY AND SWITCHING COSTS |
3136 SECRET KEY GENERATION OVER WIRELESS CHANNELS USING SHORT BLOCKLENGTH MULTILEVEL SOURCE POLAR CODING Henri Hentilä, Yanina Shkel, Visa Koivunen 3136 | SECRET KEY GENERATION OVER WIRELESS CHANNELS USING SHORT BLOCKLENGTH MULTILEVEL SOURCE POLAR CODING |
3033 SECURE UAV COMMUNICATIONS UNDER UNCERTAIN EAVESDROPPERS LOCATIONS Silei Wang, Fanxiang Kong, Qiang Li 3033 | SECURE UAV COMMUNICATIONS UNDER UNCERTAIN EAVESDROPPERS LOCATIONS |
4857 SEEHEAR: SIGNER DIARISATION AND A NEW DATASET Samuel Albanie, Gül Varol, Liliane Momeni, Triantafyllos Afouras, Andrew Brown, Chuhan Zhang, Ernesto Coto, Necati Cihan Camgöz, Ben Saunders, Abhishek Dutta, Neil Fox, Richard Bowden, Bencie Woll, Andrew Zisserman 4857 | SEEHEAR: SIGNER DIARISATION AND A NEW DATASET |
4278 SEEN AND UNSEEN EMOTIONAL STYLE TRANSFER FOR VOICE CONVERSION WITH A NEW EMOTIONAL SPEECH DATASET Kun Zhou, Berrak Sisman, Rui Liu, Haizhou Li 4278 | SEEN AND UNSEEN EMOTIONAL STYLE TRANSFER FOR VOICE CONVERSION WITH A NEW EMOTIONAL SPEECH DATASET |
4493 SEGMENTAL DTW: A PARALLELIZABLE ALTERNATIVE TO DTW TJ Tsai 4493 | SEGMENTAL DTW: A PARALLELIZABLE ALTERNATIVE TO DTW |
4398 SEGREGATION IN SOCIAL NETWORKS: MARKOV BRIDGE MODELS AND ESTIMATION Vikram Krishnamurthy, Rui Luo, Buddhika Nettasinghe 4398 | SEGREGATION IN SOCIAL NETWORKS: MARKOV BRIDGE MODELS AND ESTIMATION |
3975 SEIZURE DETECTION USING POWER SPECTRAL DENSITY VIA HYPERDIMENSIONAL COMPUTING Lulu Ge, Keshab K. Parhi 3975 | SEIZURE DETECTION USING POWER SPECTRAL DENSITY VIA HYPERDIMENSIONAL COMPUTING |
1832 SELF-ATTENTION GENERATIVE ADVERSARIAL NETWORK FOR SPEECH ENHANCEMENT Huy Phan, Huy Le Nguyen, Oliver Chén, Philipp Koch, Ngoc Q. K. Duong, Ian McLoughlin, Alfred Mertins 1832 | SELF-ATTENTION GENERATIVE ADVERSARIAL NETWORK FOR SPEECH ENHANCEMENT |
2570 SELF-ATTENTIVE VAD: CONTEXT-AWARE DETECTION OF VOICE FROM NOISE Yong Rae Jo, Young Ki Moon, Won Ik Cho, Geun Sik Jo 2570 | SELF-ATTENTIVE VAD: CONTEXT-AWARE DETECTION OF VOICE FROM NOISE |
2377 SELF-AUGMENTED MULTI-MODAL FEATURE EMBEDDING Shinnosuke Matsuo, Seiichi Uchida, Brian Kenji Iwana 2377 | SELF-AUGMENTED MULTI-MODAL FEATURE EMBEDDING |
1698 Self-Convolution: A Highly-Efficient Operator for Non-Local Image Restoration Lanqing Guo, Zhiyuan Zha, Saiprasad Ravishankar, Bihan Wen 1698 | Self-Convolution: A Highly-Efficient Operator for Non-Local Image Restoration |
4056 SELFGAIT: A SPATIOTEMPORAL REPRESENTATION LEARNING METHOD FOR SELF-SUPERVISED GAIT RECOGNITION Yiqun Liu, Yi Zeng, Jian Pu, Hongming Shan, Peiyang He, Junping Zhang 4056 | SELFGAIT: A SPATIOTEMPORAL REPRESENTATION LEARNING METHOD FOR SELF-SUPERVISED GAIT RECOGNITION |
3392 SELF-INFERENCE OF OTHERS' POLICIES FOR HOMOGENEOUS AGENTS IN COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING Qifeng Lin, Qing Ling 3392 | SELF-INFERENCE OF OTHERS' POLICIES FOR HOMOGENEOUS AGENTS IN COOPERATIVE MULTI-AGENT REINFORCEMENT LEARNING |
2214 SELF-SUPERVISED DEPTH ESTIMATION VIA IMPLICIT CUES FROM VIDEOS Jianrong Wang, Ge Zhang, Zhenyu Wu, Xuewei Li, Li Liu 2214 | SELF-SUPERVISED DEPTH ESTIMATION VIA IMPLICIT CUES FROM VIDEOS |
4821 SELF-SUPERVISED LEARNING BASED DOMAIN ADAPTATION FOR ROBUST SPEAKER VERIFICATION Zhengyang Chen, Shuai Wang, Yanmin Qian 4821 | SELF-SUPERVISED LEARNING BASED DOMAIN ADAPTATION FOR ROBUST SPEAKER VERIFICATION |
3313 SELF-SUPERVISED LEARNING FOR FEW-SHOT IMAGE CLASSIFICATION Da Chen, Yuefeng Chen, Yuhong Li, Feng Mao, Yuan He, Hui Xue 3313 | SELF-SUPERVISED LEARNING FOR FEW-SHOT IMAGE CLASSIFICATION |
2284 SELF-SUPERVISED LEARNING FOR SLEEP STAGE CLASSIFICATION WITH PREDICTIVE AND DISCRIMINATIVE CONTRASTIVE CODING Qinfeng Xiao, Jing Wang, Jianan Ye, Hongjun Zhang, Yuyan Bu, Yiqiong Zhang, Hao Wu 2284 | SELF-SUPERVISED LEARNING FOR SLEEP STAGE CLASSIFICATION WITH PREDICTIVE AND DISCRIMINATIVE CONTRASTIVE CODING |
2762 Self-supervised text-independent speaker verification using prototypical momentum contrastive learning Wei Xia, Chunlei Zhang, Chao Weng, Meng Yu, Dong Yu 2762 | Self-supervised text-independent speaker verification using prototypical momentum contrastive learning |
3348 Self-Supervised VQ-VAE For One-Shot Music Style Transfer Ondřej Cífka, Alexey Ozerov, Umut Şimşekli, Gaël Richard 3348 | Self-Supervised VQ-VAE For One-Shot Music Style Transfer |
3894 Self-training and Pre-training are complementary for Speech Recognition Qiantong Xu, Alexei Baevski, Tatiana Likhomanenko, Paden Tomasello, Alexis Conneau, Ronan Collobert, Gabriel Synnaeve, Michael Auli 3894 | Self-training and Pre-training are complementary for Speech Recognition |
3798 Self-Training for Sound Event Detection in Audio Mixtures Sangwook Park, Ashwin Bellur, David K. Han, Mounya Elhilali 3798 | Self-Training for Sound Event Detection in Audio Mixtures |
2239 SEMANTIC IMAGE SYNTHESIS FROM INACCURATE AND COARSE MASKS Kai Katsumata, Hideki Nakayama 2239 | SEMANTIC IMAGE SYNTHESIS FROM INACCURATE AND COARSE MASKS |
1925 SEMANTIC-AWARE CONTEXT AGGREGATION FOR IMAGE INPAINTING Zhilin Huang, Chujun Qin, Ruixin Liu, Zhenyu Weng, Yuesheng Zhu 1925 | SEMANTIC-AWARE CONTEXT AGGREGATION FOR IMAGE INPAINTING |
3473 SEMANTIC-AWARE UNPAIRED IMAGE-TO-IMAGE TRANSLATION FOR URBAN SCENE IMAGES Zongyao Li, Ren Togo, Takahiro Ogawa, Miki Haseyama 3473 | SEMANTIC-AWARE UNPAIRED IMAGE-TO-IMAGE TRANSLATION FOR URBAN SCENE IMAGES |
4171 SEMI-BLIND JOINT CHANNEL, DATA, AND PHASE-NOISE ESTIMATION IN MIMO-OFDM SYSTEMS Bruno Sokal, Paulo Gomes, André de Almeida, Martin Haardt 4171 | SEMI-BLIND JOINT CHANNEL, DATA, AND PHASE-NOISE ESTIMATION IN MIMO-OFDM SYSTEMS |
5593 SEMIDEFINITE PROGRAMMING METHODS FOR ALLEVIATING CLOCK SYNCHRONIZATION BIAS AND SENSOR POSITION ERRORS IN TDOA LOCALIZATION Yanbin Zou, Huaping Liu 5593 | SEMIDEFINITE PROGRAMMING METHODS FOR ALLEVIATING CLOCK SYNCHRONIZATION BIAS AND SENSOR POSITION ERRORS IN TDOA LOCALIZATION |
3437 SEMI-SUPERVISED BATCH ACTIVE LEARNING VIA BILEVEL OPTIMIZATION Zalán Borsos, Marco Tagliasacchi, Andreas Krause 3437 | SEMI-SUPERVISED BATCH ACTIVE LEARNING VIA BILEVEL OPTIMIZATION |
1546 SEMI-SUPERVISED FEATURE EMBEDDING FOR DATA SANITIZATION IN REAL-WORLD EVENTS Bahram Lavi, Jose Nascimento, Anderson Rocha 1546 | SEMI-SUPERVISED FEATURE EMBEDDING FOR DATA SANITIZATION IN REAL-WORLD EVENTS |
5243 SEMI-SUPERVISED LEARNING FOR SINGING SYNTHESIS TIMBRE Jordi Bonada, Merlijn Blaauw 5243 | SEMI-SUPERVISED LEARNING FOR SINGING SYNTHESIS TIMBRE |
4573 SEMI-SUPERVISED MULTIMODAL IMAGE TRANSLATION FOR MISSING MODALITY IMPUTATION Wangbin Sun, Fei Ma, Yang Li, Shao-Lun Huang, Shiguang Ni, Lin Zhang 4573 | SEMI-SUPERVISED MULTIMODAL IMAGE TRANSLATION FOR MISSING MODALITY IMPUTATION |
2942 SEMI-SUPERVISED SINGING VOICE SEPARATION WITH NOISY SELF-TRAINING Zhepei Wang, Ritwik Giri, Umut Isik, Jean-Marc Valin, Arvindh Krishnaswamy 2942 | SEMI-SUPERVISED SINGING VOICE SEPARATION WITH NOISY SELF-TRAINING |
1405 SEMI-SUPERVISED SKIN LESION SEGMENTATION WITH LEARNING MODEL CONFIDENCE Zhiqiang Xie, Enmei Tu, Hao Zheng, Yun Gu, Jie Yang 1405 | SEMI-SUPERVISED SKIN LESION SEGMENTATION WITH LEARNING MODEL CONFIDENCE |
4488 SEMI-SUPERVISED SPEECH RECOGNITION VIA GRAPH-BASED TEMPORAL CLASSIFICATION Niko Moritz, Takaaki Hori, Jonathan Le Roux 4488 | SEMI-SUPERVISED SPEECH RECOGNITION VIA GRAPH-BASED TEMPORAL CLASSIFICATION |
3543 SEMI-SUPERVISED SPOKEN LANGUAGE UNDERSTANDING VIA SELF-SUPERVISED SPEECH AND LANGUAGE MODEL PRETRAINING Cheng-I Lai, Yung-Sung Chuang, Hung-Yi Lee, Shang-Wen Li, James Glass 3543 | SEMI-SUPERVISED SPOKEN LANGUAGE UNDERSTANDING VIA SELF-SUPERVISED SPEECH AND LANGUAGE MODEL PRETRAINING |
2020 Semi-supervised Time Series Classification by Temporal Relation Prediction Haoyi Fan, Fengbin Zhang, Ruidong Wang, Xunhua Huang, Zuoyong Li 2020 | Semi-supervised Time Series Classification by Temporal Relation Prediction |
3485 SENONE-AWARE ADVERSARIAL MULTI-TASK TRAINING FOR UNSUPERVISED CHILD TO ADULT SPEECH ADAPTATION Richeng Duan, Nancy Chen 3485 | SENONE-AWARE ADVERSARIAL MULTI-TASK TRAINING FOR UNSUPERVISED CHILD TO ADULT SPEECH ADAPTATION |
3667 SENSOR NETWORKS TDOA SELF-CALIBRATION: 2D COMPLEXITY ANALYSIS AND SOLUTIONS Luca Ferranti, Kalle Åström, Magnus Oskarsson, Jani Boutellier, Juho Kannala 3667 | SENSOR NETWORKS TDOA SELF-CALIBRATION: 2D COMPLEXITY ANALYSIS AND SOLUTIONS |
3625 SENTENCE BOUNDARY AUGMENTATION FOR NEURAL MACHINE TRANSLATION ROBUSTNESS Daniel Li, Te I, Naveen Arivazhagan, Colin Cherry, Dirk Padfield 3625 | SENTENCE BOUNDARY AUGMENTATION FOR NEURAL MACHINE TRANSLATION ROBUSTNESS |
4800 SENTIMENT INJECTED ITERATIVELY CO-INTERACTIVE NETWORK FOR SPOKEN LANGUAGE UNDERSTANDING Zhiqi Huang, Fenglin Liu, Peilin Zhou, Yuexian Zou 4800 | SENTIMENT INJECTED ITERATIVELY CO-INTERACTIVE NETWORK FOR SPOKEN LANGUAGE UNDERSTANDING |
4718 SEPNET: A DEEP SEPARATION MATRIX PREDICTION NETWORK FOR MULTICHANNEL AUDIO SOURCE SEPARATION Shota Inoue, Hirokazu Kameoka, Li Li, Shoji Makino 4718 | SEPNET: A DEEP SEPARATION MATRIX PREDICTION NETWORK FOR MULTICHANNEL AUDIO SOURCE SEPARATION |
4913 SEQ-CPC : SEQUENTIAL CONTRASTIVE PREDICTIVE CODING FOR AUTOMATIC SPEECH RECOGNITION Yulong Chen, Jianping Zhao, Weiqi Wang, Ming Fang, Haimei Kang, Lu Wang, Tao Wei, Jun Ma, Shaojun Wang, Jing Xiao 4913 | SEQ-CPC : SEQUENTIAL CONTRASTIVE PREDICTIVE CODING FOR AUTOMATIC SPEECH RECOGNITION |
4122 SEQUENCE-LEVEL SELF-TEACHING REGULARIZATION eric sun, liang lu, zhong meng, yifan Gong 4122 | SEQUENCE-LEVEL SELF-TEACHING REGULARIZATION |
4187 SEQUENCE-TO-SEQUENCE SINGING VOICE SYNTHESIS WITH PERCEPTUAL ENTROPY LOSS Jiatong Shi, Shuai Guo, Nan Huo, Yuekai Zhang, Qin Jin 4187 | SEQUENCE-TO-SEQUENCE SINGING VOICE SYNTHESIS WITH PERCEPTUAL ENTROPY LOSS |
3616 Sequential Adversarial Anomaly Detection with Deep Fourier Kernel Shixiang Zhu, Henry Yuchi, Minghe Zhang, Yao Xie 3616 | Sequential Adversarial Anomaly Detection with Deep Fourier Kernel |
1890 SERN: STANCE EXTRACTION AND REASONING NETWORK FOR FAKE NEWS DETECTION Jianhui Xie, Song Liu, Ruixin Liu, Yinghong Zhang, Yuesheng Zhu 1890 | SERN: STANCE EXTRACTION AND REASONING NETWORK FOR FAKE NEWS DETECTION |
1158 SESQA: SEMI-SUPERVISED LEARNING FOR SPEECH QUALITY ASSESSMENT Joan Serrà, Jordi Pons, Santiago Pascual 1158 | SESQA: SEMI-SUPERVISED LEARNING FOR SPEECH QUALITY ASSESSMENT |
4129 SHAPELET BASED VISUAL ASSESSMENT OF CLUSTER TENDENCY IN ANALYZING COMPLEX UPPER LIMB MOTION Shreyasi Datta, Chandan Karmakar, Punit Rathore, Marimuthu Palaniswami 4129 | SHAPELET BASED VISUAL ASSESSMENT OF CLUSTER TENDENCY IN ANALYZING COMPLEX UPPER LIMB MOTION |
5261 SHORT-TIME SPECTRAL AGGREGATION FOR SPEAKER EMBEDDING Youzhi Tu, Man-Wai Mak 5261 | SHORT-TIME SPECTRAL AGGREGATION FOR SPEAKER EMBEDDING |
3969 SHOW AND SPEAK: DIRECTLY SYNTHESIZE SPOKEN DESCRIPTION OF IMAGES Xinsheng Wang, Siyuan Feng, Jihua Zhu, Mark Hasegawa-Johnson, Odette Scharenborg 3969 | SHOW AND SPEAK: DIRECTLY SYNTHESIZE SPOKEN DESCRIPTION OF IMAGES |
1107 SIAMESE CAPSULE NETWORK FOR END-TO-END SPEAKER RECOGNITION IN THE WILD Amirhossein Hajavi, Ali Etemad 1107 | SIAMESE CAPSULE NETWORK FOR END-TO-END SPEAKER RECOGNITION IN THE WILD |
1908 SIG2SIG : SIGNAL TRANSLATION NETWORKS TO TAKE THE REMAINS OF THE PAST SangYeon Kim, Hyunwoo Lee 1908 | SIG2SIG : SIGNAL TRANSLATION NETWORKS TO TAKE THE REMAINS OF THE PAST |
1464 Sign Language Segmentation with Temporal Convolutional Networks Katrin Renz, Nicolaj Stache, Samuel Albanie, Gül Varol 1464 | Sign Language Segmentation with Temporal Convolutional Networks |
3193 SIGNATURE FEATURE MARKING ENHANCED IRM FRAMEWORK FOR DRONE IMAGE ANALYSIS IN PRECISION AGRICULTURE Atharva Kadethankar, Neelam Sinha, Vinayaka Hegde, Abhishek Burman 3193 | SIGNATURE FEATURE MARKING ENHANCED IRM FRAMEWORK FOR DRONE IMAGE ANALYSIS IN PRECISION AGRICULTURE |
4380 SIMILARITY ANALYSIS OF SELF-SUPERVISED SPEECH REPRESENTATIONS Yu-An Chung, Yonatan Belinkov, James Glass 4380 | SIMILARITY ANALYSIS OF SELF-SUPERVISED SPEECH REPRESENTATIONS |
2465 SIML: SIEVED MAXIMUM LIKELIHOOD FOR ARRAY SIGNAL PROCESSING Matthieu SIMEONI, Paul Hurley 2465 | SIML: SIEVED MAXIMUM LIKELIHOOD FOR ARRAY SIGNAL PROCESSING |
3403 SIMPLEFLAT: A SIMPLE WHOLE-NETWORK PRE-TRAINING APPROACH FOR RNN TRANSDUCER-BASED END-TO-END SPEECH RECOGNITION Takafumi Moriya, Takanori Ashihara, Tomohiro Tanaka, Tsubasa Ochiai, Hiroshi Sato, Atsushi Ando, Yusuke Ijima, Ryo Masumura, Yusuke Shinohara 3403 | SIMPLEFLAT: A SIMPLE WHOLE-NETWORK PRE-TRAINING APPROACH FOR RNN TRANSDUCER-BASED END-TO-END SPEECH RECOGNITION |
1959 SINGER IDENTIFICATION USING DEEP TIMBRE FEATURE LEARNING WITH KNN-NET Xulong Zhang, Jiale Qian, Yi Yu, Yifu Sun, Wei Li 1959 | SINGER IDENTIFICATION USING DEEP TIMBRE FEATURE LEARNING WITH KNN-NET |
4075 SINGING LANGUAGE IDENTIFICATION USING A DEEP PHONOTACTIC APPROACH Lenny Renault, Andrea Vaglio, Romain Hennequin 4075 | SINGING LANGUAGE IDENTIFICATION USING A DEEP PHONOTACTIC APPROACH |
3682 SINGING MELODY EXTRACTION FROM POLYPHONIC MUSIC BASED ON SPECTRAL CORRELATION MODELING Xingjian Du, Bilei Zhu, Qiuqiang Kong, Zejun Ma 3682 | SINGING MELODY EXTRACTION FROM POLYPHONIC MUSIC BASED ON SPECTRAL CORRELATION MODELING |
3227 SINGLE CHANNEL VOICE SEPARATION FOR UNKNOWN NUMBER OF SPEAKERS UNDER REVERBERANT AND NOISY SETTINGS Shlomo E. Chazan, Lior Wolf, Eliya Nachmani, Yossi Adi 3227 | SINGLE CHANNEL VOICE SEPARATION FOR UNKNOWN NUMBER OF SPEAKERS UNDER REVERBERANT AND NOISY SETTINGS |
1597 SINGLE-POINT ARRAY RESPONSE CONTROL WITH MINIMUM PATTERN DEVIATION Xiaoyu Ai, Lu Gan 1597 | SINGLE-POINT ARRAY RESPONSE CONTROL WITH MINIMUM PATTERN DEVIATION |
1897 SKIP ATTENTION GAN FOR REMOTE SENSING IMAGE SYNTHESIS Kai Deng, Kun Zhang, Ping Yao, Siyuan Cheng, Peng He 1897 | SKIP ATTENTION GAN FOR REMOTE SENSING IMAGE SYNTHESIS |
1524 SLAP: A Split Latency Adaptive VLIW Pipeline Architecture which enables on-the-fly variable SIMD vector-length Ashish Shrivastava, Alan Gatherer, Tong Sun, Sushma Wokhlu, Alex Chandra 1524 | SLAP: A Split Latency Adaptive VLIW Pipeline Architecture which enables on-the-fly variable SIMD vector-length |
1184 SLIDING-CAPON BASED CONVOLUTIONAL BEAMSPACE FOR LINEAR ARRAYS Po-Chih Chen, P. P. Vaidyanathan 1184 | SLIDING-CAPON BASED CONVOLUTIONAL BEAMSPACE FOR LINEAR ARRAYS |
3912 SLOW-FAST AUDITORY STREAMS FOR AUDIO RECOGNITION Evangelos Kazakos, Arsha Nagrani, Andrew Zisserman, Dima Damen 3912 | SLOW-FAST AUDITORY STREAMS FOR AUDIO RECOGNITION |
3013 SM+: Refined Scale Match for Tiny Person Detection Nan Jiang, Xuehui Yu, Xiaoke Peng, Yuqi Gong, Zhenjun Han 3013 | SM+: Refined Scale Match for Tiny Person Detection |
2576 SMALL FOOTPRINT TEXT-INDEPENDENT SPEAKER VERIFICATION FOR EMBEDDED SYSTEMS Julien Balian, Raffaele Tavarone, Mathieu Poumeyrol, Alice Coucke 2576 | SMALL FOOTPRINT TEXT-INDEPENDENT SPEAKER VERIFICATION FOR EMBEDDED SYSTEMS |
4626 SNR-ADAPTIVE DEEP JOINT SOURCE-CHANNEL CODING FOR WIRELESS IMAGE TRANSMISSION Mingze Ding, Jiahui Li, Mengyao Ma, Xiaopeng Fan 4626 | SNR-ADAPTIVE DEEP JOINT SOURCE-CHANNEL CODING FOR WIRELESS IMAGE TRANSMISSION |
3249 SOCIAL LEARNING UNDER INFERENTIAL ATTACKS Konstantinos Ntemos, Virginia Bordignon, Stefan Vlaski, Ali H. Sayed 3249 | SOCIAL LEARNING UNDER INFERENTIAL ATTACKS |
2246 Social Sensitive Reinforcement Learning for Interactive Recommendation System Qihan Du, Li Yu, Huiyuan Li, Boyan Yue 2246 | Social Sensitive Reinforcement Learning for Interactive Recommendation System |
1741 SOLVING A CLASS OF NON-CONVEX MIN-MAX GAMES USING ADAPTIVE MOMENTUM METHODS Babak Barazandeh, Davoud Ataee Tarzanagh, George Michailidis 1741 | SOLVING A CLASS OF NON-CONVEX MIN-MAX GAMES USING ADAPTIVE MOMENTUM METHODS |
3801 SOUND EVENT DETECTION AND SEPARATION: A BENCHMARK ON DESED SYNTHETIC SOUNDSCAPES Nicolas Turpault, Romain Serizel, Scott Wisdom, Hakan Erdogan, John R. Hershey, Eduardo Fonseca, Prem Seetharaman, Justin Salamon 3801 | SOUND EVENT DETECTION AND SEPARATION: A BENCHMARK ON DESED SYNTHETIC SOUNDSCAPES |
2952 SOUND EVENT DETECTION BASED ON CURRICULUM LEARNING CONSIDERING LEARNING DIFFICULTY OF EVENTS Noriyuki Tonami, Keisuke Imoto, Yuki Okamoto, Takahiro Fukumori, Yoichi Yamashita 2952 | SOUND EVENT DETECTION BASED ON CURRICULUM LEARNING CONSIDERING LEARNING DIFFICULTY OF EVENTS |
2286 SOUND EVENT DETECTION BY CONSISTENCY TRAINING AND PSEUDO-LABELING WITH FEATURE-PYRAMID CONVOLUTIONAL RECURRENT NEURAL NETWORKS Chih-Yuan Koh, You-Siang Chen, Yi-Wen Liu, Mingsian Bai 2286 | SOUND EVENT DETECTION BY CONSISTENCY TRAINING AND PSEUDO-LABELING WITH FEATURE-PYRAMID CONVOLUTIONAL RECURRENT NEURAL NETWORKS |
4099 SOUND EVENT DETECTION IN URBAN AUDIO WITH SINGLE AND MULTI-RATE PCEN Christopher Ick, Brian McFee 4099 | SOUND EVENT DETECTION IN URBAN AUDIO WITH SINGLE AND MULTI-RATE PCEN |
2678 SOUND RECOVERY FROM RADIO SIGNALS Muhammed Zahid Ozturk, Chenshu Wu, Beibei Wang, K.J. Ray Liu 2678 | SOUND RECOVERY FROM RADIO SIGNALS |
4023 SOURCE-AWARE NEURAL SPEECH CODING FOR NOISY SPEECH COMPRESSION Haici Yang, Kai Zhen, Seungkwon Beack, Minje Kim 4023 | SOURCE-AWARE NEURAL SPEECH CODING FOR NOISY SPEECH COMPRESSION |
3396 SPARSE ARRAY TRANSCEIVER DESIGN FOR ENHANCED ADAPTIVE BEAMFORMING IN MIMO RADAR Syed A. Hamza, Weitong Zhai, Xiangrong Wang, Moeness G. Amin 3396 | SPARSE ARRAY TRANSCEIVER DESIGN FOR ENHANCED ADAPTIVE BEAMFORMING IN MIMO RADAR |
2947 SPARSE BAYESIAN LEARNING FOR ACOUSTIC SOURCE LOCALIZATION Ruchi Pandey, Santosh Nannuru, Aditya Siripuram 2947 | SPARSE BAYESIAN LEARNING FOR ACOUSTIC SOURCE LOCALIZATION |
4181 SPARSE FACTORIZATION-BASED DETECTION OF OFF-THE-GRID MOVING TARGETS USING FMCW RADARS Gilles Monnoyer de Galland, Thomas Feuillen, Luc Vandendorpe, Laurent Jacques 4181 | SPARSE FACTORIZATION-BASED DETECTION OF OFF-THE-GRID MOVING TARGETS USING FMCW RADARS |
3020 SPARSE FLOW ADVERSARIAL MODEL FOR ROBUST IMAGE COMPRESSION Shihui Zhao, Shuyuan Yang, Zhi Liu, Zhixi Feng, Xu Liu 3020 | SPARSE FLOW ADVERSARIAL MODEL FOR ROBUST IMAGE COMPRESSION |
4448 SPARSE GRAPH BASED SKETCHING FOR FAST NUMERICAL LINEAR ALGEBRA Dong Hu, Shashanka Ubaru, Alex Gittens, Kenneth Clarkson, Lior Horesh, Vassilis Kalantzis 4448 | SPARSE GRAPH BASED SKETCHING FOR FAST NUMERICAL LINEAR ALGEBRA |
2280 SPARSE HIGH-ORDER PORTFOLIOS VIA PROXIMAL DCA AND SCA Jinxin Wang, Zengde Deng, Taoli Zheng, Anthony Man-Cho So 2280 | SPARSE HIGH-ORDER PORTFOLIOS VIA PROXIMAL DCA AND SCA |
1640 SPARSE PARAMETER ESTIMATION FOR PMCW MIMO RADAR USING FEW-BIT ADCS Chao-Yi Wu, Jian Li, Tan F. Wong 1640 | SPARSE PARAMETER ESTIMATION FOR PMCW MIMO RADAR USING FEW-BIT ADCS |
4025 SPARSE RECOVERY BEAMFORMING AND UPSCALING IN THE RAY SPACE Shiduo Yu, Craig Jin, Fabio Antonacci, Augusto Sarti 4025 | SPARSE RECOVERY BEAMFORMING AND UPSCALING IN THE RAY SPACE |
3248 SPARSE REPRESENTATION OF COMPLEX-VALUED FMRI DATA BASED ON HARD THRESHOLDING OF SPATIAL SOURCE PHASE Jia-Yang Song, Miao-Ying Qi, Dun-Pei Lv, Chao-Ying Zhang, Qiu-Hua Lin, Vince Calhoun 3248 | SPARSE REPRESENTATION OF COMPLEX-VALUED FMRI DATA BASED ON HARD THRESHOLDING OF SPATIAL SOURCE PHASE |
3192 SPARSE TIME-FREQUENCY REPRESENTATION VIA ATOMIC NORM MINIMIZATION Tsubasa Kusano, Kohei Yatabe, Yasuhiro Oikawa 3192 | SPARSE TIME-FREQUENCY REPRESENTATION VIA ATOMIC NORM MINIMIZATION |
5416 SPARSE-CODED DYNAMIC MODE DECOMPOSITION ON GRAPH FOR PREDICTION OF RIVER WATER LEVEL DISTRIBUTION Yusuke Arai, Shogo Muramatsu, Hiroyasu Yasuda, Kiyoshi Hayasaka, Yu Otake 5416 | SPARSE-CODED DYNAMIC MODE DECOMPOSITION ON GRAPH FOR PREDICTION OF RIVER WATER LEVEL DISTRIBUTION |
2540 SPARSIFICATION VIA COMPRESSED SENSING FOR AUTOMATIC SPEECH RECOGNITION Kai Zhen, Hieu Nguyen, Feng-Ju Chang, Athanasios Mouchtaris, Ariya Rastrow 2540 | SPARSIFICATION VIA COMPRESSED SENSING FOR AUTOMATIC SPEECH RECOGNITION |
4897 SPARSITY AND NONNEGATIVITY CONSTRAINED KRYLOV APPROACH FOR DIRECTION OF ARRIVAL ESTIMATION Hamza Baali, Abdesselam Bouzerdoum, Abdelkrim Khelif 4897 | SPARSITY AND NONNEGATIVITY CONSTRAINED KRYLOV APPROACH FOR DIRECTION OF ARRIVAL ESTIMATION |
5466 SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING Vinayak Killedar, Praveen Kumar Pokala, Chandra Sekhar Seelamantula 5466 | SPARSITY DRIVEN LATENT SPACE SAMPLING FOR GENERATIVE PRIOR BASED COMPRESSIVE SENSING |
3965 SPARSITY IN MAX-PLUS ALGEBRA AND APPLICATIONS IN MULTIVARIATE CONVEX REGRESSION Nikos Tsilivis, Anastasios Tsiamis, Petros Maragos 3965 | SPARSITY IN MAX-PLUS ALGEBRA AND APPLICATIONS IN MULTIVARIATE CONVEX REGRESSION |
1519 SPATIAL EQUALIZATION BEFORE RECEPTION: RECONFIGURABLE INTELLIGENT SURFACES FOR MULTI-PATH MITIGATION Hongliang Zhang, Lingyang Song, Zhu Han, H. Vincent Poor 1519 | SPATIAL EQUALIZATION BEFORE RECEPTION: RECONFIGURABLE INTELLIGENT SURFACES FOR MULTI-PATH MITIGATION |
4146 SPATIOTEMPORAL ATTENTION FOR MULTIVARIATE TIME SERIES PREDICTION AND INTERPRETATION Tryambak Gangopadhyay, Sin Yong Tan, Zhanhong Jiang, Rui Meng, Soumik Sarkar 4146 | SPATIOTEMPORAL ATTENTION FOR MULTIVARIATE TIME SERIES PREDICTION AND INTERPRETATION |
3427 SPEAKER ACTIVITY DRIVEN NEURAL SPEECH EXTRACTION Marc Delcroix, Katerina Zmolikova, Tsubasa Ochiai, Keisuke Kinoshita, Tomohiro Nakatani 3427 | SPEAKER ACTIVITY DRIVEN NEURAL SPEECH EXTRACTION |
3359 SPEAKER AND DIRECTION INFERRED DUAL-CHANNEL SPEECH SEPARATION Chenxing Li, Jiaming Xu, Nima Mesgarani, Bo Xu 3359 | SPEAKER AND DIRECTION INFERRED DUAL-CHANNEL SPEECH SEPARATION |
3122 Speaker embeddings for diarization of broadcast data in the ALLIES challenge Anthony Larcher, Ambuj Mehrish, Marie Tahon, Sylvain Meignier, Jean Carrive, David Doukhan, Olivier Galibert, Nicholas Evans 3122 | Speaker embeddings for diarization of broadcast data in the ALLIES challenge |
3843 SPEAKER-INDEPENDENT BRAIN ENHANCED SPEECH DENOISING Maryam Hosseini, Luca Celotti, Éric Plourde 3843 | SPEAKER-INDEPENDENT BRAIN ENHANCED SPEECH DENOISING |
4465 SPEAKING RATE AND TONAL REALIZATION IN MANDARIN CHINESE: WHAT CAN WE LEARN FROM LARGE SPEECH CORPORA? Jiahong Yuan, Kenneth Church 4465 | SPEAKING RATE AND TONAL REALIZATION IN MANDARIN CHINESE: WHAT CAN WE LEARN FROM LARGE SPEECH CORPORA? |
3967 SPECIALIZED EMBEDDING APPROXIMATION FOR EDGE INTELLIGENCE: A CASE STUDY IN URBAN SOUND CLASSIFICATION Sangeeta Srivastava, Dhrubojyoti Roy, Mark Cartwright, Juan Bello, Anish Arora 3967 | SPECIALIZED EMBEDDING APPROXIMATION FOR EDGE INTELLIGENCE: A CASE STUDY IN URBAN SOUND CLASSIFICATION |
1272 SPECTRAL DOMAIN CONVOLUTIONAL NEURAL NETWORK Bochen Guan, Jinnian Zhang, William A. Sethares, Richard Kijowski, Fang Liu 1272 | SPECTRAL DOMAIN CONVOLUTIONAL NEURAL NETWORK |
2736 Spectral folding and two-channel filter-banks on arbitrary graphs Eduardo Pavez, Benjamin Girault, Antonio Ortega, Philip A. Chou 2736 | Spectral folding and two-channel filter-banks on arbitrary graphs |
1081 SPECTRUM ENHANCEMENT NETWORK FOR ACOUSTIC SCENE CLASSIFICATION Yang Liu, Alexandros Neophytou, Sunando Sengupta, Eric Sommerlade 1081 | SPECTRUM ENHANCEMENT NETWORK FOR ACOUSTIC SCENE CLASSIFICATION |
1035 Speech Acoustic Modelling from Raw Phase Spectrum Erfan Loweimi, Zoran Cvetkovic, Peter Bell, Steve Renals 1035 | Speech Acoustic Modelling from Raw Phase Spectrum |
1815 SPEECH BASED DEPRESSION PREDICTION USING ENCODER-WEIGHT-ONLY TRANSFER LEARNING AND A LARGE CORPUS Amir Harati, Elizabeth Shriberg, Tomasz Rutowski, Piotr Chlebek, Yang Lu, Ricardo Oliveira 1815 | SPEECH BASED DEPRESSION PREDICTION USING ENCODER-WEIGHT-ONLY TRANSFER LEARNING AND A LARGE CORPUS |
4385 SPEECH BERT EMBEDDING FOR IMPROVING PROSODY IN NEURAL TTS Liping Chen, Yan Deng, Xi Wang, Frank K. Soong, Lei He 4385 | SPEECH BERT EMBEDDING FOR IMPROVING PROSODY IN NEURAL TTS |
1606 SPEECH DEREVERBERATION USING VARIATIONAL AUTOENCODERS Deepak Baby, Herve Bourlard 1606 | SPEECH DEREVERBERATION USING VARIATIONAL AUTOENCODERS |
4128 Speech Emotion Recognition based on Listener Adaptive Models Atsushi Ando, Ryo Masumura, Hiroshi Sato, Takafumi Moriya, Takanori Ashihara, Yusuke Ijima, Tomoki Toda 4128 | Speech Emotion Recognition based on Listener Adaptive Models |
3927 SPEECH EMOTION RECOGNITION USING QUATERNION CONVOLUTIONAL NEURAL NETWORKS Aneesh Muppidi, Martin Radfar 3927 | SPEECH EMOTION RECOGNITION USING QUATERNION CONVOLUTIONAL NEURAL NETWORKS |
4963 Speech Emotion Recognition using Semantic Information Panagiotis Tzirakis, Anh Nguyen, Stefanos Zafeiriou, Björn Schuller 4963 | Speech Emotion Recognition using Semantic Information |
2869 Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation Mingke Xu, Fan Zhang, Xiaodong Cui, Wei Zhang 2869 | Speech Emotion Recognition with Multiscale Area Attention and Data Augmentation |
5236 Speech enhancement aided end-to-end multi-task learning for voice activity detection Xu Tan, Xiao-Lei Zhang 5236 | Speech enhancement aided end-to-end multi-task learning for voice activity detection |
3390 SPEECH ENHANCEMENT AUTOENCODER WITH HIERARCHICAL LATENT STRUCTURE Koen Oostermeijer, Jun Du, Qing Wang, Chin-Hui Lee 3390 | SPEECH ENHANCEMENT AUTOENCODER WITH HIERARCHICAL LATENT STRUCTURE |
5606 Speech Enhancement Using Masking for Binaural Reproduction of Ambisonics Signals Moti Lugasi, Boaz Rafaely 5606 | Speech Enhancement Using Masking for Binaural Reproduction of Ambisonics Signals |
3278 SPEECH ENHANCEMENT WITH MIXTURE OF DEEP EXPERTS WITH CLEAN CLUSTERING PRE-TRAINING Shlomo E. Chazan, Jacob Goldberger, Sharon Gannot 3278 | SPEECH ENHANCEMENT WITH MIXTURE OF DEEP EXPERTS WITH CLEAN CLUSTERING PRE-TRAINING |
2717 SPEECH PREDICTION IN SILENT VIDEOS USING VARIATIONAL AUTOENCODERS Ravindra Yadav, Ashish Sardana, Vinay P Namboodiri, Rajesh M Hegde 2717 | SPEECH PREDICTION IN SILENT VIDEOS USING VARIATIONAL AUTOENCODERS |
2378 SPEECH RECOGNITION BY SIMPLY FINE-TUNING BERT Wen-Chin Huang, Chia-Hua Wu, Shang-Bao Luo, Kuan-Yu Chen, Hsin-Min Wang, Tomoki Toda 2378 | SPEECH RECOGNITION BY SIMPLY FINE-TUNING BERT |
4034 SPEECH-LANGUAGE PRE-TRAINING FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING Yao Qian, Ximo Bian, Yu Shi, Naoyuki Kanda, Leo Shen, Zhen Xiao, Michael Zeng 4034 | SPEECH-LANGUAGE PRE-TRAINING FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING |
3353 SPEEDING UP OF KERNEL-BASED LEARNING FOR HIGH-ORDER TENSORS Ouafae Karmouda, Jeremie Boulanger, Remy Boyer 3353 | SPEEDING UP OF KERNEL-BASED LEARNING FOR HIGH-ORDER TENSORS |
4930 SPHERICAL HARMONIC REPRESENTATION FOR DYNAMIC SOUND-FIELD MEASUREMENTS Fabrice Katzberg, Marco Maass, Alfred Mertins 4930 | SPHERICAL HARMONIC REPRESENTATION FOR DYNAMIC SOUND-FIELD MEASUREMENTS |
3705 SPOKEN LANGUAGE IDENTIFICATION IN UNSEEN TARGET DOMAIN USING WITHIN-SAMPLE SIMILARITY LOSS Muralikrishna H, Shantanu Kapoor, Dileep Aroor Dinesh, Padmanabhan Rajan 3705 | SPOKEN LANGUAGE IDENTIFICATION IN UNSEEN TARGET DOMAIN USING WITHIN-SAMPLE SIMILARITY LOSS |
4593 SQUEEZING VALUE OF CROSS-DOMAIN LABELS: A DECOUPLED SCORING APPROACH FOR SPEAKER VERIFICATION Lantian Li, Yang Zhang, Jiawen Kang, Thomas Fang Zheng, Dong Wang 4593 | SQUEEZING VALUE OF CROSS-DOMAIN LABELS: A DECOUPLED SCORING APPROACH FOR SPEAKER VERIFICATION |
2796 SQWA: STOCHASTIC QUANTIZED WEIGHT AVERAGING FOR IMPROVING THE GENERALIZATION CAPABILITY OF LOW-PRECISION DEEP NEURAL NETWORKS Sungho Shin, Yoonho Boo, Wonyong Sung 2796 | SQWA: STOCHASTIC QUANTIZED WEIGHT AVERAGING FOR IMPROVING THE GENERALIZATION CAPABILITY OF LOW-PRECISION DEEP NEURAL NETWORKS |
1770 SRF-NET: SELECTIVE RECEPTIVE FIELD NETWORK FOR ANCHOR-FREE TEMPORAL ACTION DETECTION Ranyu Ning, Can Zhang, Yuexian Zou 1770 | SRF-NET: SELECTIVE RECEPTIVE FIELD NETWORK FOR ANCHOR-FREE TEMPORAL ACTION DETECTION |
1224 SSFENET: SPATIAL AND SEMANTIC FEATURE ENHANCEMENT NETWORK FOR OBJECT DETECTION Tianyuan Wang, Can Ma, Haoshan Su, Weiping Wang 1224 | SSFENET: SPATIAL AND SEMANTIC FEATURE ENHANCEMENT NETWORK FOR OBJECT DETECTION |
2374 STABILITY ANALYSIS OF THE RC-PLMS ADAPTIVE BEAMFORMER USING A SIMPLE TRANSFER FUNCTION APPROXIMATION Ghattas Akkad, Ali Mansour, Bachar ElHassan, Elie Inaty 2374 | STABILITY ANALYSIS OF THE RC-PLMS ADAPTIVE BEAMFORMER USING A SIMPLE TRANSFER FUNCTION APPROXIMATION |
4125 STABILITY OF ALGEBRAIC NEURAL NETWORKS TO SMALL PERTURBATIONS Alejandro Parada-Mayorga, Alejandro Ribeiro 4125 | STABILITY OF ALGEBRAIC NEURAL NETWORKS TO SMALL PERTURBATIONS |
2825 STABLE AND EFFECTIVE ONE-STEP METHOD FOR PERSON SEARCH Ning Lv, Xuezhi Xiang, Xinyao Wang, Jie Yang, Rokia Abdeen, Abdulmotaleb El Saddik 2825 | STABLE AND EFFECTIVE ONE-STEP METHOD FOR PERSON SEARCH |
3782 STABLE CHECKPOINT SELECTION AND EVALUATION IN SEQUENCE TO SEQUENCE SPEECH SYNTHESIS Slava Shechtman, David Haws, Raul Fernandez 3782 | STABLE CHECKPOINT SELECTION AND EVALUATION IN SEQUENCE TO SEQUENCE SPEECH SYNTHESIS |
3285 STATISTICAL CORRECTION OF TRANSCRIBED MELODY NOTES BASED ON PROBABILISTIC INTEGRATION OF A MUSIC LANGUAGE MODEL AND A TRANSCRIPTION ERROR MODEL Yuki Hiramatsu, Go Shibata, Ryo Nishikimi, Eita Nakamura, Kazuyoshi Yoshii 3285 | STATISTICAL CORRECTION OF TRANSCRIBED MELODY NOTES BASED ON PROBABILISTIC INTEGRATION OF A MUSIC LANGUAGE MODEL AND A TRANSCRIPTION ERROR MODEL |
2900 STATISTICAL DISTANCE METRIC LEARNING FOR IMAGE SET RETRIEVAL Ting-Yao Hu, Alexander G Hauptmann 2900 | STATISTICAL DISTANCE METRIC LEARNING FOR IMAGE SET RETRIEVAL |
2728 STATISTICAL PROPERTIES OF A MODIFIED WELCH METHOD THAT USES SAMPLE PERCENTILES Felix Schwock, Shima Abadi 2728 | STATISTICAL PROPERTIES OF A MODIFIED WELCH METHOD THAT USES SAMPLE PERCENTILES |
3009 ST-BERT: CROSS-MODAL LANGUAGE MODEL PRE-TRAINING FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING Minjeong Kim, Gyuwan Kim, Sang-Woo Lee, Jung-Woo Ha 3009 | ST-BERT: CROSS-MODAL LANGUAGE MODEL PRE-TRAINING FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING |
2124 STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security Mohammad Adiban, Arash Safari, Giampiero Salvi 2124 | STEP-GAN: A One-Class Anomaly Detection Model with Applications to Power System Security |
4899 STEREO RECTIFICATION BASED ON EPIPOLAR CONSTRAINED NEURAL NETWORK Yuxing Wang, Yawen Lu, Guoyu Lu 4899 | STEREO RECTIFICATION BASED ON EPIPOLAR CONSTRAINED NEURAL NETWORK |
1176 Stochastic Deep Unfolding for Imaging Inverse Problems Jiaming Liu, Yu Sun, Weijie Gan, Xiaojian Xu, Brendt Wohlberg, Ulugbek Kamilov 1176 | Stochastic Deep Unfolding for Imaging Inverse Problems |
2991 STOCHASTIC SUCCESSIVE WEIGHTED SUM-RATE MAXIMIZATION FOR MULTIUSER MIMO SYSTEMS WITH FINITE-ALPHABET INPUTS Xin Guan, Xiaotong Zhao, Qingjiang Shi 2991 | STOCHASTIC SUCCESSIVE WEIGHTED SUM-RATE MAXIMIZATION FOR MULTIUSER MIMO SYSTEMS WITH FINITE-ALPHABET INPUTS |
5421 STOCK MOVEMENT PREDICTION AND PORTFOLIO MANAGEMENT VIA MULTIMODAL LEARNING WITH TRANSFORMER Divyanshu Daiya, Che Lin 5421 | STOCK MOVEMENT PREDICTION AND PORTFOLIO MANAGEMENT VIA MULTIMODAL LEARNING WITH TRANSFORMER |
5302 STREAMING END-TO-END SPEECH RECOGNITION WITH JOINTLY TRAINED NEURAL FEATURE ENHANCEMENT Chanwoo Kim, Abhinav Garg, Dhananjaya Gowda, Seongkyu Mun, Changwoo Han 5302 | STREAMING END-TO-END SPEECH RECOGNITION WITH JOINTLY TRAINED NEURAL FEATURE ENHANCEMENT |
2057 STREAMING MULTI-SPEAKER ASR WITH RNN-T Ilya Sklyar, Anna Piunova, Yulan Liu 2057 | STREAMING MULTI-SPEAKER ASR WITH RNN-T |
4827 STREAMING SIMULTANEOUS SPEECH TRANSLATION WITH AUGMENTED MEMORY TRANSFORMER Xutai Ma, Yongqiang Wang, Mohammad Dousti, Philipp Koehn, Juan Pino 4827 | STREAMING SIMULTANEOUS SPEECH TRANSLATION WITH AUGMENTED MEMORY TRANSFORMER |
3022 STRUCTURE-AWARE AUDIO-TO-SCORE ALIGNMENT USING PROGRESSIVELY DILATED CONVOLUTIONAL NEURAL NETWORKS Ruchit Agrawal, Daniel Wolff, Simon Dixon 3022 | STRUCTURE-AWARE AUDIO-TO-SCORE ALIGNMENT USING PROGRESSIVELY DILATED CONVOLUTIONAL NEURAL NETWORKS |
1552 STRUCTURED SUPPORT EXPLORATION FOR MULTILAYER SPARSE MATRIX FACTORIZATION QUOC-TUNG LE, Rémi Gribonval 1552 | STRUCTURED SUPPORT EXPLORATION FOR MULTILAYER SPARSE MATRIX FACTORIZATION |
4639 STRUCTURE-ENHANCED ATTENTIVE LEARNING FOR SPINE SEGMENTATION FROM ULTRASOUND VOLUME PROJECTION IMAGES Rui Zhao, Zixun Huang, Tianshan Liu, Frank H.F. Leung, Sai Ho Ling, De Yang, Timothy Tin-Yan Lee, Daniel P.K. Lun, Yong-Ping Zheng, Kin-Man Lam 4639 | STRUCTURE-ENHANCED ATTENTIVE LEARNING FOR SPINE SEGMENTATION FROM ULTRASOUND VOLUME PROJECTION IMAGES |
1274 STYLEMELGAN: AN EFFICIENT HIGH-FIDELITY ADVERSARIAL VOCODER WITH TEMPORAL ADAPTIVE NORMALIZATION Ahmed Mustafa, Nicola Pia, Guillaume Fuchs 1274 | STYLEMELGAN: AN EFFICIENT HIGH-FIDELITY ADVERSARIAL VOCODER WITH TEMPORAL ADAPTIVE NORMALIZATION |
4519 SUB-BAND GROUPING SPECTRAL FEATURE-ATTENTION BLOCK FOR HYPERSPECTRAL IMAGE CLASSIFICATION Weilian Zhou, Sei-ichiro Kamata 4519 | SUB-BAND GROUPING SPECTRAL FEATURE-ATTENTION BLOCK FOR HYPERSPECTRAL IMAGE CLASSIFICATION |
5189 Subject Independent EEG Representation Learning for Emotion Recognition Soheil Rayatdoost, Yufeng Yin, David Rudrauf, Mohammad Soleymani 5189 | Subject Independent EEG Representation Learning for Emotion Recognition |
3955 Subjective and objective evaluation of deepfake videos Pavel Korshunov, Sebastien Marcel 3955 | Subjective and objective evaluation of deepfake videos |
1266 SUB-NYQUIST MULTICHANNEL BLIND DECONVOLUTION Satish Mulleti, Kiryung Lee, Yonina C. Eldar 1266 | SUB-NYQUIST MULTICHANNEL BLIND DECONVOLUTION |
2464 SUBSPACE ODDITY - OPTIMIZATION ON PRODUCT OF STIEFEL MANIFOLDS FOR EEG DATA Maria Sayu Yamamoto, Florian Yger, Sylvain Chevallier 2464 | SUBSPACE ODDITY - OPTIMIZATION ON PRODUCT OF STIEFEL MANIFOLDS FOR EEG DATA |
3394 SubSpectral Normalization for Neural Audio Data Processing Simyung Chang, Hyoungwoo Park, Janghoon Cho, Hyunsin Park, Sungrack Yun, Kyuwoong Hwang 3394 | SubSpectral Normalization for Neural Audio Data Processing |
1730 SUPER-RESOLUTION AND INFECTION EDGE DETECTION CO-GUIDED LEARNING FOR COVID-19 CT SEGMENTATION Yu Sang, Jinguang Sun, Simiao Wang, Heng Qi, Keqiu Li 1730 | SUPER-RESOLUTION AND INFECTION EDGE DETECTION CO-GUIDED LEARNING FOR COVID-19 CT SEGMENTATION |
3480 SUPER-RESOLUTION OF PERIODIC SIGNALS FROM SHORT SEQUENCES OF SAMPLES Marek Rupniewski 3480 | SUPER-RESOLUTION OF PERIODIC SIGNALS FROM SHORT SEQUENCES OF SAMPLES |
2731 Supervised Chorus Detection for Popular Music Using Convolutional Neural Network and Multi-task Learning Ju-Chiang Wang, Jordan B. L. Smith, Jitong Chen, Xuchen Song, Yuxuan Wang 2731 | Supervised Chorus Detection for Popular Music Using Convolutional Neural Network and Multi-task Learning |
4763 Supervised direct-path relative transfer function learning for binaural sound source localization Bing Yang, Xiaofei Li, Hong Liu 4763 | Supervised direct-path relative transfer function learning for binaural sound source localization |
4467 SUREmap: Predicting Uncertainty in CNN-based Image Reconstructions using Stein's Unbiased Risk Estimate Ruangrawee Kitichotkul, Christopher Metzler, Frank Ong, Gordon Wetzstein 4467 | SUREmap: Predicting Uncertainty in CNN-based Image Reconstructions using Stein's Unbiased Risk Estimate |
3674 SURROGATE SOURCE MODEL LEARNING FOR DETERMINED SOURCE SEPARATION Robin Scheibler, Masahito Togami 3674 | SURROGATE SOURCE MODEL LEARNING FOR DETERMINED SOURCE SEPARATION |
4877 SWITCHED HAWKES PROCESSES Namrata Nadagouda, Mark Davenport 4877 | SWITCHED HAWKES PROCESSES |
3517 Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement Mostafa Sadeghi, Xavier Alameda-Pineda 3517 | Switching Variational Auto-Encoders for Noise-Agnostic Audio-visual Speech Enhancement |
4389 Symmetric Sub-graph Spatio-Temporal Graph Convolution and its application in Complex Activity Recognition Pratyusha Das, Antonio Ortega 4389 | Symmetric Sub-graph Spatio-Temporal Graph Convolution and its application in Complex Activity Recognition |
5322 SYNAUG: SYNTHESIS-BASED DATA AUGMENTATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION Chenpeng Du, Bing Han, Shuai Wang, Yanmin Qian, Kai Yu 5322 | SYNAUG: SYNTHESIS-BASED DATA AUGMENTATION FOR TEXT-DEPENDENT SPEAKER VERIFICATION |
1287 SYNCHRONOUS MULTI-BIT AUDIO WATERMARKING BASED ON PHASE SHIFTING Shengbei Wang, Weitao Yuan, Zhen Zhang, Jianming Wang, Masashi Unoki 1287 | SYNCHRONOUS MULTI-BIT AUDIO WATERMARKING BASED ON PHASE SHIFTING |
1799 SYNERGIC FEATURE ATTENTION FOR IMAGE RESTORATION Chong Mou, Jian Zhang 1799 | SYNERGIC FEATURE ATTENTION FOR IMAGE RESTORATION |
3080 SYNTACTIC REPRESENTATION LEARNING FOR NEURAL NETWORK BASED TTS WITH SYNTACTIC PARSE TREE TRAVERSAL Changhe Song, Jingbei Li, Yixuan Zhou, Zhiyong Wu, Helen Meng 3080 | SYNTACTIC REPRESENTATION LEARNING FOR NEURAL NETWORK BASED TTS WITH SYNTACTIC PARSE TREE TRAVERSAL |
4609 SYNTHESIS OF NEW WORDS FOR IMPROVED DYSARTHRIC SPEECH RECOGNITION ON AN EXPANDED VOCABULARY John Harvill, Dias Issa, Mark Hasegawa-Johnson, Changdong Yoo 4609 | SYNTHESIS OF NEW WORDS FOR IMPROVED DYSARTHRIC SPEECH RECOGNITION ON AN EXPANDED VOCABULARY |
4321 SYNTHESIZE & LEARN: JOINTLY OPTIMIZING GENERATIVE AND CLASSIFIER NETWORKS FOR IMPROVED DROWSINESS DETECTION Sandipan Banerjee, Ajjen Joshi, Ahmed Ghoneim, Survi Kyal, Taniya Mishra 4321 | SYNTHESIZE & LEARN: JOINTLY OPTIMIZING GENERATIVE AND CLASSIFIER NETWORKS FOR IMPROVED DROWSINESS DETECTION |
4086 SYNTHETIC APERTURE ACOUSTIC IMAGING WITH DEEP GENERATIVE MODEL BASED SOURCE DISTRIBUTION PRIOR Boqiang Fan, Samarjit Das 4086 | SYNTHETIC APERTURE ACOUSTIC IMAGING WITH DEEP GENERATIVE MODEL BASED SOURCE DISTRIBUTION PRIOR |
1499 SYNTHETIC DATA FOR DNN-BASED DOA ESTIMATION OF INDOOR SPEECH Femke B. Gelderblom, Yi Liu, Johannes Kvam, Tor Andre Myrvoll 1499 | SYNTHETIC DATA FOR DNN-BASED DOA ESTIMATION OF INDOOR SPEECH |
3573 TABULAR TRANSFORMERS FOR MODELING MULTIVARIATE TIME SERIES Inkit Padhi, Yair Schiff, Igor Melnyk, Mattia Rigotti, Youssef Mroueh, Pierre Dognin, Jerret Ross, Ravi Nair, Erik Altman 3573 | TABULAR TRANSFORMERS FOR MODELING MULTIVARIATE TIME SERIES |
1446 TAKING A CLOSER LOOK AT SYNTHESIS: FINE-GRAINED ATTRIBUTE ANALYSIS FOR PERSON RE-IDENTIFICATION Suncheng Xiang, Yuzhuo Fu, Guanjie You, Ting Liu 1446 | TAKING A CLOSER LOOK AT SYNTHESIS: FINE-GRAINED ATTRIBUTE ANALYSIS FOR PERSON RE-IDENTIFICATION |
3752 TAMING VOTING ALGORITHMS ON GPUS FOR AN EFFICIENT CONNECTED COMPONENT ANALYSIS ALGORITHM Florian Lemaitre, Arthur Hennequin, Lionel Lacassagne 3752 | TAMING VOTING ALGORITHMS ON GPUS FOR AN EFFICIENT CONNECTED COMPONENT ANALYSIS ALGORITHM |
5527 TARGET DETECTION FROM DISTRIBUTED PASSIVE SENSORS: SEMI-LABELED DATA QUANTIZATION Zachariah Sutton, Peter Willett, Stefano Marano 5527 | TARGET DETECTION FROM DISTRIBUTED PASSIVE SENSORS: SEMI-LABELED DATA QUANTIZATION |
3924 Target Detection in Frequency Hopping MIMO Dual-Function Radar-Communication Systems INDU PRIYA EEDARA, Moeness G. Amin, Giuseppe A. Fabrizio 3924 | Target Detection in Frequency Hopping MIMO Dual-Function Radar-Communication Systems |
5355 Task Aware Multi-Task Learning for Speech to Text Tasks Sathish Indurthi, Mohd Abbas Zaidi, Nikhil Kumar Lakumarapu, Beomseok Lee, Hyojung Han, Seokchan Ahn, Sangha Kim, Chanwoo Kim, Inchul Hwang 5355 | Task Aware Multi-Task Learning for Speech to Text Tasks |
4568 TASK-AWARE NEURAL ARCHITECTURE SEARCH Cat Le, Mohammadreza Soltani, Robert Ravier, Vahid Tarokh 4568 | TASK-AWARE NEURAL ARCHITECTURE SEARCH |
5058 TASK-RELATED SELF-SUPERVISED LEARNING FOR REMOTE SENSING IMAGE CHANGE DETECTION Zhinan Cai, Zhiyu Jiang, Yuan Yuan 5058 | TASK-RELATED SELF-SUPERVISED LEARNING FOR REMOTE SENSING IMAGE CHANGE DETECTION |
2335 TCLA ARRAY: A NEW SPARSE ARRAY DESIGN WITH LESS MUTUAL COUPLING Ahmed M. A. Shaalan, Jun Du, Yanhui Tu 2335 | TCLA ARRAY: A NEW SPARSE ARRAY DESIGN WITH LESS MUTUAL COUPLING |
2244 Teacher-Assisted Mini-Batch Sampling for Blind Distillation using Metric Learning Nakamasa Inoue 2244 | Teacher-Assisted Mini-Batch Sampling for Blind Distillation using Metric Learning |
3991 TEACHER-STUDENT LEARNING FOR LOW-LATENCY ONLINE SPEECH ENHANCEMENT USING WAVE-U-NET Sotaro Nakaoka, Li Li, Shota Inoue, Shoji Makino 3991 | TEACHER-STUDENT LEARNING FOR LOW-LATENCY ONLINE SPEECH ENHANCEMENT USING WAVE-U-NET |
5226 TEACHER-STUDENT LEARNING WITH MULTI-GRANULARITY CONSTRAINT TOWARDS COMPACT FACIAL FEATURE REPRESENTATION Shurun WANG, Shiqi WANG, Wenhan YANG, Xinfeng Zhang, Shanshe WANG, Siwei MA 5226 | TEACHER-STUDENT LEARNING WITH MULTI-GRANULARITY CONSTRAINT TOWARDS COMPACT FACIAL FEATURE REPRESENTATION |
2702 Temporal Exemplar Channels in High-Multipath Environments Mohamed Kashef, Peter Vouras, Robert Jones, Richard Candell, Kate Remley 2702 | Temporal Exemplar Channels in High-Multipath Environments |
1716 TEMPORAL LINK PREDICTION VIA REINFORCEMENT LEARNING Ye Tao, Ying Li, Zhonghai Wu 1716 | TEMPORAL LINK PREDICTION VIA REINFORCEMENT LEARNING |
2209 TEMPORAL RAIN DECOMPOSITION WITH SPATIAL STRUCTURE GUIDANCE FOR VIDEO DERAINING Xinwei Xue, Ying Ding, Long Ma, Yi Wang, Risheng Liu, Xin Fan 2209 | TEMPORAL RAIN DECOMPOSITION WITH SPATIAL STRUCTURE GUIDANCE FOR VIDEO DERAINING |
1187 TENSOR DECOMPOSITION VIA CORE TENSOR NETWORKS Jianfu Zhang, Zerui Tao, Liqing Zhang, Qibin Zhao 1187 | TENSOR DECOMPOSITION VIA CORE TENSOR NETWORKS |
3758 TENSOR REORDERING FOR CNN COMPRESSION Matej Ulicny, Vladimir A. Krylov, Rozenn Dahyot 3758 | TENSOR REORDERING FOR CNN COMPRESSION |
3333 Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events Xuenan Xu, Heinrich Dinkel, Mengyue Wu, Yu Kai 3333 | Text-to-Audio Grounding: Building Correspondence Between Captions and Sound Events |
5008 THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE 2020: OPEN DATASETS, TRACKS, BASELINES, RESULTS AND METHODS Xian Shi, Fan Yu, Yizhou Lu, Yuhao Liang, Qiangze Feng, Daliang Wang, Yanmin Qian, Lei Xie 5008 | THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE 2020: OPEN DATASETS, TRACKS, BASELINES, RESULTS AND METHODS |
5587 THE AUTOMATIC DETECTION OF SPEECH DISORDER IN CHILDREN: CHALLENGES, OPPORTUNITIES, AND PRELIMINARY RESULTS Mostafa Shahin, Usman Zafar, Beena Ahmed 5587 | THE AUTOMATIC DETECTION OF SPEECH DISORDER IN CHILDREN: CHALLENGES, OPPORTUNITIES, AND PRELIMINARY RESULTS |
4045 THE BENEFIT OF TEMPORALLY-STRONG LABELS IN AUDIO EVENT CLASSIFICATION Shawn Hershey, Daniel P W Ellis, Eduardo Fonseca, Aren Jansen, Caroline Liu, R Channing Moore, Manoj Plakal 4045 | THE BENEFIT OF TEMPORALLY-STRONG LABELS IN AUDIO EVENT CLASSIFICATION |
2441 THE FAR-FIELD EQUATORIAL ARRAY FOR BINAURAL RENDERING Jens Ahrens, Hannes Helmholz, David Alon, Sebastià Amengual Garí 2441 | THE FAR-FIELD EQUATORIAL ARRAY FOR BINAURAL RENDERING |
3865 THE IDLAB VOXSRC-20 SUBMISSION: LARGE MARGIN FINE-TUNING AND QUALITY-AWARE SCORE CALIBRATION IN DNN BASED SPEAKER VERIFICATION Jenthe Thienpondt, Brecht Desplanques, Kris Demuynck 3865 | THE IDLAB VOXSRC-20 SUBMISSION: LARGE MARGIN FINE-TUNING AND QUALITY-AWARE SCORE CALIBRATION IN DNN BASED SPEAKER VERIFICATION |
4341 The ins and outs of speaker recognition: lessons from VoxSRC 2020 Yoohwan Kwon, Hee-Soo Heo, Bong-Jin Lee, Joon Son Chung 4341 | The ins and outs of speaker recognition: lessons from VoxSRC 2020 |
3944 THE IN-THE-WILD SPEECH MEDICAL CORPUS Joana Correia, Francisco Teixeira, Catarina Botelho, Isabel Trancoso, Bhiksha Raj 3944 | THE IN-THE-WILD SPEECH MEDICAL CORPUS |
3814 THE ROLE OF TASK AND ACOUSTIC SIMILARITY IN AUDIO TRANSFER LEARNING: INSIGHTS FROM THE SPEECH EMOTION RECOGNITION CASE Andreas Triantafyllopoulos, Björn Schuller 3814 | THE ROLE OF TASK AND ACOUSTIC SIMILARITY IN AUDIO TRANSFER LEARNING: INSIGHTS FROM THE SPEECH EMOTION RECOGNITION CASE |
3256 THE USE OF VOICE SOURCE FEATURES FOR SUNG SPEECH RECOGNITION Gerardo Roa Dabike, Jon Barker 3256 | THE USE OF VOICE SOURCE FEATURES FOR SUNG SPEECH RECOGNITION |
3800 TIME-DOMAIN CONCENTRATION AND APPROXIMATION OF COMPUTABLE BANDLIMITED SIGNALS Holger Boche, Ullrich Mönich 3800 | TIME-DOMAIN CONCENTRATION AND APPROXIMATION OF COMPUTABLE BANDLIMITED SIGNALS |
1723 TIME-DOMAIN LOSS MODULATION BASED ON OVERLAP RATIO FOR MONAURAL CONVERSATIONAL SPEAKER SEPARATION Hassan Taherian, DeLiang Wang 1723 | TIME-DOMAIN LOSS MODULATION BASED ON OVERLAP RATIO FOR MONAURAL CONVERSATIONAL SPEAKER SEPARATION |
3431 Time-domain speaker verification using temporal convolutional networks Sangwook Han, Jaeuk Byun, Jong Won Shin 3431 | Time-domain speaker verification using temporal convolutional networks |
3923 TIME-DOMAIN SPEECH EXTRACTION WITH SPATIAL INFORMATION AND MULTI SPEAKER CONDITIONING MECHANISM Jisi Zhang, Cătălin Zorilă, Rama Doddipatla, Jon Barker 3923 | TIME-DOMAIN SPEECH EXTRACTION WITH SPATIAL INFORMATION AND MULTI SPEAKER CONDITIONING MECHANISM |
1555 Time-varying graph signal inpainting via unrolling networks Siheng Chen, Yonina Eldar 1555 | Time-varying graph signal inpainting via unrolling networks |
3055 TINY TRANSDUCER: A HIGHLY-EFFICIENT SPEECH RECOGNITION MODEL ON EDGE DEVICES Yuekai Zhang, Sining Sun, Long Ma 3055 | TINY TRANSDUCER: A HIGHLY-EFFICIENT SPEECH RECOGNITION MODEL ON EDGE DEVICES |
1009 t-k-means: A ROBUST AND STABLE k-means VARIANT Yiming Li, Yang Zhang, Qingtao Tang, Weipeng Huang, Yong Jiang, Shu-Tao Xia 1009 | t-k-means: A ROBUST AND STABLE k-means VARIANT |
4770 To Supervise or Not To Supervise: How to Effectively Learn Wireless Interference Management Models? Bingqing Song, Haoran Sun, Wenqiang Pu, Sijia Liu, Mingyi Hong 4770 | To Supervise or Not To Supervise: How to Effectively Learn Wireless Interference Management Models? |
3806 TOP-DOWN ATTENTION IN END-TO-END SPOKEN LANGUAGE UNDERSTANDING Yixin Chen, Weiyi Lu, Alejandro Mottini, Li Erran Li, Jasha Droppo, Zheng Du, Belinda Zeng 3806 | TOP-DOWN ATTENTION IN END-TO-END SPOKEN LANGUAGE UNDERSTANDING |
2186 Topic Sequence Embedding for User Identity Linkage from Heterogeneous Behavior Data Jinzhu Yang, Wei Zhou, Wanhui Qian, Jizhong Han, Songlin Hu 2186 | Topic Sequence Embedding for User Identity Linkage from Heterogeneous Behavior Data |
3597 TOPIC-AWARE DIALOGUE GENERATION WITH TWO-HOP BASED GRAPH ATTENTION Shijie Zhou, Wenge Rong, Jianfei Zhang, Yanmeng Wang, Libin Shi, Zhang Xiong 3597 | TOPIC-AWARE DIALOGUE GENERATION WITH TWO-HOP BASED GRAPH ATTENTION |
5608 Topological IIR Filters Over Simplicial Topologies via Sheaves Georg Essl 5608 | Topological IIR Filters Over Simplicial Topologies via Sheaves |
4505 TOPOLOGICAL VOLTERRA FILTERS Geert Leus, Maosheng Yang, Mario Coutino, Elvin Isufi 4505 | TOPOLOGICAL VOLTERRA FILTERS |
2122 Toward Skills Dialog Orchestration with Online Learning Djallel Bouneffouf, Raphael Feraud, Sohini Upadhyay, Mayank Agarwal, Yasaman Khazaeni, Irina Rish 2122 | Toward Skills Dialog Orchestration with Online Learning |
2822 Towards Adversarial Robustness via Compact Feature Representations Muhammad Shah, Raphael Olivier, Bhiksha Raj 2822 | Towards Adversarial Robustness via Compact Feature Representations |
4501 TOWARDS AN ASR APPROACH FOR SPEECH ENHANCEMENT TO GENERATE MORE REALISTIC SPECTRA ACROSS TIME AND FREQUENCY KHANDOKAR MD. NAYEM, DONALD S. WILLIAMSON 4501 | TOWARDS AN ASR APPROACH FOR SPEECH ENHANCEMENT TO GENERATE MORE REALISTIC SPECTRA ACROSS TIME AND FREQUENCY |
2301 TOWARDS AN INTRINSIC DEFINITION OF ROBUSTNESS FOR A CLASSIFIER Théo Giraudon, Vincent Gripon, Matthias Löwe, Franck Vermet 2301 | TOWARDS AN INTRINSIC DEFINITION OF ROBUSTNESS FOR A CLASSIFIER |
5002 TOWARDS DATA SELECTION ON TTS DATA FOR CHILDREN'S SPEECH RECOGNITION Wei Wang, Zhikai Zhou, Yizhou Lu, Hongji Wang, Chenpeng Du, Yanmin Qian 5002 | TOWARDS DATA SELECTION ON TTS DATA FOR CHILDREN'S SPEECH RECOGNITION |
2221 TOWARDS EFFICIENT AGE ESTIMATION BY EMBEDDING POTENTIAL GENDER FEATURES Yulan Deng, Lunke Fei, Shaohua Teng, Wei Zhang, Dongning Liu, Yan Hou 2221 | TOWARDS EFFICIENT AGE ESTIMATION BY EMBEDDING POTENTIAL GENDER FEATURES |
1630 Towards efficient models for real-time deep noise suppression Sebastian Braun, Hannes Gamper, Chandan K. A. Reddy, Ivan Tashev 1630 | Towards efficient models for real-time deep noise suppression |
4927 Towards Efficiently Diversifying Dialogue Generation via Embedding Augmentation Yu Cao, Liang Ding, Zhiliang Tian, Meng Fang 4927 | Towards Efficiently Diversifying Dialogue Generation via Embedding Augmentation |
5062 TOWARDS EXPLAINING EXPRESSIVE QUALITIES IN PIANO RECORDINGS: TRANSFER OF EXPLANATORY FEATURES VIA ACOUSTIC DOMAIN ADAPTATION Shreyan Chowdhury, Gerhard Widmer 5062 | TOWARDS EXPLAINING EXPRESSIVE QUALITIES IN PIANO RECORDINGS: TRANSFER OF EXPLANATORY FEATURES VIA ACOUSTIC DOMAIN ADAPTATION |
4376 TOWARDS IMMEDIATE BACKCHANNEL GENERATION USING ATTENTION-BASED EARLY PREDICTION MODEL Amalia Istiqlali Adiba, Takeshi Homma, Toshinori Miyoshi 4376 | TOWARDS IMMEDIATE BACKCHANNEL GENERATION USING ATTENTION-BASED EARLY PREDICTION MODEL |
1244 TOWARDS LISTENING TO 10 PEOPLE SIMULTANEOUSLY: AN EFFICIENT PERMUTATION INVARIANT TRAINING OF AUDIO SOURCE SEPARATION USING SINKHORN’S ALGORITHM Hideyuki Tachibana 1244 | TOWARDS LISTENING TO 10 PEOPLE SIMULTANEOUSLY: AN EFFICIENT PERMUTATION INVARIANT TRAINING OF AUDIO SOURCE SEPARATION USING SINKHORN’S ALGORITHM |
2107 TOWARDS LOW-RESOURCE STARGAN VOICE CONVERSION USING WEIGHT ADAPTIVE INSTANCE NORMALIZATION Mingjie Chen, Yanpei Shi, Thomas Hain 2107 | TOWARDS LOW-RESOURCE STARGAN VOICE CONVERSION USING WEIGHT ADAPTIVE INSTANCE NORMALIZATION |
2243 TOWARDS NATURAL AND CONTROLLABLE CROSS-LINGUAL VOICE CONVERSION BASED ON NEURAL TTS MODEL AND PHONETIC POSTERIORGRAM Shengkui Zhao, Hao Wang, Trung Hieu Nguyen, Bin Ma 2243 | TOWARDS NATURAL AND CONTROLLABLE CROSS-LINGUAL VOICE CONVERSION BASED ON NEURAL TTS MODEL AND PHONETIC POSTERIORGRAM |
5294 TOWARDS PARKINSON’S DISEASE PROGNOSIS USING SELF-SUPERVISED LEARNING AND ANOMALY DETECTION Hongchao Jiang, Wei Yang Bryan Lim, Jer Shyuan Ng, Yu Wang, Ying Chi, Chunyan Miao 5294 | TOWARDS PARKINSON’S DISEASE PROGNOSIS USING SELF-SUPERVISED LEARNING AND ANOMALY DETECTION |
1476 TOWARDS PRACTICAL LIPREADING WITH DISTILLED AND EFFICIENT MODELS Pingchuan Ma, Brais Martinez, Stavros Petridis, Maja Pantic 1476 | TOWARDS PRACTICAL LIPREADING WITH DISTILLED AND EFFICIENT MODELS |
4137 TOWARDS PRACTICAL NEAR-MAXIMUM-LIKELIHOOD DECODING OF ERROR-CORRECTING CODES: AN OVERVIEW Thibaud Tonnellier, Marzieh Hashemipour, Nghia Doan, Warren Gross, Alexios Balatsoukas-Stimming 4137 | TOWARDS PRACTICAL NEAR-MAXIMUM-LIKELIHOOD DECODING OF ERROR-CORRECTING CODES: AN OVERVIEW |
2759 Towards Robust Speaker Verification with Target Speaker Enhancement Chunlei Zhang, Meng Yu, Chao Weng, Dong Yu 2759 | Towards Robust Speaker Verification with Target Speaker Enhancement |
5325 TOWARDS ROBUST TRAINING OF MULTI-SENSOR DATA FUSION NETWORK AGAINST ADVERSARIAL EXAMPLES IN SEMANTIC SEGMENTATION Youngjoon Yu, Hong Joo Lee, Byeong Cheon Kim, Jung Uk Kim, Yong Man Ro 5325 | TOWARDS ROBUST TRAINING OF MULTI-SENSOR DATA FUSION NETWORK AGAINST ADVERSARIAL EXAMPLES IN SEMANTIC SEGMENTATION |
2597 TOWARDS THE DEVELOPMENT OF SUBJECT-INDEPENDENT INVERSE METABOLIC MODELS Seyedhooman Sajjadi, Bobak Mortazavi, Anurag Das, Theodora Chaspari, Projna Paromita, Laura Ruebush, Nicolaas Deutz, Ricardo Gutierrez-Osuna 2597 | TOWARDS THE DEVELOPMENT OF SUBJECT-INDEPENDENT INVERSE METABOLIC MODELS |
4766 TRAFFIC SPEED FORECASTING VIA SPATIO-TEMPORAL ATTENTIVE GRAPH ISOMORPHISM NETWORK Qing Yang, Ting Zhong, Fan Zhou 4766 | TRAFFIC SPEED FORECASTING VIA SPATIO-TEMPORAL ATTENTIVE GRAPH ISOMORPHISM NETWORK |
3697 TRAIN YOUR CLASSIFIER FIRST: CASCADE NEURAL NETWORKS TRAINING FROM UPPER LAYERS TO LOWER LAYERS Shucong Zhang, Cong-Thanh Do, Rama Doddipatla, Erfan Loweimi, Peter Bell, Steve Renals 3697 | TRAIN YOUR CLASSIFIER FIRST: CASCADE NEURAL NETWORKS TRAINING FROM UPPER LAYERS TO LOWER LAYERS |
1470 TRAINING A BANK OF WIENER MODELS WITH A NOVEL QUADRATIC MUTUAL INFORMATION COST FUNCTION Bo Hu, Jose C. Principe 1470 | TRAINING A BANK OF WIENER MODELS WITH A NOVEL QUADRATIC MUTUAL INFORMATION COST FUNCTION |
3929 TRAINING LOGICAL NEURAL NETWORKS BY PRIMAL–DUAL METHODS FOR NEURO-SYMBOLIC REASONING Songtao Lu, Naweed Khan, Ismail Akhalwaya, Ryan Riegel, Lior Horesh, Alexander Gray 3929 | TRAINING LOGICAL NEURAL NETWORKS BY PRIMAL–DUAL METHODS FOR NEURO-SYMBOLIC REASONING |
2369 Training Neural Networks with Domain Pattern-Aware Auxiliary Task for Sleep Staging Taeheon Lee, Jeonghwan Hwang, Honggu Lee 2369 | Training Neural Networks with Domain Pattern-Aware Auxiliary Task for Sleep Staging |
4453 TRAINING NOISY SINGLE-CHANNEL SPEECH SEPARATION WITH NOISY ORACLE SOURCES: A LARGE GAP AND A SMALL STEP Matthew Maciejewski, Jing Shi, Shinji Watanabe, Sanjeev Khudanpur 4453 | TRAINING NOISY SINGLE-CHANNEL SPEECH SEPARATION WITH NOISY ORACLE SOURCES: A LARGE GAP AND A SMALL STEP |
1870 TRAINING REAL-TIME PANORAMIC OBJECT DETECTORS WITH VIRTUAL DATASET Qing-Yang Shen, Tian-Guo Huang, Peng-Xin Ding, Jia He 1870 | TRAINING REAL-TIME PANORAMIC OBJECT DETECTORS WITH VIRTUAL DATASET |
2089 TRAINING SPEECH RECOGNITION MODELS WITH FEDERATED LEARNING: A QUALITY/COST FRAMEWORK Dhruv Guliani, Francoise Beaufays, Giovanni Motta 2089 | TRAINING SPEECH RECOGNITION MODELS WITH FEDERATED LEARNING: A QUALITY/COST FRAMEWORK |
2698 TRANSCRIPTION IS ALL YOU NEED: LEARNING TO SEPARATE MUSICAL MIXTURES WITH SCORE AS SUPERVISION Yun-Ning Hung, Gordon Wichern, Jonathan Le Roux 2698 | TRANSCRIPTION IS ALL YOU NEED: LEARNING TO SEPARATE MUSICAL MIXTURES WITH SCORE AS SUPERVISION |
4250 TRANSFER LEARNING FOR INPUT ESTIMATION OF VEHICLE SYSTEMS Liam Cronin, Soheil Sadeghi Eshkevari, Debarshi Sen, Shamim Pakzad 4250 | TRANSFER LEARNING FOR INPUT ESTIMATION OF VEHICLE SYSTEMS |
5333 TRANSFORMER BASED UNSUPERVISED PRE-TRAINING FOR ACOUSTIC REPRESENTATION LEARNING Ruixiong Zhang, Haiwei Wu, Wubo Li, Dongwei Jiang, Wei Zou, Xiangang Li 5333 | TRANSFORMER BASED UNSUPERVISED PRE-TRAINING FOR ACOUSTIC REPRESENTATION LEARNING |
4127 Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications Yongqiang Wang, Yangyang Shi, Frank Zhang, Chunyang Wu, Julian Chan, Ching-Feng Yeh, Alex Xiao 4127 | Transformer in action: a comparative study of transformer-based acoustic models for large scale speech recognition applications |
3907 Transformer Language Models with LSTM-based Cross-utterance Information Representation Guangzhi Sun, Chao Zhang, Phil Woodland 3907 | Transformer Language Models with LSTM-based Cross-utterance Information Representation |
1847 Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention Menglong Xu, Shengqiang Li, Xiao-Lei Zhang 1847 | Transformer-based End-to-End Speech Recognition with Local Dense Synthesizer Attention |
2168 Transformer-Transducers for Code-Switched Speech Recognition Siddharth Dalmia, Yuzong Liu, Srikanth Ronanki, Katrin Kirchhoff 2168 | Transformer-Transducers for Code-Switched Speech Recognition |
1619 TRANSITIVE TRANSFER SPARSE CODING FOR DISTANT DOMAIN Lingtian Feng, Feng Qian, Xin He, Yuqi Fan, Hanpeng Cai, Guangmin Hu 1619 | TRANSITIVE TRANSFER SPARSE CODING FOR DISTANT DOMAIN |
5080 TRANSMASK: A COMPACT AND FAST SPEECH SEPARATION MODEL BASED ON TRANSFORMER zining zhang, bingsheng he, zhenjie zhang 5080 | TRANSMASK: A COMPACT AND FAST SPEECH SEPARATION MODEL BASED ON TRANSFORMER |
3274 TRANSMITTANCE REGULARIZER FOR BINARY CODED APERTURE DESIGN IN A COMPUTATIONAL IMAGING END-TO-END APPROACH Jorge Bacca, Tatiana Gelvez, Henry Arguello 3274 | TRANSMITTANCE REGULARIZER FOR BINARY CODED APERTURE DESIGN IN A COMPUTATIONAL IMAGING END-TO-END APPROACH |
3623 TRIPLE SEQUENCE GENERATIVE ADVERSARIAL NETS FOR UNSUPERVISED IMAGE CAPTIONING Yucheng Zhou, Wei Tao, Wenqiang Zhang 3623 | TRIPLE SEQUENCE GENERATIVE ADVERSARIAL NETS FOR UNSUPERVISED IMAGE CAPTIONING |
4707 TSTNN: TWO-STAGE TRANSFORMER BASED NEURAL NETWORK FOR SPEECH ENHANCEMENT IN THE TIME DOMAIN Kai Wang, Bengbeng He, Wei-Ping Zhu 4707 | TSTNN: TWO-STAGE TRANSFORMER BASED NEURAL NETWORK FOR SPEECH ENHANCEMENT IN THE TIME DOMAIN |
3277 TTS-BY-TTS: TTS-DRIVEN DATA AUGMENTATION FOR FAST AND HIGH-QUALITY SPEECH SYNTHESIS Min-Jae Hwang, Ryuichi Yamamoto, Eunwoo Song, Jae-Min Kim 3277 | TTS-BY-TTS: TTS-DRIVEN DATA AUGMENTATION FOR FAST AND HIGH-QUALITY SPEECH SYNTHESIS |
3145 TUCKER DECOMPOSITION FOR EXTRACTING SHARED AND INDIVIDUAL SPATIAL MAPS FROM MULTI-SUBJECT RESTING-STATE FMRI DATA Yue Han, Qiu-Hua Lin, Li-Dan Kuang, Xiao-Feng Gong, Fengyu Cong, Vince Calhoun 3145 | TUCKER DECOMPOSITION FOR EXTRACTING SHARED AND INDIVIDUAL SPATIAL MAPS FROM MULTI-SUBJECT RESTING-STATE FMRI DATA |
2489 Two-Stage Adaptive Pooling with RT-qPCR for COVID-19 Screening Anoosheh Heidarzadeh, Krishna Narayanan 2489 | Two-Stage Adaptive Pooling with RT-qPCR for COVID-19 Screening |
2888 Two-Stage Framework for Seasonal Time Series Forecasting Qingyang Xu, Qingsong Wen, Liang Sun 2888 | Two-Stage Framework for Seasonal Time Series Forecasting |
4426 Two-stage Graph-constrained Group Testing: Theory and Application Saurabh Sihag, Ali Tajer, Urbashi Mitra 4426 | Two-stage Graph-constrained Group Testing: Theory and Application |
4641 TWO-STAGE TEXTUAL KNOWLEDGE DISTILLATION FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING Seongbin Kim, Gyuwan Kim, Seongjin Shin, Sangmin Lee 4641 | TWO-STAGE TEXTUAL KNOWLEDGE DISTILLATION FOR END-TO-END SPOKEN LANGUAGE UNDERSTANDING |
2006 TYPINGWRISTBAND: A HUMAN SLIGHT MOTION SENSING SYSTEM BASED ON VIBRATION DETECTION Siyao Cheng, Jialiang Yan, Jianzhong Li, Jie Liu 2006 | TYPINGWRISTBAND: A HUMAN SLIGHT MOTION SENSING SYSTEM BASED ON VIBRATION DETECTION |
5099 U-CONVOLUTION BASED RESIDUAL ECHO SUPPRESSION WITH MULTIPLE ENCODERS EUISUNG KIM, Jae-Jin Jeon, Hyeji Seo 5099 | U-CONVOLUTION BASED RESIDUAL ECHO SUPPRESSION WITH MULTIPLE ENCODERS |
2457 ULTRA-LIGHTWEIGHT SPEECH SEPARATION VIA GROUP COMMUNICATION Yi Luo, Cong Han, Nima Mesgarani 2457 | ULTRA-LIGHTWEIGHT SPEECH SEPARATION VIA GROUP COMMUNICATION |
4858 ULTRA-LOW BITRATE VIDEO CONFERENCING USING DEEP IMAGE ANIMATION Goluck Konuko, Giuseppe Valenzise, Stéphane Lathuilière 4858 | ULTRA-LOW BITRATE VIDEO CONFERENCING USING DEEP IMAGE ANIMATION |
4261 ULTRASOUND ELASTICITY IMAGING USING PHYSICS-BASED MODELS AND LEARNING-BASED PLUG-AND-PLAY PRIORS Narges Mohammadi, Marvin M. Doyley, Mujdat Cetin 4261 | ULTRASOUND ELASTICITY IMAGING USING PHYSICS-BASED MODELS AND LEARNING-BASED PLUG-AND-PLAY PRIORS |
2131 UNCERTAINTY-BASED BIOLOGICAL AGE ESTIMATION OF BRAIN MRI SCANS Karim Armanious, Sherif Abdulatif, Wenbin Shi, Tobias Hepp, Sergios Gatidis, Bin Yang 2131 | UNCERTAINTY-BASED BIOLOGICAL AGE ESTIMATION OF BRAIN MRI SCANS |
3563 Unfolding Neural Networks for Compressive Multichannel Blind Deconvolution Bahareh Tolooshams, Satish Mulleti, Demba Ba, Yonina C. Eldar 3563 | Unfolding Neural Networks for Compressive Multichannel Blind Deconvolution |
3053 UNIDIRECTIONAL MEMORY-SELF-ATTENTION TRANSDUCER FOR ONLINE SPEECH RECOGNITION Jian Luo, Jianzong Wang, Ning Cheng, Jing Xiao 3053 | UNIDIRECTIONAL MEMORY-SELF-ATTENTION TRANSDUCER FOR ONLINE SPEECH RECOGNITION |
4328 Unified Clustering and Outlier Detection on Specialized Hardware Eldan Cohen, Hayato Ushijima-Mwesigwa, Avradip Mandal, Arnab Roy 4328 | Unified Clustering and Outlier Detection on Specialized Hardware |
4757 UNIFIED GRADIENT REWEIGHTING FOR MODEL BIASING WITH APPLICATIONS TO SOURCE SEPARATION Efthymios Tzinis, Dimitrios Bralios, Paris Smaragdis 4757 | UNIFIED GRADIENT REWEIGHTING FOR MODEL BIASING WITH APPLICATIONS TO SOURCE SEPARATION |
4556 UNIT SELECTION SYNTHESIS BASED DATA AUGMENTATION FOR FIXED PHRASE SPEAKER VERIFICATION Houjun Huang, Xu Xiang, Fei Zhao, Shuai Wang, Yanmin Qian 4556 | UNIT SELECTION SYNTHESIS BASED DATA AUGMENTATION FOR FIXED PHRASE SPEAKER VERIFICATION |
2386 UNIVERSAL NEURAL VOCODING WITH PARALLEL WAVENET Yunlong Jiao, Adam Gabrys, Georgi Tinchev, Bartosz Putrycz, Daniel Korzekwa, Viacheslav Klimkov 2386 | UNIVERSAL NEURAL VOCODING WITH PARALLEL WAVENET |
2974 UNROLLING OF DEEP GRAPH TOTAL VARIATION FOR IMAGE DENOISING Huy Vu, Gene Cheung, Yonina C. Eldar 2974 | UNROLLING OF DEEP GRAPH TOTAL VARIATION FOR IMAGE DENOISING |
4219 UNSUPERVISED AND SEMI-SUPERVISED FEW-SHOT ACOUSTIC EVENT CLASSIFICATION Hsin-Ping Huang, Krishna Puvvada, Ming Sun, Chao Wang 4219 | UNSUPERVISED AND SEMI-SUPERVISED FEW-SHOT ACOUSTIC EVENT CLASSIFICATION |
5059 UNSUPERVISED AUDIO-VISUAL SUBSPACE ALIGNMENT FOR HIGH-STAKES DECEPTION DETECTION Leena Mathur, Maja Matarić 5059 | UNSUPERVISED AUDIO-VISUAL SUBSPACE ALIGNMENT FOR HIGH-STAKES DECEPTION DETECTION |
3979 Unsupervised Clustering of Time Series Signals using Neuromorphic Energy-Efficient Temporal Neural Networks Shreyas Chaudhari, Harideep Nair, Jose Moura, John Shen 3979 | Unsupervised Clustering of Time Series Signals using Neuromorphic Energy-Efficient Temporal Neural Networks |
1480 UNSUPERVISED COMMON PARTICULAR OBJECT DISCOVERY AND LOCALIZATION BY ANALYZING A MATCH GRAPH Makoto Okuda, Shin'ichi Satoh, Yoichi Sato, Yutaka Kidawara 1480 | UNSUPERVISED COMMON PARTICULAR OBJECT DISCOVERY AND LOCALIZATION BY ANALYZING A MATCH GRAPH |
4255 Unsupervised Contrastive Learning of Sound Event Representations Eduardo Fonseca, Diego Ortego, Kevin McGuinness, Noel E. O'Connor, Xavier Serra 4255 | Unsupervised Contrastive Learning of Sound Event Representations |
4356 UNSUPERVISED DISCRIMINATIVE LEARNING OF SOUNDS FOR AUDIO EVENT CLASSIFICATION Sascha Hornauer, Ke Li, Stella Yu, Shabnam Ghaffarzadegan, Liu Ren 4356 | UNSUPERVISED DISCRIMINATIVE LEARNING OF SOUNDS FOR AUDIO EVENT CLASSIFICATION |
4976 Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training Sameer Khurana, Niko Moritz, Takaaki Hori, Jonathan Le Roux 4976 | Unsupervised Domain Adaptation for Speech Recognition via Uncertainty Driven Self-Training |
4550 UNSUPERVISED HEART ABNORMALITY DETECTION BASED ON PHONOCARDIOGRAM ANALYSIS WITH BETA VARIATIONAL AUTO-ENCODERS Shengchen Li, Ke Tian, Rui Wang 4550 | UNSUPERVISED HEART ABNORMALITY DETECTION BASED ON PHONOCARDIOGRAM ANALYSIS WITH BETA VARIATIONAL AUTO-ENCODERS |
3629 UNSUPERVISED IMAGE SEGMENTATION WITH SPATIAL TRIPLET MARKOV TREES Hugo Gangloff, Jean-Baptiste Courbot, Emmanuel Monfrini, Christophe Collet 3629 | UNSUPERVISED IMAGE SEGMENTATION WITH SPATIAL TRIPLET MARKOV TREES |
4269 UNSUPERVISED LEARNING FOR ASYNCHRONOUS RESOURCE ALLOCATION IN AD-HOC WIRELESS NETWORKS Zhiyang Wang, Mark Eisen, Alejandro Ribeiro 4269 | UNSUPERVISED LEARNING FOR ASYNCHRONOUS RESOURCE ALLOCATION IN AD-HOC WIRELESS NETWORKS |
3222 UNSUPERVISED LEARNING FOR MULTI-STYLE SPEECH SYNTHESIS WITH LIMITED DATA Shuang Liang, Chenfeng Miao, Minchuan Chen, Jun Ma, Shaojun Wang, Jing Xiao 3222 | UNSUPERVISED LEARNING FOR MULTI-STYLE SPEECH SYNTHESIS WITH LIMITED DATA |
4982 UNSUPERVISED MOTION REPRESENTATION ENHANCED NETWORK FOR ACTION RECOGNITION Xiaohang Yang, Lingtong Kong, Jie Yang 4982 | UNSUPERVISED MOTION REPRESENTATION ENHANCED NETWORK FOR ACTION RECOGNITION |
1243 Unsupervised Multimodal Image Registration with Adaptative Gradient Guidance Zhe Xu, Jiangpeng Yan, Jie Luo, Xiu Li, Jagadeesan Jayender 1243 | Unsupervised Multimodal Image Registration with Adaptative Gradient Guidance |
2705 UNSUPERVISED MUSICAL TIMBRE TRANSFER FOR NOTIFICATION SOUNDS Jing Yang, Tristan Cinquin, Gábor Sörös 2705 | UNSUPERVISED MUSICAL TIMBRE TRANSFER FOR NOTIFICATION SOUNDS |
2868 Unsupervised neural adaptation model based on optimal transport for spoken language identification Xugang Lu, Peng Shen, Yu Tsao, Hisashi Kawai 2868 | Unsupervised neural adaptation model based on optimal transport for spoken language identification |
5390 UNSUPERVISED RECONSTRUCTION OF SEA SURFACE CURRENTS FROM AIS MARITIME TRAFFIC DATA USING LEARNABLE VARIATIONAL MODELS Simon Benaïchouche, Clement Legoff, Yann Guichoux, François Rousseau, Ronan Fablet 5390 | UNSUPERVISED RECONSTRUCTION OF SEA SURFACE CURRENTS FROM AIS MARITIME TRAFFIC DATA USING LEARNABLE VARIATIONAL MODELS |
4738 UNSUPERVISED STACKED CAPSULE AUTOENCODER FOR HYPERSPECTRAL IMAGE CLASSIFICATION Erting Pan, Yong Ma, Xiaoguang Mei, Fan Fan, Jiayi Ma 4738 | UNSUPERVISED STACKED CAPSULE AUTOENCODER FOR HYPERSPECTRAL IMAGE CLASSIFICATION |
1171 Unveiling anomalous nodes via random sampling and consensus on graphs Vassilis N. Ioannidis, Dimitris Berberidis, Georgios B. Giannakis 1171 | Unveiling anomalous nodes via random sampling and consensus on graphs |
1819 UPSAMPLING ARTIFACTS IN NEURAL AUDIO SYNTHESIS Jordi Pons, Santiago Pascual, Giulio Cengarle, Joan Serrà 1819 | UPSAMPLING ARTIFACTS IN NEURAL AUDIO SYNTHESIS |
2136 USERREG: A SIMPLE BUT STRONG MODEL FOR RATING PREDICTION Haiyang Zhang, Ivan Ganchev, Nikola S. Nikolov, Mark Stevenson 2136 | USERREG: A SIMPLE BUT STRONG MODEL FOR RATING PREDICTION |
4606 USING DEEP IMAGE PRIORS TO GENERATE COUNTERFACTUAL EXPLANATIONS Vivek Narayanaswamy, Jayaraman Thiagarajan, Andreas Spanias 4606 | USING DEEP IMAGE PRIORS TO GENERATE COUNTERFACTUAL EXPLANATIONS |
1461 USING SYNTHETIC AUDIO TO IMPROVE THE RECOGNITION OF OUT-OF-VOCABULARY WORDS IN END-TO-END ASR SYSTEMS Xianrui Zheng, Yulan Liu, Deniz Gunceler, Daniel Willett 1461 | USING SYNTHETIC AUDIO TO IMPROVE THE RECOGNITION OF OUT-OF-VOCABULARY WORDS IN END-TO-END ASR SYSTEMS |
4418 uTDN: An Unsupervised Two-Stream Dirichlet-Net for Hyperspectral Unmixing Qiwen Jin, Yong Ma, Xiaoguang Mei, Hao Li, Jiayi Ma 4418 | uTDN: An Unsupervised Two-Stream Dirichlet-Net for Hyperspectral Unmixing |
2561 VALIDATING THE INSPIRED SINEWAVE TECHNIQUE TO MEASURE LUNG HETEROGENEITY COMPARED TO ATELECTASIS & OVER-DISTENDED VOLUME IN COMPUTED TOMOGRAPHY IMAGES Minh Tran, PhiAnh Phan, Douglas Crockett, Federico Formenti, John Cronin, Stephen Payne, Andrew Farmery 2561 | VALIDATING THE INSPIRED SINEWAVE TECHNIQUE TO MEASURE LUNG HETEROGENEITY COMPARED TO ATELECTASIS & OVER-DISTENDED VOLUME IN COMPUTED TOMOGRAPHY IMAGES |
3908 VARIANCE-CONSTRAINED LEARNING FOR STOCHASTIC GRAPH NEURAL NETWORKS Zhan Gao, Elvin Isufi, Alejandro Ribeiro 3908 | VARIANCE-CONSTRAINED LEARNING FOR STOCHASTIC GRAPH NEURAL NETWORKS |
3684 VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT WITH A NOISE-AWARE ENCODER Huajian Fang, Guillaume Carbajal, Stefan Wermter, Timo Gerkmann 3684 | VARIATIONAL AUTOENCODER FOR SPEECH ENHANCEMENT WITH A NOISE-AWARE ENCODER |
4085 VARIATIONAL AUTOENCODERS FOR HYPERSPECTRAL UNMIXING WITH ENDMEMBER VARIABILITY Shuaikai Shi, Min Zhao, Lijun Zhang, Jie Chen 4085 | VARIATIONAL AUTOENCODERS FOR HYPERSPECTRAL UNMIXING WITH ENDMEMBER VARIABILITY |
5600 VARIATIONAL DENOISING AUTOENCODERS AND LEAST-SQUARES POLICY ITERATION FOR STATISTICAL DIALOGUE MANAGERS Vassilios Diakoloukas, Fotios Lygerakis, Michail Lagoudakis, Margarita Kotti 5600 | VARIATIONAL DENOISING AUTOENCODERS AND LEAST-SQUARES POLICY ITERATION FOR STATISTICAL DIALOGUE MANAGERS |
3501 VARIATIONAL DIALOGUE GENERATION WITH NORMALIZING FLOWS Tien-Ching Luo, Jen-Tzung Chien 3501 | VARIATIONAL DIALOGUE GENERATION WITH NORMALIZING FLOWS |
5321 VARIATIONAL PARAMETER LEARNING IN SEQUENTIAL STATE-SPACE MODEL VIA PARTICLE FILTERING Chenhao Li, Simon Godsill 5321 | VARIATIONAL PARAMETER LEARNING IN SEQUENTIAL STATE-SPACE MODEL VIA PARTICLE FILTERING |
2918 VARIATION-STABLE FUSION FOR PPG-BASED BIOMETRIC SYSTEM Dae Yon Hwang, Bilal Taha, Dimitrios Hatzinakos 2918 | VARIATION-STABLE FUSION FOR PPG-BASED BIOMETRIC SYSTEM |
1939 VEHICLE 3D LOCALIZATION IN ROAD SCENES VIA A MONOCULAR MOVING CAMERA Yanting Zhang, Aotian Zheng, Ke Han, Yizhou Wang, Jenq-Neng Hwang 1939 | VEHICLE 3D LOCALIZATION IN ROAD SCENES VIA A MONOCULAR MOVING CAMERA |
4126 VGAI: END-TO-END LEARNING OF VISION-BASED DECENTRALIZED CONTROLLERS FOR ROBOT SWARMS Ting-Kuei Hu, Fernando Gama, Tianlong Chen, Zhangyang Wang, Alejandro Ribeiro, Brian M. Sadler 4126 | VGAI: END-TO-END LEARNING OF VISION-BASED DECENTRALIZED CONTROLLERS FOR ROBOT SWARMS |
3807 VIDEO QUALITY PREDICTION USING VOXEL-WISE fMRI MODELS OF THE VISUAL CORTEX Naga Sailaja Mahankali, Sumohana S Channappayya 3807 | VIDEO QUALITY PREDICTION USING VOXEL-WISE fMRI MODELS OF THE VISUAL CORTEX |
3716 VIOLENCE DETECTION IN VIDEOS BASED ON FUSING VISUAL AND AUDIO INFORMATION Wenfeng Pang, Qianhua He, Yongjian Hu, Yanxiong Li 3716 | VIOLENCE DETECTION IN VIDEOS BASED ON FUSING VISUAL AND AUDIO INFORMATION |
1011 VISUAL PRIVACY PROTECTION VIA MAPPING DISTORTION Yiming Li, Peidong Liu, Yong Jiang, Shu-Tao Xia 1011 | VISUAL PRIVACY PROTECTION VIA MAPPING DISTORTION |
5198 VISUALIZING ASSOCIATION IN EXEMPLAR-BASED CLASSIFICATION Taiga Kashima, Ryuichiro Hataya, Hideki Nakayama 5198 | VISUALIZING ASSOCIATION IN EXEMPLAR-BASED CLASSIFICATION |
3642 VK-Net: Category-level Point Cloud Registration with Unsupervised Rotation Invariant Keypoints Zhi Chen, Wei Yang, Zhenbo Xu, Zhenbo Shi, Liusheng Huang 3642 | VK-Net: Category-level Point Cloud Registration with Unsupervised Rotation Invariant Keypoints |
3362 VOWEL NON-VOWEL BASED SPECTRAL WARPING AND TIME SCALE MODIFICATION FOR IMPROVEMENT IN CHILDREN’S ASR Hemant Kathania, Avinash Kumar, Mikko Kurimo 3362 | VOWEL NON-VOWEL BASED SPECTRAL WARPING AND TIME SCALE MODIFICATION FOR IMPROVEMENT IN CHILDREN’S ASR |
4118 VSET: A Multimodal Transformer for Visual Speech Enhancement Karthik Ramesh, Wupeng Wang, Chao Xing, Dong Wang, Xiao Chen 4118 | VSET: A Multimodal Transformer for Visual Speech Enhancement |
3001 WAKE WORD DETECTION WITH STREAMING TRANSFORMERS Yiming Wang, Hang Lv, Daniel Povey, Lei Xie, Sanjeev Khudanpur 3001 | WAKE WORD DETECTION WITH STREAMING TRANSFORMERS |
3207 WARP-Q: QUALITY PREDICTION FOR GENERATIVE NEURAL SPEECH CODECS Wissam Jassim, Jan Skoglund, Michael Chinen, Andrew Hines 3207 | WARP-Q: QUALITY PREDICTION FOR GENERATIVE NEURAL SPEECH CODECS |
5480 WASE: LEARNING WHEN TO ATTEND FOR SPEAKER EXTRACTION IN COCKTAIL PARTY ENVIRONMENTS Yunzhe Hao, Jiaming Xu, Peng Zhang, Bo Xu 5480 | WASE: LEARNING WHEN TO ATTEND FOR SPEAKER EXTRACTION IN COCKTAIL PARTY ENVIRONMENTS |
2670 WASSERSTEIN BARYCENTER TRANSPORT FOR ACOUSTIC ADAPTATION Eduardo Fernandes Montesuma, Fred-Maurice Ngolè Mboula 2670 | WASSERSTEIN BARYCENTER TRANSPORT FOR ACOUSTIC ADAPTATION |
5350 WAVE-DOMAIN OPTIMIZATION OF SECONDARY SOURCE PLACEMENT FREE FROM INFORMATION OF ERROR SENSOR POSITIONS Jian Xu, Kean Chen, Yunhe Li 5350 | WAVE-DOMAIN OPTIMIZATION OF SECONDARY SOURCE PLACEMENT FREE FROM INFORMATION OF ERROR SENSOR POSITIONS |
5370 Waveform Design for the Joint MIMO Radar and Communications With Low Integrated Sidelobe Levels and Accurate Information Embedding Yongzhe Li, Xinyu Wu, Ran Tao 5370 | Waveform Design for the Joint MIMO Radar and Communications With Low Integrated Sidelobe Levels and Accurate Information Embedding |
2729 WAVE-TACOTRON: SPECTROGRAM-FREE END-TO-END TEXT-TO-SPEECH SYNTHESIS Ron Weiss, RJ Skerry-Ryan, Eric Battenberg, Soroosh Mariooryad, Diederik Kingma 2729 | WAVE-TACOTRON: SPECTROGRAM-FREE END-TO-END TEXT-TO-SPEECH SYNTHESIS |
2479 Weakly Supervised Patch Label Inference Network with Image Pyramid for Pavement Diseases Recognition in the Wild Guixin Huang, Sheng Huang, Luwen Huangfu, Dan Yang 2479 | Weakly Supervised Patch Label Inference Network with Image Pyramid for Pavement Diseases Recognition in the Wild |
5343 Wearing a MASK: Compressed Representations of Variable-Length Sequences Using Recurrent Neural Tangent Kernels Sina Alemohammad, Hossein Babaei, Randall Balestriero, Matt Y. Cheung, Ahmed Imtiaz Humayun, Daniel LeJeune, Naiming Liu, Lorenzo Luzi, Jasper Tan, Zichao Wang, Richard Baraniuk 5343 | Wearing a MASK: Compressed Representations of Variable-Length Sequences Using Recurrent Neural Tangent Kernels |
2046 WEBLY SUPERVISED DEEP ATTENTIVE QUANTIZATION Jinpeng Wang, Bin Chen, Tao Dai, Shutao Xia 2046 | WEBLY SUPERVISED DEEP ATTENTIVE QUANTIZATION |
3638 WEIGHT IDENTIFICATION THROUGH GLOBAL OPTIMIZATION IN A NEW HYSTERETIC NEURAL NETWORK MODEL Elie Leroy, Arthur Marmin, Marc Castella, Laurent Duval 3638 | WEIGHT IDENTIFICATION THROUGH GLOBAL OPTIMIZATION IN A NEW HYSTERETIC NEURAL NETWORK MODEL |
3813 WEIGHTED MAGNITUDE-PHASE LOSS FOR SPEECH DEREVERBERATION Jingshu Zhang, Mark Plumbley, Wenwu Wang 3813 | WEIGHTED MAGNITUDE-PHASE LOSS FOR SPEECH DEREVERBERATION |
3225 Weighted Recursive Least Square Filter and Neural Network based Residual Echo Suppression for the AEC-Challenge ziteng wang, yueyue na, zhang liu, biao tian, qiang fu 3225 | Weighted Recursive Least Square Filter and Neural Network based Residual Echo Suppression for the AEC-Challenge |
1310 WHAT AND WHERE TO FOCUS IN PERSON SEARCH Tong Zhou, Kun Tian 1310 | WHAT AND WHERE TO FOCUS IN PERSON SEARCH |
3618 WHAT'S ALL THE FUSS ABOUT FREE UNIVERSAL SOUND SEPARATION DATA? Scott Wisdom, Hakan Erdogan, Daniel P. W. Ellis, Romain Serizel, Nicolas Turpault, Eduardo Fonseca, Justin Salamon, Prem Seetharaman, John R. Hershey 3618 | WHAT'S ALL THE FUSS ABOUT FREE UNIVERSAL SOUND SEPARATION DATA? |
5336 When Face Recognition Meets Occlusion: A New Benchmark Baojin Huang, Zhongyuan Wang, Guangcheng Wang, Kui Jiang, Kangli Zeng, Zhen Han, Xin Tian, Yuhong Yang 5336 | When Face Recognition Meets Occlusion: A New Benchmark |
3530 WIDE AND DEEP GRAPH NEURAL NETWORKS WITH DISTRIBUTED ONLINE LEARNING Zhan Gao, Fernando Gama, Alejandro Ribeiro 3530 | WIDE AND DEEP GRAPH NEURAL NETWORKS WITH DISTRIBUTED ONLINE LEARNING |
3338 WIENER FILTER ON MEET/JOIN LATTICES Bastian Seifert, Chris Wendler, Markus Püschel 3338 | WIENER FILTER ON MEET/JOIN LATTICES |
2626 WIFI-BASED DEVICE-FREE GESTURE RECOGNITION THROUGH-THE-WALL Sai Deepika Regani, Beibei Wang, K.J. Ray Liu 2626 | WIFI-BASED DEVICE-FREE GESTURE RECOGNITION THROUGH-THE-WALL |
1103 WINDOW BEAMFORMER FOR SPARSE CONCENTRIC CIRCULAR ARRAY RAJIB SHARMA, ISRAEL COHEN, BARUCH BERDUGO 1103 | WINDOW BEAMFORMER FOR SPARSE CONCENTRIC CIRCULAR ARRAY |
4468 Word-Level ASL Recognition and Trigger Sign Detection with RF Sensors Mohammed Rahman, Emre Kurtoglu, Robiulhossain Mdrafi, Ali Gurbuz, Evie Malaia, Chris Crawford, Darrin Griffin, Sevgi Gurbuz 4468 | Word-Level ASL Recognition and Trigger Sign Detection with RF Sensors |
2459 YAPA: ACCELERATED PROXIMAL ALGORITHM FOR CONVEX COMPOSITE PROBLEMS Giovanni Chierchia, Mireille El Gheche 2459 | YAPA: ACCELERATED PROXIMAL ALGORITHM FOR CONVEX COMPOSITE PROBLEMS |
5439 ZERO-GRADIENT CONSTRAINTS FOR DESTRIPING OF REMOTE-SENSING DATA Kazuki Naganuma, Saori Takeyama, Shunsuke Ono 5439 | ZERO-GRADIENT CONSTRAINTS FOR DESTRIPING OF REMOTE-SENSING DATA |
3766 ZERO-SHOT AUDIO CLASSIFICATION WITH FACTORED LINEAR AND NONLINEAR ACOUSTIC-SEMANTIC PROJECTIONS Huang Xie, Okko Räsänen, Tuomas Virtanen 3766 | ZERO-SHOT AUDIO CLASSIFICATION WITH FACTORED LINEAR AND NONLINEAR ACOUSTIC-SEMANTIC PROJECTIONS |
4849 ZERO-SHOT VOICE CONVERSION WITH ADJUSTED SPEAKER EMBEDDINGS AND SIMPLE ACOUSTIC FEATURES Zhiyuan Tan, Jianguo Wei, Junhai Xu, Yuqing He, Wenhuan Lu 4849 | ZERO-SHOT VOICE CONVERSION WITH ADJUSTED SPEAKER EMBEDDINGS AND SIMPLE ACOUSTIC FEATURES |