研飞ivySCI (青藤学术）

IEEE/ACM Transactions on Audio Speech and Language Processing

短名	IEEE/ACM Trans. Audio Speech Lang. Process.
Journal Impact	4.09
国际分区	ENGINEERING, ELECTRICAL & ELECTRONIC(Q2)
期刊索引	SCI Q1中科院 2 区

ISSN	2329-9290, 2329-9304
h-index	81
国内分区	计算机科学(2区)计算机科学声学(1区)计算机科学工程电子与电气(2区)
Top期刊	是

IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 涵盖了音频、语音和语言处理以及支持它们的科学。在音频处理方面：换能器、室内声学、有源声音控制、人类听觉、音乐分析/合成/编码，以及消费类音频。在语音处理方面：语音分析、合成、编码、语音和说话人识别、语音产生和感知以及语音增强等领域。在语言处理方面：语音和文本分析、理解、生成、对话管理、翻译、摘要、问答和文档索引和检索，以及通用语言建模。

涉及主题	计算机科学人工智能工程类语音识别数学物理电气工程电信哲学声学计算机视觉统计心理学机器学习语言学量子力学生物程序设计语言算法自然语言处理认知心理学
出版信息	出版商: IEEE Advancing Technology for Humanity，出版周期: ，期刊类型: journal
基本数据	创刊年份: 2014，原创研究文献占比: 99.20%，自引率:14.60%， Gold OA占比: 18.71%

期刊引文格式

这些示例是对学术期刊文章的引用，以及它们应该如何出现在您的参考文献中。

并非所有期刊都按卷和期组织其已发表的文章，因此这些字段是可选的。有些电子期刊不提供页面范围，而是列出文章标识符。在这种情况下，使用文章标识符而不是页面范围是安全的。

只有1位作者的期刊

有2位作者的期刊

有3位作者的期刊

有5位以上作者的期刊

书籍引用格式

以下是创作和编辑的书籍的参考文献的示例。

学位论文引用格式

网页引用格式

这些示例是对网页的引用，以及它们应该如何出现在您的参考文献中。

专利引用格式

Sequence Labeling as Non-autoregressive Dual-Query Set Generation

2024-1-1

R 2: A Novel Recall & Ranking Framework for Legal Judgment Prediction

2024-1-1

Detecting the Presence of Sperm Whales' Echolocation Clicks in Noisy Environments

2024-1-1

Chinese NER Using Multi-View Transformer

2024-1-1

Adjustable Coherent-to-Diffuse Power Estimator for Binaural Speech Enhancement in Multi-Talker Environments

2024-1-1

Large-scale unsupervised audio pre-training for video-to-speech synthesis

2024-1-1

Constant Elevation-Beamwidth Beamforming with Concentric Ring Arrays

2024-1-1

Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix

2024-1-1

APCodec: A Neural Audio Codec With Parallel Amplitude and Phase Spectrum Encoding and Decoding

2024-1-1

Towards Generating Diverse Audio Captions Via Adversarial Training

2024-1-1

Written Term Detection Improves Spoken Term Detection

2024-1-1

Unsupervised Face-Mask Speech Enhancement Using Generative Adversarial Networks with Human-in-the-Loop Assessment Metrics

2024-1-1

A Prompt-based Hierarchical Pipeline for Cross-domain Slot Filling

2024-1-1

Multi-channel Conversational Speaker Separation via Neural Diarization

2024-1-1

CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

2024-1-1

Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness

2024-1-1

Design of Fully Steerable Differential Beamformers with Linear Superarrays

2024-1-1

On Semi-blind Source Separation-based Approaches to Nonlinear Echo Cancellation Based on Bilinear Alternating Optimization

2024-1-1

N-Gram Nearest Neighbor Machine Translation

2024-1-1

Multi-Agent Deep Learning for the Detection of Multiple Speech Steganography Methods

2024-1-1

MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing

2024-1-1

A Two-Stage Approach to Quality Restoration of Bone-Conducted Speech

2024-1-1

Improving Speech Translation Accuracy and Time Efficiency with Fine-tuned wav2vec 2.0-based Speech Segmentation

2024-1-1

Multi-Channel Speech Separation Using Spatially Selective Deep Non-Linear Filters

2024-1-1

Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-Hour Scale

2024-1-1

Partitioning Attention Weight: Mitigating Adverse Effect of Incorrect Pseudo-Labels for Self-Supervised ASR

2024-1-1

Blind and Spatially-Regularized Online Joint Optimization of Source Separation, Dereverberation, and Noise Reduction

2024-1-1

Learning Label-Adaptive Representation for Large-Scale Multi-Label Text Classification

2024-1-1

A Novel Multi-Head Self-Organized Operational Neural Network Architecture for Chronic Obstructive Pulmonary Disease Detection Using Lung Sounds

2024-1-1

Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition

2024-1-1

Harmonic Detection from Noisy Speech with Auditory Frame Gain for Intelligibility Enhancement

2024-1-1

A Compressive Sensing Approach for the Reconstruction of the Soundfield Produced by Directive Sources in Reverberant Rooms

2024-1-1

End-to-End Speech Recognition: A Survey

2024-1-1

Attention-Based Speech Enhancement Using Human Quality Perception Modeling

2024-1-1

Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis

2024-1-1

EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion

2024-1-1

Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors

2024-1-1

Generalizable Speech Spoofing Detection Against Silence Trimming with Data Augmentation and Multi-task Meta-Learning

2024-1-1

Artist Similarity based on Heterogeneous Graph Neural Networks

2024-1-1

Unifying Structure Reasoning and Language Pre-Training for Complex Reasoning Tasks

2024-1-1

End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation

2024-1-1

Representation Learning With Hidden Unit Clustering for Low Resource Speech Applications

2024-1-1

Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering

2024-1-1

Fast and Accurate Incomplete Utterance Rewriting

2024-1-1

Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models

2024-1-1

Learning with an Open Horizon in Ever-Changing Dialogue Circumstances

2024-1-1

Multi-resolution Convolutional Residual Neural Networks for Monaural Speech Dereverberation

2024-1-1

Learning to Improve Out-of-Distribution Generalization via Self-adaptive Language Masking

2024-1-1

KDPG-Enhanced MRC Framework for Scientific Entity Recognition in Survey Papers

2024-1-1

A Variance-Preserving Interpolation Approach for Diffusion Models With Applications to Single Channel Speech Enhancement and Recognition

2024-1-1

帮你贴心管理全部的文献
研飞ivySCI，高效的论文管理

投稿经验分享

分享我的经验，帮你走得更远

IEEE/ACM Transactions on Audio Speech and Language Processing

期刊引文格式

只有1位作者的期刊

有2位作者的期刊

有3位作者的期刊

有5位以上作者的期刊

书籍引用格式

学位论文引用格式

网页引用格式

专利引用格式

最新文章

Sequence Labeling as Non-autoregressive Dual-Query Set Generation

<b> <i>R</i> </b> <sup>2</sup>: A Novel Recall &amp; Ranking Framework for Legal Judgment Prediction

Detecting the Presence of Sperm Whales' Echolocation Clicks in Noisy Environments

Chinese NER Using Multi-View Transformer

Adjustable Coherent-to-Diffuse Power Estimator for Binaural Speech Enhancement in Multi-Talker Environments

Large-scale unsupervised audio pre-training for video-to-speech synthesis

Constant Elevation-Beamwidth Beamforming with Concentric Ring Arrays

Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix

APCodec: A Neural Audio Codec With Parallel Amplitude and Phase Spectrum Encoding and Decoding

Towards Generating Diverse Audio Captions Via Adversarial Training

Written Term Detection Improves Spoken Term Detection

Unsupervised Face-Mask Speech Enhancement Using Generative Adversarial Networks with Human-in-the-Loop Assessment Metrics

A Prompt-based Hierarchical Pipeline for Cross-domain Slot Filling

Multi-channel Conversational Speaker Separation via Neural Diarization

CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness

Design of Fully Steerable Differential Beamformers with Linear Superarrays

On Semi-blind Source Separation-based Approaches to Nonlinear Echo Cancellation Based on Bilinear Alternating Optimization

N-Gram Nearest Neighbor Machine Translation

Multi-Agent Deep Learning for the Detection of Multiple Speech Steganography Methods

MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing

A Two-Stage Approach to Quality Restoration of Bone-Conducted Speech

Improving Speech Translation Accuracy and Time Efficiency with Fine-tuned wav2vec 2.0-based Speech Segmentation

Multi-Channel Speech Separation Using Spatially Selective Deep Non-Linear Filters

Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-Hour Scale

Partitioning Attention Weight: Mitigating Adverse Effect of Incorrect Pseudo-Labels for Self-Supervised ASR

Blind and Spatially-Regularized Online Joint Optimization of Source Separation, Dereverberation, and Noise Reduction

Learning Label-Adaptive Representation for Large-Scale Multi-Label Text Classification

A Novel Multi-Head Self-Organized Operational Neural Network Architecture for Chronic Obstructive Pulmonary Disease Detection Using Lung Sounds

Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition

Harmonic Detection from Noisy Speech with Auditory Frame Gain for Intelligibility Enhancement

A Compressive Sensing Approach for the Reconstruction of the Soundfield Produced by Directive Sources in Reverberant Rooms

End-to-End Speech Recognition: A Survey

Attention-Based Speech Enhancement Using Human Quality Perception Modeling

Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis

EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion

Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors

Generalizable Speech Spoofing Detection Against Silence Trimming with Data Augmentation and Multi-task Meta-Learning

Artist Similarity based on Heterogeneous Graph Neural Networks

Unifying Structure Reasoning and Language Pre-Training for Complex Reasoning Tasks

End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation

Representation Learning With Hidden Unit Clustering for Low Resource Speech Applications

Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering

Fast and Accurate Incomplete Utterance Rewriting

Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models

Learning with an Open Horizon in Ever-Changing Dialogue Circumstances

Multi-resolution Convolutional Residual Neural Networks for Monaural Speech Dereverberation

Learning to Improve Out-of-Distribution Generalization via Self-adaptive Language Masking

KDPG-Enhanced MRC Framework for Scientific Entity Recognition in Survey Papers

A Variance-Preserving Interpolation Approach for Diffusion Models With Applications to Single Channel Speech Enhancement and Recognition

帮你贴心管理全部的文献.css-5myebc{font-size:14px;font-weight:var(--chakra-fontWeights-normal);margin-top:var(--chakra-space-1);margin-bottom:var(--chakra-space-1);}研飞ivySCI，高效的论文管理

<b> <i>R</i> </b> <sup>2</sup>: A Novel Recall & Ranking Framework for Legal Judgment Prediction

帮你贴心管理全部的文献
研飞ivySCI，高效的论文管理