IEEE/ACM Transactions on Audio Speech and Language Processing

短名IEEE/ACM Trans. Audio Speech Lang. Process.
Journal Impact4.09
国际分区ENGINEERING, ELECTRICAL & ELECTRONIC(Q2)
期刊索引SCI Q1中科院 2 区
ISSN2329-9290, 2329-9304
h-index81
国内分区计算机科学(2区)计算机科学声学(1区)计算机科学工程电子与电气(2区)
Top期刊

IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING 涵盖了音频、语音和语言处理以及支持它们的科学。在音频处理方面:换能器、室内声学、有源声音控制、人类听觉、音乐分析/合成/编码,以及消费类音频。在语音处理方面:语音分析、合成、编码、语音和说话人识别、语音产生和感知以及语音增​​强等领域。在语言处理方面:语音和文本分析、理解、生成、对话管理、翻译、摘要、问答和文档索引和检索,以及通用语言建模。

涉及主题计算机科学人工智能工程类语音识别数学物理电气工程电信哲学声学计算机视觉统计心理学机器学习语言学量子力学生物程序设计语言算法自然语言处理认知心理学
出版信息出版商: IEEE Advancing Technology for Humanity出版周期: 期刊类型: journal
基本数据创刊年份: 2014原创研究文献占比99.20%自引率:14.60%Gold OA占比: 18.71%

期刊引文格式

这些示例是对学术期刊文章的引用,以及它们应该如何出现在您的参考文献中。

并非所有期刊都按卷和期组织其已发表的文章,因此这些字段是可选的。有些电子期刊不提供页面范围,而是列出文章标识符。在这种情况下,使用文章标识符而不是页面范围是安全的。

只有1位作者的期刊

有2位作者的期刊

有3位作者的期刊

有5位以上作者的期刊

书籍引用格式

以下是创作和编辑的书籍的参考文献的示例。

学位论文引用格式

网页引用格式

这些示例是对网页的引用,以及它们应该如何出现在您的参考文献中。

专利引用格式

最新文章

Sequence Labeling as Non-autoregressive Dual-Query Set Generation

2024-1-1

<b> <i>R</i> </b> <sup>2</sup>: A Novel Recall &amp; Ranking Framework for Legal Judgment Prediction

2024-1-1

Detecting the Presence of Sperm Whales' Echolocation Clicks in Noisy Environments

2024-1-1

Chinese NER Using Multi-View Transformer

2024-1-1

Adjustable Coherent-to-Diffuse Power Estimator for Binaural Speech Enhancement in Multi-Talker Environments

2024-1-1

Large-scale unsupervised audio pre-training for video-to-speech synthesis

2024-1-1

Constant Elevation-Beamwidth Beamforming with Concentric Ring Arrays

2024-1-1

Distinctive and Natural Speaker Anonymization via Singular Value Transformation-Assisted Matrix

2024-1-1

APCodec: A Neural Audio Codec With Parallel Amplitude and Phase Spectrum Encoding and Decoding

2024-1-1

Towards Generating Diverse Audio Captions Via Adversarial Training

2024-1-1

Written Term Detection Improves Spoken Term Detection

2024-1-1

Unsupervised Face-Mask Speech Enhancement Using Generative Adversarial Networks with Human-in-the-Loop Assessment Metrics

2024-1-1

A Prompt-based Hierarchical Pipeline for Cross-domain Slot Filling

2024-1-1

Multi-channel Conversational Speaker Separation via Neural Diarization

2024-1-1

CMGAN: Conformer-Based Metric-GAN for Monaural Speech Enhancement

2024-1-1

Disentangled Text Representation Learning With Information-Theoretic Perspective for Adversarial Robustness

2024-1-1

Design of Fully Steerable Differential Beamformers with Linear Superarrays

2024-1-1

On Semi-blind Source Separation-based Approaches to Nonlinear Echo Cancellation Based on Bilinear Alternating Optimization

2024-1-1

N-Gram Nearest Neighbor Machine Translation

2024-1-1

Multi-Agent Deep Learning for the Detection of Multiple Speech Steganography Methods

2024-1-1

MOSA: Music Motion with Semantic Annotation Dataset for Cross-Modal Music Processing

2024-1-1

A Two-Stage Approach to Quality Restoration of Bone-Conducted Speech

2024-1-1

Improving Speech Translation Accuracy and Time Efficiency with Fine-tuned wav2vec 2.0-based Speech Segmentation

2024-1-1

Multi-Channel Speech Separation Using Spatially Selective Deep Non-Linear Filters

2024-1-1

Principled Comparisons for End-to-End Speech Recognition: Attention vs Hybrid at the 1000-Hour Scale

2024-1-1

Partitioning Attention Weight: Mitigating Adverse Effect of Incorrect Pseudo-Labels for Self-Supervised ASR

2024-1-1

Blind and Spatially-Regularized Online Joint Optimization of Source Separation, Dereverberation, and Noise Reduction

2024-1-1

Learning Label-Adaptive Representation for Large-Scale Multi-Label Text Classification

2024-1-1

A Novel Multi-Head Self-Organized Operational Neural Network Architecture for Chronic Obstructive Pulmonary Disease Detection Using Lung Sounds

2024-1-1

Optimizing Audio-Visual Speech Enhancement Using Multi-Level Distortion Measures for Audio-Visual Speech Recognition

2024-1-1

Harmonic Detection from Noisy Speech with Auditory Frame Gain for Intelligibility Enhancement

2024-1-1

A Compressive Sensing Approach for the Reconstruction of the Soundfield Produced by Directive Sources in Reverberant Rooms

2024-1-1

End-to-End Speech Recognition: A Survey

2024-1-1

Attention-Based Speech Enhancement Using Human Quality Perception Modeling

2024-1-1

Text-Inductive Graphone-Based Language Adaptation for Low-Resource Speech Synthesis

2024-1-1

EfficientTTS 2: Variational End-to-End Text-to-Speech Synthesis and Voice Conversion

2024-1-1

Multichannel Linear Prediction-Based Speech Dereverberation Considering Sparse and Low-Rank Priors

2024-1-1

Generalizable Speech Spoofing Detection Against Silence Trimming with Data Augmentation and Multi-task Meta-Learning

2024-1-1

Artist Similarity based on Heterogeneous Graph Neural Networks

2024-1-1

Unifying Structure Reasoning and Language Pre-Training for Complex Reasoning Tasks

2024-1-1

End-to-End Deep Learning-Based Adaptation Control for Linear Acoustic Echo Cancellation

2024-1-1

Representation Learning With Hidden Unit Clustering for Low Resource Speech Applications

2024-1-1

Controllable Accented Text-to-Speech Synthesis With Fine and Coarse-Grained Intensity Rendering

2024-1-1

Fast and Accurate Incomplete Utterance Rewriting

2024-1-1

Dynamic Convolutional Neural Networks as Efficient Pre-trained Audio Models

2024-1-1

Learning with an Open Horizon in Ever-Changing Dialogue Circumstances

2024-1-1

Multi-resolution Convolutional Residual Neural Networks for Monaural Speech Dereverberation

2024-1-1

Learning to Improve Out-of-Distribution Generalization via Self-adaptive Language Masking

2024-1-1

KDPG-Enhanced MRC Framework for Scientific Entity Recognition in Survey Papers

2024-1-1

A Variance-Preserving Interpolation Approach for Diffusion Models With Applications to Single Channel Speech Enhancement and Recognition

2024-1-1

帮你贴心管理全部的文献

研飞ivySCI,高效的论文管理

投稿经验分享

分享我的经验,帮你走得更远

Built withby Ivy Science
Copyright © 2020-2024
版权所有:南京青藤格致信息科技有限公司