Arabic-Whisper-CodeSwitching-Edition - 针对阿拉伯语和英语混合语音的优化识别模型

Arabic-Whisper-CodeSwitching-Edition项目介绍

项目概述

Arabic-Whisper-CodeSwitching-Edition是一款基于OpenAI的Whisper Large v2模型进行微调开发的语音识别模型。它专为处理包含英语词汇的阿拉伯语音频而设计，提升了Whisper Large v2在阿拉伯语和英语混合语言（即“代码切换”）语音识别中的性能。

开发者: العبد لله
模型类型: 语音识别
使用语言: 阿拉伯语和英语（在阿拉伯语环境中）
许可证: GPL-3.0

数据来源

Arabic-Whisper-CodeSwitching-Edition模型的训练使用了阿拉伯语-英语代码切换数据集，这个数据集可以在数据收集库中找到。此外，感兴趣的用户可以访问在线演示试用此模型。

使用方法

直接应用

这个模型可以直接用于转录包含英语词汇的阿拉伯语语音，特别适合在多语言环境中使用，当人们频繁进行语言代码切换时，这个模型能够处理这些复杂的语言转换场景。

不适用场景

当面对非阿拉伯语或英语的单语言语音，或实施包含阿拉伯语和英语之外的语言代码切换时，该模型可能表现欠佳。

偏见、风险和局限性

在使用该模型时，用户需要注意模型可能存在的偏见、风险和局限性。目前建议用户充分评估这些因素，以便做出更为知情的决策。

使用指导

以下是使用该模型的简单代码示例，方便用户快速上手：

from transformers import WhisperForConditionalGeneration, WhisperProcessor

processor = WhisperProcessor.from_pretrained("MohamedRashad/Arabic-Whisper-CodeSwitching-Edition")
model = WhisperForConditionalGeneration.from_pretrained("MohamedRashad/Arabic-Whisper-CodeSwitching-Edition")

# 示例使用
inputs = processor("path_to_audio_file.wav", return_tensors="pt")
generated_ids = model.generate(inputs["input_features"])
transcription = processor.batch_decode(generated_ids, skip_special_tokens=True)
print(transcription)

引用格式

如果在研究中使用了Arabic-Whisper-CodeSwitching-Edition模型，请引用以下文献：

BibTeX格式:

@misc{rashad2024arabicwhisper,
  title={Arabic-Whisper-CodeSwitching-Edition},
  author={Mohamed Rashad},
  year={2024},
  url={https://huggingface.co/spaces/MohamedRashad/Arabic-Whisper-CodeSwitching-Edition},
}

APA格式: Rashad, M. (2024). Arabic-Whisper-CodeSwitching-Edition. Retrieved from https://huggingface.co/spaces/MohamedRashad/Arabic-Whisper-CodeSwitching-Edition

通过以上介绍，希望更多的用户能对Arabic-Whisper-CodeSwitching-Edition项目有更深入的了解，并在合适的场景中充分利用这个工具。