Skip to content

avenstack/ComfyUI-AV-FunASR

Repository files navigation

ComfyUI FunAsr Nodes

使用FunAsr进行语音识别,并转为文本或字幕文件。

workflow

更新

  • [ 1.0.1 ] 1. 提升运行速度,2. 优化工作流,3. 字幕默认保存为utf-8编码

功能

  • 语音识别
  • 语音时间戳预测
  • 语音转字幕

安装

cd ComfyUI/custom_nodes
git clone https://github.com/avenstack/ComfyUI-AV-FunASR.git
cd ComfyUI-AV-FunASR
pip install -r requirements.txt

模型下载

  1. 语音识别
  2. 语音端点检测
  3. 语音时间戳预测

模型存放目录:models/ASR/FunASR/iic

 iic
    ├── speech_fsmn_vad_zh-cn-16k-common-pytorch
    │   ├── README.md
    │   ├── am.mvn
    │   ├── config.yaml
    │   ├── configuration.json
    │   ├── example
    │   ├── fig
    │   └── model.pt
    ├── speech_seaco_paraformer_large_asr_nat-zh-cn-16k-common-vocab8404-pytorch
    │   ├── README.md
    │   ├── am.mvn
    │   ├── asr_example_hotword.wav
    │   ├── config.yaml
    │   ├── configuration.json
    │   ├── example
    │   ├── fig
    │   ├── model.pt
    │   ├── seg_dict
    │   └── tokens.json
    └── speech_timestamp_prediction-v1-16k-offline
        ├── README.md
        ├── am.mvn
        ├── config.yaml
        ├── configuration.json
        ├── example
        ├── model.pt
        ├── seg_dict
        └── tokens.json

鸣谢

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages