SPEECH FEATURES

Setup

Install Docker.

$ git clone https://github.com/VladimirKalajcidi/Speech-Features.git
$ cd endtoend
$ docker build -t stt .

Usage

Extracting Speech Features

Execute following command in Speech-Features directory.

$ docker run -it -d -v $(pwd):/app/ --net host --name stt stt
$ docker exec -it stt bash
root@hostname:/workspace# ./scripts/installation.sh
root@hostname:/workspace# python app.py -i audio.mp3 -o output.json -m base -t token

Arguments for app.py:

    - i: input .mp3 file
    - o: ouput .json file
    - m: model path, defalut = "base", custom = "username/repo"
    - t: huggingface token ("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")

Model training

Execute following command in Speech-Features directory.

$ docker run -it -d -v $(pwd):/app/ --net host --name stt stt
$ docker exec -it stt bash
root@hostname:/workspace# ./scripts/installation.sh
root@hostname:/workspace# dvc repro

Converting trained openAI model to CT2 model

Execute following command in Speech-Features directory.

$ docker run -it -d -v $(pwd):/app/ --net host --name stt stt
$ docker exec -it stt bash
root@hostname:/workspace# ./scripts/installation.sh
root@hostname:/workspace# ./scripts/convert.sh artifacts/training/model whisper-ct2 username/repo

Arguments for convert.sh:

    1: path to trained model
    2: path to converted model in local repository
    3: path to HuggingFace repository

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.devcontainer		.devcontainer
.dvc		.dvc
.github		.github
artifacts		artifacts
config		config
research		research
scripts		scripts
src/speechRecognition		src/speechRecognition
.dvcignore		.dvcignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
app.py		app.py
dvc.lock		dvc.lock
dvc.yaml		dvc.yaml
main.py		main.py
params.yaml		params.yaml
requirements.txt		requirements.txt
setup.py		setup.py
template.py		template.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SPEECH FEATURES

Setup

Usage

Extracting Speech Features

Model training

Converting trained openAI model to CT2 model

About

Releases

Packages

Languages

License

VladimirKalajcidi/Speech-Features

Folders and files

Latest commit

History

Repository files navigation

SPEECH FEATURES

Setup

Usage

Extracting Speech Features

Model training

Converting trained openAI model to CT2 model

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages