Sequence Labeling

Description

use TIMIT dataset to predict phoneme sequences using provided mfcc or fbank features

Project Link

Requirements

keras
tensorflow
python3
h5py
sklearn

Dataset

TIMIT Dataset
Features: mfcc and fbank
Labels: 48 kinds of phones

Pre-Processing

Label Preprocessing

phone mapping 48 -> 39
converting sequences to one hot encodings
padding

Features Preprocessing

standardization
padding

Post-Processing

convert phoneme to alphabet
remove consecutive duplicates using a threshold
trim the 'sil' character

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
README.md		README.md
comparison.png		comparison.png
main.py		main.py
model.py		model.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Sequence Labeling

Description

Requirements

Dataset

Pre-Processing

Post-Processing

Results

About

Uh oh!

Releases

Packages

Languages

dchenam/sequence-labeling

Folders and files

Latest commit

History

Repository files navigation

Sequence Labeling

Description

Requirements

Dataset

Pre-Processing

Post-Processing

Results

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages