Signalrepresentasjoner for automatisk talegjenkjenning

FFI-Report 2005

This publication is only available in Norwegian

About the publication

Report number

2005/01053

ISBN

82-464-0936-0

Format

PDF-document

Size

843.4 KB

Language

Norwegian

Talegjenkjenning

Download publication

Marius Gamborg Frode Lillevold

In this report we give an overwiev of methods for front-end processing of speech signals for automatic speech recognition (ASR) that are described in the litterature. The most common representation of speech in this context seems to be mel-frequency cepstral coeficient (MFCC) with delta- and double-delta coefficients, usually combined with cepstral mean normalization (CMN). Other representations include perceptual linear prediction (PLP) and linear prediction cepstral coefficients (LPCC).

Talegjenkjenning

About the publication

Report number

2005/01053

ISBN

82-464-0936-0

Format

PDF-document

Size

843.4 KB

Language

Norwegian

Talegjenkjenning

Download publication

Newly published

FFI-Report 2024

The cyber dimension of space systems – an analysis of offensive cyber operations targeting space infrastructure

Ingunn Helene Landsend Monsen

FFI-Report 2024

Multinational search and rescue in the Arctic – findings from a concept development assessment game

Håvard Fridheim, Alf Christian Hennum

FFI-Report 2024

Benefit Management in Defence Investment Projects

Annabel Garred, Inger Sofie Landgraff, Helene Berg

FFI-Report 2024

Better Collaboration – takeaways from Norwegian Defence Materiel Agency’s strategic partnership with Teleplan Globe

Anders Aulie, Magnus Akre Thorup

FFI-Report 2024

Kremlin’s economic plans – the Russian government’s federal budget proposal for 2025 and the planning period 2026–2027

Julie Helseth Udal, Cecilie Sendstad