Language:
English
簡体中文
繁體中文
Help
Login
Create an account
Back
Switch To:
Labeled
|
MARC Mode
|
ISBD
Articulation and Intelligibility
Record Type:
Electronic resources : Monograph/item
Title/Author:
Articulation and Intelligibility/ by Jont B. Allen.
Author:
Allen, Jont B.
Description:
XIII, 124 p.online resource. :
Contained By:
Springer Nature eBook
Subject:
Electrical engineering. -
Online resource:
Fulltext (查閱電子書全文)
ISBN:
9783031025549
Articulation and Intelligibility
Allen, Jont B.
Articulation and Intelligibility
[electronic resource] /by Jont B. Allen. - 1st ed. 2005. - XIII, 124 p.online resource. - Synthesis Lectures on Speech and Audio Processing,1932-1678. - Synthesis Lectures on Speech and Audio Processing,.
Introduction -- Articulation -- Intelligibility -- Discussion with Historical Context.
Immediately following the Second World War, between 1947 and 1955, several classic papers quantified the fundamentals of human speech information processing and recognition. In 1947 French and Steinberg published their classic study on the articulation index. In 1948 Claude Shannon published his famous work on the theory of information. In 1950 Fletcher and Galt published their theory of the articulation index, a theory that Fletcher had worked on for 30 years, which integrated his classic works on loudness and speech perception with models of speech intelligibility. In 1951 George Miller then wrote the first book Language and Communication, analyzing human speech communication with Claude Shannon's just published theory of information. Finally in 1955 George Miller published the first extensive analysis of phone decoding, in the form of confusion matrices, as a function of the speech-to-noise ratio. This work extended the Bell Labs' speech articulation studies with ideas from Shannon's Information theory. Both Miller and Fletcher showed that speech, as a code, is incredibly robust to mangling distortions of filtering and noise. Regrettably much of this early work was forgotten. While the key science of information theory blossomed, other than the work of George Miller, it was rarely applied to aural speech research. The robustness of speech, which is the most amazing thing about the speech code, has rarely been studied. It is my belief (i.e., assumption) that we can analyze speech intelligibility with the scientific method. The quantitative analysis of speech intelligibility requires both science and art. The scientific component requires an error analysis of spoken communication, which depends critically on the use of statistics, information theory, and psychophysical methods. The artistic component depends on knowing how to restrict the problem in such a way that progress may be made. It is critical to tease out the relevant from the irrelevant and dig for the key issues. This will focus us on the decoding of nonsense phonemes with no visual component, which have been mangled by filtering and noise. This monograph is a summary and theory of human speech recognition. It builds on and integrates the work of Fletcher, Miller, and Shannon. The long-term goal is to develop a quantitative theory for predicting the recognition of speech sounds. In Chapter 2 the theory is developed for maximum entropy (MaxEnt) speech sounds, also called nonsense speech. In Chapter 3, context is factored in. The book is largely reflective, and quantitative, with a secondary goal of providing an historical context, along with the many deep insights found in these early works.
ISBN: 9783031025549
Standard No.: 10.1007/978-3-031-02554-9doiSubjects--Topical Terms:
423914
Electrical engineering.
LC Class. No.: TK1-9971
Dewey Class. No.: 621.3
Articulation and Intelligibility
LDR
:04074nmm a22003735i 4500
001
349683
003
DE-He213
005
20220601133058.0
007
cr nn 008mamaa
008
230512s2005 sz | s |||| 0|eng d
020
$a
9783031025549
$9
978-3-031-02554-9
024
7
$a
10.1007/978-3-031-02554-9
$2
doi
035
$a
978-3-031-02554-9
050
4
$a
TK1-9971
072
7
$a
THR
$2
bicssc
072
7
$a
TEC007000
$2
bisacsh
072
7
$a
THR
$2
thema
082
0 4
$a
621.3
$2
23
100
1
$a
Allen, Jont B.
$e
author.
$4
aut
$4
http://id.loc.gov/vocabulary/relators/aut
$3
424099
245
1 0
$a
Articulation and Intelligibility
$h
[electronic resource] /
$c
by Jont B. Allen.
250
$a
1st ed. 2005.
264
1
$a
Cham :
$b
Springer International Publishing :
$b
Imprint: Springer,
$c
2005.
300
$a
XIII, 124 p.
$b
online resource.
336
$a
text
$b
txt
$2
rdacontent
337
$a
computer
$b
c
$2
rdamedia
338
$a
online resource
$b
cr
$2
rdacarrier
347
$a
text file
$b
PDF
$2
rda
490
1
$a
Synthesis Lectures on Speech and Audio Processing,
$x
1932-1678
505
0
$a
Introduction -- Articulation -- Intelligibility -- Discussion with Historical Context.
520
$a
Immediately following the Second World War, between 1947 and 1955, several classic papers quantified the fundamentals of human speech information processing and recognition. In 1947 French and Steinberg published their classic study on the articulation index. In 1948 Claude Shannon published his famous work on the theory of information. In 1950 Fletcher and Galt published their theory of the articulation index, a theory that Fletcher had worked on for 30 years, which integrated his classic works on loudness and speech perception with models of speech intelligibility. In 1951 George Miller then wrote the first book Language and Communication, analyzing human speech communication with Claude Shannon's just published theory of information. Finally in 1955 George Miller published the first extensive analysis of phone decoding, in the form of confusion matrices, as a function of the speech-to-noise ratio. This work extended the Bell Labs' speech articulation studies with ideas from Shannon's Information theory. Both Miller and Fletcher showed that speech, as a code, is incredibly robust to mangling distortions of filtering and noise. Regrettably much of this early work was forgotten. While the key science of information theory blossomed, other than the work of George Miller, it was rarely applied to aural speech research. The robustness of speech, which is the most amazing thing about the speech code, has rarely been studied. It is my belief (i.e., assumption) that we can analyze speech intelligibility with the scientific method. The quantitative analysis of speech intelligibility requires both science and art. The scientific component requires an error analysis of spoken communication, which depends critically on the use of statistics, information theory, and psychophysical methods. The artistic component depends on knowing how to restrict the problem in such a way that progress may be made. It is critical to tease out the relevant from the irrelevant and dig for the key issues. This will focus us on the decoding of nonsense phonemes with no visual component, which have been mangled by filtering and noise. This monograph is a summary and theory of human speech recognition. It builds on and integrates the work of Fletcher, Miller, and Shannon. The long-term goal is to develop a quantitative theory for predicting the recognition of speech sounds. In Chapter 2 the theory is developed for maximum entropy (MaxEnt) speech sounds, also called nonsense speech. In Chapter 3, context is factored in. The book is largely reflective, and quantitative, with a secondary goal of providing an historical context, along with the many deep insights found in these early works.
650
0
$a
Electrical engineering.
$3
423914
650
0
$a
Signal processing.
$3
423975
650
0
$a
Acoustical engineering.
$3
424101
650
1 4
$a
Electrical and Electronic Engineering.
$3
423916
650
2 4
$a
Signal, Speech and Image Processing .
$3
423976
650
2 4
$a
Engineering Acoustics.
$3
424102
710
2
$a
SpringerLink (Online service)
$3
423502
773
0
$t
Springer Nature eBook
776
0 8
$i
Printed edition:
$z
9783031014260
776
0 8
$i
Printed edition:
$z
9783031036828
830
0
$a
Synthesis Lectures on Speech and Audio Processing,
$x
1932-1678
$3
424100
856
4 0
$u
https://doi.org/10.1007/978-3-031-02554-9
$z
Fulltext (查閱電子書全文)
912
$a
ZDB-2-SXSC
950
$a
Synthesis Collection of Technology (R0) (SpringerNature-85007)
based on 0 review(s)
Reviews
Add a review
and share your thoughts with other readers
Export
pickup library
Processing
...
Change password
Login