Audio Signals
Music Acoustics
Speech
Processing
Audio Signals (音频信号)
Not only those we can hear.

Audible Signal (可闻声)

Sound |
Frequency (Hz) |
Audible to Fish |
1~25 |
Audible to Human |
16~20000 |
Made by Human |
64~1300 |
Sensitive to Human |
1000~3000 |
Flute |
5000~8000 |
Bee
Buzz with Full Load |
220 |
Bee
Buzz without Load |
440 |
Audible to Dog |
38000 |
Thunder |
<100 |
Most
Significant in Thunder |
0.25~2 |
Sound Threshold of Pain

Description
Frequency: Number of Sound Period per
second.
Period:
Duration of one Vibration.
Speed:
Vin dry air = 331.5 + 0.6Tc
(m/s)
Vin water = 1482
(m/s) , 20 oC
Vin sea water = 1522
(m/s), 20 oC
Vin glass = 5640
(m/s)
Vin steel = 5960
(m/s)
Power(声功率):
The energy emitted by
sound source per second.
Intensity(声强):
The Energy
flow across a
unit area.
Pressure(声压):
The
Pressure on a unit area.
Power Level (声功率级):
L = 10 log(
W/W0)
Intensity Level (声强级):
L = 10 log( I/I0)
Pressure Level (声压级):
L = 20 log(
P/P0)
( P 2 =
I ρ c
)
Reference Power:
W0
= 10-12
(Watt)
Reference
Intensity:
I0
=
10-12
(Watt/m2)
Reference
Pressure:
P0
=
2x10-5
(Pa)
Sound
Superposition
WTotal
= W1
+
W2
I
Total
=
I1
+
I2
PTotal
= sqrt(P1
2 +
P2
2
)
Music Acoustics

Pitch(音调): The perceived frequency by
human ear.
Timbre(音色):
The distinct characteristics
describing perceptual
qualities of sound.
Brightness(明亮度):
A quality of sound which
relates to the proportion of
the energy distributed over
high frequencies.
Rhythm/Tempo(节奏):
Pattern of the beat
combination
Melody(旋律):
A set of tones structured together.
Speech


Vocal Cord(声带):
Two small bands of muscle within the
larynx(喉) that vibrates to produce the voice.
Vocal Track(声道):
The path that voice passed
before it comes
out of mouth.
Formant(共振峰):
A characteristic acoustic
resonance.
Processing
Analysis:
Extract the ID of Sound.
Computational Auditory Scene Analysis
Instrument Modeling
Vocal Track Modeling
Room Acoustics
(Sound Propagation +
Psychoacoustics)
Recognition:
Understand the sound
by known ID.
Music Automatic Transcription
Voice Recognition
Synthesize:
Make Machine
Sing/Talk.
MIDI
Compression:
Data file is too Fat for Secure and
Effective transmission, Online or
Wireless.
CD, MP3, RM, WMA, AIFF
Audio
Device:
Microphone (Dynamic,
Condenser, Ribbon )
Recorder
Storage Media (Tape, MD, LD, CD...)
Player
Speaker
|