Digital Speech Processing Course (Winter 2009)

No Cheating Policy:

        No cheating declaration: (No Cheating Policy.pdf)

Class Schedule: (Course Outline (Winter 2009).pdf)

Monograph on Digital Speech Processing: Introduction to Digital Speech processing.pdf

Lectures:

Introductory Material: (basic course material_winter_2009.pdf), 6-to-a-page: (basic course material_winter_2009_6tp.pdf)

Lecture 1:  Introduction to Digital Speech Processing:(Lecture 1.pdf) : 6-to-a-page:(Lecture 1_6tp.pdf)

        Chapter 1: (chapter1_1_9_2009.pdf)

Lecture 2:  Review of DSP Fundamentals:(Lecture 2.pdf)  : 6-to-a-page:(Lecture 2_6tp.pdf)

        Chapter 2: (chapter2_1_9_2009.pdf)

Lecture 3:   Acoustic Theory of Speech Production:(Lecture 3.pdf) : 6-to-a-page:(Lecture 3_6tp.pdf)

        Chapter 3: (chapter3_1_9_2009.pdf)

Lecture 4:  Speech Perception--Auditory Models, Sound Perception Models, MOS Methods:(Lecture 4.pdf): 6-to-a-page: (Lecture 4_6tp.pdf)

       Chapter 4: (chapter4_1_9_2009.pdf)

Lectures 5-6:Sound Propagation in the Vocal Tract: (Lectures 5-6.pdf) : 6-to-a-page: (Lectures 5-6_6tp.pdf)

      Chapter 5: (chapter5_1_9_2009.pdf)

Lectures 7-8:Time Domain Methods in Speech Processing: (Lectures 7-8.pdf) : 6-to-a-page: (Lectures 7-8_6tp.pdf)

Methods of Pitch Period Estimation:

       Chapter 6: (chapter6_1_9_2009.pdf)

Lecture 9:  Short-Time Fourier Transform (STFT) Concepts: (Lecture9.pdf)  : 6-to-a-page: (Lecture9_6tp.pdf)

Lecture 10: Short Time Fourier Analysis Methods--Filter Bank Summation and Overlap Add: (Lecture10.pdf): 6-to-a-page: (Lecture10_6tp.pdf)

Lecture 11: Speech Representations Based on STFT Analysis-Synthesis Methods:(Lecture11.pdf) : 6-to-a-page: (Lecture11_6tp.pdf)

      Chapter 7: (chapter7_1_9_2009.pdf)

Lecture 12: Homomorphic Speech Processing:(Lecture 12.pdf) : 6-to-a-page: (Lecture 12_6tp.pdf)

       Chapter 8: (chapter8_1_9_2009.pdf)

Lecture 13: Linear Predictive Coding (LPC) Methods:(Lecture 13.pdf) : 6-to-a-page:(Lecture 13_6tp.pdf)

Lecture 14: LPC--Frequency Domain Interpretations, Methods for Synthesis and Vocoding: (Lecture 14.pdf): 6-to-a-page: (Lecture 14_6tp.pdf)

      Chapter 9: (chapter9_1_9_2009.pdf)

Lecture Algorithms: (Lecture_algorithms.pdf)  6-to-a-page: (Lecture_algorithms_6tp.pdf)

      Chapter 10: (chapter10_1_9_2009.pdf)

Lecture 15: Speech Waveform Coding--Uniform and Non-Uniform Quantization: (Lecture 15_extensions.pdf) ; 6-to-a-page: (Lecture 15_extensions_6tp.pdf)

      Chapter 11: (chapter11_1_9_2009.pdf)

Lecture 16: Speech Waveform Coding--Adaptive and Differential Quantization: (Lecture 16_extensions.pdf)  : 6-to-a-page: (Lecture 16_extensions_6tp.pdf)

Lecture 17: Speech Coding Methods--Model-Based Approaches:(Lecture 17_extensions.pdf) : 6-to-a-page: (Lecture 17_extensions_6tp.pdf)

      Chapter 12: (chapter12_1_9_2009.pdf)

     Chapter 13: (chapter13_1_9_2009.pdf)

     Chapter 14: (chapter14_1_9_2009.pdf)

 


Homeworks:

 

Problem Set 1: (ps1.pdf);   PS1 solution: (ps1_soln.pdf)

Problem Set 2: (ps2.pdf) ;  PS2 solution: (ps2_soln.pdf)

Problem Set 3: (ps3.pdf) ;  PS3 solution: (ps3_soln.pdf)

Problem Set 4: (ps4.pdf)  ; PS4 solution: (ps4_soln.pdf)

Problem Set 5: (ps5.pdf) ;  PS5 solution: (ps5_soln.pdf)

Problem Set 6: (ps6.pdf) ;  PS6 solution: (ps6_soln.pdf)

Problem Set 7: (ps7.pdf) ;  PS7 solution: (ps7_soln.pdf)

Problem Set 8: (ps8.pdf) ;  PS8 solution: (ps8_soln.pdf)


Speech Files:

 

test_16k.wav: (test_16k.wav)

ah.wav: (ah.wav)

should.wav: (should.wav)

s1.wav: (s1.wav); pitch period contour for s1.wav: (pp1.mat)

s2.wav: (s2.wav); pitch period contour for s2.wav: (pp2.mat)

s3.wav: (s3.wav); pitch period contour for s3.wav: (pp3.mat)

s4.wav: (s4.wav); pitch period contour for s4.wav: (pp4.mat)

s5.wav: (s5.wav); pitch period contour for s5.wav: (pp5.mat)

s6.wav: (s6.wav); pitch period contour for s6.wav: (pp6.mat)

we_were: (we were away a year ago_lrr.wav)

isolated digit training files: (digits_train.zip)

isolated digit testing files: (digits_test.zip)

isolated digit training files (raw-no endpoints marked): (digits_train_raw.zip)

isolated digit testing files (raw-no endpoints marked): (digits_test_raw.zip)  

 


Matlab Files:

 

loadwav.m: (loadwav.m)

savewav.m: (savewav.m)

loadraw.m: (loadraw.m)

saveraw.m: (saveraw.m)

grayscale.m: (grayscale.m)

fxquant.m: (fxquant.m)

pspect.m: (pspect.m)

spectgr.m: (spectgr.m)

LPC solutions: (cholesky_full.m), (durbin.m), (lattice.m)

 


 

Project Suggestions:

 

            General Project Suggestions: (Digital Speech Processing Projects.pdf)

            LPC Vocoder Project Details: (LPC Vocoder Project.pdf)

            Project Schedule (UCSB-2009):

            User Interface Example (Sound Spectrograms):(GUI_plot_spectrogram_ucsb.m), (select_dir.m)