Lecture "Pattern Recognition and Machine Learning"

 

Basic Information
Lecturers: Gerhard Schmidt (lecture) and Viktoriia Boichenko (exercise)
Room: KS2/Geb.F - SR-III
E-mail: This email address is being protected from spambots. You need JavaScript enabled to view it.
Language: English
Target group: Students in electrical engineering and computer engineering
Prerequisites: Basics in system theory
Contents:

In this lecture the basics of speech, audio, and music signal processing are treated. Often schemes that are based on statistical optimization are utilized for these applications. The involved cost function are matched to the human audio perception.

Topic overview:

  • Preprocessing to reduce signal distortions
    • Noise reduction
    • Beamforming
  • Feature extraction and data compression
  • Pattern recognition and data regression
    • Learning types
      • Unsupervised learning
      • Supervised learning
      • Reinformcement learning
    • Codebooks
    • Object-to-vector conversion
    • Gaussian mixture models (GMMs)
    • Artificial Neural Networks (ANNs)
      • Multilayer perceptron
      • Convolutional neuronal networks (CNNs)
      • Recurrent neural networks (RNNs)
      • Autoencoder networks
      • Generative adversarial networks (GANs)
    • Hidden Markov models (HMMs)
    • Explainable artificial intelligence (EAI)

 

News

No news yet.

 

Schedule

The following schedule regarding lectures and excercises is preliminary and may be adapted during the semester. The lecture will usually take place from 8:15 h - 10:45 h.

Date Lecture Exercise
23.10.2024 Introduction -
30.10.2024 Noise Suppression + Beamforming Noise Suppression (video only)
06.11.2024 Beamforming + Feature Extraction Noise Suppression (video and exercise in the lecture room)
13.11.2024 Feature Extraction + Codebook Training Beamforming (video and exercise in the lecture room)
20.11.2024 Codebook Training + Object-to-vector Conversion + Bandwidth Extension Feature Extraction + Codebook Training (video and exercise in the lecture room)
27.11.2024 Bandwidth Extension Codebook Training (video and exercise in the lecture room)
04.12.2024 Gaussian Mixture Models Bandwidth Extension (video and exercise in the lecture room)
11.12.2024 Neural Networks - Part 1 Gaussian Mixture Models (video and exercise in the lecture room)
18.12.2024 Neural Networks - Part 2 Neural Networks (video and exercise in the lecture room)
08.01.2025 Student talks Student talks
15.01.2025 Hidden Markov Models Hidden Markov Models (video and exercise in the lecture room)
22.01.2025 Explainable artificial intelligence Speaker and Speech Recognition (video and exercise in the lecture room)

 

Lecture Slides

Link Content
Slides of the lecture "Introduction"
(Introduction, boundary conditions of the lecture, applications)
Slides of the lecture "Noise Suppression"
(Noise suppression, dereverberation, speech reconstruction)
Slides of the lecture "Beamforming"
(Fixed and adaptive beamforming, postfiltering)
Slides of the lecture "Feature Extraction"
(Linear prediction, cepstrum, mel-filtered cepstral coefficients)
Slides of the lecture "Codebook Training"
(K-means algorithm, LBG algorithms)
Slides of the lecture "Object-to-vector Conversion (slides not available yet)"
(Word2Vec)
Slides of the lecture "Bandwidth Extension"
(Model-bases approaches, evaluation)
Slides of the lecture "Gaussian Mixture Models (GMMs)"
(Training with the EM algorithm, applications)
Slides of the lecture "Neural networks"
(Network types, training procecures)
Slides of the lecture "Hidden Markov Models (HMMs)"
(Efficient probability calculation, training of HMMs)
Slides of the lecture "Explainable artificial intelligence"
(Understanding what had contributed mainly to the decision of neural networks, etc.)

 

Matlab Demos

Link Content
Matlab demo (GUI based) for adaptive noise suppression
Matlab demo (GUI based) for linear prediction

 

Exercises

For each lecture topic on-demand video will be provided. There will be an in-presence exercises to discuss your questions and requested topics. You may send in questions (if you want the answer to be supported by slides) or exercise topic suggestions to This email address is being protected from spambots. You need JavaScript enabled to view it. at any time.

Video Content Material

Noise suppression:

  • Wiener Filter summary
  • comprehension questions
  • python demo

Beamforming:

  • beamforming summary
  • comprehension questions
  • python demo

Feature extraction:

  • linear prediction
  • LPC, LPCC, MFCC features
  • comprehension questions
  • python demo

Codebook training:

  • codebooks
  • codebook training
  • comprehension questions
  • python demo

Bandwidth extension:

  • bandwidth extension summary
  • comprehension questions
  • python demo

Gaussian Mixture Models:

  • Gaussian Mixture Models
  • EM algorithm
  • comprehension questions
  • python demo

Neural Networks:

  • neural networks summary
  • comprehension questions
  • python demo

Hidden Markov Models:

  • hidden markov models summary
  • comprehension questions
  • python demo

Speaker and Speech Recognition:

  • speaker and speech recognition summary
  • comprehension questions
  • python demo

 

Talks

Each student will give a talk about a certain topic. The aim is both to give you the chance to work on a pattern recognition-related topic that interests you, and to improve your presentational skills. The talk is also a prerequisite for your admission to the exam. The talks should be held in English and should take ten minutes, plus 2.5 minutes of discussion and 2.5 minutes of feedback. Please write an email to This email address is being protected from spambots. You need JavaScript enabled to view it. to reserve your topic. The talk registration deadline is 18.12.2024.

Date Room Time Topic Presenter(s)
08.01.2025 KS2/Geb.F - SR-III 08:15 h Opening Gerhard Schmidt
08.01.2025 KS2/Geb.F - SR-III 08:25 h --- ---
08.01.2025 KS2/Geb.F - SR-III 08:40 h --- ---

 

Evaluation

Current evaluation Completed evaluations