Lecture: Audio Processing for the Internet of Things (AIoT) (Summer Term 2024)

Source https://tenor.com/view/embedded-security-for-internet-of-things-gif-25502017
  • Instructor: Prof. Dr. Nils Peters

  • Credits: 2.5 ECTS

  • Term: Summer 2024

  • Time: Friday 12:00 - 14:15 (3 x 45 min) (1st Lecture: 19.04.2024)

  • Format: hybrid lecture, in English

  • Location: Am Wolfsmantel 33, Erlangen-Tennenlohe, Room 3R4.04 and via ZOOM. Link and access information for ZOOM meetings can be found at StudOn (see below).

  • Dates:

    • Fri 19.04.2024, Fri 26.04.2024,
    • Fri 03.05.2024, Fri 17.05.2024, Fri 24.05.2024
    • Fri 07.06.2024, Fri 14.06.2024, Fri 21.06.2024
  • Office Hours: Thu 13:00 – 13:30 (virtual, use lecture ZOOM link)

  • Exam (graded): Examination at the end of term. Students must register for examination via Campo

  • Examination Dates and Location: To be announced, contact me.


  • No lecture on Fri 10.05.2023
  • No lecture on Fri 31.05.2023

Hybrid Format

  • The lecture will be offered as a hybrid course via ZOOM (i.e. in-person and virtual via Zoom at the same time).

  • To access the Zoom sessions, you must register via StudOn prior to the first lecture. In StudOn, you will then find Zoom access information.

  • Virtual participants must have access to a computer capable of running the ZOOM video conferencing software, including a stable internet connection for audio and video transmission.

  • Recordings of the lectures may not be provided.

Course Content

The course focuses on audio and speech processing algorithms within the context of the Internet of Things (IoT). Reading material recommendations are provided during the lectures.

  • Foundation: history, components, current challenges
  • Overview of Relevant Wireless Protocols: bandwidth, range, latency, spectrum
  • Audio Device Synchronization: NTP, PTP, device orchestration, wireless acoustic sensor networks, asynchronous and event-driven audio sampling
  • Acoustic Sensing for Voice User Interfaces: keyword spotting, speech recognition, speaker verification, anti-spoofing
  • Acoustic Scene Detection: event detection, scene classification, anomaly detection, sound tagging
  • Sound Creation: text-to-speech, sound generative networks
  • Data-over-sound: sound-beacon, watermarking, acoustic fingerprint
  • Privacy in IoT: edge vs. cloud processing, secure signal processing, federated learning, differential privacy, audio encryption


Before starting this lecture, it is recommended to complete the following FAU courses (or have equivalent knowledge):

  • Signals and Systems I & II
  • Digital Signal Processing
  • Deep Learning, Machine Learning in Signal Processing