admin管理员组

文章数量:1612827

音频筑基:基音、基频和共振峰

    • 是什么
    • 深入理解
    • 相关参考

音频信号分析中,经常遇到基音、周期、共振峰的概念,这里总结下自己的粗浅理解,不当之处,还望各位大佬提出指正。

是什么


基音,是复音中强度最大、频率最低的音。表征着声音的音高,即人们所感知到的音调高低。通常是以频率维度衡量,所以也叫基音频率,简称基频F0。有时粗略讲,又叫音高,英文名Pitch。

  • 基音周期Period:气流经声门的周期间隔,体现声源激励信息,设时间为T(s),物理含义为是时域波形最小重复的时间。
    • 周期Period是指声音波形中一个完整的周期所需要的时间。
    • 周期Period一般可以通过分析波形的重复模式来确定,比如:计算波形中相邻两个峰值(或谷值)之间的时间间隔来获得。
  • 基音频率Fundamental Frequency,是基音周期的倒数F0=1/T

共振峰,英文名Formant,表现为频谱上由基频衍生的次谐波(能量最强的一根根横线)。

  • 共振峰产生原理:F1、F2,经声门后来到声道处与发声环境产生共振的频率,体现声道环境信息。

深入理解


周期Period和基音Pitch如何转换?

Pitch指的是音频信号的基波周期,用周期或者频率来表示,如周期为4.3ms,对应频率就是230Hz,跟频谱图的频率概念是有别的。语音信号场景,基音指标十分重要。

转换公式为:Pitch = 1 / Period。换句话说,基音Pitch的值是周期Period的倒数。这意味着,周期越短,基音Pitch就越高;周期越长,基音Pitch就越低。

严格意义上讲,音高Pitch和基频F0有何区别?

严格说,音高的概念更广,音高主要由基频决定。通常认为,音高增高八度音程相当于基频翻倍。

  • 基频F0是指声音波形中最低频率的周期性振动频率,也就是声音的基频。在语音信号中,基频F0通常对应于说话人的声调(tone),也就是说话人说话时的基本音高。基频F0的单位是赫兹(Hz)。
  • 基音Pitch是指声音的音高,通常用赫兹(Hz)或半音(semitone)来表示。在音乐领域中,Pitch通常指的是乐器演奏的音高,而在语音领域中,Pitch通常指的是说话人的音高。

男声和女声的基频范围是?

  • 男声基频:60-150Hz
  • 女声基频:200-400Hz

pitch period和gain之间的关系是?

  • pitch period指元音发音的周期,其倒数为基音频率
  • pitch gain指基音频率对应的能量强度,越强则较于噪声信号听感更显性

what is the difference between the pitch gain and the pitch period in audio coding?

Copilot:
In audio coding, pitch gain and pitch period are two distinct concepts used to describe characteristics of voiced sounds, particularly in speech processing:

  1. Pitch Period:
    • The pitch period refers to the duration of one complete cycle of a periodic waveform, typically measured in milliseconds. In the context of speech, it represents the time interval between successive glottal pulses during voiced speech sounds, such as vowels1.
    • Mathematically, if the pitch period is ( T_p ), the pitch frequency ( f_p ) is given by ( f_p = \frac{1}{T_p} )2.
  2. Pitch Gain:
    • Pitch gain, on the other hand, is a measure of the strength or amplitude of the periodic component in the speech signal. It indicates how much of the signal’s energy is concentrated at the pitch frequency1.
    • Higher pitch gain means that the periodic component (voiced sound) is more dominant compared to the noise or unvoiced components in the signal1.

In summary, while the pitch period is about the timing of the periodicity in the signal, the pitch gain is about the amplitude or strength of that periodicity.

f p = 1 T p f_p = \frac{1}{T_p} fp=Tp1

相关参考


  1. 声音基频f0_音高和基频(Pitch and F0),link
  2. 音频(二) 基音周期与共振峰, link
  3. 基音周期和共振峰,link
  4. Wikipedia:共振峰,link
  5. 口语元音中的共振峰和谐波,link

本文标签: 基频基音音频