%matplotlib inline
import numpy, scipy, matplotlib.pyplot as plt, IPython.display as ipd
import librosa, librosa.display
import stanford_mir; stanford_mir.init()
The zero crossing rate indicates the number of times that a signal crosses the horizontal axis.
Let's load a signal:
x, sr = librosa.load('audio/simple_loop.wav')
Listen to the signal:
ipd.Audio(x, rate=sr)
Plot the signal:
plt.figure(figsize=(14, 5))
librosa.display.waveplot(x, sr=sr)
Let's zoom in:
n0 = 6500
n1 = 7500
plt.figure(figsize=(14, 5))
plt.plot(x[n0:n1])
I count five zero crossings. Let's compute the zero crossings using librosa.
zero_crossings = librosa.zero_crossings(x[n0:n1], pad=False)
zero_crossings.shape
That computed a binary mask where True
indicates the presence of a zero crossing. To find the total number of zero crossings, use sum
:
print(sum(zero_crossings))
To find the zero-crossing rate over time, use zero_crossing_rate
:
zcrs = librosa.feature.zero_crossing_rate(x)
print(zcrs.shape)
Plot the zero-crossing rate:
plt.figure(figsize=(14, 5))
plt.plot(zcrs[0])
Note how the high zero-crossing rate corresponds to the presence of the snare drum.
The reason for the high rate near the beginning is because the silence oscillates quietly around zero:
plt.figure(figsize=(14, 5))
plt.plot(x[:1000])
plt.ylim(-0.0001, 0.0001)
A simple hack around this is to add a small constant before computing the zero crossing rate:
zcrs = librosa.feature.zero_crossing_rate(x + 0.0001)
plt.figure(figsize=(14, 5))
plt.plot(zcrs[0])
Try for other audio files. Does the zero-crossing rate still return something useful in polyphonic mixtures?
ls audio