Just a voice analysis/synthesis algorithm pack. On the way of approaching it.
Tuku [email protected]
Hikaria [email protected]
- Ugly and simple viterbi decoder(sparsehmm)
- LPC(lpc)
- Yin F0 estimator(yin)
- PYin F0 estimator(pyin)
- Viterbi F0 tracker(monopitch)
- Viterbi note tracker(mononote)
- Adaptive STFT(adaptivestft)
- Cheaptrick Envelope(cheaptrick)
- MFI Envelope(mfienvelope)
- HNM analyzer framework and synther(hnm)
- HNM analyzer based on STFT(hnm_qfft)
- HNM analyzer based on QHM-AIR(hnm_qhm)
- HNM analyzer based on signle band DFT(hnm_get)
- LF-Model(lfmodel)
- Magnitude-only Rd estimator(rd_krh)
- Voice Tract Model(vt)
- (Decaparted)Ugly and simple formant tracker(formanttracker)
- Yang style SNR and instantaneous frequency(yang)
- Partial Glottal-Tract model(gvm)
- Hubble F0 detect system(hubble)
- Use log probabilities in sparsehmm
- SRH F0 estimator
- STFChT
- STFChT F0 estimator
- HNM based on STFChT
- DCE-MFA Envelope
- Formant Refinement
- Voice transformation
- Harmonic to noise conversion
- Better Documention
- Create an aliasing-free, non-integer offset and width hanning window via frequency domain method(accurateHann, fdHann)
- Denoise data and keep instantaneous information(applySmoothingFilter)
- Do DFT at any frequency for input signal(calcSpectrumAtFreq)
- Measure difference between two spectrum magnitude(calcItakuraSaitoDistance)
- Measure similarity between two time domain signal(calcSRER)
- Mauch, Matthias, and Simon Dixon. "pYIN: A fundamental frequency estimator using probabilistic threshold distributions." 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). IEEE, 2014.
- Morise, Masanori. "CheapTrick, a spectral envelope estimator for high-quality speech synthesis." Speech Communication 67 (2015): 1-7.
- Nakano, Tomoyasu, and Masataka Goto. "A spectral envelope estimation method based on F0-adaptive multi-frame integration analysis." SAPA@ INTERSPEECH. 2012.
- Fant, Gunnar. "The LF-model revisited. Transformations and frequency domain analysis." Speech Trans. Lab. Q. Rep., Royal Inst. of Tech. Stockholm 2.3 (1995): 40.
- Bonada, Jordi. "High quality voice transformations based on modeling radiated voice pulses in frequency domain." Proc. Digital Audio Effects (DAFx). Vol. 3. 2004.
- Saratxaga, Ibon, et al. "Simple representation of signal phase for harmonic speech models." Electronics letters 45.7 (2009): 381-383.
- Story, Brad H. "A parametric model of the vocal tract area function for vowel and consonant simulation." The Journal of the Acoustical Society of America 117.5 (2005): 3231-3254.
- Kawahara, H., Y. Agiomyrgiannakis, and H. Zen. "YANG vocoder, Google."
- Hua, Kanru. "Nebula: F0 Estimation and Voicing Detection by Modeling the Statistical Properties of Feature Extractors." arXiv preprint arXiv:1710.11317 (2017).