Figure 1. A representation of the algorithm used to mimic human speech. (IMAGE)
Caption
Processing pipeline of perceptual matching pursuit algorithm used to derive auditory sparse representations from speech signals. The five main processing steps are illustrated by gray blocks and solid arrows. The first step is to decompose signal, second is to apply mask effect, third is to find max, fourth is to update, and last is to halt. Information about selected kernel found after find-max step is used to create auditory sparse representation, resynthesized signal, and residual signal.
Credit
Masashi Unoki from JAIST.
Usage Restrictions
none
License
Original content