Envelope modeling for speech and audio processing using distribution quantization

Envelope modeling for speech and audio processing using distribution quantization. Jähnel, T., Bäckström, T., & Schubert, B. In 2015 23rd European Signal Processing Conference (EUSIPCO), pages 584-588, Aug, 2015.

Paper doi abstract bibtex

Envelope models are common in speech and audio processing: for example, linear prediction is used for modeling the spectral envelope of speech, whereas audio coders use scale factor bands for perceptual masking models. In this work we introduce an envelope model called distribution quantizer (DQ), with the objective of combining the accuracy of linear prediction and the flexibility of scale factor bands. We evaluate the performance of envelope models with respect to their ability to reduce entropy as well as their correlation to the original signal magnitude. The experiments show that in terms of entropy, distribution quantization and linear prediction are comparable, whereas for correlation, distribution quantization is better. Furthermore the coefficients of distribution quantization are independent and thus more flexible and easier to quantize than linear predictive coefficients.

@InProceedings{7362450,
  author = {T. Jähnel and T. Bäckström and B. Schubert},
  booktitle = {2015 23rd European Signal Processing Conference (EUSIPCO)},
  title = {Envelope modeling for speech and audio processing using distribution quantization},
  year = {2015},
  pages = {584-588},
  abstract = {Envelope models are common in speech and audio processing: for example, linear prediction is used for modeling the spectral envelope of speech, whereas audio coders use scale factor bands for perceptual masking models. In this work we introduce an envelope model called distribution quantizer (DQ), with the objective of combining the accuracy of linear prediction and the flexibility of scale factor bands. We evaluate the performance of envelope models with respect to their ability to reduce entropy as well as their correlation to the original signal magnitude. The experiments show that in terms of entropy, distribution quantization and linear prediction are comparable, whereas for correlation, distribution quantization is better. Furthermore the coefficients of distribution quantization are independent and thus more flexible and easier to quantize than linear predictive coefficients.},
  keywords = {audio coding;hearing;quantisation (signal);speech intelligibility;speech processing;speech processing;audio processing;distribution quantization;envelope models;linear prediction;spectral envelope;audio coders;scale factor band;perceptual masking models;Predictive models;Entropy;Correlation;Quantization (signal);Speech;Speech coding;Frequency-domain analysis;Speech coding;linear predictive coding;signal modeling},
  doi = {10.1109/EUSIPCO.2015.7362450},
  issn = {2076-1465},
  month = {Aug},
  url = {https://www.eurasip.org/proceedings/eusipco/eusipco2015/papers/1570096285.pdf},
}

Downloads: 0

{"_id":"xZWNTfXxuh7BojW4j","bibbaseid":"jhnel-bckstrm-schubert-envelopemodelingforspeechandaudioprocessingusingdistributionquantization-2015","authorIDs":[],"author_short":["Jähnel, T.","Bäckström, T.","Schubert, B."],"bibdata":{"bibtype":"inproceedings","type":"inproceedings","author":[{"firstnames":["T."],"propositions":[],"lastnames":["Jähnel"],"suffixes":[]},{"firstnames":["T."],"propositions":[],"lastnames":["Bäckström"],"suffixes":[]},{"firstnames":["B."],"propositions":[],"lastnames":["Schubert"],"suffixes":[]}],"booktitle":"2015 23rd European Signal Processing Conference (EUSIPCO)","title":"Envelope modeling for speech and audio processing using distribution quantization","year":"2015","pages":"584-588","abstract":"Envelope models are common in speech and audio processing: for example, linear prediction is used for modeling the spectral envelope of speech, whereas audio coders use scale factor bands for perceptual masking models. In this work we introduce an envelope model called distribution quantizer (DQ), with the objective of combining the accuracy of linear prediction and the flexibility of scale factor bands. We evaluate the performance of envelope models with respect to their ability to reduce entropy as well as their correlation to the original signal magnitude. The experiments show that in terms of entropy, distribution quantization and linear prediction are comparable, whereas for correlation, distribution quantization is better. Furthermore the coefficients of distribution quantization are independent and thus more flexible and easier to quantize than linear predictive coefficients.","keywords":"audio coding;hearing;quantisation (signal);speech intelligibility;speech processing;speech processing;audio processing;distribution quantization;envelope models;linear prediction;spectral envelope;audio coders;scale factor band;perceptual masking models;Predictive models;Entropy;Correlation;Quantization (signal);Speech;Speech coding;Frequency-domain analysis;Speech coding;linear predictive coding;signal modeling","doi":"10.1109/EUSIPCO.2015.7362450","issn":"2076-1465","month":"Aug","url":"https://www.eurasip.org/proceedings/eusipco/eusipco2015/papers/1570096285.pdf","bibtex":"@InProceedings{7362450,\n author = {T. Jähnel and T. Bäckström and B. Schubert},\n booktitle = {2015 23rd European Signal Processing Conference (EUSIPCO)},\n title = {Envelope modeling for speech and audio processing using distribution quantization},\n year = {2015},\n pages = {584-588},\n abstract = {Envelope models are common in speech and audio processing: for example, linear prediction is used for modeling the spectral envelope of speech, whereas audio coders use scale factor bands for perceptual masking models. In this work we introduce an envelope model called distribution quantizer (DQ), with the objective of combining the accuracy of linear prediction and the flexibility of scale factor bands. We evaluate the performance of envelope models with respect to their ability to reduce entropy as well as their correlation to the original signal magnitude. The experiments show that in terms of entropy, distribution quantization and linear prediction are comparable, whereas for correlation, distribution quantization is better. Furthermore the coefficients of distribution quantization are independent and thus more flexible and easier to quantize than linear predictive coefficients.},\n keywords = {audio coding;hearing;quantisation (signal);speech intelligibility;speech processing;speech processing;audio processing;distribution quantization;envelope models;linear prediction;spectral envelope;audio coders;scale factor band;perceptual masking models;Predictive models;Entropy;Correlation;Quantization (signal);Speech;Speech coding;Frequency-domain analysis;Speech coding;linear predictive coding;signal modeling},\n doi = {10.1109/EUSIPCO.2015.7362450},\n issn = {2076-1465},\n month = {Aug},\n url = {https://www.eurasip.org/proceedings/eusipco/eusipco2015/papers/1570096285.pdf},\n}\n\n","author_short":["Jähnel, T.","Bäckström, T.","Schubert, B."],"key":"7362450","id":"7362450","bibbaseid":"jhnel-bckstrm-schubert-envelopemodelingforspeechandaudioprocessingusingdistributionquantization-2015","role":"author","urls":{"Paper":"https://www.eurasip.org/proceedings/eusipco/eusipco2015/papers/1570096285.pdf"},"keyword":["audio coding;hearing;quantisation (signal);speech intelligibility;speech processing;speech processing;audio processing;distribution quantization;envelope models;linear prediction;spectral envelope;audio coders;scale factor band;perceptual masking models;Predictive models;Entropy;Correlation;Quantization (signal);Speech;Speech coding;Frequency-domain analysis;Speech coding;linear predictive coding;signal modeling"],"metadata":{"authorlinks":{}},"downloads":0},"bibtype":"inproceedings","biburl":"https://raw.githubusercontent.com/Roznn/EUSIPCO/main/eusipco2015url.bib","creationDate":"2021-02-13T17:31:52.338Z","downloads":0,"keywords":["audio coding;hearing;quantisation (signal);speech intelligibility;speech processing;speech processing;audio processing;distribution quantization;envelope models;linear prediction;spectral envelope;audio coders;scale factor band;perceptual masking models;predictive models;entropy;correlation;quantization (signal);speech;speech coding;frequency-domain analysis;speech coding;linear predictive coding;signal modeling"],"search_terms":["envelope","modeling","speech","audio","processing","using","distribution","quantization","jähnel","bäckström","schubert"],"title":"Envelope modeling for speech and audio processing using distribution quantization","year":2015,"dataSources":["eov4vbT6mnAiTpKji","knrZsDjSNHWtA9WNT"]}