A Practitioner’s Look at Speech -to-Text

Posted in News Story  |  Tagged

Mark Dickison, Technical and Modeling Lead for Capital One’s internal speech recognition team – SpeakEasy

Abstract:

This talk covers the basics of Speech-to-Text using Hidden Markov Models and Gaussian Mixture Models (HMM-GMM). Included will be a discussion of the basics of signal processing for speech recognition, acoustic and language models, and how they are jointly maximized to produce text.

​ As time and interest allows, Neural Networks, and how they can be used either in hybrid with HMM-GMM models will also be covered.

Bio:

Mark Dickison started his career as a computational physicist specializing in network science – acquiring his Ph.D. from Boston University. This was followed by a post-doctoral fellowship at Pennsylvania State in their USP program, which supports the US Defense Threat Reduction Agency. Leaving academia, he joined Booz Allen Hamilton as a data scientist, working with a variety of clients across health, finance, and energy. Mark is currently the Technical and Modeling Lead for Capital One’s internal speech recognition team – SpeakEasy.

View “A Practitioner’s Look at Speech-to-Text” Presentation


More Analytics Seminars

DSAN and DataKindDC Partner for a DataDive for Social Impact

November 7th, 2022

On Saturday, October 22, we partnered with DataKindDC for an all-day data dive, giving our students a chance to work in real-time alongside local volunteer data scientists on projects for four non-profits to move the selected organization’s programming forward and benefit the communities they serve.…

Students gather in front of screen displaying our program logo

DSAN Partners with AWS for a Fall Career Fair

October 25th, 2022

Our program places great emphasis on providing our students with opportunities to network with recruiters. We were delighted to partner with Amazon Web Services (AWS) on October 6th, 2022, in the HFSC Social Room for a career fair.…