Open Source AI and speech recognition

A2 | Tue 22 Jan | 2:40 p.m.–3:10 p.m.


Presented by

  • Kathy Reid
    @KathyReid
    https://blog.kathyreid.id.au

    Kathy Reid works at the intersection of open source, emerging technologies and technical communities. She is currently Director of Developer Relations at Mycroft AI, democratising voice with open source speech recognition and language processing technologies. She has held several technical leadership positions, including roles as Digital Platforms and Operations Manager at Deakin University, managing platforms such as WordPress, Squiz Matrix and Atlassian Confluence, technical lead on projects involving digital signage and videoconferencing, and has worked as a web and application developer. More recently, she has run her own technical consulting micro-business. She is the President of Linux Australia, Inc. Kathy holds Arts and Science undergraduate degrees from Deakin University and an MBA (Computing) from Charles Sturt University, as well as several ITIL qualifications.

Abstract

This session will present a history of open source voice projects and provide an overview of the current open source voice landscape. From Star Trek’s computer, to Knight Rider’s K.I.T.T, to Selma in Time Trax, we have long desired to work with computers using the power of our voice. But while attractive, this technology is currently used to gather data, using spoken utterances to personalize advertisements, and provide privacy-impugning ‘nudges. Enter open source, privacy-focussed voice assistants. As voice becomes ubiquitous, those organisations wanting to ensure privacy and prevent vendor lock-in are turning to open source solutions.What's out there? What are the differences? And what sort of things should you consider when you're choosing an open source voice platform? What does the open source voice assistant roadmap look like? What models are there for sustainability and revenue generation in open source voice? This presentation will provide an overview of the voice stack, and the different open source options available at each layer of the stack, as well as their pros and cons, enabling you to make an informed choice about open source voice.