With speakers enrolled, submit an input and have cognitive services identify the speaker in the submitted file.
- [Instructor] Now the process of identifying two users,…as in creating their identification profiles,…and creating enrollments, which is…the process of associating their spoken text…with their identification profiles.…Feel free to play these audio files.…You can find them in the associated source code.…You'll see that they the 1.wav and 2.wav…are spoken by me and Jill's 1.wav is spoken by Jill.…So we've set up our users at this point.…
Now we can perform identification.…So as I mentioned earlier, the way identification works…is that I'm going to upload a new audio file…not matching the enrollment files,…but something brand new.…And I have this inputs folder up here…that contains audio files from both Jill and Sahil.…So I'm going to take these two audio files…and I'm going to make them a part of my source code…so we can parse them in as inputs.…So go ahead and drag-drop them into the source folder.…
Now they're a part of my source code.…And now what I can do, is that I can take these audio files…and these audio files I'll be able to submit them…
Author
Released
4/10/2018- Using the Translate Text API
- Getting supported languages
- Writing code to translate between languages
- Performing text to speech
- Setting up speech to text
- Writing code for speaker identification
Skill Level Intermediate
Duration
Views
Related Courses
-
Microsoft Cognitive Services for Developers: 1 Vision
with Sahil Malik2h 35m Intermediate -
Learning Microsoft Cognitive Services for Developers
with Sahil Malik1h 42m Intermediate -
Creating Bots with the Microsoft Bot Framework, Part 1
with Scott Peterson45m 41s Intermediate
-
Introduction
-
Welcome57s
-
What you should know1m 37s
-
-
1. The Basics
-
Introduction2m 56s
-
Set up a Node.js project4m 34s
-
-
2. Translation Text API
-
Set up a project1m 39s
-
Get language names5m 7s
-
Break apart longer sentences3m 23s
-
Get languages for Speak1m 53s
-
Performing Text-to-Speech3m 46s
-
-
3. Bing Speech API
-
Speech to Text5m 5s
-
Bing Speech Text-to-Speech3m 59s
-
4. Speaker Recognition API
-
Write business objects1m 41s
-
Enrolling the first user6m 3s
-
Enrolling the second user2m 14s
-
Identifying speakers4m 53s
-
Conclusion
-
Next steps1m 9s
-
- Mark as unwatched
- Mark all as unwatched
Are you sure you want to mark all the videos in this course as unwatched?
This will not affect your course history, your reports, or your certificates of completion for this course.
CancelTake notes with your new membership!
Type in the entry box, then click Enter to save your note.
1:30Press on any video thumbnail to jump immediately to the timecode shown.
Notes are saved with you account but can also be exported as plain text, MS Word, PDF, Google Doc, or Evernote.
Share this video
Embed this video
Video: Identifying speakers