Easy-to-follow video tutorials help you learn software, creative, and business skills.Become a member

Turning spoken dialogue into searchable metadata

From: Soundbooth CS5 Essential Training

Video: Turning spoken dialogue into searchable metadata

Soundbooth has a speech recognition module. It can analyze spoken words and convert them with reasonable accuracy into text. It can also differentiate between individuals to associate dialog with each person doing the speaking. This is a fast way to create a transcript, to create text that people can use inside Closed Captioning software or for subtitles. So, let me show you how that works. We have a dialog here with the declaration of independence in it, that just one person speaking me. We are going to have Soundbooth analyze this and try to convert the spoken word into the text.

Turning spoken dialogue into searchable metadata

Soundbooth has a speech recognition module. It can analyze spoken words and convert them with reasonable accuracy into text. It can also differentiate between individuals to associate dialog with each person doing the speaking. This is a fast way to create a transcript, to create text that people can use inside Closed Captioning software or for subtitles. So, let me show you how that works. We have a dialog here with the declaration of independence in it, that just one person speaking me. We are going to have Soundbooth analyze this and try to convert the spoken word into the text.

To do that, we need to go to what's called the Metadata panel, and that's not visible here. It's not visible in the default view. So, you go Window, Metadata. Now Metadata is information that's stored inside the file as text and typically you don't necessarily see unless you can look at the Metadata using a panel like this, and it does not interfere with audio. So, here is an audio file, but inside of this file, it's a little sort capsule of metadata. And right now, there is very little Metadata associated with this because we have not added any. So, what I want to do now is analyze this text.

In the Metadata panel, there is this one thing it's the Speech Analysis and at the bottom there is a little button and whenever you see a button or menu command that is ... after it, just for information. That means another menu is going to open if I click on it, and there is a little dialog box. It says what are options, which language are you going to use. Well actually, three are different English- language versions that it does searching on. We will take the English - U.S. version as oppose to the UK or Canada version. It ask for reference script. If there was one, it would show up here. This is something that you would create inside Premiere Pro.

And then it says what kind of quality do you want to search on? High quality, which takes longer or medium quality, which is faster. Well we will go with high quality because well we want to try to get the best job done here the first time through. There is a little option here to identify speakers. Since there is only one person speaking, we don't need to check that box. If there was more than one person speaking, you would check that so it try to decide that this is person one, and this is person two based upon the analysis of how their waveforms fit. Now I will click OK, when we do this the analysis will begin. Typically, the analysis takes about as long as the clip is and if you know this little segment you are going to say that doesn't look quite right.

I am going to play this, and as a place you will notice that each word will be highlighted as I go through here. So, you actually can search on words like Happiness would be over there, for example. Well let's go to the beginning and play and see how those words compare to the actual words, and you will see that they don't necessarily completely fit. (Audio playing.) It did pretty well, but it missed a few, a little bit of irony here that they are endowed that they aren't owned by their creator, sometimes funny things are stated inadvertently.

But what you can do now is you can fix this, and the thing is you can't, let's say, copy all of this and do a text editing program, fix-it and then paste it back. That's because each word here is connected to time inside this file. So, copying and pasting an entire file back in here won't work. We need to stick with what we have here. So, you have to edit literally one word at a time. So, I am going to show you, how that works. You click a word, and it turns blue. You click again, and it becomes editable. There is one we hold, and I could keep on going like this, but what I want to say is We hold these truths to be self-evident.

So, I am a missing a word here to be actually I guess can turn solved to self, self evident. Hold these truths, and I want to fix these other guys. I want to show you one thing that you can add words ahead of or after one so, and the pursuit here is missing. We need the word the, so I click on that word and then right-click, say Insert Word Before. I can do Before, After, Delete or Merge, but we will do Insert Word Before and the pursuit and the pursuit of happiness.

So, let me edit the rest of these guys so you can see the finished product when I am done. So, now I am almost done. I want to show you one more thing. There is with certain unalienable, and it's one word they made into three slang. Right-click on this one and just delete that one, right-click on this one delete that one and then replace legal with unalienable. And now we are done among these are Life, Liberty and the pursuit of Happiness Now that we are done we can always jump right to a word or if we have a very long dialog or narration, which something that you want to do the text analysis on first, go get a coffee while it's don't the analysis come back, and then you want to be able to jump right to a word or phrase or just type in the word you are looking for like happiness, and it will search the entire set of metadata to define any instances of that word and in this case, it jumps right to happiness.

If I click on it, I will click play. I loop it. It goes over and over again. Enough of that. So you can see how advantages it can be to do speech analysis because if you have a very long dialog or a log interview or something, and you really don't want sit down there and transcribe it or remind yourself what the person said. You just know that may be 15 minutes in or so the person have that pithy sound bite that you want to you as well as one that say 30 seconds and want at 30 minutes.

You just do the whole speech analysis creation, and then later on you can then go find that word that jumped out at you to help you track that segment down right away so you can edit it. Let me just tell you one another thing you can do. You can right-click on any one of these words and say Copy All, and then when you Copy All you can then go to another program let's say like Word and then Paste, which would be Paste or Ctrl+V or Command+V, and you can put in all the text, and this could be used let's say for Closed Captioning or for subtitles.

Then we back to Soundbooth, just have that open. So, that's basically how you can do speech analysis plus you can see, I think, the advantages of speech analysis knowing that it isn't perfect.

Show transcript

This video is part of

Image for Soundbooth CS5 Essential Training
Soundbooth CS5 Essential Training

52 video lessons · 14777 viewers

Jeff Sengstack
Author

 
Expand all | Collapse all
  1. 4m 51s
    1. Welcome
      1m 13s
    2. What is Soundbooth CS5?
      2m 30s
    3. Using the exercise files
      1m 8s
  2. 18m 4s
    1. Taking a look at basic sound waves
      3m 53s
    2. Taking a look at complex sound waves
      6m 43s
    3. Understanding digital audio concepts
      7m 28s
  3. 14m 45s
    1. Understanding the workflow
      5m 35s
    2. Touring the workspace
      3m 44s
    3. Customizing the workspace
      5m 26s
  4. 13m 11s
    1. Opening and importing files
      3m 59s
    2. Setting up recording hardware
      2m 35s
    3. Recording vocals and instruments
      6m 37s
  5. 1h 3m
    1. Playing and monitoring audio
      5m 10s
    2. Viewing audio waveforms and spectral displays
      6m 24s
    3. Selecting audio
      9m 14s
    4. Trimming and deleting audio
      3m 59s
    5. Copying, cutting, and pasting audio and inserting silence
      7m 55s
    6. Adjusting volume
      11m 21s
    7. Using specialized volume techniques
      6m 54s
    8. Creating and using loops
      4m 42s
    9. Stretching time and shifting pitch
      5m 14s
    10. Working with video files
      2m 22s
  6. 26m 50s
    1. Identifying noises: Hums, hisses, clicks, and pops
      5m 45s
    2. Removing background noise: Audio tape hiss
      7m 55s
    3. Removing vinyl record clicks and pops
      3m 57s
    4. Removing individual sounds
      9m 13s
  7. 13m 27s
    1. Previewing Soundbooth effects
      5m 32s
    2. Applying and adjusting standard effects
      4m 58s
    3. Applying and customizing advanced effects
      2m 57s
  8. 48m 25s
    1. Applying reverb and echo: Analog Delay and Convolution Reverb
      8m 46s
    2. Using delay-based effects: Chorus/Flanger and Phaser
      6m 8s
    3. Understanding sound-level effects: Compressor and Dynamics
      7m 0s
    4. Applying equalization effects: Graphic and Parametric
      11m 14s
    5. Exploring other special effects: Distortion and Vocal Enhancer
      7m 58s
    6. Setting the all-in-one effect: Mastering
      7m 19s
  9. 46m 42s
    1. Understanding multitrack concepts
      1m 16s
    2. Building a multitrack file
      7m 5s
    3. Adjusting track and clip volume and panning
      8m 54s
    4. Adding effects to individual tracks
      7m 38s
    5. Using Soundbooth sound effects in your multitrack file
      6m 38s
    6. Using three multitrack editing techniques: Duplicating, splitting, and cross-fading
      6m 15s
    7. Working with video in multitrack
      2m 49s
    8. Using professional production studio mixing techniques
      6m 7s
  10. 17m 13s
    1. Understanding how scores work
      3m 30s
    2. Previewing, downloading, and inserting scores into multitrack files
      4m 45s
    3. Adjusting score duration, intensity and parts
      8m 58s
  11. 10m 42s
    1. Dynamically linking to Premiere Pro and After Effects projects
      4m 31s
    2. Turning spoken dialogue into searchable metadata
      6m 11s
  12. 21m 42s
    1. Saving snapshots
      6m 23s
    2. Saving entire files or selected ranges
      11m 59s
    3. Saving and mixing down multitrack files
      3m 20s
  13. 10s
    1. Goodbye
      10s

Start learning today

Get unlimited access to all courses for just $25/month.

Become a member
Sometimes @lynda teaches me how to use a program and sometimes Lynda.com changes my life forever. @JosefShutter
@lynda lynda.com is an absolute life saver when it comes to learning todays software. Definitely recommend it! #higherlearning @Michael_Caraway
@lynda The best thing online! Your database of courses is great! To the mark and very helpful. Thanks! @ru22more
Got to create something yesterday I never thought I could do. #thanks @lynda @Ngventurella
I really do love @lynda as a learning platform. Never stop learning and developing, it’s probably our greatest gift as a species! @soundslikedavid
@lynda just subscribed to lynda.com all I can say its brilliant join now trust me @ButchSamurai
@lynda is an awesome resource. The membership is priceless if you take advantage of it. @diabetic_techie
One of the best decision I made this year. Buy a 1yr subscription to @lynda @cybercaptive
guys lynda.com (@lynda) is the best. So far I’ve learned Java, principles of OO programming, and now learning about MS project @lucasmitchell
Signed back up to @lynda dot com. I’ve missed it!! Proper geeking out right now! #timetolearn #geek @JayGodbold
Share a link to this course

What are exercise files?

Exercise files are the same files the author uses in the course. Save time by downloading the author's files instead of setting up your own files, and learn by following along with the instructor.

Can I take this course without the exercise files?

Yes! If you decide you would like the exercise files later, you can upgrade to a premium account any time.

Become a member Download sample files See plans and pricing

Please wait... please wait ...
Upgrade to get access to exercise files.

Exercise files video

How to use exercise files.

Learn by watching, listening, and doing, Exercise files are the same files the author uses in the course, so you can download them and follow along Premium memberships include access to all exercise files in the library.


Exercise files

Exercise files video

How to use exercise files.

For additional information on downloading and using exercise files, watch our instructional video or read the instructions in the FAQ.

This course includes free exercise files, so you can practice while you watch the course. To access all the exercise files in our library, become a Premium Member.

Join now "Already a member? Log in

Are you sure you want to mark all the videos in this course as unwatched?

This will not affect your course history, your reports, or your certificates of completion for this course.


Mark all as unwatched Cancel

Congratulations

You have completed Soundbooth CS5 Essential Training.

Return to your organization's learning portal to continue training, or close this page.


OK
Become a member to add this course to a playlist

Join today and get unlimited access to the entire library of video courses—and create as many playlists as you like.

Get started

Already a member?

Become a member to like this course.

Join today and get unlimited access to the entire library of video courses.

Get started

Already a member?

Exercise files

Learn by watching, listening, and doing! Exercise files are the same files the author uses in the course, so you can download them and follow along. Exercise files are available with all Premium memberships. Learn more

Get started

Already a Premium member?

Exercise files video

How to use exercise files.

Ask a question

Thanks for contacting us.
You’ll hear from our Customer Service team within 24 hours.

Please enter the text shown below:

The classic layout automatically defaults to the latest Flash Player.

To choose a different player, hold the cursor over your name at the top right of any lynda.com page and choose Site preferencesfrom the dropdown menu.

Continue to classic layout Stay on new layout
Exercise files

Access exercise files from a button right under the course name.

Mark videos as unwatched

Remove icons showing you already watched videos if you want to start over.

Control your viewing experience

Make the video wide, narrow, full-screen, or pop the player out of the page into its own window.

Interactive transcripts

Click on text in the transcript to jump to that spot in the video. As the video plays, the relevant spot in the transcript will be highlighted.

Are you sure you want to delete this note?

No

Your file was successfully uploaded.

Thanks for signing up.

We’ll send you a confirmation email shortly.


Sign up and receive emails about lynda.com and our online training library:

Here’s our privacy policy with more details about how we handle your information.

Keep up with news, tips, and latest courses with emails from lynda.com.

Sign up and receive emails about lynda.com and our online training library:

Here’s our privacy policy with more details about how we handle your information.

   
submit Lightbox submit clicked
Terms and conditions of use

We've updated our terms and conditions (now called terms of service).Go
Review and accept our updated terms of service.