Start your free trial now, and begin learning software, business and creative skills—anytime, anywhere—with video instruction from recognized industry experts.

Start Your Free Trial Now

Turning spoken dialogue into searchable metadata

Turning spoken dialogue into searchable metadata provides you with in-depth training on Audio + Musi… Show More

Soundbooth CS5 Essential Training

with Jeff Sengstack

Video: Turning spoken dialogue into searchable metadata

Turning spoken dialogue into searchable metadata provides you with in-depth training on Audio + Music. Taught by Jeff Sengstack as part of the Soundbooth CS5 Essential Training
Expand all | Collapse all
  1. 4m 51s
    1. Welcome
      1m 13s
    2. What is Soundbooth CS5?
      2m 30s
    3. Using the exercise files
      1m 8s
  2. 18m 4s
    1. Taking a look at basic sound waves
      3m 53s
    2. Taking a look at complex sound waves
      6m 43s
    3. Understanding digital audio concepts
      7m 28s
  3. 14m 45s
    1. Understanding the workflow
      5m 35s
    2. Touring the workspace
      3m 44s
    3. Customizing the workspace
      5m 26s
  4. 13m 11s
    1. Opening and importing files
      3m 59s
    2. Setting up recording hardware
      2m 35s
    3. Recording vocals and instruments
      6m 37s
  5. 1h 3m
    1. Playing and monitoring audio
      5m 10s
    2. Viewing audio waveforms and spectral displays
      6m 24s
    3. Selecting audio
      9m 14s
    4. Trimming and deleting audio
      3m 59s
    5. Copying, cutting, and pasting audio and inserting silence
      7m 55s
    6. Adjusting volume
      11m 21s
    7. Using specialized volume techniques
      6m 54s
    8. Creating and using loops
      4m 42s
    9. Stretching time and shifting pitch
      5m 14s
    10. Working with video files
      2m 22s
  6. 26m 50s
    1. Identifying noises: Hums, hisses, clicks, and pops
      5m 45s
    2. Removing background noise: Audio tape hiss
      7m 55s
    3. Removing vinyl record clicks and pops
      3m 57s
    4. Removing individual sounds
      9m 13s
  7. 13m 27s
    1. Previewing Soundbooth effects
      5m 32s
    2. Applying and adjusting standard effects
      4m 58s
    3. Applying and customizing advanced effects
      2m 57s
  8. 48m 25s
    1. Applying reverb and echo: Analog Delay and Convolution Reverb
      8m 46s
    2. Using delay-based effects: Chorus/Flanger and Phaser
      6m 8s
    3. Understanding sound-level effects: Compressor and Dynamics
      7m 0s
    4. Applying equalization effects: Graphic and Parametric
      11m 14s
    5. Exploring other special effects: Distortion and Vocal Enhancer
      7m 58s
    6. Setting the all-in-one effect: Mastering
      7m 19s
  9. 46m 42s
    1. Understanding multitrack concepts
      1m 16s
    2. Building a multitrack file
      7m 5s
    3. Adjusting track and clip volume and panning
      8m 54s
    4. Adding effects to individual tracks
      7m 38s
    5. Using Soundbooth sound effects in your multitrack file
      6m 38s
    6. Using three multitrack editing techniques: Duplicating, splitting, and cross-fading
      6m 15s
    7. Working with video in multitrack
      2m 49s
    8. Using professional production studio mixing techniques
      6m 7s
  10. 17m 13s
    1. Understanding how scores work
      3m 30s
    2. Previewing, downloading, and inserting scores into multitrack files
      4m 45s
    3. Adjusting score duration, intensity and parts
      8m 58s
  11. 10m 42s
    1. Dynamically linking to Premiere Pro and After Effects projects
      4m 31s
    2. Turning spoken dialogue into searchable metadata
      6m 11s
  12. 21m 42s
    1. Saving snapshots
      6m 23s
    2. Saving entire files or selected ranges
      11m 59s
    3. Saving and mixing down multitrack files
      3m 20s
  13. 10s
    1. Goodbye

please wait ...
Turning spoken dialogue into searchable metadata
Video duration: 6m 11s 4h 59m Beginner


Turning spoken dialogue into searchable metadata provides you with in-depth training on Audio + Music. Taught by Jeff Sengstack as part of the Soundbooth CS5 Essential Training

Audio + Music Video

Turning spoken dialogue into searchable metadata

Soundbooth has a speech recognition module. It can analyze spoken words and convert them with reasonable accuracy into text. It can also differentiate between individuals to associate dialog with each person doing the speaking. This is a fast way to create a transcript, to create text that people can use inside Closed Captioning software or for subtitles. So, let me show you how that works. We have a dialog here with the declaration of independence in it, that just one person speaking me. We are going to have Soundbooth analyze this and try to convert the spoken word into the text.

To do that, we need to go to what's called the Metadata panel, and that's not visible here. It's not visible in the default view. So, you go Window, Metadata. Now Metadata is information that's stored inside the file as text and typically you don't necessarily see unless you can look at the Metadata using a panel like this, and it does not interfere with audio. So, here is an audio file, but inside of this file, it's a little sort capsule of metadata. And right now, there is very little Metadata associated with this because we have not added any. So, what I want to do now is analyze this text.

In the Metadata panel, there is this one thing it's the Speech Analysis and at the bottom there is a little button and whenever you see a button or menu command that is ... after it, just for information. That means another menu is going to open if I click on it, and there is a little dialog box. It says what are options, which language are you going to use. Well actually, three are different English- language versions that it does searching on. We will take the English - U.S. version as oppose to the UK or Canada version. It ask for reference script. If there was one, it would show up here. This is something that you would create inside Premiere Pro.

And then it says what kind of quality do you want to search on? High quality, which takes longer or medium quality, which is faster. Well we will go with high quality because well we want to try to get the best job done here the first time through. There is a little option here to identify speakers. Since there is only one person speaking, we don't need to check that box. If there was more than one person speaking, you would check that so it try to decide that this is person one, and this is person two based upon the analysis of how their waveforms fit. Now I will click OK, when we do this the analysis will begin. Typically, the analysis takes about as long as the clip is and if you know this little segment you are going to say that doesn't look quite right.

I am going to play this, and as a place you will notice that each word will be highlighted as I go through here. So, you actually can search on words like Happiness would be over there, for example. Well let's go to the beginning and play and see how those words compare to the actual words, and you will see that they don't necessarily completely fit. (Audio playing.) It did pretty well, but it missed a few, a little bit of irony here that they are endowed that they aren't owned by their creator, sometimes funny things are stated inadvertently.

But what you can do now is you can fix this, and the thing is you can't, let's say, copy all of this and do a text editing program, fix-it and then paste it back. That's because each word here is connected to time inside this file. So, copying and pasting an entire file back in here won't work. We need to stick with what we have here. So, you have to edit literally one word at a time. So, I am going to show you, how that works. You click a word, and it turns blue. You click again, and it becomes editable. There is one we hold, and I could keep on going like this, but what I want to say is We hold these truths to be self-evident.

So, I am a missing a word here to be actually I guess can turn solved to self, self evident. Hold these truths, and I want to fix these other guys. I want to show you one thing that you can add words ahead of or after one so, and the pursuit here is missing. We need the word the, so I click on that word and then right-click, say Insert Word Before. I can do Before, After, Delete or Merge, but we will do Insert Word Before and the pursuit and the pursuit of happiness.

So, let me edit the rest of these guys so you can see the finished product when I am done. So, now I am almost done. I want to show you one more thing. There is with certain unalienable, and it's one word they made into three slang. Right-click on this one and just delete that one, right-click on this one delete that one and then replace legal with unalienable. And now we are done among these are Life, Liberty and the pursuit of Happiness Now that we are done we can always jump right to a word or if we have a very long dialog or narration, which something that you want to do the text analysis on first, go get a coffee while it's don't the analysis come back, and then you want to be able to jump right to a word or phrase or just type in the word you are looking for like happiness, and it will search the entire set of metadata to define any instances of that word and in this case, it jumps right to happiness.

If I click on it, I will click play. I loop it. It goes over and over again. Enough of that. So you can see how advantages it can be to do speech analysis because if you have a very long dialog or a log interview or something, and you really don't want sit down there and transcribe it or remind yourself what the person said. You just know that may be 15 minutes in or so the person have that pithy sound bite that you want to you as well as one that say 30 seconds and want at 30 minutes.

You just do the whole speech analysis creation, and then later on you can then go find that word that jumped out at you to help you track that segment down right away so you can edit it. Let me just tell you one another thing you can do. You can right-click on any one of these words and say Copy All, and then when you Copy All you can then go to another program let's say like Word and then Paste, which would be Paste or Ctrl+V or Command+V, and you can put in all the text, and this could be used let's say for Closed Captioning or for subtitles.

Then we back to Soundbooth, just have that open. So, that's basically how you can do speech analysis plus you can see, I think, the advantages of speech analysis knowing that it isn't perfect.

Find answers to the most frequently asked questions about Soundbooth CS5 Essential Training .

Expand all | Collapse all
please wait ...
Q: After making a recording using Soundbooth CS5, I’ve discovered the stereo channels are reversed- left is right and right is left. I can’t seem to figure out how to swap them in Soundbooth. How can I adjust channels?
Also, is there a more advanced audio software that might be better for working with recorded audio than Soundbooth CS5?
A: To swap channels in Soundbooth, right-click on the file in the
Files panel, choose Insert Channels Into New Multitrack File. That will
create a multitrack sessions with the two channels on separate mono
tracks. Pan them left and right to create the swapped channels and then
choose Export > Multitrack Mixdown.

A more advanced audio recording, editing and mixing product is Adobe
Audition. The current version 3 is for Windows only. Check out the Audition 3 Essential Training in the Online Training Library.





Don't show this message again
Share a link to this course

What are exercise files?

Exercise files are the same files the author uses in the course. Save time by downloading the author's files instead of setting up your own files, and learn by following along with the instructor.

Can I take this course without the exercise files?

Yes! If you decide you would like the exercise files later, you can upgrade to a premium account any time.

Become a member Download sample files See plans and pricing

Please wait... please wait ...
Upgrade to get access to exercise files.

Exercise files video

How to use exercise files.

Learn by watching, listening, and doing, Exercise files are the same files the author uses in the course, so you can download them and follow along Premium memberships include access to all exercise files in the library.

Exercise files

Exercise files video

How to use exercise files.

For additional information on downloading and using exercise files, watch our instructional video or read the instructions in the FAQ .

This course includes free exercise files, so you can practice while you watch the course. To access all the exercise files in our library, become a Premium Member.

Join now Already a member? Log in

* Estimated file size

Are you sure you want to mark all the videos in this course as unwatched?

This will not affect your course history, your reports, or your certificates of completion for this course.

Mark all as unwatched Cancel


You have completed Soundbooth CS5 Essential Training.

Return to your organization's learning portal to continue training, or close this page.


Upgrade to View Courses Offline


With our new Desktop App, Annual Premium Members can download courses for Internet-free viewing.

Upgrade Now

After upgrading, download Desktop App Here.

Become a Member and Create Custom Playlists

Join today and get unlimited access to the entire library of online learning video courses—and create as many playlists as you like.

Get started

Already a member?

Log in

Exercise files

Learn by watching, listening, and doing! Exercise files are the same files the author uses in the course, so you can download them and follow along. Exercise files are available with all Premium memberships. Learn more

Get started

Already a Premium member?

Exercise files video

How to use exercise files.

Ask a question

Thanks for contacting us.
You’ll hear from our Customer Service team within 24 hours.

Please enter the text shown below:

Exercise files

Access exercise files from a button right under the course name.

Mark videos as unwatched

Remove icons showing you already watched videos if you want to start over.

Control your viewing experience

Make the video wide, narrow, full-screen, or pop the player out of the page into its own window.

Interactive transcripts

Click on text in the transcript to jump to that spot in the video. As the video plays, the relevant spot in the transcript will be highlighted.

You started this assessment previously and didn’t complete it.

You can pick up where you left off, or start over.

Resume Start over

Learn more, save more. Upgrade today!

Get our Annual Premium Membership at our best savings yet.

Upgrade to our Annual Premium Membership today and get even more value from your subscription:

“In a way, I feel like you are rooting for me. Like you are really invested in my experience, and want me to get as much out of these courses as possible this is the best place to start on your journey to learning new material.”— Nadine H.

Thanks for signing up.

We’ll send you a confirmation email shortly.

Sign up and receive emails about and our online training library:

Here’s our privacy policy with more details about how we handle your information.

Keep up with news, tips, and latest courses with emails from

Sign up and receive emails about and our online training library:

Here’s our privacy policy with more details about how we handle your information.

submit Lightbox submit clicked
Terms and conditions of use

We've updated our terms and conditions (now called terms of service).Go
Review and accept our updated terms of service.