Breaking News

ASUSTOR at Computex 2026 Exceed the Infinite with New ASRock X870E Taichi White Motherboard Fanatec unveils new products and performance upgrades at Spring Showcase LG Electronics Introduces First UltraGear evo Hyper Mini LED 5K Gaming Monitor CORSAIR Launches ThermalProtect PCIe 5.1 600W 12V-2x6 Cable to Help Protect GPUs from Overheating

logo

  • Share Us
    • Facebook
    • Twitter
  • Home
  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map

Search form

Toshiba's Voice Recognition Technology Can Distinguish Multiple Individual Speakers Without Training

Toshiba's Voice Recognition Technology Can Distinguish Multiple Individual Speakers Without Training

Enterprise & IT Oct 25,2016 0

Toshiba has developed a technology capable of precisely distinguishing the voices of individual speakers in real time, even when multiple persons are speaking at the same time.

Typically, voice recognition reliability falls off when multiple people speak simultaneously. While technologies have been developed for separating simultaneous speech, the acoustic characteristics of locations where conversations take place and recording environment factors, such as the positioning of the speakers, have required provision of dozens of minutes of recordings to realize training for optimal separation.

Toshiba claims that its new technology achieves precise, real-time identification of speakers and separate voice capture, even when many voices are trying to be heard, and delivers high-precision recognition and transcription of each speaker, using a microphone array embedded in a single sound input device.

High-precision transcription alleviates the need for manually keeping minutes in business meetings, and allows an increased focus on analysis of customer opinions and the improvement of staff manuals. Transcriptions of meetings with customers from overseas can also be used for automatic translation systems.

A problem with previous sound-source separation systems is that they require many minutes of pre-recorded speech for system training in order create a sufficiently precise separation filter for each speech source (person). Toshiba's method replaces this time-consuming direct learning for filter creation with learning of the spatial characteristics representing speaker position information from the positioning of the microphones. This achieves high-performance separation supported by continuous filter updates according to the environment, and approximately double the separation precision of previous techniques. Toshiba says that when separating simultaneous speech of two speakers, the amount of suppression of the of the second person's speech was improved from 3 to 9 dB - an approximate doubling.

In operation, the new system rapidly determines the relative positioning of speakers through matching an association table for sound direction to the time difference at which sound arrives from speakers attached to each microphone. This technique allows the capture and separation of each individual's voice, even when there are simultaneous utterances and without any previous recordings at the location.

Toshiba keeps working on the new technology, and plans to include it to its 2017 RECAIUS cloud-based service that supports various human activities for understanding the intentions of humans in audio and visual recordings.

Tags: Toshiba
Previous Post
Emporio Armani Launches A Hybrid Smartwatch
Next Post
Alexa Skill To Lets You Control Your Harmony Hub-based Universal Remotes

Related Posts

  • Toshiba Canvio Flex Portable Hard Drive, Now in Metallic Blue

  • Toshiba Begins Sampling of 30-34 TB SMR Nearline Hard Disk Drives

  • World Backup Day 2026: A Backup Doesn’t Always Need to be in the Cloud

  • Toshiba to Showcase High-Performance AI and Petabyte-Scale Storage Solutions at Cloudfest 2026

  • Asustor AS5404T 4-Bay NAS System

  • Toshiba Storage Trends 2026

  • Toshiba launches S300 AI surveillance HDD for AI-driven video applications

  • Toshiba First in Industry to Verify 12-Disk Stacking Technology for Hard Drives

Latest News

ASUSTOR at Computex 2026
Enterprise & IT

ASUSTOR at Computex 2026

Exceed the Infinite with New ASRock X870E Taichi White Motherboard
PC components

Exceed the Infinite with New ASRock X870E Taichi White Motherboard

Fanatec unveils new products and performance upgrades at Spring Showcase
Gaming

Fanatec unveils new products and performance upgrades at Spring Showcase

LG Electronics Introduces First UltraGear evo Hyper Mini LED 5K Gaming Monitor
Gaming

LG Electronics Introduces First UltraGear evo Hyper Mini LED 5K Gaming Monitor

CORSAIR Launches ThermalProtect PCIe 5.1 600W 12V-2x6 Cable to Help Protect GPUs from Overheating
Enterprise & IT

CORSAIR Launches ThermalProtect PCIe 5.1 600W 12V-2x6 Cable to Help Protect GPUs from Overheating

Popular Reviews

Akaso 360 Action camera

Akaso 360 Action camera

Dragon Touch Digital Calendar

Dragon Touch Digital Calendar

be quiet! Pure Loop 3 280mm

be quiet! Pure Loop 3 280mm

Noctua NF-A12x25 G2 fans

Noctua NF-A12x25 G2 fans

Arctic Liquid Freezer III 360 Pro Argb

Arctic Liquid Freezer III 360 Pro Argb

Soft2bet and the unseen hardware that makes instant play possible

Soft2bet and the unseen hardware that makes instant play possible

Crucial T710 2TB NVME SSD

Crucial T710 2TB NVME SSD

JSAUX 65Wh Rog Ally Battery

JSAUX 65Wh Rog Ally Battery

Main menu

  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map
  • About
  • Privacy
  • Contact Us
  • Promotional Opportunities @ CdrInfo.com
  • Advertise on out site
  • Submit your News to our site
  • RSS Feed