Breaking News

Leica Ultravid 8x32 HD-Plus Special Edition in brown leather BIOSTAR INTRODUCES THE BIRPL-PAT INDUSTRIAL MOTHERBOARD Sony Electronics and The Associated Press complete testing of advanced In-Camera authenticity technology TEAMGROUP Launches T-FORCE SIREN GD120S AIO SSD Cooler - An Exceptional AIO M.2 2280 SSD Liquid Cooler COUGAR Introduces the Hotrod – a motorsports-inspired gaming chair designed to support extreme gaming performance

logo

  • Share Us
    • Facebook
    • Twitter
  • Home
  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map

Search form

Microsoft Advances Conversation Transcription Using Virtual Microphone Arrays

Microsoft Advances Conversation Transcription Using Virtual Microphone Arrays

Enterprise & IT May 11,2019 0

Microsoft Research's 'Project Denmark' Technology you to use the microphones in phones and laptops to create a virtual array that can handle real-time conversation transcription and more.

Announced at Build 2019, the new Conversation Transcription capability, part of Microsoft's Azure Speech Service., allows real-time transcription of multi-user conversations with automatic speaker attribution.

While smart speakers are commercially available today, most of them can only handle a single person’s speech command one at a time and require a wake-up word before issuing such a command. The Azure Speech Service, available in preview today, is enhanced by the availability of audio-only or audio-visual microphone array devices via Microsoft's referenced Devices SDK (DDK).

The Conversation Transcription capability expands Microsoft’s existing Azure speech service to enable real-time, multi-person, far-field speech transcription and speaker attribution. Paired with a Speech DDK, Conversation Transcription can recognize conversational speech for a small group of people in a room and generate a transcription handling common yet challenging scenarios such as “cross-talk”.

Microsoft is engaging with selected customers and Systems Integration (SI) partners such as Accenture, Avanade and Roobo to customize and integrate the Conversation Transcription solution in US and China respectively.

The Conversation Transcription capability utilizes multi-channel data including audio and visual signals from a Speech DDK that is codenamed Princeton Tower. The edge device is based on Microsoft's reference-designed 360-degree audio microphone array or 360-degree fisheye camera with audio-visual fusion to support improved transcription. The edge device sends signals to Azure cloud for neural signal processing and speech recognition. Audio-only microphone array DDKs can be purchased from http://ddk.roobo.com. Advanced audio-visual microphone array DDKs are available from Microsoft's SI partners.

Microsoft's latest research progress (Project Denmark) enables dynamic creation of a virtual microphone array with a set of existing devices such as mobile phones or laptops equipped with an ordinary microphone. The virtual microphone array combines existing devices like mobile phones or laptops equipped with an ordinary microphone like Lego blocks into a single larger array dynamically. Project Denmark can potentially help Microsoft's customers more easily transcribe conversations anytime and anywhere using Azure speech services, with or without a dedicated microphone array DDK. Future application scenarios are broad. For example, Microsoft may pair up multiple Microsoft Translator applications to help multiple people communicate more effectively using mobile phones to minimize language barriers.

Accurate speech transcription is very difficult if the domain vocabulary such as acronyms is unavailable. To solve for this, Microsoft is extending Azure custom speech recognition capabilities and enabling organizations to create custom speech models using their Office 365 data. For Office 365 enterprise customers opting in for this service, Azure can automatically generate a custom model leveraging Office 365 data such as Contacts, Emails, and Documents in a completely eyes-off, secure and compliant fashion. This delivers more accurate speech transcription on organization-specific vernacular such as technical terms and people names.

Tags: MicrosoftMicrosoft azure
Previous Post
Toshiba Nominates non-Japanese Directors to New Board
Next Post
Cooler Master SK621 Wireless Bluetooth Mechanical Keyboard Now Available

Related Posts

  • Activision Blizzard King to Team Xbox

  • NVIDIA Studio Lineup Adds RTX-Powered Microsoft Surface Laptop Studio 2

  • Samsung and Microsoft Unveil First On-Device Attestation Solution for Enterprise

  • Introducing Xbox Game Pass Core, Coming This September

  • Announcing the next wave of AI innovation with Microsoft Bing and Edge

  • Microsoft Announces Security Copilot AI

  • Microsoft breaks new ground in healthcare with the next evolution of AI

  • ChatGPT is now available in Azure OpenAI Service

Latest News

Leica Ultravid 8x32 HD-Plus Special Edition in brown leather
Consumer Electronics

Leica Ultravid 8x32 HD-Plus Special Edition in brown leather

BIOSTAR INTRODUCES THE BIRPL-PAT INDUSTRIAL MOTHERBOARD
Enterprise & IT

BIOSTAR INTRODUCES THE BIRPL-PAT INDUSTRIAL MOTHERBOARD

Sony Electronics and The Associated Press complete testing of advanced In-Camera authenticity technology
Cameras

Sony Electronics and The Associated Press complete testing of advanced In-Camera authenticity technology

TEAMGROUP Launches T-FORCE SIREN GD120S AIO SSD Cooler - An Exceptional AIO M.2 2280 SSD Liquid Cooler
Cooling Systems

TEAMGROUP Launches T-FORCE SIREN GD120S AIO SSD Cooler - An Exceptional AIO M.2 2280 SSD Liquid Cooler

COUGAR Introduces the Hotrod – a motorsports-inspired gaming chair designed to support extreme gaming performance
Gaming

COUGAR Introduces the Hotrod – a motorsports-inspired gaming chair designed to support extreme gaming performance

Popular Reviews

Pioneer BDR-S13U-X Blu-Ray Recorder

Pioneer BDR-S13U-X Blu-Ray Recorder

Arctic Liquid Freezer II 360 Α-RGB

Arctic Liquid Freezer II 360 Α-RGB

Pioneer BDR-X13U-S

Pioneer BDR-X13U-S

Pioneer BDR-XD08UMB-S External Blu-Ray Recorder

Pioneer BDR-XD08UMB-S External Blu-Ray Recorder

Verbatim External 4K Slimline Blu-Ray Recorder

Verbatim External 4K Slimline Blu-Ray Recorder

Surefire KINGPIN M2 Keyboard

Surefire KINGPIN M2 Keyboard

Samsung 970 EVO Plus 2TB NVME SSD

Samsung 970 EVO Plus 2TB NVME SSD

Crucial X8 4TB PortableSSD

Crucial X8 4TB PortableSSD

Main menu

  • Home
  • News
  • Reviews
  • Essays
  • Forum
  • Legacy
  • About
    • Submit News

    • Contact Us
    • Privacy

    • Promotion
    • Advertise

    • RSS Feed
    • Site Map
  • About
  • Privacy
  • Contact Us
  • Promotional Opportunities @ CdrInfo.com
  • Advertise on out site
  • Submit your News to our site
  • RSS Feed