San Jose, 5th April 2016 – Intelligent Voice®, a leading specialist in voice and text analysis solutions, officially announced the death of traditional audio and video formats at this year’s NVidia® GTC show.
Traditionally, to understand what is contained in an audio or video file, a user has to listen to or view the whole file. This can be time-consuming, especially for longer files.
Intelligent Voice® has long-specialised in the ultra-high speed extraction of text from speech using its NVidia® GPU accelerated decoding platform. It had taken this expertise and combined it with advanced machine learning and natural language techniques to produce the SmartTranscript™.
SmartTranscript™ is a single, self-contained HTML file that contains not only original audio/video, but also a transcript of what has been said. This is augmented by Intelligent Voice’s patent pending topic extraction technology which gives an instant snapshot of the key things said in the file. This allows the user to immediately engage with the content, understanding what has been said, and to navigate intuitively from topic to topic, guided by the “karaoke” playback function. A user can within seconds understand the key topics in audio or video, without the need to listen to the whole file. Laborious manual metadata tagging is instantly rendered redundant.
Because this file is self-contained, it replaces the traditional MP3 or MP4 file: The file can be emailed or sent via a file sharing link. It can be viewed without an internet connection, making it perfect for use in confidential or private environments. As it is part-text, part-audio, it is capable of being indexed as easily as text document, with no special indexing tools required, while maintaining all of the features of the original audio or video file.
“With the SmartTranscript™ we are allowing people to ‘see’ their audio for the first time,“ says Nigel Cannings, CTO of Intelligent Voice®. “For too long, audio communication can been consigned to the ‘too hard’ pile: We aim to make it as accessible as any text document.”
Intelligent Voice® has also been chosen to take part in this year’s Early Stage Challenge, for the chance to win $100,000 in Nvidia’s annual competition to find the most interesting early stage companies using advanced GPU technology. Intelligent Voice® is exhibiting in the “Emerging Companies Pavilion” at #GTC16.