Catalogue technologique

Téléchargez le Catalogue Technologique en pdf ici.

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
    Q-Tech-Vocapia-automatic speechtranscription-visuel
    Vocapia Research develops core multilingual large vocabulary speech recognition technologies* for voice interfaces and automatic audio indexing applications. This speech-to-text technology is available for multiple languages. (* Under license from LIMSI-CNRS) Lire la suite
     

    Target users and customers

    The targeted users and customers of speech-to-text transcription technologies are actors in the multimedia and call center sector, including academic and industrial organizations interested in the automatic mining processing of audio or audiovisual documents.

    Application sectors

    This core technology can serve as the basis for a variety of applications: multilingual audio indexing, teleconference transcription, telephone speech analytics, transcription of speeches, subtitling…

    Large vocabulary continuous speech recognition is the key technology for enabling content-based information access in audio and audiovisual documents. Most of the linguistic information is encoded in the audio channel of audiovisual data, which once transcribed can be accessed using text-based tools.

    Via speech recognition, spoken document retrieval can support random access using specific criteria to relevant portions of audio documents, reducing the time needed to identify recordings in large multimedia
    databases. Some applications are data-mining, news-on-demand, and
    media monitoring.

     
    Q-Tech-Jouve-DocumentClassificationSystem-visuel
    A generic tool for classifying documents based on a hybrid learning technique Lire la suite
     

    Target users and customers

    Everyone who has to deal with document classification with a large amount of already classified documents.

    Application sectors

    • Industrial property
    • Scientific Edition
     
    ircambeat
    Ircambeat software estimates the global and time-variable tempo and meter of a music file. It also estimates the positions of the beats and downbeats over time. Lire la suite
     

    Target users and customers

    Tempo and meter of a music file are among the major perceptual characteristics of a music file. Their automatic estimation allows to get these values for large collections of music files. They can therefore be used to perform automatic music classification of large music collections, search by similarity over large music collections and automatic music play-list generation. The technology can therefore benefit to music providers, online music portals or offline media-player developers.

    Beats and downbeats define the time-grid of a music file. They are used as front-end – for the estimation of many other music parameters and – for other processings (time-stretching, segmentation, DJ-ing). The technology for their automatic estimation can therefore benefit to music software developers (music production, music DJ-ing software).

    Application sectors

    • Online music providers
    • Online music portals
    • Music players developers
    • Music software developers
     
    Bildschirmfoto vom 2013-07-23 183A153A01
    Automatic speech recognition, also known as speech-to-text, is the transcription of speech into (machine-readable) text by a computer Lire la suite
     

    Target users and customers

    • Researchers
    • Developers
    • Integrators

    Application sectors

    The use of automatic speech recognition is so manifold that it is hard to list here. The main usages today are customer interaction via the telephone, healthcare dictation and usage on car navigation systems and smartphones. These applications will with increasingly better technology extend to audio mining, speech translation and an increased use of human computer interaction via speech.

     
    Q-Tech-Jouve-HandwritingRecognitionSystem_v02
    Capture handwritten and machine-printed data from documents Lire la suite
     

    Target users and customers

    Everyone who has to deal with forms containing handwritten fields or to process incoming mails

    Application sectors

    • Banking
    • Healthcare
    • Government
    • Administration
     
    Q-Tech-Inria-SlopPy-visuel
    Slope One with Privacy Lire la suite
     

    Target users and customers

    The targeted users and customers are all the Internet actors providing personalized services to their users, interested by integrating recommender systems that are more respectful of their privacy.

    Application sectors

    • Personalization
    • Recommender systems
     
    visuel KIT_speech to text
    Transcription of human speech into written word sequences Lire la suite
     

    Target users and customers

    Companies who want to integrate the transcription of human speech into their products.

    Application sectors

    Speech-to-Text technology is key to indexing multimedia content as it is found in multimedia databases or in video and audio collections on the World Wide Web, and to make it searchable by human queries. In addition, it offers a natural interface for submitting and executing queries.

    This technology is further part of speech-translation services. In combination with machine translation technology, it is possible to design machines that take human speech as input and translate it into a new language. This can be used to enable human-to-human combination across the language barrier or to access languages in a cross-lingual way.

     
    KIT-face-recognition-cvhci-demo-MAU-cropped
    Localize and identify faces and estimate age, gender and emotions Lire la suite
     

    Target users and customers

    The targeted users are companies interested in integrating face analysis into their products.

    Application sectors

    • Digital Signage
    • User Interfaces / Human Computer Interaction
    • Entertainment
    • Safety and Security
    • Multimedia Analysis, Search & Retrieval
    • Assistive Techonologies
     
    Q-Tech-Jouve-ImageIdentificationSystem
    A generic tool to identify automatically documents, photos and text zones in scanned images Lire la suite
     

    Target users and customers

    Everyone who has to deal with document recognition like identity cards, passports, invoices…

    Application sectors

    • Administration
    • Banking
    • Insurance
     
    ircamchord-1
    Ircamchord software estimates automatically the temporal succession of music chords (C-Major, C-minor, …) that makes up a piece of music. Lire la suite
     

    Target users and customers

    One of the most important perceptual aspects of popular music is the succession of chords over time. Two tracks based on the same chord succession are perceived very similar and sometimes indicate a cover-version of the same composition. Automatic estimation of chord succession can therefore be used to perform search by similarity and play-list generation.
    It can therefore benefit to music providers, online music portals.

    Chord notation is also very popular for beginner musicians (a very large amount of guitar tabs are accessible and used over the web). Estimating automatically the chord succession of a given track can therefore be beneficial for personal users through the inclusion of the technology in local software.

    Application sectors

    • Online music providers
    • Online music portals
    • Music players developers
    • Music software developers
     
 
 
Démonstrateurs applicatifs