Catalogue technologique

Téléchargez le Catalogue Technologique en pdf ici.

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
    Q-Tech-Jouve-HandwritingRecognitionSystem_v02
    Capture handwritten and machine-printed data from documents Lire la suite
     

    Target users and customers

    Everyone who has to deal with forms containing handwritten fields or to process incoming mails

    Application sectors

    • Banking
    • Healthcare
    • Government
    • Administration
     
    audioprint-1
    AudioPrint captures the acoustical properties by computing a robust representation of the sound Lire la suite
     

    Target users and customers

    AudioPrint is dedicated to middle-ware integrators that wish to develop audio fingerprint applications (i.e. systems for live recognition of music on air), as well as synchronization frameworks for second screen applications (a mobile device brings contents directly related to the live TV program). The music recognition application can also be used by digital rights management companies.

    Application sectors

    • Second screen software providers
    • Digital right management
    • Music query software developers
     
    Q-Tech-Jouve-ColorimetricCorrectionSystem
    A specific tool to create a suitable colorimetric correction and check its stability over time Lire la suite
     

    Target users and customers

    Everyone who has to deal with highcolorimetric constraints.

    Application sectors

    • Patrimony
    • Industry
     
    Q-Tech-Vocapia-automatic speechtranscription-visuel
    Vocapia Research develops core multilingual large vocabulary speech recognition technologies* for voice interfaces and automatic audio indexing applications. This speech-to-text technology is available for multiple languages. (* Under license from LIMSI-CNRS) Lire la suite
     

    Target users and customers

    The targeted users and customers of speech-to-text transcription technologies are actors in the multimedia and call center sector, including academic and industrial organizations interested in the automatic mining processing of audio or audiovisual documents.

    Application sectors

    This core technology can serve as the basis for a variety of applications: multilingual audio indexing, teleconference transcription, telephone speech analytics, transcription of speeches, subtitling…

    Large vocabulary continuous speech recognition is the key technology for enabling content-based information access in audio and audiovisual documents. Most of the linguistic information is encoded in the audio channel of audiovisual data, which once transcribed can be accessed using text-based tools.

    Via speech recognition, spoken document retrieval can support random access using specific criteria to relevant portions of audio documents, reducing the time needed to identify recordings in large multimedia
    databases. Some applications are data-mining, news-on-demand, and
    media monitoring.

     
    Vocapia Research provides a language identification technology* that can identify languages in audio data. (* Under license from LIMSI-CNRS) Lire la suite
     

    Target users and customers

    The targeted users and customers of language recognition technologies are actors in the multimedia and call center sectors, including academic and industrial organizations, as well as actors in the defense domain, interested in the processing of audio documents, and in particular if the collection of documents contains multiple languages.

    Application sectors

    A language identification system can be run prior to a speech recognizer. Its output is used to load the appropriate language dependent speech recognition models for the audio document.

    Alternatively, the language identification might be used to dispatch audio documents or telephone calls to a human operators fluent in the corresponding identified language.

    Other potential applications also involve the use of LID as a front-end to a multi-lingual translation system. This technology can also be part of automatic system for spoken data retrieval or automatic enriched transcriptions.

     
    Q-Tech-Jouve-DocumentStructuringSystem
    A generic tool to recognize the logical structure of documents from a OCR stream Lire la suite
     

    Target users and customers

    Everyone who has to deal with electronic document encoding of from the original source material and needs to consider the hierarchical structure represented in the digitized document.

    Application sectors

    • Industry
    • Service
    • Patrimony
    • Administration
     
    Q-Tech-Technicolor-MovieChaptering-visuel
    An Automatic temporal segmentation of video Lire la suite
     

    Target users and customers

    • Content providers
    • ISP (Internet Service Provider)
    • Video editing software companies

    Application sectors

    • Video structuring
    • Video archiving
     
    Hybrid braodcast broadband synchronization_12_09_2013
    Personalized audio, Multi-view on multi-screen, Hybrid stereoscopic 3D TV Lire la suite
     

    Target users and customers

    • Broadcasters (Sat, Cable, Terrestrial)
    • ISP

    Application sectors

    Personalized audio: the technology offers the user the possibility of enjoying a broadcast TV program in his/her favorite language. Additional languages are streamed on demand from a server and can be rendered either on the main TV screen or on a personal device (e.g. smartphone with headphone).

    Multi-view on multi-screen: the user can enrich the broadcast TV program (e.g. music live concert or sport event) by selecting additional points of views rendered on a second screen, e.g. a tablet.

    Hybrid stereoscopic 3D TV: it consists in rendering a 3D side-by-side content without monopolizing a broadcast channel. One view is transmitted over a broadcast network whilst the other view is delivered over Internet. Each view can be rendered independently as a 2D content.

     
    ircambeat
    Ircambeat software estimates the global and time-variable tempo and meter of a music file. It also estimates the positions of the beats and downbeats over time. Lire la suite
     

    Target users and customers

    Tempo and meter of a music file are among the major perceptual characteristics of a music file. Their automatic estimation allows to get these values for large collections of music files. They can therefore be used to perform automatic music classification of large music collections, search by similarity over large music collections and automatic music play-list generation. The technology can therefore benefit to music providers, online music portals or offline media-player developers.

    Beats and downbeats define the time-grid of a music file. They are used as front-end – for the estimation of many other music parameters and – for other processings (time-stretching, segmentation, DJ-ing). The technology for their automatic estimation can therefore benefit to music software developers (music production, music DJ-ing software).

    Application sectors

    • Online music providers
    • Online music portals
    • Music players developers
    • Music software developers
     
    KIT-multimedia-demo-settings
    Identifying actors in movies and TV series Lire la suite
     

    Target users and customers

    • Multimedia Content Providers
    • Movie/TV Streaming Providers
    • Movie/TV Industry Actors

    Application sectors

    • Movie/TV Streaming & Playback
    • Second Screen
     
 
 
Démonstrateurs applicatifs