Service

Data Processing

Data Annotation

Suitable for large-scale data processing requirement, a professional annotating team converts original data into usable data:

  1. 1. Data classification/cleaning: including evaluation content classification, picture classification, adult yellow cleaning, effective speech screening, etc.
  2. 2. Data calibration evaluation: including text grammar check, image correlation evaluation, search correlation evaluation, emotional inclination evaluation, etc.
  3. 3. Data content extraction: including image text extraction, text keyword extraction, speech and text transcription, web page summary writing, etc.

Text Annotation

The Data Center of Tuosi not only provides more than 50 kinds of languages and 20 kinds of text and speech data resources but also provides multiple language data annotating services such as page correlation and emotional annotating. Moreover, the well-experienced multilingual foreign project team provides a deeper processing capacity of text corpora classification and theme induction, which can satisfy the different levels of the natural language requirement.
Meanwhile, we have the whole intellectual property with large- scale and high quality, which can satisfy the authorized text corpus of customer engineering application such as multilingual machine transcription parallel corpus, mobile phone message, email.
We offer text annotating service mainly consist with following types:

Text phonetic notation
The annotation of text corpus such as participle, rhyme, entity, speech, syntax, grammar, semantics
Emotional annotating of text corpus
Theme event induction
Semantic disambiguation annotating
Multilingual machine transcription and transcription of corpus

Image Annotation

According to the specific requirement of customers and large-scale data processing requirements, the annotated teams provides various types of services such as filtering, classification, and annotation as well as converts the source data into usable data.

  1. * Data classification/cleaning: including evaluation content classification, picture type classification, garbage discharge cleaning, effective speech screening, etc.
  2. * Data validation evaluation: including text grammar check, image correlation evaluation, search correlation evaluation, emotional assessment, etc.
  3. * Data content extraction: including image text extraction, text keyword extraction, speech and text transcription, and web summary writing.

Video Annotation

According to the specific requirement of the customer, we provide video annotating service as following:

Video subject classification
Character and object attribute annotation
Analysis of the subject's track
Subject heading annotation
Image start point annotation

Phonetic Annotation

India Tuosi owns a foreign multilingual transcription team that has various language backgrounds as well as has professional transcription trained. They can provide more than 110 kinds of language speech transcription and more than 6000 hours of speech processing ability monthly.
Moreover, as the increasing requirement of customers, our speech data processing ability is increasing as well.
According to the specific transcription and annotating service requirement, we can also provide and develop customized speech transcription \annotating tool software, which can improve the data processing efficiency and accuracy to speed up the delivery.

Phonetic Data Transcription Service

Except for the normal transcription service, based on the requirement of increased customer model training research and test scheduling algorithm, we are offering different kinds of specific transfer service:
Orthographic transcription
Phonetic transcription is based on the dictionary such as SAMPA, X-SAMPA, which is based on the actual pronounce.

Phonetic data annotating service

Based on the technique modeling and testing requirement of phonetic composition, phonetic recognition, and speech recognition, we are offering different kinds of annotating service as following:
Text to speech (TTS), such as Phoneme syncopation and labeling, stress, and prosodic labeling of synthesized speech data.
Word/word boundary annotation.
Background environment, noise annotating (ASR)
Articulator role annotation, etc.
Multilingual TOBI annotation.
Video data speaker, action, etc.

contact