Resources

Success Stories

Discover how leading companies use Innodata’s solutions and platforms to turn data into a valuable commodity.

Data Annotation

Objective

A leading scientific information provider required labeled data for content related to drug discovery and research funding.

Solution

  • Innodata created structured labeled data in XML from structured and unstructured data sources.
  • Professional teams of bio-medical and chemistry experts labeled millions of pages of scientific patent information and created structured data for predictive and prescriptive analytics platforms.
  • We used advanced entity extraction, text and data mining, augmented by professional scientific SMEs to create high-quality labeled scientific datasets.

Results

  • Innodata provided comprehensive information that has helped the client build and maintain reliable, up-to-date information of global patterns that matter.
  • The patent analytics platform is used by leading global drug and chemical manufacturers in efforts to avoid patent infringement, identify potential investments in new drug development and research fund attribution.

Objective

Post-market adverse drug reaction surveillance is a mandatory regulatory responsibility of pharmaceutical companies. The adverse reaction data is hidden in the scientific content published in research journals.

Thus, a leading pharmacovigilance platform required labeled datasets for training AI models based on this information in order to help its clients improve their drug safety workflow processes.

Solution

  • Innodata setup a team of bio-medical subject matter experts to label unstructured data; the adverse drug reactions are published across various journals and written as scientific research abstracts.
  • Innodata’s medical experts annotated the abstracts from scientific journals broken into individual words with a customized labeling application.
  • The SMEs studied the abstracts and labeled individual words to a controlled ontology of medical terms to create an annotated dataset for machine learning.

Results

  • Innodata’s labeled datasets have helped the company build machine learning models that predict adverse drug events with high accuracy and enables pharma companies to navigate the increasingly complex and ever-changing regulatory environment.
  • Today, our client can confidently offer its customers a fully compliant, configurable, easy-to-adopt and audit-ready solution for scientific literature monitoring and pharmacovigilance. 

Objective

A leading AI photography platform required annotated data for photographs. In order to teach the tool to recognize “context” from its photos, the company needed large volumes of footage with detailed labeling, and required it within 24 hours.

Solution 

  • Innodata gathered a team of annotation resources, setup the infrastructure and trained the team in 4 weeks.
  • An initial POC test was conducted with multiple vendors, Innodata ranked #1.

Results

  • The client has an on-demand, flexible and scalable data annotation team on-call.
  • Client has been able to meet the market demands effectively.
  • To-date, Innodata has processed 2.5 million images in 20 weeks and delivered the batches in near real-time.

Data Transformation

Objective

A global financial institution was facing millions of dollars in penalties due to non–compliance issues. Therefore, our client needed custom regulatory data feeds to support their global compliance teams.

Solution

  • Innodata partnered with the client and setup a team of 280 dedicated legal experts.
  • SEC Regulatory law and other data were automatically extracted from over 15 years of legacy data from 68 web sources.
  • Innodata built a ML-enabled solution for entity recognition, labeling and content structuring to the client’s specific XML schema.
  • All linked data in the regulations were identified and a custom linked dataset was created.
  • The complete database was built in less than 24 months, during which we processed millions of records.

Results

  • The client has access to a customized regulatory law database with accurate information.
  • The data feed is continuously updated each day from 280 global SEC regulation web sources.
  • The ML-enabled data solution powers the predictive analytics engine that is now used by their global compliance teams to successfully manage compliance.

Objective

With a massive volume of industry-specific contracts being signed by multiple parties, it’s paramount that our client is able to stay informed of the latest changes in order to stay compliant with regulatory changes.

Solution

  • Innodata employed a team of lawyers with expertise in analyzing complex derivative contracts like ISDA, GSLA, and SLA.
  • Our team extracted over 1200 data points in ISDA contracts and tagged them according to our clients’ schema
  • Our team of derivative lawyers unraveled critical data from these contracts for predictive and prescriptive analytics platforms.

Results
The data created by our SMEs has been successfully used within predictive analytics systems in investment banks for regulatory compliance/margin management/risk management.

Objective
A leading US publisher needed to extract specific data points from contracts while taking a systematic approach to rights management for managing risk.

Solution

  • Innodata employed a team of paralegal SMEs to analyze the contracts and identify the data points to be extracted.
  • We leveraged advanced entity extraction, text and data mining, augmented by our team of accredited legal experts to extract critical contract data points from contract documents with high accuracy.
  •  

Results
The data points are ingested into the contracts intelligence platform to enable descriptive and predictive analytics for more precise contract risk management.

Data Curation

Objective
A leading business intelligence enterprise wanted to maintain a competitive edge in the marketplace by offering the most up-to-date company information to help its customers guide decision making and stay ahead of industry trends.

Solution

  • Innodata’s engineering team implemented a robust and technology-driven process for the research and normalization of company information data from company websites and financial reports (e.g., 10K and annual reports).
  • To ensure timeliness, an account team consisting of financial experts was created to provide the client with daily updates.

Results

  • The client is a global preferred source of company data with the most accurate information.
  • The client has substantially reduced the overall cost of operations with a proven process customized to their business needs.

Objective
One of the most respected advisory firms sought to overhaul its database with clean, accurate, verified business information.

Solution

  • Innodata analyzed the database and divided the data into 3 buckets of top, mid and low priority company datasets.
  • 16,000 companies in top priority were cleaned up in 4 weeks.
  • 150,000 companies in the mid priority were completed in 12 weeks; the remaining were completed in another 16 weeks.

Results

  • Our client has a clean database with accurate information.
  • High quality data has led to improved customer satisfaction and brand reputation.
  • Our client is now able to add more companies to the database periodically, providing a richer product to the market.​

Objective
A global database provider wanted to increase coverage of its product database with updated information and create comprehensive product descriptions and company profiles for target suppliers from various industry domains to enhance searchability.

Solution

  • Innodata provided a team of highly trained engineers with writing skills and product expertise across various domains.
  • SMEs viewed product catalogues and scripted product descriptions and marketing content.
  • The team aimed to increase scope of the content covered in the database and ensure update cycles are more frequent for existing products.

Results

  • Our client is now a leading source of data for manufacturers and buyers of engineering products.
  • 500K+ supplier profiles, 6 million+ products, 300K+ white papers and articles are on the platform.
  • Scalable delivery model enables continued business growth.​

Intelligent Automtation

Objective

Our client needed to adopt open industry standards for metadata management, ensure interoperability with vendors and third party aggregators and publish adopted and originated standards content in a timely and effective manner.

Solution

  • Workflow integration with Content Management System (CMS) enabling role-based access and cross-functional user task flow.
  • CMS capable of handling multiple formats – audio, video, PDF, Word, XML, etc.
  •  Creation of XML schema to standardize feed ingestion from SDO’s –ISO/IEC/CEN/CLC.
  • Integration of Smart Content Repository (multi-channel delivery application)  for both data and content delivery.

Results

  • Innodata provided comprehensive information that has helped the client build and maintain reliable, up-to-date information of global patterns that matter.
  • The patent analytics platform is used by leading global drug and chemical manufacturers in efforts to avoid patent infringement, identify potential investments in new drug development and research fund attribution.

Objective

A renowned global geospatial information provider tasked us with implementing more efficient processes and workflows to ensure the delivery of critical geophysical intelligence to the market in a timelier manner.

Solution

  • Innodata analyzed the client’s entire information distribution process and examined the steps being taken to deliver the current product.
  • Innodata rationalized the process incorporating Six Sigma principles
  • All non-value add and value-added steps were segregated.
  • We applied a combination of AI-enabled classification for the information extraction and classification, and implemented RPA to automate the manual tasks.

Results

  • Today, our client is successfully able to deliver information to its Oil and Energy clients 50% faster.
  • The combination of RPA and AI has delivered excellent automation results.
  • Their previously cumbersome workflow process has been streamlined to 12 steps, of which 11 are fully automated.

Objective

One of the world’s most respected professional associations relied on a high degree of manual labor to deliver bibliographic information in industry standard XML formats and needed to eliminate inefficient business processes.

Solution

  • Technology modernization & process consulting – Workflow integration with Content Management System (CMS) enabling role-based access and cross-functional user task flow.

  • Source acquisition and ingestion fully automated.

  • Remodelling of data and introduction of modular entities to achieve linkage and eliminate redundancy.

  • Creation of XML Schema for each of the entities to standardise feed ingestion from various sources into a unified format.

Results

  • Increased productivity through improved business processes performed through centralized workflows.

  • Increased automation – reduction in TAT by 30%.

  • Smart & linked content achieved through efficient data model.

  • Hassle free & fully automated delivery to various distribution partners in requested formats.

Recent Articles

Take the next step

Contact us
Request a demo
Speak With An Expert
(NASDAQ: INOD) Innodata is a leading data engineering company. Prestigious companies across the globe turn to Innodata for help with their biggest data challenges. By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of over 3,000 subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of digital data and ubiquitous AI.

Contact

  • 55 Challenger Road, Suite 202 Ridgefield Park, New Jersey 07660
  • 201-371-8000
Scroll to Top