Legal·Data Annotation Service·Australia

Structured extraction labels for an enterprise contract platform

OCR-grade key-value extraction across 90K commercial contracts – clause typing, party tagging, and renewal flags at 99.2% field-level accuracy.

Stack of printed contract documents next to a laptop on a desk

90K

Contracts processed

99.2%

Field-level accuracy

Challenge

An enterprise contract-lifecycle platform serving in-house legal teams needed labelled training data for structured extraction across 28 contract types – party identification, clause classification, key-date tagging, and renewal-condition flags – at a quality bar that justified replacing manual review.

Free-text variation in commercial agreements was breaking the platform's previous extraction approach, with field-level accuracy stalling around 91% on the messier contract families.

Approach

We assembled an annotation team led by paralegals with corporate-law backgrounds and built a custom labelling tool that highlighted ambiguous clauses for second-pass review. Schema design was iterated jointly with the platform's ML and legal-product leads, with edge-case rulings captured in a versioned playbook.

Every document went through automated pre-OCR, primary annotation, and a sampling-based QA layer that audited 10% of completed work blind.

Outcome

Delivered 90K labelled contracts at 99.2% field-level accuracy on the validation set, lifting the platform's production extraction model from 91% to 97.6% on a customer-blind benchmark.

Their largest customer cut average contract-review time per document from 41 minutes to 9 – the cornerstone metric in their renewal pitch and a direct lever on a A$3.4M ARR retention.

More case studies

View all

Luxury hotel lobby with guests being attended to at the concierge desk

Hospitality·Thailand

24/7 AI concierge for a luxury hotel group

Busy urban street with vehicles, pedestrians, and lane markings – the kind of scene 2D autonomous-driving perception models train on

Mobility·Japan

2D image annotation for an autonomous-driving programme

Clinician reviewing medical imaging scans on a workstation display

Healthcare·APAC

Pixel-grade segmentation for diagnostic medical AI

Ready to Get Started

Let's build what's next

Share your challenge – AI, data, or infrastructure. We'll scope your project and put the right team on it.

Start a Conversation See Case Studies