Negent Clean — The missing foundation for your AI

Why Clean is critical

AI can't guess
what's true.

Most companies don't lack data — they lack structured, trustworthy knowledge. Without cleaning, your AI agents start hallucinating the moment the pilot ends.

🔀

Version anarchy

15 versions of the same contract. No one knows which one is valid. Signed? Which one? The "final" or the "REAL_final"?

🧩

Missing context

A contract without its amendments, annexes, and decision history is useless. These relationships exist nowhere in exploitable form.

📉

Zero trust

Your search returns something relevant — but is it the right version? Is it legally binding? Doubt persists and slows decisions.

🔓

ACL violations

Without native ACL enforcement, "smart search" becomes a security breach. A user can retrieve content they shouldn't see.

🏷️

Metadata chaos

Inconsistent naming, missing tags, heterogeneous taxonomies. Automation fails. Humans spend hours just finding things.

📈

Doesn't scale

POC works. Production fails. Without observability, metrics, and continuous improvement, your AI assistant stays a fragile prototype.

What Clean delivers

Your documents are a liability.
Clean makes them an asset.

Clean doesn't just index. It arbitrates, reconciles, and governs — so your AI answers with the right version, from the right scope, for the right user.

Negent in action One document selected — its card revealed ↓

Negent AI

Enterprise Intelligence

Dashboard

Sources

Conversations

Administration

Organisation

Taxonomies

Groups & Rules

Master

Monitoring

Prompts

☰

Total Documents

6 852

✓

Categorized

6 613

Action needed

239

NAME / SOURCE UPLOAD DATE SIZE STATUS ACTIONS

S-AMAR_Annex_16.1__Executed_LNTP_1.pdf

Contractual Documentation › Contract

Mar 30, 10:28

606 KB

⋯

AMAR_Annex_5.1_-_Updated_Time_Schedule.pdf

Technical Report › Technical Report

Mar 30, 10:27

844 KB

⋯

S-AMAR_Amendment_3_Executed.docx.pdf

Contractual Documentation › Amendment

Mar 30, 10:29

505 KB

⋯

S-AMAR_Annex_11_-_Updated_Certificates.docx.pdf

Tender › Award

Mar 30, 10:28

373 KB

⋯

S-AMAR_Amendment_2_v221129_Rev_VF.docx.pdf

Contractual Documentation › Amendment

Mar 30, 10:28

541 KB

⋯

S-PT_1.1.1_-_Grid_Requirement_-_P73_2020.pdf

Technical Report › Aucune

Mar 30, 10:29

2.7 MB

⋯

S-PT_1.1.1.1_-_Grid_Code_and_Scope_of_works.pdf

Technical Report › Technical Report

Mar 30, 10:29

1.4 MB

Error

⋯

S-PT_1.1.1_-_Grid_Requirement_-_DL172_2006.pdf

Statutory meetings › None

Mar 30, 10:29

492 KB

⋯

S-PT-EPC_Contract_Gensun_AMAR_VF.docx.pdf

Contractual Documentation › Contract

Mar 30, 10:22

1.7 MB

⋯

S-PT_2.2.2_-_HSE_-_Penalties.pdf

HSSE › Aucune

Mar 30, 10:30

238 KB

⋯

S-PT_2.2_-_HSE_Requirements_Cover_Page.pdf

HSSE

Mar 30, 10:30

167 KB

⋯

S-AMAR_Amendment_2_v221129_Rev_VF.docx.pdf

Document Card — Negent AI Enterprise Intelligence

Categorisation

Contractual Documentation avenant

Document Identity

Titre Amendment #2 to the Lump Sum Contract — Solar Park of Amar

Langue Français (fr)

Nature Contractual amendment

Purpose Modifier les termes du contrat EPC initial pour le parc solaire d'Amar.

Business Scope

Anchor type

projetcontrat

Projet Solar Park of Amar

Companies AMAR, UNIPAR LDA · VOLTEX PVS, S.A.

Phase Construction

Temporality & Status

Doc date 11 January 2023

Signed 23 July 2021

Statut Final Signé Approuvé

Semantic summary

« Avenant #2 daté du 11 January 2023 — modifie le contrat EPC initial signé le 23 July 2021. Révise l'Advance Payment (Art. 8.1) et les délais LNTP 2 au plus tard le 1er mars 2023, Notice to Proceed au plus tard le 30 June 2023. »

Single Source of Truth

One canonical reference per content "family" — no more contradictions, no more competing versions. Clean selects, justifies, and audits every promotion.

Canonical promotion

Semantic Foundation

clean embeddings layer that gives your AI agents reliable, up-to-date sources. Encrypted text, normalized metadata, semantic + full-text index.

AI-ready index

Knowledge Graph

Intelligently connect your content — amendments, contracts, annexes — to reconstruct full business context. Relations traced, explained, and correctable.

Relationship mapping

Continuous Governance

Your taxonomy and business rules apply in real time across your entire document flow. ACLs propagated to document level. Automatic delta. Full audit trail.

Delta - ACL - HITL

The Doc Card

Each document becomes
actionable intelligence.

For every processed file, Clean produces a complete identity card — structured, comparable, and queryable by your AI without ever opening the source document.

Categorisation

Contractual Documentation avenant

Document Identity

TitreAmendment #2 to the Lump Sum Contract — Solar Park of Amar

LangueFrançais (fr)

NatureContractual amendment

Sous-typeAmendment to EPC contract

PurposeAmend the terms of the initial EPC contract for the Amar solar park.

Business Scope

Anchor type

projetcontrat

ProjetSolar Park of Amar

CompaniesAMAR, UNIPAR LDA
VOLTEX PVS, S.A.

Contrat réf.Lump Sum Contract — Solar Park of Amar

PhaseConstruction

Acteurs

ÉmetteursAMAR, UNIPAR LDA · VOLTEX PVS, S.A.

Rôles

EmployerContractorParties

Temporalité

Doc date11 January 2023

Signed23 July 2021

Avenant #111 February 2022

LNTP 21 March 2023 (planned)

NTP30 June 2023

Statut Formel

StatutFinal

SignéOui

ApprouvéOui

PertinenceHaute

Signaux discriminants

Amendment #2 Contrat signé: 23/07/2021 Avenant #1: 11/02/2022 Avenant #2: 11/01/2023 Modification Advance Payment Art. 8.1 Commencement of Work LNTP 2: 01/03/2023 NTP: 30/06/2023

Relations documentaires

↗Lump Sum Contract — Solar Park of Amar

↗Annex 16.1 LNTP 1

↗Annex 16.2 LNTP2 Template

Rev_VF (final version)Potential translation

Semantic summary

« Cet avenant contractuel (Amendment #2), daté du 11 January 2023, modifie le contrat initial de construction du parc solaire d'Amar signé le 23 July 2021. Il révise l'Advance Payment et l'article 8.1 relatif au Commencement of Work, incluant les délais LNTP 2 au plus tard le 1er mars 2023 et Notice to Proceed au plus tard le 30 June 2023. »

Category & identity — without opening the file

Every document receives a business category, a normalised title, and an automatically extracted purpose. Your teams and AI know exactly what it is — before opening it.

Business taxonomy · Normalisation

Actors, dates and formal status captured

Who signed, when, with what legal validity. Critical metadata is extracted, structured and comparable across all your documents — regardless of origin.

Extraction · Structuring

The document chain reconstructed

Parent contract, amendments, annexes — inter-document relations are traced automatically. Your AI queries the full context, not an isolated fragment stripped of its history.

Relationship mapping · Graph

The semantic summary — your LLM's fuel

The AI queries the card to find, then the document to answer. Analysing a Doc Card costs 100× less than a raw 80-page document. Less noise, lower costs, greater reliability.

RAG · LLM-ready

×100

Analysing a Doc Card costs 100× less than a raw 80-page document for your LLM. Fewer tokens, fewer calls, more reliable answers at scale.

Configuration

Configure Clean for your business.
Zero code required.

Taxonomy, classification rules, metadata fields, extraction prompts — all editable in the interface, versioned and reactivatable at any time. Your business teams stay in control, without depending on IT.

Structure

Tender ▶

Rules/RFP

Award

Package

+ Add Subcategory

Statutory meetings ▶

Financial Statement approval

Shareholders decisions ●

Board decisions

+ Add Subcategory

Land Register ▶

Technical Report ▶

Studies ▶

Business Plan ▶

Claims ▶

HSSE ▶

Financial Report ▶

Lender Audit Report ▶

Subcategory Details

Active Component

Name

Shareholders decisions

Code (Identifier)

SHR-DCN

Description

Décisions d'assemblée / shareholders resolutions : approbations, nominations, opérations sur capital. Mots-clés : résolution, shareholders meeting, assemblée des actionnaires, approval, decision, minutes / procès-verbal. À ne pas confondre : Board decisions, Articles of association.

Metadata Fields

Define data points specifically extracted for this component.

+ Add Field

date

Date ▾

✓

Required

Description

Decision date

Options / Examples

2024-01-01

+ Define a new metadata field…

①

Your vocabulary, not a generic one

Categories reflect exactly how your teams name things: "EPC Amendment" rather than "Amendment", "Site Report" rather than "Document". Your people recognise their corpus instantly — and so does your AI.

Categories · Subcategories · Identifier codes

②

Classification rules in plain language

Describe how to tell a contract from an amendment, a board resolution from minutes. Add keywords, examples, edge cases. Clean applies these rules uniformly across your entire corpus.

Description · Keywords · Edge cases

③

Per-category typed metadata fields

Each subcategory defines its own fields: a decision date for board meetings, an amount for invoices, a project phase for site reports. These fields feed directly into every Doc Card.

Date · Amount · List · Required field

Return on investment

Clean isn't a cost.
It's a recovery.

Beyond AI quality, Clean delivers measurable gains from the first weeks — across five concrete operational dimensions.

-70%

Coûts de
stockage

ROT data eliminated

×100

Réduction
coûts LLM

Doc Card vs raw document

96.5%

Précision
classification

Reliable AI answers

3×

Recherche
plus rapide

Clean corpus

Document sans
permission

ACL propagées

Reduced storage costs

40 to 70% of enterprise data is ROT — Redundant, Obsolete, Trivial. Clean identifies and qualifies every file for deletion or archiving, directly reducing storage, backup and indexation costs.

Storage · Backup · Indexation

Leaner backups from the pilot phase

Indexation costs reduced proportionally

eDiscovery 3× faster on a clean corpus

Optimised LLM costs

The AI queries Doc Cards to find, then documents to answer. A card costs 100× less to analyse than a raw 80-page document — fewer tokens, fewer calls, less noise in context.

Tokens · API calls · Latency

Token consumption reduced at scale

Unnecessary calls eliminated by pre-filtering

Context transmitted more precisely, less noisy

Your path to
an AI-ready foundation.

Six steps. No file migration. A reliable, secure, governed foundation — built on top of what you already have.

Sources

Where your content lives.

Connect Negent to your existing systems — SharePoint, emails, servers, ECM, knowledge tools. Read-only. No disruption. Your workflows stay untouched.

Native connectors · Read-only mode · ACLs captured at connection

Scope

Scope Configuration

Teach Negent your language. Define your own rules, tags, and taxonomies — the platform adapts to how your business actually works, not the other way around.

Custom Taxonomy · Versioning rules · Confidence thresholds

Secure Extraction

Your data stays yours

Only essential text and metadata are temporarily extracted, encrypted in transit, then processed. Your source files remain secure in your infrastructure.

TTL/purge · Minimal storage · Encryption in transit · Optional BYOK

Foundation Build

Your AI-ready core.

The system normalizes formats, cleans noise, segments content, and creates embeddings to enable semantic search across your entire corpus, regardless of source.

Normalization · Semantic chunking · Embeddings · Text index

Unification & Resolution

One truth. No contradictions.

Negent resolves semantic conflicts automatically, surfaces version families, and maps relationships across related content. Where ambiguity remains, human validation steps in.

Deduplication · Version chains · HITL on at-risk content

Governance

Continuous Loop

Your foundation stays reliable as your business evolves. Every new piece of content or rule change is automatically propagated. Logs, SLAs, drift alerts — full observability.

Delta sync · Audit trail · Targeted reprocessing · Full observability

Negent Architecture

Source SystemsSharePoint - Drive - ECM - Network - Intranet

See Clean ↗

↓ connectors - read-only - ACLs captured

Negent CleanTruth Layer - AI-ready Index - Business Graph

You are here ↗

↓ trusted corpus - native ACLs - embeddings

Negent IntelligenceSource RAG - Semantic Search - Records

Discover ↗

↓ robust - traceable - governed foundation

Negent AgentiqueAutomated Actions - Workflows - Agents

Coming soon ⋯

Clean builds the foundation.
The rest becomes possible.

Intelligence and Agentic can't scale on broken ground. A RAG built on a chaotic corpus doesn't fix the chaos — it amplifies it, answering confidently from the wrong source.

Activate Clean first and you create the conditions for the hardest transition in AI: from POC to production. Reliable, versioned, traceable — with permissions enforced at document level.

→ Source files never leave your environment.

FAQ

We have the answers
you're looking for.

Does Negent Clean replace our ECM/SharePoint ?

No. Clean connects to your source systems without replacing or touching them. Your files stay exactly where they are. It builds a truth layer on top of your existing repositories — an intelligent index, not a migration. Your DMS keeps running exactly as before. Clean just makes it AI-ready.

Do we need perfect metadata or a finalized taxonomy before we start ?

No — and that's the point. Clean is designed for imperfect corpora: inconsistent naming, incomplete tags, fragmented environments. It infers structure, enriches metadata, and proposes a taxonomy from what exists. You refine the rules over time through the scope configurator.

Where is our data stored with Clean ?

Source files remain in your systems (SharePoint, Drive, servers). Clean never moves them. Only metadata, the semantic index, and the relationship graph are stored in Negent — with at-rest and in-transit encryption, tenant isolation, and propagated ACLs. You maintain full control: your documents never leave your infrastructure.

How does the corpus stay reliable over time ?

Delta mode keeps it current without rebuilding from scratch. Clean continuously detects changes in your source systems and reprocesses only the affected families — nothing more. Rule change? Only impacted families are updated. Full observability through logs, SLAs, queues, and drift alerts means you always know the state of your foundation.

How long before Clean delivers results ?

Days, not months. From the connection and inventory phase — typically on a pilot scope — you get a document chaos report, identified duplicate families, and reconstructed version chains. Concrete evidence before any full deployment decision.

Why not just use SharePoint Search or our existing ECM?

Native tools (SharePoint, ECM) index everything without arbitration. Result: your search returns 15 versions of the same contract without telling you which one is binding. Clean solves this upstream: it identifies the authoritative reference, reconstructs version chains, detects semantic duplicates, and applies your business rules. SharePoint Search indexes. Clean governs.

Who decides when Clean detects a conflict or ambiguity?

Clean does — until it shouldn't. Clear-cut cases are resolved automatically based on your business rules. When ambiguity crosses a threshold — two signed versions dated close together, conflicting metadata — Clean triggers Human-in-the-Loop mode. A team expert is notified and validates the call in the interface. Automation handles the volume. Humans handle the judgment calls.

Contact

Let's talk about
your corpus.

Request a demo on your own scope. We assess your document chaos level and show you what Clean delivers — before any commitment.

First name & Last name

Work Email

Company

Estimated corpus size

Context & objective

🕐

Response within 24 hours

A Negent expert will reach out to scope your needs and propose a demo tailored to your environment.

🔎

Free diagnostic

Before any commitment, we produce a document chaos report on a sample of your corpus.

🔒

NDA available

For sensitive discussions, a confidentiality agreement can be signed from the very first contact.

Get your data right.
Then do AI.

AI can't guess
what's true.

Version anarchy

Missing context

Zero trust

ACL violations

Metadata chaos

Doesn't scale

Your documents are a liability.
Clean makes them an asset.

Single Source of Truth

Semantic Foundation

Knowledge Graph

Continuous Governance

Each document becomes
actionable intelligence.

Configure Clean for your business.
Zero code required.

Your vocabulary, not a generic one

Classification rules in plain language

Per-category typed metadata fields

Clean isn't a cost.
It's a recovery.

Your path to
an AI-ready foundation.

Where your content lives.

Scope Configuration

Your data stays yours

Your AI-ready core.

One truth. No contradictions.

Continuous Loop

Clean builds the foundation.
The rest becomes possible.

We have the answers
you're looking for.

Let's talk about
your corpus.

Request sent

Build your AI
on a real fondation.

Get your data right.Then do AI.

AI can't guesswhat's true.

Version anarchy

Missing context

Zero trust

ACL violations

Metadata chaos

Doesn't scale

Your documents are a liability. Clean makes them an asset.

Single Source of Truth

Semantic Foundation

Knowledge Graph

Continuous Governance

Each document becomesactionable intelligence.

Configure Clean for your business.Zero code required.

Your vocabulary, not a generic one

Classification rules in plain language

Per-category typed metadata fields

Clean isn't a cost.It's a recovery.

Your path toan AI-ready foundation.

Where your content lives.

Scope Configuration

Your data stays yours

Your AI-ready core.

One truth. No contradictions.

Continuous Loop

Clean builds the foundation.The rest becomes possible.

We have the answersyou're looking for.

Let's talk aboutyour corpus.

Request sent

Build your AIon a real fondation.

Negent Agentic

Get your data right.
Then do AI.

AI can't guess
what's true.

Your documents are a liability.
Clean makes them an asset.

Each document becomes
actionable intelligence.

Configure Clean for your business.
Zero code required.

Clean isn't a cost.
It's a recovery.

Your path to
an AI-ready foundation.

Clean builds the foundation.
The rest becomes possible.

We have the answers
you're looking for.

Let's talk about
your corpus.

Build your AI
on a real fondation.