2026-04-25·15 min read·sota.io team

EU AI Act Art.84: Commission Evaluation, Regulatory Evolution, and Long-Term Compliance Strategy for Developers (2026)

EU AI Act Article 84 governs something that compliance programmes often treat as background noise: the Regulation itself changes over time. Art.84 creates a structured, legally mandated process through which the European Commission must evaluate the Regulation's application and report to the European Parliament and Council — with the explicit authority to recommend amendments to Annex I, Annex III, Art.5, Art.99, and the conformity assessment framework.

For developers, Art.84 is not just procedural furniture. It is the mechanism by which an AI system classified as non-high-risk today can become high-risk tomorrow, by which a practice permitted under Art.5 today can become prohibited, and by which penalty thresholds that shape compliance investment decisions can be revised upward. Building a compliance programme without accounting for Art.84's review cycle is building on a foundation that will shift.

Art.84 became applicable as part of the EU AI Act's full entry into force on 2 August 2026, three years after which the first major Commission evaluation must occur — targeting 2 August 2029.

Art.84 in the Final Provisions Architecture

Art.84 sits in the EU AI Act's closing chapters alongside Art.85 (right of recourse), Art.86 (transparency), and the final transitional provisions. Its position reflects its function: it is not about day-to-day compliance but about the long-term governance of the regulatory framework itself.

The article creates a feedback loop between implementation experience and regulatory content. The Commission, informed by market surveillance data, AI Board evaluations, NCA reports, and stakeholder input, must assess whether the original Annex categories remain appropriate as AI technology and deployment practice evolve. This is a deliberate design choice: the drafters recognised that AI capability advances faster than legislative cycles, and Art.84 is the mechanism to close that gap without requiring full re-legislation each time.

Layer	Instrument	Art.84 Scope
Techniques	Annex I	New AI approaches not covered at entry into force
Use-case risk	Annex III	High-risk classification list and thresholds
Prohibitions	Art.5	Whether new practices should be prohibited
Enforcement	Art.99	Penalty threshold proportionality
Conformity	Art.43–46	Whether procedures remain adequate
GPAI	Art.51–55	GPAI model provision effectiveness
Governance	AI Office, NCAs	Resource adequacy and implementation effectiveness

Art.84(1): The Core Review Obligation

Art.84(1) establishes the primary review cycle: three years after the date of application (i.e., by 2 August 2029), and every four years thereafter (2033, 2037, ...). The Commission must evaluate and report to the European Parliament and Council on eight distinct scope items.

(a) Annex I — AI Techniques and Approaches

Annex I currently defines AI systems by reference to machine learning (supervised, unsupervised, reinforcement), logic- and knowledge-based approaches, statistical approaches, Bayesian estimation, search and optimisation methods, and similar techniques. The Commission must assess whether this list remains technically accurate — whether new paradigms (e.g., neuro-symbolic architectures, foundation model fine-tuning methods, quantum-assisted learning) should be added.

Developer implication: A system not currently classified as an AI system under Annex I may be reclassified in 2029 if it uses a technique added to the Annex. Compliance programmes must monitor Annex I amendment proposals in the 2027–2028 pre-review period and assess whether products fall within scope of proposed expansions.

(b) Annex III — High-Risk AI Systems List

This is the most commercially significant review scope item. Annex III currently lists eight categories of high-risk AI use cases: biometric identification and categorisation, critical infrastructure management, education and vocational training, employment and worker management, access to essential services, law enforcement, migration and border control, and administration of justice.

The Commission must evaluate:

Whether the existing categories remain appropriately calibrated
Whether the threshold for high-risk classification within each category is set correctly
Whether new use cases should be added — for example, AI systems used in healthcare diagnostics, autonomous vehicle control, or financial credit decisions are candidates for inclusion
Whether any current categories should be narrowed or removed based on actual risk evidence from market surveillance

The reclassification risk is asymmetric. Systems added to Annex III become subject to the full high-risk compliance regime (QMS, technical documentation, Annex IV, conformity assessment, EU DB registration). Systems removed gain significant compliance relief. The probability of additions exceeding removals is high given the trajectory of AI deployment across critical societal functions.

(c) Art.99 Penalty Thresholds

The Commission must assess whether the Art.99 penalty structure — EUR 35M/7% global turnover for prohibited practices, EUR 15M/3% for other obligations, EUR 7.5M/1.5% for misinformation — remains proportionate. This review works in both directions: if enforcement experience shows companies absorbing penalties as a cost of doing business, thresholds may increase. If penalties prove disproportionate for SMEs, adjustments may introduce more granular tiers.

Developer implication: Compliance investment calculations that rely on penalty thresholds as one input to risk-based decision-making should be updated after each Art.84 review cycle.

(d) Art.5 — Prohibited AI Practices

The Commission must review whether the eight current categories of prohibited AI practices remain adequate or whether additional practices should be prohibited. Current prohibitions cover subliminal manipulation, vulnerability exploitation, social scoring by public authorities, real-time remote biometric identification (with exceptions), emotion recognition in workplace/education, biometric categorisation for sensitive attributes, predictive policing on profiling, and database scraping for facial recognition.

Candidates for future extension — based on AI Board recommendations and public consultation input from 2024–2025 — include:

Behavioural prediction systems used in insurance underwriting
AI-driven content ranking systems with addictive design as a primary objective
Autonomous lethal weapons decision systems in non-military contexts

Providers whose systems operate near the margins of current Art.5 categories should treat this as an active risk horizon, not a resolved question.

(e) Conformity Assessment Procedures

The Commission must assess whether the Art.43–46 conformity assessment framework — self-assessment for most high-risk systems, third-party notified body assessment for specific categories (biometric identification, critical infrastructure) — remains adequate. If conformity assessment is found to be insufficient to assure safety in practice, new or additional procedures may be mandated.

(f) Implementation, Market Surveillance, and Enforcement

This scope item focuses on the operational effectiveness of the framework: are NCAs adequately resourced? Are Art.74–80 surveillance procedures working? Is enforcement consistent across member states? This is the feedback channel through which under-resourced NCA functions or divergent enforcement approaches feed back into legislative correction.

(g) General Purpose AI Model Provisions

Art.84(1)(g) directly addresses the GPAI provisions in Art.51–55. Given that GPAI regulation was added late in the legislative process and reflects a rapidly changing technology landscape, the Commission must evaluate whether the classification thresholds (10²³ FLOPs for systemic risk), the obligations framework, the AI Office's enforcement capacity, and the Code of Practice system are working as intended.

This is the highest-velocity review item. Foundation model capability and deployment patterns are evolving faster than any other segment of the AI landscape. The 2029 review will occur after three years of AI Office enforcement experience and may result in substantial amendments to the GPAI regime.

(h) General Need to Amend

The Commission retains a catch-all authority to recommend amendments to the Regulation as a whole based on the overall evaluation.

Art.84(2): First Evaluation Report — Additional Scope

The first evaluation report (due 2 August 2029) carries additional requirements beyond the standard Art.84(1) scope. Art.84(2) mandates that the first report also address:

(a) AI Office Implementation

The AI Office, established under the Commission as the primary GPAI enforcer and Art.84 coordination function, must be evaluated for implementation effectiveness. This includes: whether the AI Office's enforcement powers are adequate, whether the staffing and resource allocation matches the workload, and whether the relationship between the AI Office and NCAs is functioning correctly.

(b) NCA Resources

Article 57 required member states to designate NCAs by August 2025. The 2029 review will assess whether the NCAs that have been operational for three years are adequately resourced to perform their Art.74–80 surveillance and enforcement functions. Persistent NCA resource gaps — a well-documented issue in GDPR DPA enforcement — are likely to be a significant finding.

(c) GPAI Model Provision Implementation

In addition to the ongoing Art.84(1)(g) review, the first report must specifically address whether the GPAI provisions as a whole are being implemented correctly — a recognition that the GPAI regime is the most novel element of the Regulation and warrants particular first-cycle scrutiny.

(d) Adequacy of Delegated and Implementing Acts

The Commission has issued numerous delegated acts (under Art.6(6), Art.7(1)–(3), Art.51(2), Art.52(4), Art.53(3), Art.97) and implementing acts since entry into force. The first review must assess whether these secondary legislation instruments are calibrated correctly and whether gaps exist that require further secondary legislation.

The AI Board's Role in the Art.84 Process

Art.84(3) specifies that the Commission's report must take account of the AI Board's evaluation. The AI Board — composed of one senior NCA representative from each member state — is positioned as the primary source of implementation feedback for the Art.84 review. This creates a structured path from NCA enforcement experience to regulatory amendment:

NCAs conduct Art.74–80 surveillance and enforcement
NCAs report findings to the AI Board
The AI Board aggregates findings and formulates an evaluation
The Commission incorporates the AI Board evaluation into the Art.84 report
The report may recommend Annex or article amendments
Parliament and Council act on recommendations

For developers, this means that the evidence base for Art.84 amendments is being built now, through every enforcement action, MSA investigation, and corrective measure issued under the current framework. The 2029 review will reflect three years of accumulated enforcement data.

Art.84 and the CLOUD Act Risk Multiplier

For organisations storing compliance documentation on US-cloud infrastructure, Art.84 introduces a compounding risk dimension. The CLOUD Act (Clarifying Lawful Overseas Use of Data Act) allows US law enforcement to compel US-based cloud providers to disclose data stored outside the United States without notifying the data subject or the EU data controller.

Art.84 makes this risk dynamic rather than static. As the Commission amends Annex I, Annex III, or the conformity assessment framework:

Existing documentation must be updated to reflect new classification status
New conformity assessments may be required for systems reclassified as high-risk
New documentation formats may be mandated if Annex IV or conformity assessment procedures change

If technical documentation, QMS records, conformity assessment reports, and Annex IV materials are stored on US infrastructure under CLOUD Act jurisdiction, each regulatory amendment cycle creates a new disclosure window: US law enforcement can demand updated documentation at any point in the compliance refresh cycle. The risk is not just static disclosure — it is disclosure of the current compliance posture at the precise moment when it is most in flux, during a reassessment triggered by an Art.84 amendment.

EU-incorporated infrastructure providers with no US parent entity — outside CLOUD Act jurisdiction by design — reduce the regulatory change exposure to the compliance task itself, without the compounding legal risk layer.

Practical Compliance Strategy: Building for Regulatory Horizon

Art.84 transforms compliance from a point-in-time exercise into a continuous programme. The following strategic posture accounts for the Art.84 review cycle:

1. Annex III Watch Programme

Establish an internal process to monitor Commission consultation documents, AI Board opinions, and European Parliament reports in the 18 months preceding each Art.84 review cycle. Annex III amendment proposals typically circulate in draft form before formal publication. Early identification of likely reclassifications allows time for technical documentation pre-preparation, conformity assessment scheduling, and product roadmap adjustment before mandatory compliance dates.

The 2028–2029 window is the first critical horizon. By Q1 2028, organisations should have an Annex III reclassification impact assessment in place covering their current product portfolio.

2. Annex I Technology Monitoring

New AI paradigms entering Annex I expand the scope of what counts as an AI system. Organisations using techniques that are not currently on the Annex I list — including hybrid neuro-symbolic models, neurosymbolic planning systems, or emerging quantum-assisted approaches — should monitor AI Office technical guidance and AI Board opinions for signals that these techniques are being evaluated for Annex I inclusion.

3. Art.5 Margin Analysis

Systems that operate near the margins of current Art.5 categories require active risk tracking. If the product uses behavioural inference, social scoring, or emotion recognition in contexts adjacent to (but currently outside) the prohibited categories, an Art.84 extension of Art.5 could require product withdrawal or fundamental redesign. Identifying this risk early — before an Art.84 amendment cycle — allows orderly strategic response rather than emergency compliance reaction.

4. Conformity Assessment Procedure Flexibility

Compliance programmes that assume self-assessment will remain available for a given high-risk category should maintain readiness for third-party notified body engagement. If the Art.84 review determines that self-assessment has been inadequate for specific Annex III categories, mandatory third-party assessment may be introduced for those categories in the next regulatory cycle.

5. Penalty Threshold Sensitivity Analysis

Financial risk models that include penalty exposure as a variable should incorporate a scenario analysis for penalty threshold increases. If the Commission finds that current thresholds have insufficient deterrent effect based on enforcement data, the 2029 review may recommend upward adjustment. A 50% increase in the EUR 35M/7% threshold for prohibited practices would materially alter the expected value calculation for non-compliance decisions.

Python ReviewTracker: Monitoring Art.84 Compliance Horizon

from dataclasses import dataclass, field
from datetime import date, timedelta
from typing import Optional
from enum import Enum

class ReclassificationRisk(str, Enum):
    HIGH = "HIGH"       # Near-margin for Annex III inclusion or Art.5 extension
    MEDIUM = "MEDIUM"   # Potentially in scope of known consultation proposals
    LOW = "LOW"         # No current signals of reclassification
    MONITORED = "MONITORED"  # On watch list, no current active signal

@dataclass
class AnnexIIIWatchItem:
    product_name: str
    current_classification: str  # "non-high-risk", "high-risk-[category]", "GPAI", "general"
    reclassification_risk: ReclassificationRisk
    risk_rationale: str
    consultation_signals: list[str] = field(default_factory=list)
    last_reviewed: Optional[date] = None
    next_review_due: Optional[date] = None

    def days_to_review(self) -> Optional[int]:
        if self.next_review_due:
            return (self.next_review_due - date.today()).days
        return None

    def is_overdue(self) -> bool:
        return self.next_review_due is not None and self.next_review_due < date.today()

@dataclass
class Art84ReviewCycleTracker:
    first_review_date: date = date(2029, 8, 2)
    watch_items: list[AnnexIIIWatchItem] = field(default_factory=list)

    APPLICATION_DATE = date(2026, 8, 2)
    REVIEW_CYCLE_YEARS = 4

    def days_to_first_review(self) -> int:
        return (self.first_review_date - date.today()).days

    def pre_review_window_start(self) -> date:
        return self.first_review_date - timedelta(days=548)  # 18 months

    def is_in_pre_review_window(self) -> bool:
        return date.today() >= self.pre_review_window_start()

    def high_risk_watch_items(self) -> list[AnnexIIIWatchItem]:
        return [i for i in self.watch_items if i.reclassification_risk == ReclassificationRisk.HIGH]

    def add_watch_item(self, item: AnnexIIIWatchItem) -> None:
        self.watch_items.append(item)

    def review_summary(self) -> dict:
        return {
            "days_to_first_review": self.days_to_first_review(),
            "in_pre_review_window": self.is_in_pre_review_window(),
            "pre_review_window_start": self.pre_review_window_start().isoformat(),
            "high_risk_items": len(self.high_risk_watch_items()),
            "total_watch_items": len(self.watch_items),
            "overdue_reviews": sum(1 for i in self.watch_items if i.is_overdue()),
        }

def build_sample_tracker() -> Art84ReviewCycleTracker:
    tracker = Art84ReviewCycleTracker()
    tracker.add_watch_item(AnnexIIIWatchItem(
        product_name="CreditScoringEngine v3",
        current_classification="non-high-risk",
        reclassification_risk=ReclassificationRisk.HIGH,
        risk_rationale="AI Board 2025 Opinion on financial services AI flagged credit scoring for Annex III addition. Commission consultation Q3 2027 expected.",
        consultation_signals=["AI Board Opinion 2025/3", "EBA technical guidance draft 2026"],
        last_reviewed=date(2026, 8, 15),
        next_review_due=date(2027, 8, 15),
    ))
    tracker.add_watch_item(AnnexIIIWatchItem(
        product_name="HealthDiagnosticAssistant v2",
        current_classification="non-high-risk",
        reclassification_risk=ReclassificationRisk.MEDIUM,
        risk_rationale="Medical diagnostic AI not yet in Annex III but within scope of Art.84 consultation topics. MDR intersection creates parallel regulatory track.",
        consultation_signals=["Commission Staff Working Document 2026/SWD/142"],
        last_reviewed=date(2026, 9, 1),
        next_review_due=date(2027, 3, 1),
    ))
    return tracker

if __name__ == "__main__":
    tracker = build_sample_tracker()
    summary = tracker.review_summary()
    print(f"Days to first Art.84 review: {summary['days_to_first_review']}")
    print(f"In pre-review window: {summary['in_pre_review_window']}")
    print(f"Pre-review window opens: {summary['pre_review_window_start']}")
    print(f"High-risk reclassification items: {summary['high_risk_items']}/{summary['total_watch_items']}")
    for item in tracker.high_risk_watch_items():
        print(f"  [{item.reclassification_risk.value}] {item.product_name}: {item.risk_rationale[:80]}...")

Art.84 vs Adjacent Provisions

Art.84 is the primary long-term review mechanism, but it intersects with several other evolutionary provisions:

Provision	Mechanism	Timescale
Art.84	Comprehensive review → recommend amendments	Every 3/4 years
Art.7(1)	Commission can amend Annex III by delegated act without full review	Ongoing — triggered by specific conditions
Art.6(6)	Commission can amend Annex II by delegated act	Ongoing
Art.97	General delegated act authority (5-year period Aug 2024–Jul 2029)	Within 5-year window
Art.71	Delegated act scrutiny by Parliament/Council (3 months)	Per delegated act

The key distinction is procedural scope: delegated acts under Art.7(1) can amend Annex III without an Art.84 review cycle if the Commission determines that new AI uses satisfy the existing criteria for high-risk classification. Art.84 is the vehicle for broader structural assessment, but Annex III can expand between Art.84 cycles through delegated acts.

This means the 3-year review cycle is not the only reclassification horizon. Organisations must monitor both: the formal Art.84 cycle and interim delegated act proposals from the Commission that can amend Annex III on a rolling basis.

Art.84 in the EU AI Act Enforcement Arc

Viewing the Chapter VIII–IX arc as a whole, Art.84 closes the regulatory feedback loop that begins with Art.57 (NCA designation) and runs through market surveillance, enforcement, and penalty:

Art.57 NCA designation
    → Art.74–80 market surveillance + enforcement
        → Enforcement experience accumulates in AI Board
            → Art.84 Commission report synthesises evidence
                → Annex I/III/Art.5/Art.99 amendments
                    → New compliance obligations for providers
                        → Art.74–80 surveillance of updated obligations
                            → cycle repeats

For a developer, this arc means that compliance is a living obligation, not a one-time certification event. A system that achieves full conformity on 2 August 2026 must be re-evaluated against any Annex III or Art.5 amendments that result from the 2029 Art.84 cycle. The conformity assessment performed in 2026 certifies compliance with the 2026 Regulation — it does not certify compliance with the 2029 post-amendment Regulation.

10-Item Developer Checklist: Art.84 Regulatory Horizon

Annex III exposure map — Document which current products would be affected by the most likely Annex III extension candidates identified in AI Board opinions and Commission consultation documents.
Annex I technology audit — Identify whether any products use AI techniques not currently listed in Annex I; assess likelihood of inclusion in 2029 review.
Art.5 margin analysis — For products using behavioural inference, social scoring-adjacent functions, or emotion recognition, assess distance from current prohibition scope and monitor AI Board guidance.
Pre-review calendar entry — Set internal horizon date of 1 February 2028 (18 months before first review) as trigger for structured Annex III reclassification impact assessment.
GPAI reclassification monitoring — If products include or depend on GPAI models, monitor AI Office technical guidance for changes to systemic risk threshold (currently 10²³ FLOPs) and Code of Practice updates.
Penalty threshold sensitivity — Include Art.99 threshold increase scenario (e.g., +50%, +100%) in financial risk models for compliance investment decisions.
Conformity assessment readiness — Maintain notified body relationships and pre-assessment readiness for product categories currently relying on self-assessment, in the event third-party assessment becomes mandatory.
CLOUD Act documentation posture — Ensure compliance documentation (QMS records, technical documentation, conformity assessment reports) is stored on infrastructure not subject to CLOUD Act compelled disclosure, reducing risk during regulatory update and re-assessment cycles.
Delegated act monitoring — Establish an ongoing process (not limited to Art.84 cycles) to monitor Art.7(1) and Art.6(6) delegated acts that can amend Annex III and Annex II between formal review cycles.
Compliance programme versioning — Maintain dated compliance programme versions keyed to the applicable Regulation version; update programme upon each Annex amendment, delegated act, or Art.84 cycle change.