Private Generated Connector Pilot Source Selection — 2026-04-23

add-polyfill-connector-systemProject noteWorking notes

Created Apr 25, 2026openspec/changes/add-polyfill-connector-system/design-notes/private-generated-connector-source-selection-2026-04-23.mdView on GitHub →

Status: owner recommendation (project-scoped, non-normative)
Purpose: choose the right kind of source for the first generated/private pilot.

Depends on:

agent-generated-custom-connectors-open-question-2026-04-23.md
private-generated-connector-pilot-brief-2026-04-23.md
pdpp-trust-model-framing.md

Question

What source shape gives the first generated/private connector pilot the best chance of producing evidence instead of noise?

This note is not choosing a specific product yet. It is choosing the selection criteria and the default owner posture for that choice.

Recommendation

Choose a source that is:

personally valuable
single-surface
snapshot-like or append-only
easy to verify before extraction
rich enough to feel real
simple enough that failure teaches us something local

That points toward a first pilot that looks more like:

a history list
a transaction/activity table
a saved-items/archive list
a personal ledger or exports page

and less like:

a full inbox
an edited/deleted chat product
a highly relational productivity workspace
a source that requires heavy anti-bot heroics

Why this matters

The first pilot is not trying to prove:

marketplace-grade reuse
broad source coverage
rich mutable-state sync
deep semantic understanding

It is trying to prove:

a generated/private artifact can be PDPP-conformant
it can have honest rerun semantics
a generic consumer can still browse, page, filter, and search the data

So the source should maximize signal on those three questions and minimize orthogonal chaos.

The wrong first-source traps

1. Mutation trap

Sources with frequent edits, deletes, merges, or thread reshaping make it too easy to confuse:

bad connector generation
bad incremental semantics
source mutation itself

Examples of bad first-pilot shapes:

chat inboxes
collaborative docs/tasks with constant edits
issue trackers with lots of state mutation

2. Ontology trap

Some sources are only useful after a long modeling exercise.

That is bad for pilot one because it becomes unclear whether failure was about:

agent authoring
schema design
or the connector/runtime lane itself

3. Anti-bot trap

If staying authenticated already requires brittle automation tricks, then failure tells us too much about browser evasion and not enough about PDPP.

4. Consumer-usability trap

A source may be easy to scrape but still produce data that no generic consumer can meaningfully use without source-specific logic.

That is also a bad pilot, because generic downstream usability is one of the core questions we are trying to answer.

Selection rubric

Score each candidate 1–5 on each dimension.

A. Owner value

Would the owner actually care about keeping and querying this data later?

Good signals:

recurring personal usefulness
audit/history value
likely to be queried again

B. Stable identity

Can records get a stable key cheaply and honestly?

Best:

source-native durable ids

Acceptable:

obvious synthesized key from durable fields

Bad:

no stable identity without fuzzy matching

C. Incremental tractability

Can reruns be described honestly as either:

append-only since checkpoint, or
snapshot refresh of a clearly bounded surface?

Best:

obvious "new items since X"
or honest whole-surface snapshot semantics

D. Verifier tractability

Can a verifier cheaply determine whether:

the session is valid
the right surface is loaded
extraction should proceed

Best:

obvious page invariant
obvious authenticated identity cue

E. Schema tractability

Can an agent produce a useful schema without a heroic ontology exercise?

Best:

small number of obvious fields
mostly flat record shape

F. Generic-consumer usability

If the connector emits truthful schema and field metadata, can a generic consumer do something useful with the resulting stream?

Best:

table-like rows
obvious sort/search fields
naturally page-able history

G. Collection robustness

How likely is the source to keep working without anti-bot or heavy UI drift dominating the experiment?

Best:

stable, boring web surface
simple auth/session flow

H. Semantic pressure

Does the source create a legitimate future need for semantic retrieval without making semantic retrieval a prerequisite for pilot success?

Best:

lexical retrieval is enough to make the data usable
semantic retrieval would be a clear additive improvement later

Scoring guidance

Strong candidate

mostly 4s and 5s
especially strong on:
- stable identity
- incremental tractability
- generic-consumer usability

Weak candidate

any 1 or 2 in:
- stable identity
- verifier tractability
- incremental tractability

Those should usually be disqualifying for pilot one.

Default preferred source shapes

These are the shapes I would bias toward first.

1. Transaction / ledger / history table

Why this is strong:

naturally row-shaped
stable identity often exists
obvious append or snapshot semantics
generic consumers understand it immediately

2. Saved-items / bookmarks / archive list

Why this is strong:

simple object model
good owner value
easy generic browsing/search
semantic retrieval would be additive later, not required

3. Account activity / audit log

Why this is strong:

append-only shape
explicit time ordering
excellent fit for page/filter/search

Default avoided source shapes

1. Full messaging/chat systems

Bad first pilot because:

edits/deletes/threading complicate identity and sync
semantic expectations rise immediately
anti-bot/auth complexity is often higher

2. Deeply relational productivity tools

Bad first pilot because:

multiple object types
many relationships
hard to judge whether failure was schema complexity or connector generation

3. Sources whose only faithful representation is documents/blobs first

Bad first pilot because:

generic downstream usability gets murky quickly
lexical/semantic retrieval questions overwhelm the core connector question

What success should look like

A winning first source is one where, after the pilot, we can honestly say:

yes, the artifact bundle was durable enough
yes, reruns were honest
yes, the resulting stream was still useful to a generic consumer

even if the source itself was not glamorous.

Consequence for the next step

Before launching the pilot, score 3–5 candidate sources against this rubric and pick the highest-signal boring winner.

If a candidate only looks attractive because it is impressive or AI-demo-friendly, that is a reason to distrust it for pilot one.

Question

Recommendation

Why this matters

The wrong first-source traps

1. Mutation trap

2. Ontology trap

3. Anti-bot trap

4. Consumer-usability trap

Selection rubric

A. Owner value

B. Stable identity

C. Incremental tractability

D. Verifier tractability

E. Schema tractability

F. Generic-consumer usability

G. Collection robustness

H. Semantic pressure

Scoring guidance

Strong candidate

Weak candidate

Default preferred source shapes

1. Transaction / ledger / history table

2. Saved-items / bookmarks / archive list

3. Account activity / audit log

Default avoided source shapes

1. Full messaging/chat systems

2. Deeply relational productivity tools

3. Sources whose only faithful representation is documents/blobs first

Recommended owner posture

What success should look like

Consequence for the next step