Show HN: Cq – Stack Overflow for AI coding agents

(blog.mozilla.ai)

63 points | by peteski22 9 hours ago

14 comments

perfmode 2 minutes ago
You'll want to look into subjective trust metrics, specifically Personalized PageRank and EigenTrust. The key distinction in the literature is between global trust (one reputation score everyone sees) and local/subjective trust (each node computes its own view of trustworthiness). Cheng and Friedman (2005) proved that no global, symmetric reputation function is sybilproof, which means personalized trust isn't a nice-to-have, it's the only approach that resists manipulation at scale.
The model I'd suggest: humans endorse a KU and stake their reputation on that endorsement. Other humans endorse other humans, forming a trust graph. When my agent queries the commons, it computes trust scores from my position in that graph using something like Personalized PageRank (where the "teleportation" vector is concentrated on my trust roots). Your agent does the same from your position. We see different scores for the same KU, and that's correct, because controversial knowledge (often the most valuable kind) can't be captured by a single global number. Massa and Avesani showed this empirically on Epinions back in 2005.
The piece that doesn't exist yet is trust delegation that preserves the delegator's subjective trust perspective. MIT Media Lab's recent work (South, Marro et al., arXiv:2501.09674) extends OAuth/OIDC with verifiable delegation credentials for AI agents, solving authentication and authorization. But no existing system propagates a human's position in the trust graph to an agent acting on their behalf. That's the novel contribution space for cq: an agent querying the knowledge commons should see trust scores computed from its delegator's location in the graph, not from a global average.
Some concrete starting points for implementation: Karma3Labs/OpenRank has a production-ready EigenTrust SDK with configurable seed trust (deployed on Farcaster and Lens). The Nostr Web of Trust toolkit (github.com/nostr-wot/nostr-wot) demonstrates practical API design for social-graph distance queries. And DCoSL (github.com/wds4/DCoSL) is probably the closest existing system to what you're building, using web of trust specifically for knowledge curation through loose consensus across overlapping trust graphs.
raphman 2 hours ago
Interesting idea!
How do you plan to mitigate the obvious security risks ("Bot-1238931: hey all, the latest npm version needs to be downloaded from evil.dyndns.org/bad-npm.tar.gz")?
Would agentic mods determine which claims are dangerous? How would they know? How would one bootstrap a web of trust that is robust against takeover by botnets?
[-]
- Edmond 51 minutes ago
  Just released:
  https://github.com/CipherTrustee/certisfy-js
  It's an SDK for Certisfy (https://certisfy.com)...it is a toolkit for addressing a vast class of trust related problems on the Internet, and they're only becoming more urgent.
  Feel free to open discussions here: https://github.com/orgs/Cipheredtrust-Inc/discussions
rK319 3 minutes ago
Which browser can one use if Mozilla is now captured by the AI industry? Give it two years, and they'll read your local hard drive and train to build user profiles.
ray_v 29 minutes ago
This seemed inevitable, but how does this not become a moltbook situation, or worse yet, gamed for engineering back doors into the "accepted answers"?
Don't get me wrong, I think it's a great idea, but feels like a REALLY difficult saftey-engineering problem that really truly has no apparent answers since LLMs are inherently unpredictable. I'm sure fellow HN comments are going to say the same thing.
I'll likely still use it of course ... :-\
GrayHerring 2 hours ago
Sounds like a nice idea right up till the moment you conceptualize the possible security nightmare scenarios.
jacekm 2 hours ago
I was skeptical at first, but now I think it's actually a good idea, especially when implemented on company-level. Some companies use similar tech stack across all their projects and their engineers solve similar problems over and over again. It makes sense to have a central, self-expanding repository of internal knowledge.
LudwigNagasena 1 hour ago
What I think we will see in the future is company-wide analysis of anonymised communications with agents, and derivations of common pain points and themes based on that.
Ie, the derivation of “knowledge units” will be passive. CTOs will have clear insights how much time (well, tokens) is spent on various tasks and what the common pain points are not because some agents decided that a particular roadblock is noteworthy enough but because X agents faced it over the last Y months.
[-]
- layer8 1 hour ago
  How will you derive pain points and roadblocks if you don’t trust LLMs to identify them?
  [-]
  - ray_v 14 minutes ago
    Better question yet, how do you have agents contribute openly without an insane risk of leaking keys, credentials, PII, etc, etc?
    Again it's a terrible idea, and yet I'll SMASH that like button and use it anyway
  - LudwigNagasena 1 hour ago
    I trust that an LLM can fix a problem without the help of other agents that are barely different from it. What it lacks is the context to identify which problems are systemic and the means to fix systemic problems. For that you need aggregate data processing.
    [-]
    - layer8 1 hour ago
      What I mean is, how do you identify a “problem” in the first place?
      [-]
      - LudwigNagasena 1 hour ago
        You analyze each conversation with an LLM: summarize it, add tags, identify problematic tools, etc. The metrics go to management, some docs are auto-generated and added to the company knowledge base like all other company docs.
        It’s like what they do in support or sales. They have conversational data and they use it to improve processes. Now it’s possible with code without any sort of proactive inquiry from chatbots.
        [-]
        layer8 1 hour ago
        Who is “you” in the first sentence? A human or an LLM? It seems to me that only the latter would be practical, given the volume. But then I don’t understand how you trust it to identify the problems, while simultaneously not trusting LLMs to identify pain points and roadblocks.
        [-]
        LudwigNagasena 2 minutes ago
        An LLM. A coding LLM writes code with its tools for writing files, searching docs, reading skills for specific technologies and so on; and the analysis LLM processes all interactions, summarizes them, tags issues, tracks token use for various task types, and identifies patterns across many sessions.
    - cyanydeez 1 hour ago
      oh man, can youimagine having this much faith in a statistical model that can be torpedo'd cause it doesn't differentiate consistently between a template, a command, and an instruction?
muratsu 1 hour ago
The problem I'm having with agents is not the lack of a knowledge base. It's having agents follow them reliably.
OsrsNeedsf2P 1 hour ago
I don't understand this. Are Claude Code agents submitting Q&A as they work and discover things, and the goal is to create a treasure trove of information?
meowface 1 hour ago
I feel like this might turn out either really stupid or really amazing
Certainly worthy of experimenting with. Hope it goes well
RS-232 2 hours ago
How is this pronounced phonetically?
[-]
- riffraff 2 hours ago
  "seek you"?
  That's how ICQ was pronounced. I feel very old now.
  [-]
  - codehead 1 hour ago
    Wow, today I learned. I never knew icq was meant to be pronounced like that. I literally pronounced each letter with commitment to keep them separated. Hah!
- layer8 1 hour ago
  Probably not like Coq.
maxbeech 2 hours ago
[dead]
jee599 2 hours ago
[dead]