Postgres extension complements pgvector for performance and scale

(github.com)

82 points | by flyaway123 5 days ago

4 comments

aunty_helen 1 hour ago
Related discussion for pgvector perf: https://news.ycombinator.com/item?id=45798479
[-]
- tacoooooooo 1 hour ago
  the main issue with pgvectorscale is that it's not available in RDS :(
  [-]
  - omg2864 52 minutes ago
    Yes, RDS seems to really hold PG back on AWS, with all the interesting pg extensions getting released now (pg_lake). It is a share I can't move to other PG vendors because it is a pain in the ass to get all privacy, legal docs in order.
ricw 3 hours ago
I’ve been using this since early this year and it’s been great. It was what convinced me to just stick to Postgres rather than using a dedicated vector db.
Only working with 100m or so vectors, but for that it does the job.
[-]
- pqdbr 2 hours ago
  Are you using a dedicated pg instance for vector or you keep all your data in a single pg instance (vector and non-vector)?
  [-]
  - ComputerGuru 2 hours ago
    The biggest selling point to using Postgres over qdrant or whatever is that you can put all the data in the same db and use joins and ctes, foreign keys and other constraints, lower latency, get rid of effectively n+1 cases, and ensure data integrity.
    [-]
    - dalberto 2 hours ago
      I generally agree that one database instance is ideal, but there are other reasons why Postgres everywhere is advantageous, even across multiple instances:
      - Expertise: it's just SQL for the most part - Ecosystem: same ORM, same connection pooler - Portability: all major clouds have managed Postgres
      I'd gladly take multiple Postgres instances even if I lose cross-database joins.
      [-]
      - throwaway7783 1 hour ago
        Yep. If performance becomes a concern, but we still want to exploit joins etc, it's easy to set up replicas and "shard" read only use cases across replicas.
- esafak 2 hours ago
  What kind of performance do you observe with what setup?
isoprophlex 1 hour ago
The linked blogpost is an interesting read, too, comparing well-tuned pgvector to pinecone:
https://www.tigerdata.com/blog/pgvector-vs-pinecone
mmmeff 1 hour ago
This is still unsupported in RDS, right?
[-]
- tacoooooooo 1 hour ago
  correct afaik :(
  https://github.com/timescale/pgvectorscale/issues/113