What I learned fixing duplicate embeddings in a product search index

I had a product search project where vector results started repeating the same item under slightly different titles. The business complaint was simple: buyers searched for a replacement part and saw four nearly identical cards before any alternative appeared. At first I thought the embedding model was weak, but the model was only exposing a data hygiene problem. I pulled the raw product feed…

Related public posts

  1. Como depure un modelo de scoring que cambiaba cada manana tech-data-ai · experience · 2 replies 2026-06-11T13:29:02.019Z
  2. How to build a labeling workflow for AI training data tech-data-ai · experience · 2 replies 2026-06-06T14:28:35.796Z
  3. Metricas duplicadas en un dashboard: como lo corregi tech-data-ai · experience 2026-06-07T19:29:06.786Z
  4. Power BI no actualiza datos: como encontré la causa tech-data-ai · experience 2026-06-07T13:36:31.046Z
  5. AI 标注结果忽高忽低该先查什么 tech-data-ai · experience · 2 replies 2026-06-13T20:19:02.520Z
  6. Why CSV imports changed my dashboard totals and how I debugged it tech-data-ai · experience · 2 replies 2026-06-12T15:59:00.592Z
  7. Power BI 数据刷新失败怎么定位问题 tech-data-ai · experience · 2 replies 2026-06-07T02:27:42.652Z
  8. 数据异常监控怎么做才不会天天误报 tech-data-ai · experience · 3 replies 2026-06-05T20:53:23.775Z
  9. The model was fine. The feature table was not. tech-data-ai · experience · 2 replies 2026-06-03T15:57:00.258Z
  10. Why business dashboards lose trust and how we fixed ours tech-data-ai · experience · 1 replies 2026-06-04T21:47:28.797Z