DaaS / Products / ML Embedding Pipeline with Vector Search

ML Embedding Pipeline with Vector Search

Use PAI to preprocess training datasets and train embedding models, then store the generated vector embeddings into OSS vector indexes to power a semantic similarity search service end-to-end.

Products involved

Scenario

Use PAI to preprocess training datasets and train embedding models, then store the generated vector embeddings into OSS vector indexes to power a semantic similarity search service end-to-end.

How the products combine

  1. pai · pai-manage-data — Platform for AI (PAI) — Manage and process training datasets
  2. See pai/pai-manage-data.

  3. oss · oss-manage-data — Object Storage Service — Manage vector data and indexes
  4. See oss/oss-manage-data.

Typical questions