Skip to main content

Document transformers

📄️ Google Cloud Vertex AI Reranker

The Vertex Search Ranking API is one of the standalone APIs in Vertex AI Agent Builder. It takes a list of documents and reranks those documents based on how relevant the documents are to a query. Compared to embeddings, which look only at the semantic similarity of a document and a query, the ranking API can give you precise scores for how well a document answers a given query. The ranking API can be used to improve the quality of search results after retrieving an initial set of candidate documents.

📄️ RankLLM Reranker

RankLLM is a flexible reranking framework supporting listwise, pairwise, and pointwise ranking models. It includes RankVicuna, RankZephyr, MonoT5, DuoT5, LiT5, and FirstMistral, with integration for FastChat, vLLM, SGLang, and TensorRT-LLM for efficient inference. RankLLM is optimized for retrieval and ranking tasks, leveraging both open-source LLMs and proprietary rerankers like RankGPT and RankGemini. It supports batched inference, first-token reranking, and retrieval via BM25 and SPLADE.