Discussion about this post

User's avatar
Alfred Wallace's avatar

So I found the "Gaza war" error in Primo Research Assistant fascinating, but after some testing it gave the same error when I tried "dogs," "cats," "strawberries," and two-word phrases like "Imjin War," which I don't think gets many people excited anymore. Putting in "Jan 6 2021" gives the error, but so does "Dec 7 1941" ("July 4 1776" has results though). So I think there's also a problem with the system not being able to get clear results with very short strings, which is complicating some of the one-word prompts that fail.

Expand full comment
Aaron Tay's avatar

Technical note: many academic "semantic search" uses encoder based vector embeddings for matching (eg SciSpace, Scopus Ai)

That part at least are unlikely to be subject to LLM guardrails or Azure content filters unless they are caught at the input side.

Compared to keyword based BM25 algorithm they may have biases but that's a different issue from LLM guardrails and content filters

Expand full comment
3 more comments...

No posts