Private AI

FastQuery: Communication-efficient Embedding Table Query for Private LLMs inference