In this work we harness the power of Microsoft Phi3 Mini 3.8 on ONXX CPU runtime. We build a PDF Q/A system with nomic-embed-text-v1 as embedding moel
faiss as Vector DB.
- pre_processing.py: Contains code for parsing PDF file, creating Embedding and Vector DB.
- application.ipynb: This notebook for creating a pdf Q/A pipeline.
- app.py: Code for Gradio Application. The app is hosted on
HF Space
- LLM: Phi3 Mini
- Embedding Model: nomic-embed-text-v1
- Vector DB: faiss
- Application: Gradio
- Install libraries with
make install - Prepare Phi3 Mini with
ONXX CPU Runtime in Linuxwithmake phi3_dependency - Run run the app execute
python app.py
Microsoftfor the open source Phi3 Mini Quantized along with ONXX Runtime support.Hugging Facefor the all the educational and open source resources.
