Transform PDFs into Digital Forms with the GPT-4 Vision API

12 December 2023
0
71

PDF to Digital Form using GPT4 Vision API

A POC that uses GPT 4 Vision API to generate a digital form from an Image using JSON Forms from https://jsonforms.io/

Inspired by:

Both repositories demonstrate that the GPT4 Vision API can be used to generate a UI from an image and can recognize the patterns and structure of the layout provided in the image.

Demo

Click the thumbnail to watch on YouTube:

Running using Local Environment 💻

Frontend

cd into frontend directory

cd ai-json-form

Install Packages and run

npm install
npm run dev

Backend

cd into directory

cd backend

Install Packages

poetry install
# alternatively, you can use pip install
pip install -r requirements.txt

Setup Environment Variables

export OPENAI_API_KEY=
# optional
export OPENAI_ORG=

If you plan to use the Mock response only, you should set OPENAI_API_KEY to any value.

python main.py

Running using Docker 🐳

export the environment variables

echo "OPENAI_API_KEY=YOUR_API_KEY" > .env
# The following is optional
echo "OPENAI_ORG=YOUR_ORG" >> .env

Run the docker-compose

docker-compose up --build

Open the browser and visit http://localhost:8080/aijsv/

Disclaimer

I am new to Vue, so the code might not be the best practice. I am still learning and improving. Should you have any suggestions, please feel free to PR.

Flow Explain

Upload PDF files of up to three pages from the frontend

If you want to adjust the number of pages, you can change the MAX_PDF_PAGES variable in backend/app/socket.py

When the backend receives the PDF file in Base64 string format, it does the following processes:

Convert the URL String Back to Bytes
Read the PDF file, convert it to a JPG image, and save it to the /tmp folder using the package pdf2image.
Extract the strings from the same PDF file using the package PyPDF2. The extracted strings will become part of the prompt sent to the GPT4 model to enhance accuracy.
Prepare the prompts and send them along with the PDF screenshot to the GPT4 Vision API
Send the chunk to the frontend via Socket.IO incrementally.

Whenever the frontend receives the chunk, it appends it to the codemirror editor, and checks if the current content is a valid YAML. If it’s a valid YAML, it will apply it to the JSON Scheme to force the UI to re-render.

GitHub

View Github

REACTJS

Unleashing the Power of GPT-4 Vision API: React App for Image Content Analysis and Description

15 November 2023

InsightforGeeks

Transform PDFs into Digital Forms with the GPT-4 Vision API

PDF to Digital Form using GPT4 Vision API

Demo

Running using Local Environment 💻

Frontend

Backend

Running using Docker 🐳

Disclaimer

Flow Explain

GitHub

Related Posts

Unleashing the Power of GPT-4 Vision API: React App for Image Content Analysis and Description

Categories

Popular Posts

BrowserVideoEdit: A feature-rich video editor created using fabric.js and Next.js, all within the convenience of your web browser

A weather app that allows users to view real-time weather information based on their locations

Add Login and Register page into your Nuxt 3 project using Supabase authentication

A powerful Flutter package that allows you to easily create and control glitch effects

A Library for Rendering 3D Models in React.js and Next.js Views

Recent Posts

ഇടുക്കിയിലെ മലയോര മേഖലകളിൽ രാത്രിയാത്ര നിരോധിച്ചു. രാത്രി ഏഴു മുതൽ രാവിലെ ആറു വരെയാണ് നിരോധനം

ഏന്തയാർ ഈസ്റ്റിൽ പ്രളയത്തിൽ തകർന്ന പാലത്തിന് പകരം പുതിയ പാലം നിർമ്മിക്കുവാൻ താത്ക്കാലിക പാലം പൊളിച്ച് നീക്കി

Explore the Investment Opportunities: A Comprehensive Guide to Different Types of Mutual Funds

Title: Understanding Mutual Funds: A Beginner's Guide to Investing

തീവ്രമഴ മുന്നറിയിപ്പിന്റെ പശ്ചാതലത്തിൽ സംസ്ഥാനം ജാഗ്രതയിൽ

250,000 അപേക്ഷകൾ വർദ്ധിച്ചതിനാൽ ട്രാൻസ്‌പോർട്ട് കമ്മീഷണർ പരിശോധന പുനരാരംഭിക്കും

ഏലക്കയിൽ കീടനാശിനി സാന്നിധ്യം; ആറര ലക്ഷത്തിലധികം ടിൻ അരവണ നശിപ്പിക്കാൻ ടെൻഡർ ക്ഷണിച്ച് ദേവസ്വം ബോർഡ്‌

ചക്രവാതച്ചുഴി:അതിശക്തമായ മഴ വരുന്നു

പ്ലസ് വൺ പ്രവേശനം. അക്ഷയയിൽ തിക്കി തിരക്കേണ്ട, നെറ്റിവിറ്റി/ജാതി തെളിയിക്കാൻ പത്താംതരം സർട്ടിഫിക്കറ്റ് മതി

InsightforGeeks