Skip to main content
Participating Frequently
February 8, 2025
Question

AI Solution for Extracting Data from Large Scanned PDFs

  • February 8, 2025
  • 2 replies
  • 1329 views

Subject: AI Solution for Extracting Data from Large Scanned PDFs

Dear Adobe Team,

I hope you're doing well. I wanted to inquire if Adobe has the capability to extract data from scanned PDFs, especially when the file size is large. If this is a challenge, I’d love to connect and discuss a solution—I’ve already built an AI-powered system using Gemini 1.5 Flash that effectively processes scanned textbooks to generate structured notes and provide accurate Q&A.

Key Features of My Solution:
✅ Processes large scanned PDFs into structured study notes
✅ Extracts accurate answers from the provided material, ensuring syllabus-specific learning
✅ Intelligent resource handling—prioritizing user content while offering external support when necessary

If this aligns with your goals, please feel free to DM me. I’d be happy to share insights and explore potential collaborations.

Looking forward to your response.

Best regards,
Vivek Avdhesh Sharma

2 replies

droopydog500
Community Manager
Community Manager
February 8, 2025

I moved this to the Acobat forum from Firefly.

 

    droopy

Adobe Community Expert (not an Adobe employee)
Participating Frequently
February 10, 2025

@droopydog500 
Right now adobe can't take scanned pdf  my project aim is that it takes scanned pdf perform extraction operation on it extract all data from pdf save into memory and provide solution

Note : Adobe takes normal pdf (document-to-pdf) and my project takes scanned pdf there is many difference, and another side it provide solution using extracted data 

Monika Gause
Community Expert
Community Expert
February 8, 2025
Participating Frequently
February 10, 2025

I tried adobe but can't processing scanned pdf it can't extract data from it