Exit
  • Global community
    • Language:
      • Deutsch
      • English
      • Español
      • Français
      • Português
  • 日本語コミュニティ
  • 한국 커뮤니티
2

ModuleNotFoundError: No module named 'pdfservices_sdk'

New Here ,
Feb 05, 2024 Feb 05, 2024

Hi,

Not sure if anybody uses this community forum anymore as most posts are way older. I'll try get help anyway.

 

For setup, I followed following steps.

1) Downloaded my credential under Adobe PDF Extract API Free Tier

2) Added separate variables for client_id, client_secret, and organization_id in my user variables

2) Created and activated a virtual environment with python 3.8
3) Installed my pdfservices_sdk using pip install and all requirements with pip install -r requirements.txt

4) Tried running following script:

import os
from pdfservices_sdk.pdfservices import PDFServices
from pdfservices_sdk.auth import Credentials

# Access environment variables
client_id = os.environ.get('adobe_pdf_extract_client_id')
client_secret = os.environ.get('adobe_pdf_extract_client_secret')
organization_id = os.environ.get('adobe_pdf_extract_organization_id')

# Initialize SDK with credentials from environment variables
credentials = Credentials.service_account_credentials_builder()\
    .with_client_id(client_id)\
    .with_client_secret(client_secret)\
    .with_organization_id(organization_id)\
    .build()

pdf_services = PDFServices(credentials=credentials)

# Directory containing PDF files
pdf_dir = "C:\\Users\\hamza\\Downloads\\CODING\\Adobe_pdf_extract\\adobe-dc-pdf-services-sdk-python\\ALL"
output_dir = "C:\\Users\\hamza\\Downloads\\CODING\\Adobe_pdf_extract\\adobe-dc-pdf-services-sdk-python\\ALL\\retrieved_text_from_pdfs"

# Ensure output directory exists
os.makedirs(output_dir, exist_ok=True)

# Process each PDF in the directory
for pdf_file in os.listdir(pdf_dir😞
    if pdf_file.endswith(".pdf"😞
        input_pdf_path = os.path.join(pdf_dir, pdf_file)
        output_json_path = os.path.join(output_dir, pdf_file.replace(".pdf", "_text.json"))

        # Call the Extract API
        extract_options = pdf_services.extract_pdf_options_builder().add_element_to_extract("text").build()
        operation = pdf_services.create_extract_pdf_operation(extract_options)
        operation.add_input(input_pdf_path)

        # Execute the operation and save the output
        result = operation.execute()

        with open(output_json_path, "wb") as f:
            f.write(result)
        print(f"Text extracted from {pdf_file} and saved to {output_json_path}")

However, I am consistantly getting ModuleNotFoundError: No module named 'pdfservices_sdk'. Tried reinstalling it changing python version to 3.11. still the same error.

Can someone tell what am I missing here?

 

TOPICS
Bug , General , How to , PDF Extract API , Python SDK
1.4K
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Adobe Employee ,
Feb 07, 2024 Feb 07, 2024

Not sure why you think this forum isn't used - we get posts every day. But I have seen it oddly sort sometimes. 🙂 

 

From the quickstart, https://developer.adobe.com/document-services/docs/overview/pdf-extract-api/quickstarts/python/, it does look you are doiong your imports differently. Can you try matching the style used there?

Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
New Here ,
Feb 10, 2024 Feb 10, 2024
LATEST

Hi, thanks for responding.

 

matched the style suggested in documentation.

 

now, consistantly getting 

ModuleNotFoundError: No module named 'adobe'.
 
already tried solution given here (https://community.adobe.com/t5/acrobat-services-api-discussions/modulenotfounderror-no-module-named-...), not working. 

using win 11 64 bit, steps followed:
1) cloned pdfservices-python-sdk-samples repo from github
2) Put my credendials in json file
3) Created enviro variables in user variables (double checked by
print("Client ID:", os.getenv('PDF_SERVICES_CLIENT_ID'))
print("Client Secret:", os.getenv('PDF_SERVICES_CLIENT_SECRET'))
)
4) pip install -r requirements (all good)
5) Checked if api's working fine by requesting Access Token (working)
6) Ran extract_txt_from_pdf.py, getting:
[Running] python -u "c:\Users\hamza\Downloads\CODING\ADOBE\pdfservices-python-sdk-samples\src\extractpdf\extract_txt_from_pdf.py"
Traceback (most recent call last):
  File "c:\Users\hamza\Downloads\CODING\ADOBE\pdfservices-python-sdk-samples\src\extractpdf\extract_txt_from_pdf.py", line 14, in <module>
    from adobe.pdfservices.operation.auth.credentials import Credentials
ModuleNotFoundError: No module named 'adobe'

[Done] exited with code=1 in 0.106 seconds
 
my_virtual_environment>Lib>site-packages contains 'adobe' folder, active when tried running the script
 
any idea whats happening here?
Translate
Report
Community guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
community guidelines
Resources