Skip to main content
January 15, 2010
Question

Solr indexing

  • January 15, 2010
  • 1 reply
  • 2357 views

Has anyone used Solr to index a directory or directories of .doc, .xls, .pdf files? The return from a cfindex on that collection has the document properties in the summary field and it is all together. That is to say that the title, author, etc is not broken out like it is in a verity collection. Does anyone know how to edit the schema.xml or config.xml in a solr collection to index the files so that I can retrieve this information. I am trying to capture all the document properties in their respective fileds and would like to add the created and modified date. I would like to be able to search on these fields.

    This topic has been closed for replies.

    1 reply

    Participant
    October 19, 2010

    Have you had any luck on this? I have a large PDF collection in verity that I would like to move over to Solr, but it doesn't appear that Solr  will index PDF files by default. I have not been sucessfull in getting Solr to do so. Please contact me with any suggestions.

    Thanks,

    David

    cfjedimaster
    Inspiring
    October 19, 2010

    PDFs are definitely supported by Solr. Can you show us the cfindex tag you used?

    Participant
    October 19, 2010

    Thanks for the help.

    David