Highlighted

CF index is hung

New Here ,
Feb 16, 2015

Copy link to clipboard

Copied

related to my earlier slow solr post.  Running a refresh on the first file then an update on all others. Recurse may not be necessary.

cfindex

action=refresh

collection="this_name_solr"

key="c:\inetpub\wwwroot\myappname\thefolder\thesubfolder\thefile.ext"

type="file"

urlpath="http://#application.root#/appname/thefolder/thesubfolder/thefile.ext"

extensions=".html, .htm, .xls, .xlsm, .doc, .docx, .pdf, .txt"

recurse="yes"

status="alldocs"

language="English"

Does fine until it hits a corrupt PDF File.  If I try to open the file manually in PDF reader I get the message that the file may be corrupt.

I need it to get past this file and continue indexing the rest. I have tried a request timeout of three minutes but that does not work.  I have attempted CFPDF Info extraction but it hangs reading it too.  I do not know how to test the doc to see if it is corrupt.

Ultimately I would like it to give up on the file after about 3 minutes.

Any suggestions

Views

185

Likes

Translate

Translate

Report

Report
Community Guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more

CF index is hung

New Here ,
Feb 16, 2015

Copy link to clipboard

Copied

related to my earlier slow solr post.  Running a refresh on the first file then an update on all others. Recurse may not be necessary.

cfindex

action=refresh

collection="this_name_solr"

key="c:\inetpub\wwwroot\myappname\thefolder\thesubfolder\thefile.ext"

type="file"

urlpath="http://#application.root#/appname/thefolder/thesubfolder/thefile.ext"

extensions=".html, .htm, .xls, .xlsm, .doc, .docx, .pdf, .txt"

recurse="yes"

status="alldocs"

language="English"

Does fine until it hits a corrupt PDF File.  If I try to open the file manually in PDF reader I get the message that the file may be corrupt.

I need it to get past this file and continue indexing the rest. I have tried a request timeout of three minutes but that does not work.  I have attempted CFPDF Info extraction but it hangs reading it too.  I do not know how to test the doc to see if it is corrupt.

Ultimately I would like it to give up on the file after about 3 minutes.

Any suggestions

Views

186

Likes

Translate

Translate

Report

Report
Community Guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Feb 16, 2015 0
New Here ,
Feb 18, 2015

Copy link to clipboard

Copied

Wow - Rather shocked no one has experienced this.

I have isolated a couple of folders that have the issue and have run every test i can think of to make it move past this.  Really need an assist.

Likes

Translate

Translate

Report

Report
Community Guidelines
Be kind and respectful, give credit to the original source of content, and search for duplicates before posting. Learn more
Reply
Loading...
Feb 18, 2015 0