<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0" xmlns:dc="http://purl.org/dc/elements/1.1/">
    <channel>
        <title>search pdf-files WITHOUT using pdftotext.exe</title>
        <description> </description>
        <link>http://www.sphider.eu/forum/read.php?3,3703,3703#msg-3703</link>
        <lastBuildDate>Wed, 19 Jun 2013 20:51:54 +0300</lastBuildDate>
        <generator>Phorum 5.2.10</generator>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,9183#msg-9183</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,9183#msg-9183</link>
            <description><![CDATA[ Thanks hmc. But I see no longer the php script of kashmir's post 3 Mar 08 (12:09am) to which you refer. Where can I find it?]]></description>
            <dc:creator>ben2012</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Tue, 31 Jan 2012 22:35:13 +0200</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,8550#msg-8550</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,8550#msg-8550</link>
            <description><![CDATA[ refer this for PDF file search<br />
[<a href="http://techpdf.in/blog/2011/07/php-file-search-with-out-my-sql/" rel="nofollow" >techpdf.in</a>]]]></description>
            <dc:creator>azhagu</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Mon, 04 Jul 2011 19:51:21 +0300</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,5993#msg-5993</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,5993#msg-5993</link>
            <description><![CDATA[ This looks like an excellent way to index pdfs without the need for an exe on the server. <br />
<br />
Has anybody managed to get this working with a pdf version 1.4 file however? <br />
<br />
Thanks in advance<br />
<br />
Robert]]></description>
            <dc:creator>robert_t</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Wed, 15 Apr 2009 18:30:26 +0300</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,3844#msg-3844</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,3844#msg-3844</link>
            <description><![CDATA[ Thanks very much. It works very well with sphider. I tested on localhost (win xp) and on the yahoo server (BeOS). The files were nicely indexed. <br />
<br />
However, it did not work with sphider-plus. Not only were the pdf files not indexed but the changes to the sphiderfunc and conf files screwed up the existing instllation. <br />
<br />
Any ideas on how to make it compatible with sphider-plus?]]></description>
            <dc:creator>sorlag</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Wed, 26 Mar 2008 22:23:48 +0200</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,3824#msg-3824</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,3824#msg-3824</link>
            <description><![CDATA[ A &quot;how to&quot; list, that worked for me (eventually!). <br />
<br />
(Note: I still have to find a way to force admin.php to reindex an entire site without having to &quot;change&quot; every html page - there's some help in the forum that I'll try.)<br />
<br />
Make a backup copy of your working sphiderfuncs.php and conf.php files<br />
<br />
Copy kashmir's post 3 Mar 07 0009h (12:09am) (from &lt;?php of course) and save to the admin folder, overwriting the existing sphiderfunc.php<br />
<br />
Change conf.php as follows:<br />
<br />
Suggest this line be up to date:<br />
$version_nr	= '1.3.4b';<br />
<br />
Set logging on so you can see what is happening:<br />
$keep_log = 1;<br />
<br />
Set pdf on:<br />
// Index pdf files<br />
$index_pdf	= 1;<br />
<br />
Comment out:<br />
//executable path to pdf converter<br />
//$pdftotext_path = 'c:\temp\pdftotext.exe';<br />
<br />
Transfer the amended conf.php and sphiderfuncs.php to your domain host<br />
<br />
Ensure at least one v1.2 pdf file exists on your domain, using download3k.com if needed to create a v1.2 pdf from a later version. It is best if this pdf has at least one word in it that does not exist elsewhere on the site, so you can prove to yourself the pdf indexing routines are working properly.<br />
<br />
Note: The pdf you use as a test must have recognisable text in it, in other words it cannot just be a scanned image, it has to be a pdf file that has been produced either by using Acrobat Distiller, or some other pdf creation utility from a text or doc file, or has been &quot;recognised&quot; (to use Adobe's terminology).<br />
<br />
Transfer the v1.2 pdf to your domain host, and create a new link to it on any existing page - this will ensure the re-indexing process will see the new link and index it and the v1.2 pdf file, irrespective of your re-indexing settings and even if your site has previously been re-indexed. <br />
<br />
Then re-index your site and then look at the log. Did the test pdf file get indexed? It should have been! All of the words in the test pdf will have been parsed and added as needed to the sphinder database.<br />
<br />
Then do a search for that unique word in the v1.2 test pdf online within your domain to prove all is well. <br />
<br />
If it all works, the next step - to convert all of your site's pdf files back to v1.2!]]></description>
            <dc:creator>hmc</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Mon, 24 Mar 2008 07:02:12 +0200</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,3809#msg-3809</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,3809#msg-3809</link>
            <description><![CDATA[ I tried Sphider-Plus. It is better (but needs php 5.x) in most aspects from the regular sphider. However, in so far as pdf files are concerned. all it does is to include pd2text.exe in the converter package. That exe still needs to run on the server. So, if the server prohibits exe files, you are where you are.<br />
<br />
This is why a php converter would be great. <br />
<br />
A step by step guide please would be greatly appreciated.]]></description>
            <dc:creator>sorlag</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Sat, 22 Mar 2008 07:55:32 +0200</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,3807#msg-3807</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,3807#msg-3807</link>
            <description><![CDATA[ ]]></description>
            <dc:creator>Picalo</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Fri, 21 Mar 2008 23:17:02 +0200</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,3805#msg-3805</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,3805#msg-3805</link>
            <description><![CDATA[ i am very interested too]]></description>
            <dc:creator>antpeks</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Fri, 21 Mar 2008 20:13:12 +0200</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,3804#msg-3804</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,3804#msg-3804</link>
            <description><![CDATA[ This is great. I am hosted on yahoo and they do not allow pdftotext.exe to run. So your solution will be superb if it works. Could you post a step-by-step guide on how to get this working.]]></description>
            <dc:creator>sorlag</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Fri, 21 Mar 2008 20:03:00 +0200</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,3709#msg-3709</guid>
            <title>Re: search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,3709#msg-3709</link>
            <description><![CDATA[ ]]></description>
            <dc:creator>kashmir</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Mon, 03 Mar 2008 00:09:38 +0200</pubDate>
        </item>
        <item>
            <guid>http://www.sphider.eu/forum/read.php?3,3703,3703#msg-3703</guid>
            <title>search pdf-files WITHOUT using pdftotext.exe</title>
            <link>http://www.sphider.eu/forum/read.php?3,3703,3703#msg-3703</link>
            <description><![CDATA[ ]]></description>
            <dc:creator>kashmir</dc:creator>
            <category>Sphider Mods</category>
            <pubDate>Sun, 02 Mar 2008 11:44:53 +0200</pubDate>
        </item>
    </channel>
</rss>
