Forums
This topic is locked
Suggestions?
Posted 09 Jul 2003 22:37:15
1
has voted
09 Jul 2003 22:37:15 Jelle-Jacob de Vries posted:
Hi all,I dont know where to start so I'll start from the beginning.
My client asked me to convert 5000 printed newspaper articles to usable text with OCR (Optical Character Recognition) for use on there intranet which runs on an windows server. The also would like an option to search by keyword through the content of the whole range of articles.
I know how to convert these articles to usable text. But my question is, what is the most usable/fastest way of getting this articles in a database and which database MS Acces or SQL server? I've also have an option to export directly from my OCR program to HTML or XML.
I hope someone can give me some advice.
Replies
Replied 14 Jul 2003 14:46:00
14 Jul 2003 14:46:00 Vince Baker replied:
Hi,
The problem you will have is when you want to have the keyword search function.
To be able to search for this I would suggest using the Index Service function on IIS.
This function will allow you to search for a keyword in the body of an html / word / excel doc and it is very fast.
This way, you can choose what format you want the pages to be in and then search the body text of them easily.
I only hope you are running a windows server with IIS.....
Regards
Vince
Visit my home: www.chez-vince.com
VBScript | ASP | HTML | SQL | Oracle | Hosting
The problem you will have is when you want to have the keyword search function.
To be able to search for this I would suggest using the Index Service function on IIS.
This function will allow you to search for a keyword in the body of an html / word / excel doc and it is very fast.
This way, you can choose what format you want the pages to be in and then search the body text of them easily.
I only hope you are running a windows server with IIS.....
Regards
Vince
Visit my home: www.chez-vince.com
VBScript | ASP | HTML | SQL | Oracle | Hosting
Replied 14 Jul 2003 14:53:28
14 Jul 2003 14:53:28 Julio Taylor replied:
If you're not running Windows, the you're probably using Linux and PHP + MySQL, in which case you can make a fulltext search of your text fields in the database, which is probably going to be the best solution by default, simply because it wasn't made by microsoft... <img src=../images/dmxzone/forum/icon_smile_wink.gif border=0 align=middle>
MySQL also works in Windows 98+, if need be, and it's useable with ASP. It's cheaper (in fact, it's completely FREE) and faster and (in my opinion) much better than MSQL and Access.
Some links:
----------------
www.mysql.com/
www.mysql.com/doc/en/Fulltext_Search.html
www.php.net/
www.interakt.ro/
I hope that helps!
------------------------
Julio
PHP | MySQL | DWMX
ICQ: 19735247
MSN:
Edited by - poolio on 14 Jul 2003 14:54:22
MySQL also works in Windows 98+, if need be, and it's useable with ASP. It's cheaper (in fact, it's completely FREE) and faster and (in my opinion) much better than MSQL and Access.
Some links:
----------------
www.mysql.com/
www.mysql.com/doc/en/Fulltext_Search.html
www.php.net/
www.interakt.ro/
I hope that helps!
------------------------
Julio
PHP | MySQL | DWMX
ICQ: 19735247
MSN:
Edited by - poolio on 14 Jul 2003 14:54:22