Some time back I had done some work on extracting topics from an arbitrary piece of text using Wikipedia data. Recently I thought of a concept to put that algorithm to work. As a part of this project, I need to extract relevant text from an arbitrary HTML page. By relevant I mean the “meat” [...]
Categories: programming,python,text processing,web
- Published:
- July 27, 2009 – 4:46 pm
- Author:
- By prashanthellina
As a part of a project I am working on, I had to cluster urls on a page. After some light googling I found, python-cluster. You can find below a simple python script to illustrate the usage of python-cluster library.
Categories: programming,python,text processing
Tagged: clustering, python, script
- Published:
- July 25, 2009 – 9:24 am
- Author:
- By prashanthellina