TODO
Here we list some subprojects that can possibly be done for ASPseek.
If you are developer and want to show your excellent coding ability,
please feel free to pick up one of the projects below:
Search statistics
Currently searchd stores all search statistics to SQL table, but
no tool exists yet to show it in a nice way. Interesting things to do would
be: show most popular queries, show site load graph per day (and so),
compress stats table etc.
URL exit statistics
It would probably make sense to let s.cgi generate not plain links
to result URLs, but a redirect so that each link user follows will be
encountered and stored into DB. This data can be further used for statistics
analysis.
Graphical User Interface
A nice windowed (GTK+/QT/whatever) or Web GUI can be written to simplify
configuration of index and searchd. It would be nice
to show real-time indexing and search stats, too.
More sophisticated HTML parser
Implement poison words (spam filter), take the link text into account, custom [meta] tags (more than just title/keyword/description/body).
More search filters
Filtering by language, custom meta tags, etc.
Support for more SQL databases
Implement backends for PostgreSQL, DB3, other DBMSes.
Clusterization
Made ASPseek a distributed application, thus virtually removing the limits
on speed and number of URLs in the database.