We have a scraper script which we need to added functionality which include but not limited to:
1.Adding multithreading capabilites - though originally provided for this function isn't enabled and we need someone with the knowledge and experience of asynchronous process esp. in an amazon instance.
2.Extract additional information - the script currently parses just a portion of the data and we would like to get more data from the document so you will have experience with dom, xpath and reg ex.
3.Better database connectivity and usage.
4.Error detection and reporting capabilities
The script is in php and our database is mysql.