Laptop 6: Laptop Reloaded

Aug 20, 2008 19:07

Brought sniper module up-to-date, but didn't set up the CRON job yet -- not enough training data for good sniping. In the process found and fixed a calculation error there, old version was reporting slightly incorrect results some of the time (guess what, normalize(a,min,max)-normalize(b,min,max) does not equal normalize(a-b,min,max) ).

Also rewrote my request search query to look like
"(la*, no*, in*, am*) -(broken,parts,cracked,dead,damaged,as is,bad,no,not,lot,only,repair,repairs,fix,for)"
This should produce as many fully-functional laptops as possible, if you simply enter it in eBay laptops category search field. Older query of the form "* - (crap)" probably worked through API, but if you enter it in eBay search it'll actually barf at you. Also turned off "search in descriptions", because those frequently have words like "not damaged" or "if broken we'll accept return" which kicks good laptops out of my search results. This seem to have improved both quality and quantity of the data I get -- now it's about 400 laptops/day, whereas before it was about 300. Also, the test error seems to be dropping very slowly. Looks like about 18.5% yesterday. If I demand no missing fields, it'll actually go down to about 16%, but that means all Gateway and Asus laptops are automatically disqualified since I don't have "Family" specified for them in the DB.

dev

Previous post Next post
Up