Page 1 of 1
Web Scrapers
Posted: Thu Jul 18, 2013 3:38 pm
by nileriver
I am starting a new blog on basketball statistics. I have strong SQL skills and am excited to be able to see what I can do. However, I really need some good, detailed data. What I really need is play-by-play information. I was wondering if anyone knows of some pre-existing web scrapers for espn.com or nba.com. Any other suggestsion of where/how to get data would be much appreciated. I have already gotten data from here
http://basketballvalue.com/downloads.php and here
http://www.basketballgeek.com/data/.
Thanks in advance, and I look forward to sharing my blog with you all in the upcoming months.
Re: Web Scrapers
Posted: Sun Jul 28, 2013 7:43 pm
by Crow
There are some past threads that deal with this general issue. Search around a bit, if you haven't already.
This guy I think posted some stuff that may be of use.
memberlist.php?mode=viewprofile&u=579
Perhaps you could try emailing him for conversation / assistance.
Apparently now works for an NBA team.
Re: Web Scrapers
Posted: Mon Jul 29, 2013 3:25 pm
by nileriver
Thanks. I will look into those.
Re: Web Scrapers
Posted: Mon Jul 29, 2013 7:32 pm
by Crow
Threads from Chris on this general topic:
viewtopic.php?f=2&t=8099&hilit=web+scraper
His github came down after he was hired.
viewtopic.php?f=2&t=8077
(has other contact info)
viewtopic.php?f=2&t=8106
viewtopic.php?f=2&t=7958
There may be earlier threads by other people findable by other key words.
Re: Web Scrapers
Posted: Thu Aug 08, 2013 2:41 pm
by kpascual
My scraper is open-source:
https://github.com/kpascual/nbascrape
I believe I need to update the SQL schema, otherwise an error gets thrown, but once done it should work pretty easily.
Re: Web Scrapers
Posted: Mon Oct 20, 2014 8:58 pm
by dougadamsmith
This is great work by kpascual. Unfortunately, I can't seem to get it to work. It runs error-free, but doesn't actually update data in SQL. Has anyone else encountered this issue (I'm using Windows)? I've spent quite some time trying to troubleshoot, but I'm new to Python...