Scraping data from MP (or how about a public API?)
|
Since MP doesn't have a public API (as say The Crag: thecrag.com/article/api ), what are the rules to scraping the data? |
|
No, it is not kosher - for many reasons, not least of which is you would be violating the copyright of all the submitted content. The content was submitted to MP, not another site and the TOS is between us and the user. By displaying the content on a third party site/app you are violating the original copyright holder unless you receive explicit permission from each contributor. |
|
So what is the TOS between a third party and MP? Does one even exist? If one exists, please elaborate the difference between profit vs. non-profit. |
|
Contributed by Colonel Mustard. I like it. |
|
Eric Thomas wrote:So what is the TOS between a third party and MP? Does one even exist? If one exists, please elaborate the difference between profit vs. non-profit. "By displaying the content on a third party site/app you are violating the original copyright holder unless you receive explicit permission from each contributor." Why wouldn't attributing credit to a contributor qualify?"Receive" and "attribute" mean vastly different things. This first one's free...after that I bill my standard rate of $400 an hour. Andy, of course, gets all the free legal consulting he wants. |
|
Because the contributing users have not given you explicit permission to use their content within your application/site. Eric Thomas wrote:So what is the TOS between a third party and MP? Does one even exist? If one exists, please elaborate the difference between profit vs. non-profit. "By displaying the content on a third party site/app you are violating the original copyright holder unless you receive explicit permission from each contributor." Why wouldn't attributing credit to a contributor qualify? |
|
I don't really have anything to add beyond what Andy's said, but feel compelled to chime in. |
|
Fair enough. I probably should have been more explicit with my request, but I was interested in an API to build off of (requested here with no response: mountainproject.com/v/mount… ) |
|
Just DON'T do it!!!! |
|
since when is this not a 'public' app? |
|
John Marsella wrote:My app will replace all the beta pictures with cat pictures and probably rename routes cat-themed things. I will call it MEOWntainProject. Is that halal?Best damn post I've seen in a long time. |
|
Andy Laakmann wrote: If I steal someone's photograph from one book and put it into another book, just displaying the original photographers name does not solve the fact that I did not get permission to copy that photograph into a new book.There seems to be some type of rule differentiations between people who are reporting about a subject (e.g. college students, news reporters, ect.) and someone who is using an external source for direct gain. For example, if I wanted to write a research paper on the growth of Internet-based climbing forums, I would probably pull statistical information about, and from, this website. As so long as I cite the information in accordance with APA guidelines, no professor would have a problem with my essay, and I dont need permission to cite in my paper information I learn from MP.com. Let me provide a more-realistic example of a blurb a college student might include in a paper. Copyright wars rage on among website administrators and website users. For example, a recent thread on the Mountain Project website sparked an inter-site debate between users and administrators about how user rights relate to copying information. Website user Eric Thomas believes that the Mountain Project website does not take ownership of the information on its website (Thomas, 2013). But website administrator, Andy Laakmann (2013), said "The content was submitted to MP, not another site and the TOS is between us and the user." (para 1). Reference Thomas, E. (2013). Scraping data from MP (or how about a public API?). Retrieved from mountainproject.com/scripts… Laakmann, A. (2013). Scraping data from MP (or how about a public API?). Retrieved from mountainproject.com/scripts… I could write that information in a research paper and copyright law does not prevent me from doing so. Where is the line drawn then? |
|
John Marsella wrote:My app will replace all the beta pictures with cat pictures and probably rename routes cat-themed things. I will call it MEOWntainProject. Is that halal?For the win!!! |
|
But remember, phone books are not copyright-able. |
|
It would be nice to have an API to work with. Many people who have responded to this thread obviously do not understand what that means. It doesn't mean that someone is going to make a rival site or even try to make money off of the data in MP.com. |
|
We actually do now have a rudimentary API. It's primary use is to retrieve personal tick data. Because it is new, and I may change it (and break your code that uses it), contact me if you'd like to give it a try. |
|
That's awesome. Thanks for replying. I will message you to get more info. |
|
Great! I would be very interested in an API to gain access to my ToDo routes. Ideally JSON returns with area location coordinates but I realize you are just getting started with this. These are steps in a great direction! |
|
I understand that MP already took a stance on their Terms of Use but I would like to direct some consideration toward how Open Street Map opens their crowd sourced data. For those that don't know, Open Street Map is a crowd sourced map of the world with thousands of active contributors and many more users. They have a completely open data policy which means that as a contributor I in turn have the benefit of being able to use the data I submit along with all the rest of the map data. This is a huge benefit to contributors as the core app provides a specific service but the data can be extracted for additional niche purposes. This has resulted in a host of customized web maps being offered by agencies like MapQuest as well as empowering smaller groups to build data on an existing data storage option that almost immediately benefits them as a service. The best example of this is with the humanitarian group HOT OSM which goes to impacted areas like Heiti after the earth quake and partners with locals to rebuild a map of the areas destroyed to help with rebuilding. MP isn't mapping such critical data but can still be very useful for all kinds of applications including as a potential proxy measurement for health analytics. In an ideal world I think this kind of model is what is needed from sites like MP and I hope they move toward this model. |
|
This thread is pretty outdated, but I wanted to ask - does it violate the terms of service to copy only "public information" (areas, routes, and basic info about each such as type, grade, etc)? From what it sounds like, the main problem the Admins are stating is to do with user-submitted content (comments, pictures, possibly even route descriptions)? If I was to create an application where a user asks for information about a route by search query and I provide a snippet of that route's information (maybe type, grade, location, etc) with a link to the route's page on MountainProject, is this acceptable? |
|
dantesupertramp wrote: I'm also curious about this. As the person above me said, it seems that the that "public information" on MP as stated above (minimally speaking the route name and location), is not subject to copywrite from what I understand. Presumably it would be OK to scrape this data from MP into a personal project? The new API is nice, but does not allow one to get route IDs (for instance by crag, by state, etc.) Maybe that is what the lat/long method is for, but I haven't checked it out yet. Thanks |