Median vs. mean consensus grades

Follow topic: Email Notify on site

Post Reply

Original Post

	MikeJ · Jun 26, 2012 · Unknown Hometown · Joined Aug 2011 · Points: 0 Consensus grades on the site are currently calculated using the mean opinion. Switching to the median opinion would make the estimate more robust to outliers, and would prevent this sort of nonsense: mountainproject.com/v/debbi… (Administrator decides he doesn't like a 5.11a climb so he gives it a 5.0 rating and screws up the consensus grade.) 1 Flag Quote
	Nick Wilder · Jun 26, 2012 · Boulder, CO · Joined Jan 2005 · Points: 4,098 I deleted the bogus ratings, and the route comments related to them. 2 Flag Quote
	a d · Jun 26, 2012 · Los Angeles, CA · Joined Mar 2010 · Points: 5 If the sample size is large enough and its normally distributed then the mean and median should be the same. With that said, I'm not sure what the sample size is... 0 Flag Quote
	Wiled Horse · Jun 26, 2012 · Unknown Hometown · Joined Dec 2002 · Points: 3,669 alexdavis wrote:If the sample size is large enough and its normally distributed then the mean and median should be the same. With that said, I'm not sure what the sample size is... beat me to it. except there is nothing normal about the distribution on MP! 4 Flag Quote
	slim · Jun 26, 2012 · Unknown Hometown · Joined Dec 2004 · Points: 1,103 i agree that the median is the way to go. 3 Flag Quote
	a d · Jun 26, 2012 · Los Angeles, CA · Joined Mar 2010 · Points: 5 Young Gun wrote: Switching to the median opinion would make the estimate more robust to outliers, and would prevent this sort of nonsense: mountainproject.com/v/debbi… (Administrator decides he doesn't like a 5.11a climb so he gives it a 5.0 rating and screws up the consensus grade.) agreed. 1 Flag Quote
	Danny Andrews · Mar 20, 2018 · Austin, TX · Joined Oct 2017 · Points: 286 Is the mean still used? I can't figure it out. 0 Flag Quote
	chris tregge · Mar 20, 2018 · Madison WI · Joined May 2007 · Points: 11,036 https://www.mountainproject.com/forum/topic/109084255/changes-to-route-consensus-ratings 1 Flag Quote
	reboot · Mar 20, 2018 · . · Joined Jul 2006 · Points: 125 I believe it's median, but the page author's rating counts 3 times (or something). 2 Flag Quote
	Eric L · Mar 20, 2018 · Roseville, CA · Joined Jan 2015 · Points: 145 I'm happier with median because it actually represents a real opinion (not the calculated "2.6 children per family"). Different people have different styles and body types which may make the climb feel different to each person giving the rating. That said, I don't think it should matter much because the ratings should be in close proximity anyway. If someone rates a 5.5 as a 5.14 to skew (screw) the rating, they are an @$$hat and the data should be dumped. 0 Flag Quote
	amarius · Mar 20, 2018 · Nowhere, OK · Joined Feb 2012 · Points: 20 Here is how route grade voting is done on https://www.redriverclimbing.com/RRCGuide/?type=route&id=39 2 Flag Quote
	Cole T · Mar 20, 2018 · Cincinnati, OH · Joined May 2012 · Points: 496 I prefer the way RedRiverClimbing shows the distribution using the charts and uses the consensus grade (median) 1 Flag Quote
	reboot · Mar 20, 2018 · . · Joined Jul 2006 · Points: 125 Cole T wrote: I prefer the way RedRiverClimbing shows the distribution using the charts and uses the consensus grade (median) Yes the charts are nice, but median consensus grade? Doesn't seem to be working for these: https://www.redriverclimbing.com/RRCGuide/?type=route&id=1066 https://www.redriverclimbing.com/RRCGuide/?type=route&id=2226 0 Flag Quote
	amarius · Mar 20, 2018 · Nowhere, OK · Joined Feb 2012 · Points: 20 reboot wrote: Yes the charts are nice, but median consensus grade? Doesn't seem to be working for these: https://www.redriverclimbing.com/RRCGuide/?type=route&id=1066 https://www.redriverclimbing.com/RRCGuide/?type=route&id=2226 Disclaimer - this is wank on, wank off data manipulation Difficult to say when sampling sizes are that small. If I fire Matlab, and use convention 13.25 = 13a, etc, "God's Own Stone" is right on the money: median([13.5 13.5 13.75 13.75 13.75 14.25 14.25 14.25 14.25])=13.75, which would be 13d. mean([13.5 13.5 13.75 13.75 13.75 14.25 14.25 14.25 14.25])=13.9, whatever that is. "Southern Smoke" is median([14.25 14.25 14.5 14.75])=14.38 - depends on programming what this could be assigned to 14a or 14b, but 14b would make more sense mean([14.25 14.25 14.5 14.75])=14.44 - my guess would be 14b. 0 Flag Quote
	Andrew Krajnik · Mar 20, 2018 · Plainfield, IL · Joined Jul 2016 · Points: 1,739 amarius wrote: Disclaimer - this is wank on, wank off data manipulation Difficult to say when sampling sizes are that small. If I fire Matlab, and use convention 13.25 = 13a, etc, "God's Own Stone" is right on the money: median([13.5 13.5 13.75 13.75 13.75 14.25 14.25 14.25 14.25])=13.75, which would be 13d. mean([13.5 13.5 13.75 13.75 13.75 14.25 14.25 14.25 14.25])=13.9, whatever that is. "Southern Smoke" is median([14.25 14.25 14.5 14.75])=14.38 - depends on programming what this could be assigned to 14a or 14b, but 14b would make more sense mean([14.25 14.25 14.5 14.75])=14.44 - my guess would be 14b. I'm nit-picking here, but I think you need to review your number conventions... dont you mean 13a would be 13.0? If an a is 13.25, and a d is 13.75, what are b and c? Edit to add: I used a convention of a=.0, b=.25, c=.5, and d=.75 Numbers round to the nearest .25, so ratings are plus or minus .125. (13a would be any number between 12.875 and 13.125, for instance.) Doing so, the mean for God's own stone is 13.8, or 13d. The median is 13.75, also 13d. For southern smoke, the mean is 14.1875, or 14b. The median is 14.125, which also rounds up to 14b. For both of these routes, of course, there are no extreme outliers, which the median would help guard against. 1 Flag Quote
	amarius · Mar 20, 2018 · Nowhere, OK · Joined Feb 2012 · Points: 20 Andrew Krajnik wrote: I'm nit-picking here, but I think you need to review your number conventions... dont you mean 13a would be 13.0? If an a is 13.25, and a d is 13.75, what are b and c? Edit to add: I used a convention of a=.0, b=.25, c=.5, and d=.75 Numbers round to the nearest .25, so savings are is plus or minus .125. (13a would be any number between 12.875 and 13.125, for instance.) Doing so, the mean for God's own stone is 13.8, or 13d. The median is 13.75, also 13d. For southern smoke, the mean is 14.1875, or 14b. The median is 14.125, which also rounds up to 14b. For both of these routes, of course, there are no extreme outliers, which the median would help guard against. Your nitpicking and comments regarding my brainfarting are right on the money. 1 Flag Quote
	Nick Sweeney · Mar 20, 2018 · Spokane, WA · Joined Jun 2013 · Points: 969 agd wrote: If the sample size is large enough and its normally distributed then the mean and median should be the same. With that said, I'm not sure what the sample size is... That's only true if the data is valid and reliable. In the case of someone frivolously rating a 5.11a climb 5.0 for the purpose of messing up the page, that data point is not valid. 0 Flag Quote
	kenr · Mar 20, 2018 · Unknown Hometown · Joined Oct 2010 · Points: 16,608 The statistical theory analysis of robustness of median versus mean assumes that the data points are independent of each other. But with real humans in a public social setting, the assessments of route difficulty are surely not independent. There is substantial "herding" of opinions to follow what other people think (or what a favorite guidebook once said). Also social bias/skewing in that nobody wants to admit they felt they found the route _harder_ than what other people already said. Then there are rare cases of deliberate anti-herd contrarian game-playing. But as Nick just demonstrated, there are other ways to deal with those. Ken 1 Flag Quote
	David K · Mar 21, 2018 · The Road, Sometimes Chattan… · Joined Jan 2017 · Points: 423 Eric L wrote: I'm happier with median because it actually represents a real opinion (not the calculated "2.6 children per family"). Technically that's only true if there's an odd number of data points. If there are four families, with 1, 2, 3, and 4 children, the median is 2.5 children per family. 2 Flag Quote