Mountain Project Logo

Median vs. mean consensus grades

Original Post
MikeJ · · Unknown Hometown · Joined Aug 2011 · Points: 0

Consensus grades on the site are currently calculated using the mean opinion. Switching to the median opinion would make the estimate more robust to outliers, and would prevent this sort of nonsense:

mountainproject.com/v/debbi…

(Administrator decides he doesn't like a 5.11a climb so he gives it a 5.0 rating and screws up the consensus grade.)

Nick Wilder · · Boulder, CO · Joined Jan 2005 · Points: 4,098

I deleted the bogus ratings, and the route comments related to them.

a d · · Los Angeles, CA · Joined Mar 2010 · Points: 5

If the sample size is large enough and its normally distributed then the mean and median should be the same. With that said, I'm not sure what the sample size is...

Wiled Horse · · Unknown Hometown · Joined Dec 2002 · Points: 3,669
alexdavis wrote:If the sample size is large enough and its normally distributed then the mean and median should be the same. With that said, I'm not sure what the sample size is...
beat me to it.
except there is nothing normal about the distribution on MP!
slim · · Unknown Hometown · Joined Dec 2004 · Points: 1,103

i agree that the median is the way to go.

a d · · Los Angeles, CA · Joined Mar 2010 · Points: 5
Young Gun wrote: Switching to the median opinion would make the estimate more robust to outliers, and would prevent this sort of nonsense: mountainproject.com/v/debbi… (Administrator decides he doesn't like a 5.11a climb so he gives it a 5.0 rating and screws up the consensus grade.)
agreed.
Danny Andrews · · Austin, TX · Joined Oct 2017 · Points: 286

Is the mean still used? I can't figure it out.

chris tregge · · Madison WI · Joined May 2007 · Points: 11,036

https://www.mountainproject.com/forum/topic/109084255/changes-to-route-consensus-ratings

reboot · · . · Joined Jul 2006 · Points: 125

I believe it's median, but the page author's rating counts 3 times (or something).

Eric L · · Roseville, CA · Joined Jan 2015 · Points: 145

I'm happier with median because it actually represents a real opinion (not the calculated "2.6 children per family").  Different people have different styles and body types which may make the climb feel different to each person giving the rating.  That said, I don't think it should matter much because the ratings should be in close proximity anyway.  If someone rates a 5.5 as a 5.14 to skew (screw) the rating, they are an @$$hat and the data should be dumped.

amarius · · Nowhere, OK · Joined Feb 2012 · Points: 20

Here is how route grade voting is done on

https://www.redriverclimbing.com/RRCGuide/?type=route&id=39

Cole T · · Cincinnati, OH · Joined May 2012 · Points: 496

I prefer the way RedRiverClimbing shows the distribution using the charts and uses the consensus grade (median)

reboot · · . · Joined Jul 2006 · Points: 125
Cole T wrote:

I prefer the way RedRiverClimbing shows the distribution using the charts and uses the consensus grade (median)

Yes the charts are nice, but median consensus grade? Doesn't seem to be working for these:

https://www.redriverclimbing.com/RRCGuide/?type=route&id=1066

https://www.redriverclimbing.com/RRCGuide/?type=route&id=2226

amarius · · Nowhere, OK · Joined Feb 2012 · Points: 20
reboot wrote:

Yes the charts are nice, but median consensus grade? Doesn't seem to be working for these:

https://www.redriverclimbing.com/RRCGuide/?type=route&id=1066

https://www.redriverclimbing.com/RRCGuide/?type=route&id=2226

Disclaimer - this is wank on, wank off data manipulation

Difficult to say when sampling sizes are that small.

If I fire Matlab, and use convention 13.25 = 13a, etc,

"God's Own Stone" is right on the money:
median([13.5 13.5 13.75 13.75 13.75 14.25 14.25 14.25 14.25])=13.75, which would be 13d.
mean([13.5 13.5 13.75 13.75 13.75 14.25 14.25 14.25 14.25])=13.9, whatever that is.

"Southern Smoke" is
median([14.25 14.25 14.5 14.75])=14.38 - depends on programming what this could be assigned to 14a or 14b, but 14b would make more sense
mean([14.25 14.25 14.5 14.75])=14.44 - my guess would be 14b.

Andrew Krajnik · · Plainfield, IL · Joined Jul 2016 · Points: 1,739
amarius wrote:

Disclaimer - this is wank on, wank off data manipulation

Difficult to say when sampling sizes are that small.

If I fire Matlab, and use convention 13.25 = 13a, etc,

"God's Own Stone" is right on the money:
median([13.5 13.5 13.75 13.75 13.75 14.25 14.25 14.25 14.25])=13.75, which would be 13d.
mean([13.5 13.5 13.75 13.75 13.75 14.25 14.25 14.25 14.25])=13.9, whatever that is.

"Southern Smoke" is
median([14.25 14.25 14.5 14.75])=14.38 - depends on programming what this could be assigned to 14a or 14b, but 14b would make more sense
mean([14.25 14.25 14.5 14.75])=14.44 - my guess would be 14b.

 I'm nit-picking here, but I think you need to review your number conventions... dont you mean 13a would be 13.0? If an a is 13.25, and a d is 13.75, what are b and c?

Edit to add: I used a convention of a=.0, b=.25, c=.5, and d=.75

Numbers round to the nearest .25, so ratings are plus or minus .125. (13a would be any number between 12.875 and 13.125, for instance.)

Doing so, the mean for God's own stone is 13.8, or 13d. The median is 13.75, also 13d.

For southern smoke, the mean is 14.1875, or 14b. The median is 14.125, which also rounds up to 14b.

For both of these routes, of course, there are no extreme outliers, which the median would help guard against. 

amarius · · Nowhere, OK · Joined Feb 2012 · Points: 20
Andrew Krajnik wrote:

 I'm nit-picking here, but I think you need to review your number conventions... dont you mean 13a would be 13.0? If an a is 13.25, and a d is 13.75, what are b and c?

Edit to add: I used a convention of a=.0, b=.25, c=.5, and d=.75

Numbers round to the nearest .25, so savings are is plus or minus .125. (13a would be any number between 12.875 and 13.125, for instance.)

Doing so, the mean for God's own stone is 13.8, or 13d. The median is 13.75, also 13d.

For southern smoke, the mean is 14.1875, or 14b. The median is 14.125, which also rounds up to 14b.

For both of these routes, of course, there are no extreme outliers, which the median would help guard against. 

Your nitpicking and comments regarding my brainfarting are right on the money.

Nick Sweeney · · Spokane, WA · Joined Jun 2013 · Points: 969
agd wrote: If the sample size is large enough and its normally distributed then the mean and median should be the same. With that said, I'm not sure what the sample size is...

That's only true if the data is valid and reliable.  In the case of someone frivolously rating a 5.11a climb 5.0 for the purpose of messing up the page, that data point is not valid.

kenr · · Unknown Hometown · Joined Oct 2010 · Points: 16,608

The statistical theory analysis of robustness of median versus mean assumes that the data points are independent of each other.

But with real humans in a public social setting, the assessments of route difficulty are surely not independent. There is substantial "herding" of opinions to follow what other people think (or what a favorite guidebook once said).

Also social bias/skewing in that nobody wants to admit they felt they found the route _harder_ than what other people already said.

Then there are rare cases of deliberate anti-herd contrarian game-playing. But as Nick just demonstrated, there are other ways to deal with those.

Ken

David K · · The Road, Sometimes Chattan… · Joined Jan 2017 · Points: 423
Eric L wrote:

I'm happier with median because it actually represents a real opinion (not the calculated "2.6 children per family").

Technically that's only true if there's an odd number of data points. If there are four families, with 1, 2, 3, and 4 children, the median is 2.5 children per family.

Guideline #1: Don't be a jerk.

Discuss MountainProject.com
Post a Reply to "Median vs. mean consensus grades"

Log In to Reply
Welcome

Join the Community

Create your FREE account today!
Already have an account? Login to close this notice.

Get Started