XTRA! XTRA! READ ALL ABOUT IT: KIck Posterior QA by JNew and KENPOM...

KUSTEVE

@JayHawkFanToo Kenpom’s numbers are illogical. 46th in defense…really?

JayHawkFanToo

I know the players are Frank Mason and Trey Burke of Michigan. Both records are for their sophomore years and Burke’s last year in college. Burke was a consensus 1st team All-American and much like Mason, the driving force in his team, so he would get the nod, although Mason’s stats are pretty gosh darn good and by the time he is a senior, he might well be a 1st team All-American as well.

Jesse Newell

@JayHawkFanToo Here’s the best way I can describe the way I look at it: Every team’s season is a book, and KenPom is able to read all the chapters.

A lot of the arguments that you made — and a lot of people make them — center on one team winning head to head or records against only certain teams. “Chapter 3 and 5 and 9 are good, so how can the book be bad?”

For me, KenPom’s computer is much better than me because it strips biases. Each chapter can add to the overall story. And teams can improve their stock even if they aren’t playing top 50 or 100 or 200 opponents.

In Michigan State’s case … it is the exact opposite of KU: It has lost every close game and blown out almost every other opponent. 30-point wins over top-100 KenPom teams should mean something. Losing an overtime game at Notre Dame (No. 14) also probably tells us MSU is better than its record.

We only get so many data points. Lots of people are content at stopping at head-to-head record or only looking at wins and losses. KenPom’s projection numbers, if you look, are almost a reflection of Vegas’ lines. And that’s because he takes each chapter of a team’s book into consideration.

Jesse Newell

@KUSTEVE Yes, 46th. Easily KU’s worst defense under Self. Jayhawks are 193rd in eFG% after never finishing worse than 64th in any other year under Self. This team doesn’t block shots like the past, and it never turns anyone over. Hard to make an effective defense when both those areas are bad.

Again, the numbers aren’t taking into account possible improvement (it usually happens under Self), but up to this point, I’d say that 46th number accurate reflects what this year’s team is: a good defense for normal team standards, a horrible defense for Bill Self standards.

jaybate 1.0

@Jesse-Newell

You raise a intriguing point: the similarity between the Vegas Lines and KENPOM.

Having worked some with stats and computer modeling, I believe it is rather unusual for nonparametric statistical modeling to coincide closely in predictions with other non parametric statistical modeling, unless both are using essentially like modeling assumptions, and like non parametrically applied statistical models, and like data sets.

Is Vegas relying on KENPOM?

Or Vegas oddsmakers working with a stat model essentially the same as KENPOM?

How do they come out similarly?

If I recall correctly, and I am not absolute sure that I do, you and others have noted a lot of variance between KENPOM and other ranking and predictive systems regarding rankings and predictive spreads.

KUSTEVE

@Jesse-Newell I’m sorry, Jesse …I don’t see a bad defense like last year’s edition.

My 12th grade education is preventing me to fully appreciate jb’s last post, so I Google’d “non parametrically applied statistical models”. Even my computer crashed.

wrwlumpy

Parametric Models - I think I dated one in college.

wrwlumpy

@jaybate-1.0 runway+models+Mark+Fast+Runway+LFW+Autumn+Z-Ki0bfDYFXl.jpg

Crimsonorblue22

@wrwlumpy fifi and lumpy! Ahhh

JayHawkFanToo

@Jesse-Newell

I agree that a computerized model treats all teams equally, but it also can introduce the developer’s own bias depending of how he sees different parameters. Many of the models have been developed by mathematicians with little knowledge of the game and who try to use numbers to account for everything when in real life you really cannot.

I did not based my comment on just the head-to-head result; you can look at all the parameters across the board and KU has better numbers in most if not all of them. Neither team played and yet MSU moved past KU in spite of all the numbers favoring KU. Like I said in my post, I understand that as the season moves on and more data is available the models get better, but at this point in the season, KenPoms seem to be unrealistic. You mention that MSU is losing close games and that is helping them, but you are completely overlooking the obvious…they are still losing! Do you think it is better to lose 20 games by one point each or to win the same 20 games by 1 point each? Under your premise, the teams that is 0-20 is rewarded for losing very close games while the teams that 20-0 is penalized for winning very close games…I understand that it is an extreme case and other factors are involved but the gist of it is still valid.

As far as the Vegas lines, you have to be careful on how you look at them. Initially they are set as close as possible to the most likely outcome with the sole purpose of having the bets balanced. If the betting is not balanced, and depending on the final outcome, Vegas can make a lot of money or lose a lot of money…Vegas does not like that uncertainty and the only guaranteed way for them to make money is to have the bets perfectly balanced and make money on the “vig” or commission it collects. Vegas will tweak the line not to better predict the outcome but to balance the bets, and on games where there are heavy emotions involved and one party is heavily betting one way, the final tweaked line is not close to what the outcome is expected to be but to the number that will bring the bets in balance, Vegas could not care less who wins a game, they only care about making money.

The gambling industry is a multi-billion business and I will guess they have access to resources that we don’t know and the models they use are considerably more sophisticated and confidential that the public ones, plus they have access to “inside information” such as confidential injury reports, that KenPom, Sagarin and Massey can only dream of.

Very interesting discussion and several perspectives that reflects our own bias. Good to have you here to keep us on our toes and get our thinking caps on.

Jesse Newell

Here’s this week’s update. KU makes a nice jump up the board.

jaybate 1.0

@wrwlumpy

I don’t usually like models, but Fifi has a serious chassis.

JayHawkFanToo

@Jesse-Newell

I would be interested in your personal comments about the KenPom numbers.

Obviously KenPom’s computer does not watch the actual games, and as I indicated before, the model is pretty flawed until close to the end of the season when the outcome is mostly obvious anyway.

Do you believe that OU that just lost to bottom feeder KSU at home has the best chance of winning the conference? It looks like once again, a team gets more credit for losing than winning a close game.

I watched ISU play OSU and WVU and it had to hold on for dear life to beat OSU at home and barely won at West Virginia; both wins were by 2 points . Does this mean that OSU is better and WVU worse than we know or that ISU is wildly inconsistent, and if the later is the case, should they really have the 3rd bets odds to win the championship? On the other hand, ISU is obviously in a better position than Oklahoma since it won two games it could have easily lost while OU lost a game it most certainly should have won, wouldn’t you agree?

Just a couple of observation by quickly glancing at the numbers, and again, I would be curious on what you think.

Jesse Newell

The response above I had to you “A few random thoughts” is how I feel about KenPom. Don’t have much to say other than that. We obviously aren’t going to agree on this … you say KenPom’s computer “doesn’t actually watch the games” while I would argue that KenPom’s computer is best at watching the games, being able to take every single possession of every game into account without human bias.

So yes, I believe OU is still a good team. I think there are many games (like the Texas road win) that prove that. Just as I wouldn’t say KU is a bad team because of the Temple game, I’m not going to use one game to end my judging for OU.

ISU barely beat OSU, IMO, because OSU is a really good team. I think you’ll see that tonight when the Cowboys play at Allen Fieldhouse.

Bottom line: From all the information we’ve gathered, OU is the best team in the conference, albeit just slightly. Knowing that, it remains the slight favorite at the top to win or share the title.

jaybate 1.0

@Jesse-Newell

At this point of the season, does it take several games to significantly change a team’s stats and standings in KENPOM? I always wondered how many games into a season that things start taking several games to change. I guess I am asking about how many games before the stats and rankings get fairly stable.

And, hey, I am psyched up for the OSU-KU Live Blog!

Hope you and Blake and KHas will all be sitting in.

JayHawkFanToo

@Jesse-Newell

The issue with computer rankings is not that they look at teams differently, they most definitely have no bias in this regard as they treat all teams equally; however, the real issue is that they introduce the bias/vision of what the model creator thinks is important. If the computer models were perfect, then the results would be very close. They are not. For example look at the current ranking for KU by 3 of the top rankings:

Ken Pomeroy - 16
Jeff Sagarin - 12
Kenneth Massey - 6

Many believe that Massey’s rankings are the most full-featured and scientific rankings. Even Pomeroy has spoken highly of Massey’s methods…and yet, all three have widely varying results that highlight the emphasis (read bias) each model developer has introduced into his own model.

Here is a link to a summary of all computerized rankings; you can see KU is ranked as high as #4 and as low as #19…and all of these models are using basically the same data, just looking at it differently and coming up with different results, so…which one is correct . See my point?

BTW, what is your prediction for tonight’s game? Are you taking OSU?

jaybate 1.0

@JayHawkFanToo

My suggestion is: the one that is closest to correct would be the one that averages the best forecasts of final game scores, predicted conference finishes, and final ranking in the NCAA tournament.

I suppose it wouldn’t be to hard for a quant with a budget to input that data for all the different services for the last 5 years and come to a conclusion.

Jesse Newell

@jaybate-1.0 Agree JB. Or even better, as Ken has suggested, take an aggregate of the top proven systems so that outliers are filtered out.

@JayHawkFanToo I didn’t think I was arguing Pom’s merits against Sagarin and Massey, which obviously are well respected as well. I can tell you that Pomeroy’s numbers almost directly reflect the Vegas line, which to me shows he’s right on the money in most cases. He also does test and retest and isn’t afraid to make changes if it improves his projections.

And I’ve got KU beating OSU by two.

JayHawkFanToo

@Jesse-Newell

As I mentioned before, the Vegas lines (and I am on the records saying that I like them a lot) are not predictor of outcome, they are set to make sure bets are evenly spread and they are tweaked, not based on any sports criteria, but uniquely to ensure even betting and profit for the house on the vig. Maybe, just maybe, the opening line could be…but it quickly changes to balance the bets. For example, against TTU, KU open favored by 15 and by game time it went down to 14-1/2 (might be off by 1’2 point) and KU ended up winning by how much? 32 points?

As far as tonight’s game, OSU has a short bench and in the last 6 games (as far back as I checked) they had only 5 players in each game score more than 2 points; the key is limiting Forte’s points. In the their 3 (not close) loses Forte had 5 points against South Carolina and 13 points against Maryland. BTW, Vegas opened at 6-1/2 KU and the consensus is now at 7-1/2; I will surprise you and take KU by 7.

Jesse Newell

@JayHawkFanToo The Vegas line is closer to reality than the actual game score. We’re trying to determine what is going to happen before the game. After it starts … well, stuff happens. You think Florida State is 30-plus points worse than Oregon? Or Oregon is 20-plus points worse than Ohio State? Of course not. Stuff happened. A lot of luck is involved. So if they played the games 100 times, who would win most? That’s what we’re trying to figure out, and the Vegas line (like you said, especially the opening line) is about as close as we can come to that.

jaybate 1.0

@JayHawkFanToo

The statistics explanation of how bet balancing works, at least assuming it is NOT corruptly engineered with managed hype and a lot of narco and intel money laundering distorting it massively, is that over large numbers of bets the error factors of each bet by each bettor cancel out to a probabilistic estimation of the most likely actual spread, especially if the betting process is biased with the initial line setting based on sophisticated initial prediction. Essentially betting becomes a huge series of tiny interactions around the initial line. So, statistically speaking, bet balancing is just another technique of predicting a probable outcome. The logic that bet balancing is not about the estimating the final outcome contradicts the reality of what all of the betters are in fact trying to do. They are each betting with their biases on what the actual outcome will be. The beauty of bet balancing is that it is continually interacting and canceling out the error factor of each individual bet and bettor. So the probability is that the Vegas line, with an enormous betting volume, is actually going to tend to be more accurate than any single estimation system, especially if it were to start with, say, KENPOM. Or at least that is how I have had it explained to me by a couple of folks that I would tend to think know quite a bit about this sort of thing.

drgnslayr

@JayHawkFanToo

“As I mentioned before, the Vegas lines (and I am on the records saying that I like them a lot) are not predictor of outcome, they are set to make sure bets are evenly spread and they are tweaked, not based on any sports criteria, but uniquely to ensure even betting and profit for the house on the vig.”

That’s it. That’s how the line works in Vegas. Because their money-making model is to reduce their risk to nothing while taking 10% fees from the losing side. The Vegas line is a measurement of how the betting world sees the game outcome at that very moment. That’s why lines change over time, even if most (or all) the real factors involved in the game remain the same, because bets come in and start changing the scales of dollar amounts on each side.

We can criticize computer prediction models all we want… because they only offer their results off the limited information they have.

If you want to put it to the ultimate test… try to have a computer predict specific player match ups. Well… if I wanted a machine to accurately predict results I’d want match up info in there and I’d want it to be accurate. How can that work?

Computer results tend to be really wild early in the year because there isn’t enough data yet.

Let’s all put ourselves in the shoes of these computers. Imagine a season where you didn’t see a single game. No visuals. And you don’t read blogs and news that give other information. The only thing you have to go off of is a stat book. Now you predict the future! We blast computer interpretations, but this is how a computer has to deal with it all, with some additional programmable formulas developed by someone… Whoever is the person (or people) writing the formulas and codes… there is nothing in this for them to develop inaccurate results. The big prize for them is to develop something that has a higher accuracy over time.

There will be improvements as we move forward, just like the fact that we will see improvements in all areas of computing. I saw a nerdy friend wearing this:

The only sure thing in life is:

Death

Taxes

Software Updates

JayHawkFanToo

@Jesse-Newell

Bottom line: From all the information we’ve gathered, OU is the best team in the conference, albeit just slightly. Knowing that, it remains the slight favorite at the top to win or share the title.

Do you still think OU is the best team in the conference? They just lost to WVY by 19…after losing to KSU at home.

Crimsonorblue22

@JayHawkFanToo ha ha!!

JayHawkFanToo

@Jesse-Newell

The Vegas line is closer to reality than the actual game score. We’re trying to determine what is going to happen before the game.

With all due respect, I don’t believe you understand how the Vegas line works. Although they seem to closely resemble margins of victory, the Vegas lines do not predicts game outcomes and they are not meant to do that, they simply predict the numbers that will force money to be bet evenly, which is the only guaranteed way it has of making money and this is why they are constantly changing, again, so money bets are balanced. Vegas does not care who wins or loses or what the margin of victory is, all they care is that the bets are balanced so they can make money.

BTW, nothing is closer to reality than the actual game score…the game score IS reality, right?

Again, no malice or disrespect intended.

JayHawkFanToo

@jaybate-1.0

Again with all due respect and no malice intended, you like Jesse are under the impression that the Vegas lines are trying to predict the outcome of games, the Vegas lines do no do such thing, they try to predict the number that will cause balanced betting, even when they resemble the predicted outcomes. I will leave it at that and you can research it for yourself.

jaybate 1.0

@drgnslayr

I learned long ago not to quarrel about statistics, but I do want to take one more pass at what prediction and bet balancing are about, as far as I have been able to learn.

Accuracy of estimation in stochastic realm is all about canceling prediction biases and limiting variance to random error.

There is perhaps no better way to cancel biases than by bet balancing at high volumes of betting. It is a marvelously effective technique.

The larger the number the better the cancelling of biases.

And the “vig” doesn’t bias the outcome of the bet balancing at all. In fact, the vig is irrelevant to the final betting line. In effect the vig comes half out of the range above the betting line and half out of the range below the betting line.

I am not sure where this urban legend of bet balancing being nothing more than a meaningless indicator of what bettor’s think got started. It is not that anyone of the betters know diddly squat. It is that all together cancel out each other’s biases.

I know for a time that I learned this urban legend and assumed it was true for a time. But because I was trained some in stats I thought finally that there was something wrong with that logic and I worked through it and logically it is just an urban legend.

For the same reason that a Vegas line is a very effective way of predicting the outcome of games because of using high Ns of betting to balance, Ken Pomerory suggests using all the different estimation models, not just his, or Sagarin’s, or someone else’s, to make predictions. The more estimation models you use, if they are all based on the same data and varied, but sound algorithms, the more likely it is that the biases of their algorithms will cancel out and you will end up with a closer approximation of the actual outcome.

But when it comes to stats, again, I never argue. Just sharing some food for thought.

JayHawkFanToo

@drgnslayr

You obviously understand how the Vegas lines work; it is one of the most misunderstood concepts. I learned the concept myself the first time I visited Vegas 30+ years ago.