Every American senior building a college list opens at least one of these screens. Some are walled gardens used inside the counseling office. Some are free apps anyone can try. A few are paid. We tested seven platforms in March 2026 to see how they really decide what counts as a reach.
CollegeKickstart is the platform parents and counselors turn to when they want a sharper take. Founded in 2014 by an MIT alum in Pleasanton, it now serves more than 750 institutions and sells direct to families for $80 to $125 a year.
Its core feature is a list balance grader: every school on a student's list is sorted into Likely, Target, Reach, or Unlikely. The rule is mechanical and published: Likely needs an admit rate above 50% and a student in the top quartile of last year's pool; Unlikely is anything below 25% with a student below the typical profile.
Underneath sits a more interesting product, an Early Admission Strategy engine that ranks ED, ED II, EA, and Restrictive Early Action options based on a student's profile and the school's historical patterns. CollegeKickstart publishes the data — you can read its annual Class of 2030 results blog and see, for example, that Duke's ED rate clocked in at 13.8% and Brown's at 16.5%.
Rules give clarity but not probability. The next category claims to give both.
CollegeVine is the most sophisticated commercial chancing tool — and the one most students see first. Free for the user, with revenue from essay reviews and a B2B AI-recruiter product, it pulled $30.7 million in funding led by Fidelity and runs on roughly $7.5 million in annual revenue.
In May 2021 it threw out its rule-based engine and went what its lead data scientist called "fully ML-forward." Today the model trains on more than 100,000 outcomes from IPEDS, Common Data Sets, and self-reported user results. It refreshes annually.
The richest layer is its extracurricular taxonomy: four tiers, three sub-tiers each, twelve levels in all — from Tier 1 (USAMO, Intel, published novel) to Tier 4 (general club participation). ECs alone account for roughly 35% of the prediction. The model also handles GPA, course rigor, rank, gender, intended major, first-gen status, low-income status, and round (added May 2022).
And, rare for the industry, CollegeVine publishes calibration. When it predicts 30%, the actual rate is 28.7%. At 80%, actual is 82.2%. They compare themselves to the FiveThirtyEight NBA model.
Calibration is one thing. Coverage is another.
Scoir is the platform Naviance has reason to worry about. Roughly 12% market share, 2,200 high schools, 40-50% YoY growth, and pricing about a third lower per student than Naviance ($4.80 vs $7.11). It also became the Coalition Application platform itself, and integrates with Common App for near-real-time document delivery.
Its January 2025 release, Admission Intelligence 2.0, did what Naviance has not: it shipped a per-student, per-college, per-round probability trained on what the company calls "tens of millions of de-identified outcome records." A student gets one number for ED, one for EA, one for RD — distinct, by college.
The model uses GPA (weighted and unweighted), test scores, first-gen status, geography, race/ethnicity, sex, and high-school profile. It does not include course rigor, ECs, essays, recommendations, or legacy/donor status — those last two get annotated on the scattergram but never enter the math.
All these tools have one thing in common: they push behavior. The next chapter shows in which direction.
Two academic studies bracket the question of whether tools like these help. Mulhern (2021), published in the Journal of Labor Economics, found that Naviance increases four-year college enrollment for underrepresented students at local public flagships. That is the upbeat finding.
Tomkins, Grossman, Page, and Goel (2023), published in PNAS, ran the data the other way. Among high-achieving students (SAT above 1310), Naviance adoption increased undermatching by about fifty percent. The mechanism is the scattergram itself: when a student sees only two prior dots near the admit line and a cluster of red ones above, they retreat to a less selective school where their profile fits the green dots — even though holistic factors would have made them genuinely competitive.
The chart on the right is the headline result. A two-variable visualization, the authors argued, dissuades exactly the applications that should be made.
The paid tools have models. The free ones have habits — and a much wider audience.
Most students still try a free chancing site before they ever talk to a counselor. PrepScholar, CampusReel, CollegeData, AdmitYogi, Parchment, GradGPT, and the famously chaotic r/ChanceMe all promise a number — and most of them deliver one calculated in roughly the same way: a percentile lookup against published Common Data Set ranges.
Some are smarter. AdmitYogi trains on 6,000 real outcomes but only for the top 20 colleges. GradGPT runs an LLM-flavored scorer that claims 90% accuracy. CollegeData uses a three-step comparison and is the only free tool that shows ED, EA, and RD rates separately. Reddit's r/ChanceMe is methodologically worthless and sociologically fascinating.
The chart sorts these tools by what they actually use as input: a single point (GPA + SAT) at one end, full profile parsing at the other. Most cluster at the simpler end.
Strip away the marketing copy and these tools are mostly arguing about which features to ignore.
Every chancing tool is, at heart, a list of features it bothers to look at and a list it doesn't. Hooks — recruited athlete, donor, legacy, first-generation — are where the gap shows up most clearly. Naviance, CollegeKickstart, and Scoir don't model legacy or donor status at all. CollegeVine handles first-gen and (until SFFA) ethnicity but lists legacy as "not yet implemented."
Essays appear nowhere in any commercial chancing model. Letters of recommendation, demonstrated interest, and interviews are all absent. Round handling varies sharply: Naviance and Kickstart ignore it, CollegeVine treats ED as a yield-side commitment ("100% yield, no admit boost"), Scoir produces distinct per-round probabilities, and our own simulation boosts P(admit) directly via a round multiplier in logit space.
The matrix on the right is the cleanest summary. Filled cells are features the tool models; empty cells are gaps. The picture is sparser than the marketing suggests.
Which raises the obvious question: how good can any of these models actually get?
Published research puts a hard upper bound on what any of these tools can deliver. A 2023 study found that simple logistic regression on undergraduate admissions reached only AUC 0.56 — barely better than a coin flip. Tuned random forests and XGBoost climbed to AUC 0.76. A 2024 deep-learning paper added another three points. Graduate admissions, with their narrower input space, hit 89.5% accuracy.
Why the gap? Holistic review introduces a permanent fog of unobservable factors — essay craft, recommendation tone, demonstrated interest, an admissions officer's mood on a Tuesday in February. None of that is in the training data. None of it ever will be.
On the chart, CollegeVine's self-reported numbers sit above the academic baselines. That is impressive — and not yet independently audited. The bar at the right is the rough ceiling any tool can credibly claim today.
So what is a family supposed to do with all of this?
Each of these tools answers a slightly different question. Naviance answers what happened to students like me at this school. CollegeKickstart answers is my list balanced. CollegeVine answers what would a model trained on a hundred thousand outcomes guess. Scoir answers all three at once, and breaks the answer out by round.
None of them gives a real probability with confidence intervals. None of them models hooks comprehensively. None has been independently audited. And the Tomkins finding — that the most familiar of them increases undermatching by half — should be uncomfortable for the whole industry.
Our own logistic simulation tries to plug specific gaps: hooks (athlete, donor, legacy, first-gen) in logit space, an international seat reserve per college, phantom applicants for the 98.7% of the national pool we don't model, and parental-education effects correlated with income. It is fully transparent, but it is also one model among many, and shares the AUC ceiling the academic literature has identified.
The most honest takeaway is the boring one: use more than one tool, distrust any single number, and weigh the dots on the chart against the things the chart cannot see.