Hacker News — vinext + Cloudflare Workers

new
past
show
ask
show
jobs
submit

▲Kimi K2.7 Code is generally available in GitHub Copilot (github.blog)

99 points by unliftedq 4 hours ago | 25 comments

kingstnap 5 minutes ago [-]

Input: $0.95

Cache hit (most important): $0.19

Output: $4.00

This is the same as how much Moonshot charges for it, and it puts it at roughly the price of GPT 5.4 mini, not a bad option.

For some context here is a stupid prompt that wastes tokens: "Play a game of tic tac toe against yourself on a 5x5 board, you need 5 in a row to win."

It costs $0.006 on Kimi K2.7, and you get to see the whole raw reasoning trace.

GPT-5.4 mini costs $0.016 and its summarized.

And in case you are wondering both play incredibly stupidly.

Kimi:

      A   B   C   D   E
  1   .   .   .   .   .
  2   .   .   .   .   .
  3   X   X   X   X   X
  4   .   O   O   O   O
  5   .   .   .   .   .

GPT 5.4 mini:

  1: X X X X X
  2: O O . . .
  3: . . O . .
  4: . . . O .
  5: . . . . O

nsoonhui 33 minutes ago [-]

I used GitHub Copilot for my VS 2026 development and switched between ChatGPT and Claude. That was before I discovered Claude Code and the Codex app. Copilot was OK for my purposes, and the USD 10 per month fee was enough for my usage.

However, last month they introduced a new pricing model ( I know the old pricing was not sustainable), and my USD 10 was exhausted within days. Because of that, I switched to Claude Code and Codex and have never looked back. Yes, tokens on Claude Code and Codex are subsidized heavily, but let's just enjoy when good things last.

I do feel there is a difference between using Claude via Copilot versus using Claude directly in Claude Code. I'm not sure what Microsoft is doing behind the scenes.

arikrahman 25 minutes ago [-]

I had a similar experience moving away from Copilot within Zed. Now using the reasonix harness for Deepseek that makes cache hits almost free. And that's with unsubsidized American providers like Digital Ocean or Cloudflare.

k__ 8 seconds ago [-]

Nice.

I paid $6 yesterday for DeepSeek V4 Flash on OpenRouter. That's like $120 dollar for a month, and it's not even a good model.

altmanaltman 17 minutes ago [-]

My copilot quota finished in maybe 2-3 prompts with claude 4.8 opus. i was expecting it to suck but not this bad. it was good while it lasted though

andhuman 2 hours ago [-]

Finally an alternative to the big dogs that a company can use. People have been asking for a way to run the Chinese models from a trusted provider. Here GitHub delivered!

The performance, if we trust the benchmarks, put it at Sonnet 4.6.

Let’s see if it’s worth it with GitHubs pricing.

MangoCoffee 2 hours ago [-]

Microsoft needs to offer cheaper option since they change to token based billing. GPT-5.4 used to be x1 for yearly subscriber but now it cost 6x. i run out the premium request for just couple prompts. Github copilot for $10 used to be the best value since you get all the US AI labs model for cheap.

sneezychl 37 minutes ago [-]

CoPilot was an insanely good value while it lasted. Only moneysoft could subsidize a service that much.

johnathan101 16 minutes ago [-]

Competition in coding models has gotten intense. A year ago it felt like choosing between two options. Now the bigger question is which model to route each task to.

mmusc 2 hours ago [-]

Yes significantly cheaper to run compared to the other models, tried it for an hour yesterday and the results look promising.

Saw in a discussion on Reddit that the team is evaluating glm5.2 so hopefully more to come!

SeriousM 2 hours ago [-]

Who really cares? The model multipliers and the artificial currency were the final nail in the Github Copilot coffin.

sognetic 1 hours ago [-]

Enterprises still have big contracts with github, those companies are imposing tight spending limits now and if the open weight models enable those limits to last a bit longer that's probably quite popular.

skybrian 1 hours ago [-]

Looks like it’s the same price on Fireworks AI?

https://fireworks.ai/blog/kimi-k2p7-code

I don’t know much about them but they did a deal with Microsoft in March:

https://azure.microsoft.com/en-us/blog/introducing-fireworks...

scriptsmith 2 hours ago [-]

Is GitHub Copilot the best positioned platform for enterprise? They support Claude, GPT, Gemini, and now even open weight models. Larger orgs are paying at API rates anyway so it costs just as much as anywhere else. They have a pretty good agent CLI and SDK, and now a desktop app. They have hosted agents, and you can run their 'Agentic Workflows' in CI.

Has their reputation tanked so much that the alternatives get all the buzz? Or is it that non-enterprise users are priced out by the usage costs, so no free marketing?

gunalx 2 hours ago [-]

The rugpull with the pricing change without further notice was not taken kindly by enterprice.

attentive 1 hours ago [-]

They were, until they decided to commit suicide for the service.

impact_sy 2 hours ago [-]

When will DeepSeek be available?

pkaye 2 hours ago [-]

The V4 models are already in the Azure AI foundry so maybe a good chance of it coming.

4 hours ago [-]

boundless88 3 hours ago [-]

When will GitHub Copilot support integrating custom models?

mvATM99 43 minutes ago [-]

It does, but it's very poorly documented and quite unstable (on purpose i think). What the other commenter said about the VSCode BYOK seems to be the more reliable way.

I tried adding a Foundry LLM as Github Copilot custom model and failed miserably. But with VSCode BYOK (and Github Copilot as the interfact) i did get it working, and i can now use Deepseek V4 Flash with Copilot.

Klaster_1 2 hours ago [-]

AFAIK you can already use custom models in VSCode Copilot, but probably not for cloud workloads yet.

ignoramous 2 hours ago [-]

Copilot Chat supports BYOK since Oct 2025 for the VSCode plugin: https://code.visualstudio.com/blogs/2026/06/18/byok-vscode

websap 2 hours ago [-]

Where is the inference running?

pkaye 2 hours ago [-]

Azure. It was already available on the Azure AI Foundry before.

https://docs.github.com/en/copilot/reference/ai-models/model...

Tepix 33 minutes ago [-]

On servers that are subject to the CLOUD Act. Expect no GDPR compliance.

Rendered at 08:13:43 GMT+0000 (Coordinated Universal Time) with Cloudflare Workers.