Novelcrafter

How much does the AI cost?

2 min read Last updated Jan 30, 2025

What is BYOK?

BYOK stands for “bring your own key”. Novelcrafter is designed firstly as a writing platform, and we don’t want to tie you down with limited LLM models. You can connect local models that run off your own computers (and are therefore ‘free’, save your energy bill), or via vendors like OpenRouter or OpenAI, and featherless.

This flexibility means that no matter what your price point is, you can still write.

Please look up the pricing structure of the vendor for the model you are trying to run. here are the links to the popular vendors we support:

Example Model Costs

Here are example costs of some models,

For a full list of the model costs that OpenRouter provide, see here. All figures are correct as of November 2024.

Input cost: the cost per 1000 tokens in your prompt

Output cost: the cost per 1000 tokens in the output of the AI.

All prices are in USD.

Example Prompt

The following assumptions will be made for our hypothetical generation (a beat to prose prompt used in the write interface):

Input

  • We have used the system general purpose prompt, with 1,982 words of prose before being read.
  • We have called 9 codex entries, with a combined word count of 643 words.
  • 367 words of chapter summaries has been included.

Output

  • We have an output of 400 words = around 500 tokens

Of course, your prompt input and output will differ, depending on how large your codex is, the amount of prose/outline in the context, and the output.

This is a good approximation for us to work with, however.

High-cost

ModelInput CostOutput CostExample Prompt Cost
Claude Opus0.0150.0750.1005
GPT4 (non-turbo, 32k)0.060.120.3100
GPT4-turbo0.010.030.0550

Input and output cost measured per 1k tokens, all prices in USD.

Mid-cost

ModelInput CostOutput CostExample Prompt Cost
GPT3.5-turbo0.0030.0040.0145
GPT4o0.0050.0150.0275
Claude 3.5 Sonnet0.0030.0150.0200
Mistral Large0.0020.0060.0113
Mistral Medium0.00270.0080.0153

Input and output cost measured per 1k tokens, all prices in USD.

Low-cost

ModelInput CostOutput CostExample Prompt Cost
Claude 3 Haiku0.000250.001250.0016
Claude 3.5 Haiku0.0010.0050.0067
Weaver0.0018750.002250.0896
Gemini Pro 1.50.00250.0075
Airoboros0.00050.00050.0023
4o mini0.000150.00060.00093

Input and output cost measured per 1k tokens, all prices in USD.

Free (As of November 2024)

These are based on OpenRouter prices. Locally run models are of course all free too (see below).

  • Google Gemma 2 9B
  • Meta Llama 3 8B instruct
  • Google Gemini Flash 8B Experimental
  • Nous Hermes
  • Mythomist 7B
  • Toppy
  • Hugging Face Zephyr 7B

Running Local Models

Another alternative is to run a local model using a provider such as LM Studio and Ollama. This will cost electricity, and will be limited to how powerful your computer is, however it is worth looking into if you want to use one of the open-source models.

Fixed Rate

We now support connections to featherlessAI, infermatic, and Arli AI for those of you who want unlimited AI use with a monthly fee, rather than pay-as-you-go.