Token Efficiency Engine (promptz)

A production-grade Go library and CLI for optimizing LLM prompts. promptz reduces token usage while identifying and preserving semantic meaning through deterministic NLP techniques.

Features

Smart Normalization: Trims whitespace, collapses spans, and removes low-value filler phrases ("in order to", "as a matter of fact").
NLP-Enhanced Rewrite:
- POS Tagging: Uses jdkato/prose/v2 for accurate Part-of-Speech tagging.
- TF-IDF Scoring: Ranks word importance across the conversation context to preserve key terms.
- Sentence Compression: Strips parenthetical asides and verbose clauses.
Semantic Deduplication: Uses Jaccard similarity to detect and merge near-duplicate messages (e.g., "fix the bug" vs "please fix that bug").
Deterministic: No external AI calls required. Runs entirely locally.

Modes

conservative: Minimal changes. Safe for all prompts.
balanced: Removes stop words and common filler phrases.
aggressive: Retains only high-value content words (nouns, verbs, adjectives, important adverbs).

Installation

go get github.com/the-wrong-guy/promptz

CLI Usage

Build the tool:

go build -o distill ./cmd/distill

Run with JSON input:

echo '{"messages": [{"role": "user", "content": "Hello my world, whats up"}], "mode": "aggressive"}' | ./distill

Output:

{
  "optimized": [
    {
      "role": "user",
      "content": "Hello world whats up"
    }
  ],
  "tokens_before": 6,
  "tokens_after": 4,
  "savings_ratio": 0.33
}

Library Usage

package main

import (
	"fmt"
	"github.com/the-wrong-guy/promptz/core/engine"
	"github.com/the-wrong-guy/promptz/core/types"
)

func main() {
	req := types.OptimizeRequest{
		Messages: []types.Message{
			{Role: "user", Content: "I am facing a database connection error in production"},
		},
		Mode: types.ModeAggressive,
	}

	resp := engine.Optimize(req)

	for _, msg := range resp.Optimized {
		fmt.Printf("%s: %s\n", msg.Role, msg.Content)
	}
	// Output: user: database connection error production
}

Architecture

The engine runs a 7-step optimization pipeline:

Token Count (Pre-optimization)
Normalize (Whitespace, filler phrases)
Compress (Strip parentheticals, verbose patterns)
Similarity Dedup (Merge near-duplicates)
NLP Rewrite (POS Tagging + TF-IDF)
Token Count (Post-optimization)
Metrics Calculation

Benchmarks

Use go run scripts/benchmark/main.go to run the benchmarks:

Sample Name	Mode	Before	After	Reduction
Tech Support (Verbose)	aggressive	114	45	60.5%
General Chat (Greeting)	aggressive	32	19	40.6%
Code Request (Complex)	aggressive	74	33	55.4%
Repeated Context	aggressive	26	13	50.0%

Architecture

The engine runs a 7-step optimization pipeline:

Token Count (Pre-optimization)
Normalize (Whitespace, filler phrases)
Compress (Strip parentheticals, verbose patterns)
Similarity Dedup (Merge near-duplicates)
NLP Rewrite (POS Tagging + TF-IDF)
Token Count (Post-optimization)
Metrics Calculation

Contributing

We welcome contributions to make promptz even better!

How to contribute:

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add some amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

Development:

Run tests: go test -v ./...
Ensure code is idiomatic Go and formatted with gofmt.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
cmd/distill		cmd/distill
core		core
scripts/benchmark		scripts/benchmark
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
WIKI.md		WIKI.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Token Efficiency Engine (promptz)

Features

Modes

Installation

CLI Usage

Library Usage

Architecture

Benchmarks

Architecture

Contributing

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Token Efficiency Engine (promptz)

Features

Modes

Installation

CLI Usage

Library Usage

Architecture

Benchmarks

Architecture

Contributing

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages