eth-vanity-metal

A Metal GPU-accelerated Ethereum vanity address generator for Apple Silicon.

Generate custom Ethereum addresses with specific prefixes or suffixes using Apple Silicon's native Metal GPU acceleration.

Performance Leader: Metal GPU native implementation now achieves ~265 MH/s, 2.6x faster than profanity2's OpenCL on the same hardware!

Performance

Mode	Speed	Hardware
Metal GPU Native	~265 MH/s	Apple M4 Pro
profanity2 (OpenCL)	~100 MH/s	Apple M4 Pro
CPU multi-threaded	~2 MH/s	14-core M4 Pro

Metal native is now 2.6x faster than profanity2's OpenCL on Apple Silicon! The Metal GPU implementation uses optimized Jacobian coordinates with efficient field arithmetic, achieving industry-leading performance for vanity address generation on macOS.

Difficulty Estimates (at 265 MH/s)

Pattern Length	Combinations	Approximate Time
1 character	16	Instant
2 characters	256	Instant
3 characters	4,096	Instant
4 characters	65,536	Instant
5 characters	1,048,576	< 1 second
6 characters	16,777,216	~1 second
7 characters	268,435,456	~1 second
8 characters	4,294,967,296	~16 seconds

Features

Metal GPU Acceleration - Native Apple Silicon GPU compute for maximum performance
Prefix Search - Find addresses starting with custom hex patterns (e.g., 0xdead...)
Suffix Search - Find addresses ending with patterns (e.g., ...beef)
Simpler than TRON - No Base58, just hex! Pattern matching is straightforward
Real-time Progress - Live speed, match counter, and time elapsed
Secure - All keys generated locally, never transmitted
Validated - Generated addresses verified against reference implementations

Installation

Prerequisites

macOS 12.0+ (Monterey or later)
Apple Silicon Mac (M1/M2/M3/M4) or Intel Mac with AMD GPU
Rust 1.70+

Build from Source

git clone https://github.com/mrtozner/eth-vanity-metal.git
cd eth-vanity-metal
cargo build --release

The binary will be at ./target/release/eth-vanity-metal.

Usage

GPU-Accelerated Search (Recommended)

# Find address starting with "dead"
./target/release/eth-vanity-metal -p dead --gpu-native

# Find address ending with "beef"
./target/release/eth-vanity-metal -e beef --gpu-native

# Find address starting with "cafe" AND ending with "42"
./target/release/eth-vanity-metal -p cafe -e 42 --gpu-native

CPU-Only Search (Slower, for compatibility testing)

./target/release/eth-vanity-metal -p dead

Command Line Options

Options:
  -p, --prefix <PREFIX>    Hex prefix pattern (e.g., 'dead' for 0xdead...)
  -e, --end <SUFFIX>       Hex suffix pattern (e.g., 'beef' for ...beef)
      --gpu-native         Use Metal GPU acceleration (recommended)
      --benchmark          Run performance benchmark
      --info               Show hardware information
  -t, --threads <N>        Number of CPU threads (CPU mode only)
  -h, --help               Print help

Output Format

Found vanity address!
========================
Address:      0xdead5d0d95924acc737fb0f7e5289a5a8d0bb6bd
Private Key:  58fd1dac4f979cb27d4acb35119e27a6a759a3c655636f3ad7995dab52d50a59

WARNING: Keep your private key secure! Anyone with this key can access your funds.

IMPORTANT: Save the private key immediately! It's only displayed once.

Technical Details

Algorithm

Generate random 256-bit private keys in batches (268M per GPU batch)
Compute public key points using precomputation table lookup
Apply Keccak-256 hash to uncompressed public key (64 bytes, excluding 0x04 prefix)
Take last 20 bytes as Ethereum address
Check if address matches hex pattern
Return matching private key

GPU Optimizations (Current)

Precomputation Table - 8,160 precomputed EC points for fast scalar multiplication
64-bit Limbs - Optimized uint256 arithmetic using 4x64-bit limbs
Jacobian Coordinates - Avoids per-operation inversions (~16 field muls per point addition)
Optimized Keccak-256 - Unrolled 24-round permutation for Metal shaders
131K GPU Threads - Parallel processing with 2048 steps each
Fast Modular Reduction - Exploits secp256k1 prime structure: 2^256 ≡ 2^32 + 977 (mod P)

Planned Optimizations

Montgomery Batch Inversion - Amortize 1 inverse across 255 keys (8x speedup potential)
deltaX/lambda representation - Reduce to ~2 field muls per iteration (vs current ~16)

Architecture

┌─────────────┐     ┌──────────────────┐     ┌─────────────┐
│  Rust CLI   │────▶│   Metal Shader   │────▶│  GPU Cores  │
│  (main.rs)  │◀────│ (search_native)  │◀────│  (M4 Pro)   │
└─────────────┘     └──────────────────┘     └─────────────┘
      │                      │                      │
      │         ┌────────────┴────────────┐       │
      │         │                         │       │
      │    ┌────┴────┐             ┌─────┴─────┐  │
      │    │ Precomp │             │ EC Point  │  │
      │    │  Table  │             │ Addition  │  │
      │    │(8160 pt)│             │ (Jacobian)│  │
      │    └─────────┘             └───────────┘  │
      │                                           │
      │         ┌──────────────────┐             │
      └────────▶│    Keccak-256    │◀────────────┘
                │  (unrolled 24r)  │
                └──────────────────┘

Precomputation Table

The GPU uses a precomputation table of 8,160 elliptic curve points:

32 byte positions × 255 non-zero values
Allows computing k×G with only ~32 point additions instead of ~256
Table size: ~510 KB (fits comfortably in GPU cache)

Security

Private keys are generated using cryptographically secure random number generator
All computation happens locally on your machine
No network connections or telemetry
Keys are displayed once and not stored
Addresses validated against reference implementation (eth_account Python library)

WARNING:

Never share your private key
Test the generated address with a small amount first
Consider using a hardware wallet for large amounts
Verify the address checksum before use

Comparison

Tool	GPU Support	Speed	Platform
eth-vanity-metal	Metal (native)	~265 MH/s	macOS (Apple Silicon)
profanity2	OpenCL	~100 MH/s	Linux/Windows/macOS
vanity-eth (Node)	CPU only	~50 KH/s	Cross-platform
ethvanity (Go)	CPU only	~200 KH/s	Cross-platform

eth-vanity-metal is the fastest Ethereum vanity address generator, achieving 2.6x the performance of profanity2 on Apple Silicon.

Ethereum Address Format

Unlike TRON (which uses Base58Check), Ethereum addresses are simpler:

20 bytes (40 hex characters)
Prefixed with 0x
No checksum in the address itself (EIP-55 provides optional mixed-case checksum)
Example: 0xdead5d0d95924acc737fb0f7e5289a5a8d0bb6bd

This means pattern matching is straightforward - what you search for is exactly what appears in the address!

Roadmap

Current Status: ~265 MH/s (Production Ready) 🎉

Target Achieved! Metal native implementation now exceeds profanity2's OpenCL performance by 2.6x on Apple Silicon.

Completed Optimizations

Optimized Jacobian coordinates - Efficient field arithmetic achieving 265 MH/s
Metal shader optimization - Native GPU compute outperforming OpenCL
Fast modular reduction - Exploiting secp256k1 prime structure
Precomputation table - 8,160 precomputed EC points for fast scalar multiplication

Future Enhancements

Multi-GPU support - Distribute work across multiple Metal devices
EIP-55 checksum patterns - Support mixed-case checksum matching
GUI application - Native macOS app with SwiftUI interface
Further optimizations - Explore additional Metal compute optimizations

Why Metal Wins

Metal's native integration with Apple Silicon provides lower overhead and better performance than OpenCL. The 265 MH/s achievement proves that Metal is the superior choice for GPU compute on macOS.

Contributing

Contributions welcome! Areas of interest:

Implementing profanity2-style batch inversion in Metal
GLV endomorphism optimization
Multi-GPU support
Further Metal shader optimizations
Support for EIP-55 checksum patterns

License

MIT License - see LICENSE

Changelog

See CHANGELOG.md for version history.

Acknowledgments

profanity2 for GPU optimization techniques
secp256k1 curve implementation techniques from libsecp256k1
Montgomery batch inversion algorithm
Apple Metal Compute documentation

Made with Metal for Apple Silicon

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.cargo		.cargo
examples		examples
src		src
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.toml		Cargo.toml
LICENSE		LICENSE
OPTIMIZATION_SUMMARY.md		OPTIMIZATION_SUMMARY.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

eth-vanity-metal

Performance

Difficulty Estimates (at 265 MH/s)

Features

Installation

Prerequisites

Build from Source

Usage

GPU-Accelerated Search (Recommended)

CPU-Only Search (Slower, for compatibility testing)

Command Line Options

Output Format

Technical Details

Algorithm

GPU Optimizations (Current)

Planned Optimizations

Architecture

Precomputation Table

Security

Comparison

Ethereum Address Format

Roadmap

Current Status: ~265 MH/s (Production Ready) 🎉

Completed Optimizations

Future Enhancements

Why Metal Wins

Contributing

License

Changelog

Acknowledgments

About

Uh oh!

Releases

Packages

Languages

License

mrtozner/eth-vanity-metal

Folders and files

Latest commit

History

Repository files navigation

eth-vanity-metal

Performance

Difficulty Estimates (at 265 MH/s)

Features

Installation

Prerequisites

Build from Source

Usage

GPU-Accelerated Search (Recommended)

CPU-Only Search (Slower, for compatibility testing)

Command Line Options

Output Format

Technical Details

Algorithm

GPU Optimizations (Current)

Planned Optimizations

Architecture

Precomputation Table

Security

Comparison

Ethereum Address Format

Roadmap

Current Status: ~265 MH/s (Production Ready) 🎉

Completed Optimizations

Future Enhancements

Why Metal Wins

Contributing

License

Changelog

Acknowledgments

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages