webweigh

mirror of https://codeberg.org/Pontoporeia/webweigh.git synced 2026-05-19 06:08:59 +02:00

A rust CLI that calculates the file size of a web page when loaded with all external ressources.

Rust 96.9%
Just 1.5%
Dockerfile 1%
Nix 0.6%

Find a file

Pontoporeia 4ed7f068a1 docs: improve error messages and help text for --analyze-page Add clearer explanations about why --directory is required and how paths are resolved relative to the base directory. - Added examples section to --help output - Improved --directory and --analyze-page help text - Enhanced error message when page path resolution fails		2026-04-15 10:31:09 +02:00
.cargo	Added New build system, and updated .gitignore	2025-12-05 00:15:17 +01:00
.forgejo/workflows	cross compilation with armv7l support	2025-09-24 22:34:46 +02:00
assets	SVG version with text-to-path so logo renders correctly for those without the font installed	2026-04-08 10:10:14 +02:00
src	docs: improve error messages and help text for --analyze-page	2026-04-15 10:31:09 +02:00
tests	feat(test): add browser integration test scaffold (§9) and close all remaining TODO items	2026-02-24 16:35:58 +01:00
.gitignore	Add Cargo.lock and flake.lock for nix develop compatibility	2026-03-11 11:25:17 +01:00
.jjignore	fix: resolve assets when base_url prefix matches absolute HTML URLs (issue #11 )	2026-03-31 12:00:03 +02:00
BUILDING.md	docs: Comprehensive documentation update for v0.2.0 release	2025-09-19 01:52:19 +02:00
Cargo.lock	chore: v0.4.2 — bump version, tag, cross-platform dist builds	2026-04-03 13:57:58 +02:00
Cargo.toml	chore: v0.4.2 — bump version, tag, cross-platform dist builds	2026-04-03 13:57:58 +02:00
Containerfile	Added New build system, and updated .gitignore	2025-12-05 00:15:17 +01:00
CONTRIBUTING.md	feat(test): add browser integration test scaffold (§9) and close all remaining TODO items	2026-02-24 16:35:58 +01:00
flake.lock	Add Cargo.lock and flake.lock for nix develop compatibility	2026-03-11 11:25:17 +01:00
flake.nix	release: bump version to v0.4.1	2026-03-31 12:05:20 +02:00
Justfile	refactor(core): separate asset-size collection from AssetInfo construction	2026-02-24 14:31:16 +01:00
LICENSE	Initial commit	2025-04-16 10:28:33 +00:00
README.md	SVG version with text-to-path so logo renders correctly for those without the font installed	2026-04-08 10:10:14 +02:00
TODO.md	chore: v0.4.2 — bump version, tag, cross-platform dist builds	2026-04-03 13:57:58 +02:00

README.md

webweigh is a command-line tool that scans HTML files in a directory, calculates their total page size (including referenced assets like CSS, JavaScript, and images), and optionally injects this size information into specified HTML elements.

Key capabilities:

Scan mode: Calculate total website size without modifying files
Injection mode: Update HTML files with calculated page sizes
Analysis mode: Detailed breakdown of page size calculations for debugging

This is a Rust rewrite of the Python script originally used by solar.lowtech-website, offering correct measurements, significantly improved performance and new features.

Installation

Binary releases

Currently, I only build for Linux X86. You can grab the executable from the release page.

Cargo

cargo install --git https://codeberg.org/Pontoporeia/webweigh

From Source

Prerequisites

Rust 1.70+ (recommended)
Cargo package manager

git clone https://codeberg.org/Pontoporeia/webweigh.git
cd webweigh
cargo install --path .

See BUILDING.md for detailed build instructions.

Usage

Basic Usage

Scan mode (calculate sizes without modifying files):

webweigh --directory /path/to/website

Injection mode (update HTML files with calculated sizes):

webweigh --directory /path/to/website --selector ".page-size"

Common Examples

Scan a website (read-only analysis):

webweigh --directory ./dist

Update page sizes in elements with class "page-size":

webweigh --directory ./dist --selector ".page-size"

Process with exclusions and exceptions:

webweigh --directory ./dist --selector ".size" \
  --exclude "portfolio" "demo" \
  --except "portfolio/index.html"

Remove base URL prefix from asset paths:

webweigh --directory ./dist --selector ".size" \
  --base-url "https://example.com"

Dry run (calculate and validate selectors but write nothing):

webweigh --directory ./dist --selector ".page-size" --dry-run

Verbose processing with detailed logs:

webweigh --directory ./dist --selector "#size-info" -vv

Silent operation (no output):

webweigh --directory ./dist --selector "body" --silent

Analyze a specific page (detailed breakdown for debugging):

webweigh --directory ./dist --analyze-page "/"
webweigh --directory ./dist --analyze-page "/portfolio/" --base-url "https://example.com"

This will show:

Base HTML size
Assets grouped by type (Stylesheets, Scripts, Images, Fonts, etc.)
Size and percentage contribution of each asset type
Detailed list of all assets sorted by size

Perfect for debugging discrepancies between webweigh calculations and browser dev tools.

Excludes & Exceptions

Exclude Examples:

# excludes entire directory
-e static/portfolio
-e static/portfolio templates

# regex: excludes .tmp files in static/
-e "static/.*\.tmp$" 

# regex: excludes all .html files in content/microblog/
-e "content/microblog/.*\.html$"

Exceptions Examples:

# excludes all HTML in content/microblog/ except index.html in that directory
-e "content/microblog/.*.html$" --except index.html

# excludes content/temp/ directory except files ending with important.txt
  -e content/temp --except "important\.txt$"

Command Line Options

Usage: webweigh [OPTIONS] --directory <DIR>

Options:
  -d, --directory <DIR>          Directory to traverse (required)
      --base-url <URL>           Base URL prefix to remove from asset paths
      --selector <SELECTOR>      CSS selector for size injection (optional — scans without modification if omitted)
      --analyze-page <PATH>      Analyze specific page with detailed breakdown (e.g., "/", "/portfolio/")
      --dry-run                  Calculate sizes and validate selectors but do not write any files
  -e, --exclude <PATTERN>...     Exclude paths/patterns (supports literal paths and regex)
      --except <PATTERN>...      Exception patterns within exclusion scope
  -v, --verbose...               Logging verbosity: -v (errors), -vv (info), -vvv (debug), -vvvv (trace)
  -s, --silent                   Suppress all output
  -h, --help                     Show help information
  -V, --version                  Show version

Verbosity Levels

Default: Shows only the final statistics report (no log output)
-v: Shows error-level logs only
-vv: Shows info-level logs (processing start, configuration)
-vvv: Shows debug-level logs (detailed processing information)
-vvvv: Shows trace-level logs (maximum detail)
--silent: No output at all (suppresses even the statistics report)

Supported Assets

Direct References:

HTML files (.html, .htm)
CSS stylesheets (<link rel="stylesheet">)
JavaScript files (<script src>)
Images (<img src>, CSS url())
Fonts (CSS @font-face, url())
Icons (<link rel="icon">)

Dynamic Assets (detected in JS/CSS):

Dynamic script loading (script.src = "...")
Dynamic imports (import(), require())
Fetch calls (fetch("/api/data.json"))
CSS imports (@import url(...))
Asset assignments (link.href = "...")

Dependencies

Core runtime dependencies:

clap: Command-line argument parsing (builder API, no proc-macro derive)
scraper: HTML parsing and CSS selector evaluation
regex: Pattern matching for asset discovery in CSS and JS
rayon: Work-stealing parallel processing
log: Logging facade (stderr logger is inlined; no env_logger required)
anyhow: Error handling in the binary entry point

Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines on how to contribute to webweigh.

For build instructions and development setup, see BUILDING.md.

Credits

Icon: Phosphor Icons

License

This project is licensed under the GNU Affero General Public License v3.0 (AGPL-3.0).