Simplify Your Workflow: Search MiniWebtool.
Add Extension
> Word Frequency Analyzer

Word Frequency Analyzer

Paste any text and instantly see which words appear most often. Get a ranked frequency table, an animated bar chart, an interactive word cloud, lexical diversity score, and optional stop-word filtering across 6 languages. Export results as CSV.

Word Frequency Analyzer
📚 Try an example
Paste any prose — an article, an essay, a transcript, even a full novel chapter. Up to 200,000 characters per run.
Add character names, brand names, or any words you want filtered out — separated by commas or spaces.

Embed Word Frequency Analyzer Widget

About Word Frequency Analyzer

The Word Frequency Analyzer answers a simple question with surprising depth: which words does this text really use the most? Paste any block of prose — a blog post, a transcript, a chapter, a job description, a speech — and it ranks every distinct word by how often it appears, charts the distribution, and renders an interactive word cloud sized by frequency. The tool is built for writers checking for accidental word repetition, SEO specialists looking for natural keyword density, students studying an author's vocabulary, researchers running a quick lexical-diversity sanity check, and translators or linguists exploring an unfamiliar text. Everything runs in your browser or on our server and is never stored.

What makes this analyzer different

  • Live preview as you type. The side panel updates unique-word count, total words, TTR (lexical diversity), and the live top 5 instantly — without clicking Analyze. You can iterate filters in seconds.
  • Six-language stop-word lists. English, Spanish, French, German, Italian, and Portuguese — curated lists, not bloated dumps. Plus a free-form custom stop-word field for character names, brand names, or boilerplate.
  • Square-root scaled word cloud. Most cloud generators size words by raw count, which means the top word can be 50× the height of mid-rank words and visually crushes the cloud. Sqrt scaling keeps the cloud readable and is the industry-standard approach since Wordle (2009).
  • The top-3 "podium" view. A glance at the gold/silver/bronze cards tells you the words your text leans on hardest — the first thing to check when you suspect accidental repetition.
  • Lexical-diversity metrics. Type-Token Ratio and hapax-legomena count give you a richness score, not just a frequency dump. Short prose with TTR > 0.6 is rich; a TTR under 0.2 in a long document is repetitive.
  • One-click CSV export. Download or copy the full ranked table for spreadsheet analysis.

How to use this tool

  1. Paste your text. Up to 200,000 characters — roughly 30,000 words, the length of a long novel chapter or several blog posts combined.
  2. Pick a stop-word language. If you do not filter stop words, the top of the table will be "the", "of", "and" — informative once, never again. Choose the language of your text, or pick None for a true raw frequency count.
  3. Set a minimum word length. Set to 3 or 4 if you want to skip "a", "I", "it", "no". Set to 1 to keep everything.
  4. Choose how many results to display. Top 50 is the sweet spot for most prose; Top 500 gives you the full long tail.
  5. Optional toggles. Turn on case-sensitive if you care about "Paris" vs. "paris". Turn on basic lemmatization to collapse "runs", "ran", and "running" into "run". Turn on counting numbers if version numbers, years, and statistics are meaningful in your text.
  6. Click Analyze. Read the podium, scan the bar-chart table, glance at the cloud, and export the CSV if you want to dig further.

The math behind the metrics

Frequency and percentage

For each distinct word \( w \), the count is the number of times it appears in the kept token list, and the percentage is \( \text{count}(w) / N \) where \( N \) is the kept-token total. The bar width is relative to the most common word so you can see the shape of the distribution at a glance.

Type-Token Ratio (TTR)

\( \text{TTR} = U / N \) where \( U \) is the number of unique words (types) and \( N \) is the total counted tokens. TTR is the simplest measure of lexical diversity. A short news brief typically sits at 0.5–0.7; a long novel sinks to 0.15–0.25 because common words recur. TTR is length-sensitive — long texts always have lower TTR than short ones, so do not compare TTR across documents of wildly different sizes.

Hapax legomena

A hapax legomenon (Greek for "said once") is a word that appears exactly once in the text. The hapax count and hapax percentage are classic signals of vocabulary richness. In Shakespeare's complete works, roughly 14,000 of his 31,000 distinct words are hapax — about 45%. A modern blog post often hits 60% or more hapax because there is not enough text for words to recur.

Word cloud font sizing

The font size for word \( w \) in the cloud uses square-root scaling between the minimum and maximum counts on display:

\( \text{size}(w) = 60\% + 180\% \cdot \dfrac{\sqrt{\text{count}(w)} - \sqrt{\text{min}}}{\sqrt{\text{max}} - \sqrt{\text{min}}} \)

This compresses the dynamic range so a 200× word is roughly 3× the height of a 20× word, not 10×. Without this compression, the cloud is dominated by one or two giant words.

Color-coded frequency tiers

The bars and cloud words are color-coded by rank tier so you can spot the shape of your distribution at a glance:

Tier 1 — ranks 1–5The 5 words your text leans on hardest. If a content word lands here, that is your theme.
Tier 2 — ranks 6–15The supporting cast. Recurring nouns and verbs you use to develop the main idea.
Tier 3 — ranks 16–40The wider vocabulary surrounding your top themes.
Tier 4 — ranks 41–100Specialist or specific terms — proper nouns, jargon, named entities.
Tier 5 — ranks 101+The long tail. Words used once or twice. Often where the most interesting vocabulary lives.

Use cases

Writers — catching unintended repetition

You will be surprised how often a single word ("quickly", "really", "essentially", a character's name) sneaks to the top of your draft. Paste a chapter and look at the gold-silver-bronze podium. If a content word appears there that you did not consciously emphasize, you have a tic to edit out.

SEO and content marketing

Set the stop-word filter and minimum length, then read the top 25. These are the words search engines will most strongly associate with your page. If they do not match your target keyword cluster, your on-page SEO will underperform. Avoid keyword stuffing — modern algorithms penalize unnatural density. A healthy target is roughly 1–2% for your main keyword.

Literary study and stylistics

Paste a chapter of Dickens vs. Hemingway and compare TTR, hapax percentage, and average word length. The numerical fingerprints of authorial styles are remarkably consistent across their bodies of work — this is the foundation of computational stylometry.

Speech and transcript analysis

Politicians and CEOs have favorite words. Run a speech through the analyzer with stop words removed and the top 15 reveal the messaging strategy. Compare two speeches by the same speaker to see what shifted.

Translation and language learning

When working on a translation, run the source text first to see which content words dominate. Make sure your translation preserves the same emphasis. For learners, picking a 200-word article and running it with no stop-word filtering shows which function words you need to recognize fluently.

Research and academic writing

Many journals expect a controlled vocabulary in abstracts. A frequency check before submission catches accidental jargon overuse. Researchers running corpus-linguistics studies use frequency lists as the starting input for collocation, n-gram, and topic-modeling work — this tool generates that input.

Recommended settings by document type

DocumentStop wordsMin lengthTop NLemmatize
Blog post / articleEnglish (or your language)350Off
Novel chapterEnglish3100On (collapse "runs"/"ran"/"running")
Academic paperEnglish4100On
Tweet thread / short postNone125Off
SEO researchEnglish350On
Speech transcriptEnglish325Off (you want exact phrasing)
Foreign-language textMatch the language150Off (English-only lemmatizer)

Frequently asked questions

What counts as a "word"?

The tokenizer matches one or more Unicode letters, optionally joined by apostrophes or hyphens. So don't, state-of-the-art, and l'ovvio are each one word. Numbers are excluded by default — toggle "Count numbers" on if you want to include them. The tokenizer works across Latin, Cyrillic, Greek, and CJK scripts.

What does the basic lemmatizer do, and what does it not do?

It performs three lightweight transformations: drop possessive 's, collapse common verb endings (-ing, -ed), and simple plurals (-s, -es, -ies → -y). It does not do full morphological lemmatization (better → good, went → go). Full lemmatization would require shipping the WordNet lexicon and is overkill for frequency analysis where exact word forms are often what you want to see. The conservative approach also avoids the worst stemmer failure mode: collapsing semantically distinct words ("university" and "universe" share a stem under Porter).

Why do the live preview and the server result differ slightly?

The live preview only filters English stop words client-side to keep the script tiny — other languages get fully filtered on the server. The server also applies basic lemmatization when toggled. The total token count is always the same between the two.

Does the tool handle non-Latin scripts?

Yes — the tokenizer uses Unicode character classes, so Cyrillic, Greek, Arabic, Hebrew, Chinese, Japanese, and Korean text all tokenize correctly. Chinese and Japanese do not use spaces between words, so each contiguous run of CJK characters is treated as a single "token" — for true word segmentation in those languages you would need a dedicated tokenizer like jieba (Chinese) or MeCab (Japanese).

What is the upper limit on text size?

200,000 characters per run — about 30,000 English words or a typical novel chapter. Beyond that, browser memory and request size become a concern; split your text into smaller passes.

Is my text private?

Yes. The text is processed in memory to render the result page and is never written to disk. The live mini-stats while you type run entirely in your browser. We do not log, store, or analyze the content you paste.

A short history of word frequency analysis

Word frequency lists are among the oldest tools in linguistics. The first machine-generated frequency list of English was Father Roberto Busa's 1949–1980 Index Thomisticus, which counted every word in the works of Thomas Aquinas using IBM punched-card machines — widely considered the founding project of digital humanities. The Brown Corpus (1961) provided the first systematically sampled million-word frequency list of modern American English. Today, every search engine, machine-translation system, large language model, and SEO tool runs on word and token frequency statistics at scale. The same simple Counter-based ranking you see in this tool is the kernel of the field.

Reference this content, page, or tool as:

"Word Frequency Analyzer" at https://MiniWebtool.com// from MiniWebtool, https://MiniWebtool.com/

by miniwebtool team. Updated: May 27, 2026

Top & Updated:

Random PickerRandom Name PickerFPS ConverterInstagram User ID LookupLine CounterRelative Standard Deviation CalculatorSort NumbersBatting Average CalculatorMAC Address GeneratorRemove SpacesERA CalculatorJob FinderFeet and Inches to Cm ConverterWord to Phone Number ConverterMAC Address LookupRandom Truth or Dare GeneratorFacebook User ID LookupSum CalculatorSun, Moon & Rising Sign Calculator 🌞🌙✨Percent Off CalculatorSquare Root (√) CalculatorSHA256 Hash GeneratorOPS CalculatorImage ResizerLog Base 10 CalculatorMP3 LooperSaturn Return CalculatorNumber of Digits CalculatorAudio SplitterBitwise CalculatorRandom Credit Card GeneratorSlope and Grade CalculatorVertical Jump CalculatorPhone Number ExtractorAI Text HumanizerRoman Numerals ConverterSlugging Percentage CalculatorRandom Activity GeneratorCm to Feet and Inches ConverterInvisible Text GeneratorRandom Sound Frequency GeneratorMerge VideosRandom Movie PickerSalary Conversion CalculatorOn Base Percentage CalculatorRandom IMEI Generator⬛ Aspect Ratio CalculatorNumber to Word ConverterRandom Quote GeneratorWAR CalculatorRandom Poker Hand GeneratorRandom Fake Address GeneratorRandom Loadout GeneratorRandom Superpower GeneratorCaffeine Overdose CalculatorFile Size ConverterOctal CalculatorMaster Number CalculatorText FormatterDecimal to BCD ConverterBinary to Gray Code ConverterRandom Writing Prompt GeneratorVideo to Image ExtractorRandom Birthday GeneratorAdd Prefix and Suffix to TextWHIP CalculatorFirst n Digits of PiQuotient and Remainder CalculatorSteel Weight CalculatorCompare Two StringsYouTube Channel StatisticsTime Duration CalculatorWord Ladder GeneratorCM to Inches ConverterLove Compatibility CalculatorCompound Growth CalculatorBCD to Decimal ConverterName Number CalculatorRemove Line BreaksDMS to Decimal Degrees ConverterOutlier CalculatorSHA512 Hash Generator📅 Date CalculatorGray Code to Binary ConverterBattery Life CalculatorWhat is my Lucky Number?Random Meal GeneratorPercent Growth Rate CalculatorRemove AccentLeap Years ListProportion CalculatorAcreage CalculatorImage CompressorSocial Media Username CheckerRandom Object GeneratorClothing Size ConverterDay of Year CalendarStair CalculatorVideo CompressorEmail ExtractorURL ExtractorAI ParaphraserAI Punctuation AdderList of Prime NumbersDay of the Year Calculator - What Day of the Year Is It Today?Binary to BCD ConverterIP Address to Hex ConverterSort Lines AlphabeticallyHex to BCD ConverterBCD to Binary ConverterLottery Number GeneratorBCD to Hex ConverterMedian CalculatorStandard Error CalculatorList RandomizerBreak Line by CharactersAverage CalculatorModulo CalculatorPVIFA CalculatorReverse VideoHypotenuse CalculatorRemove Audio from VideoActual Cash Value CalculatorScientific Notation to Decimal ConverterNumber ExtractorAngel Number CalculatorLog Base 2 CalculatorRoot Mean Square CalculatorSum of Positive Integers CalculatorSHA3-256 Hash GeneratorAI Sentence ExpanderLbs to Kg ConverterHex to Decimal ConverterRandom Group GeneratorConvolution CalculatorMAC Address AnalyzerRandom String GeneratorRemove Leading Trailing SpacesAmortization CalculatorMarkup CalculatorPVIF CalculatorDecimal to Hex ConverterInstagram Font GeneratorSocial Media Image Size GuideTikTok Money CalculatorTwitter/X Character CounterTwitter/X Timestamp ConverterYouTube Watch Time CalculatorTwitch Earnings CalculatorYouTube Shorts Monetization CalculatorFacebook Ad Cost CalculatorSocial Media ROI CalculatorSocial Media Post Time OptimizerCTR CalculatorROAS CalculatorInfluencer ROI CalculatorForce CalculatorAcceleration CalculatorVelocity CalculatorMomentum CalculatorProjectile Motion CalculatorKinetic Energy CalculatorPotential Energy CalculatorWork and Power CalculatorDensity CalculatorPressure CalculatorIdeal Gas Law CalculatorFree Fall CalculatorTorque CalculatorHorsepower CalculatorDilution CalculatorChemical Equation BalancerStoichiometry CalculatorPercent Yield CalculatorEmpirical Formula CalculatorBoiling Point CalculatorTitration CalculatorMole/Gram/Particle ConverterIrregular Polygon Area CalculatorFrustum CalculatorTorus Calculator3D Distance CalculatorGreat Circle Distance CalculatorCircumscribed Circle (Circumcircle) CalculatorInscribed Circle (Incircle) CalculatorAngle Bisector CalculatorTangent Line to Circle CalculatorHeron's Formula CalculatorCoordinate Geometry Distance CalculatorVolume of Revolution CalculatorSurface of Revolution CalculatorParametric Curve GrapherRiemann Sum CalculatorTrapezoidal Rule CalculatorSimpson's Rule CalculatorImproper Integral CalculatorL'Hôpital's Rule CalculatorMaclaurin Series CalculatorPower Series CalculatorSeries Convergence Test CalculatorInfinite Series Sum CalculatorAverage Rate of Change CalculatorInstantaneous Rate of Change CalculatorRelated Rates SolverOptimization Calculator (Calculus)Gradient Calculator (Multivariable)Divergence CalculatorCurl CalculatorLine Integral CalculatorSurface Integral CalculatorJacobian Matrix CalculatorNewton's Method CalculatorRREF Calculator (Row Echelon Form)Matrix Inverse CalculatorMatrix Multiplication CalculatorDot Product CalculatorCross Product CalculatorVector Magnitude CalculatorUnit Vector CalculatorAngle Between Vectors CalculatorNull Space CalculatorColumn Space CalculatorCramer's Rule CalculatorMatrix Diagonalization CalculatorQR Decomposition CalculatorCholesky Decomposition CalculatorMatrix Power CalculatorCharacteristic Polynomial CalculatorBayes' Theorem CalculatorF-Test / F-Distribution CalculatorHypergeometric Distribution CalculatorNegative Binomial Distribution CalculatorGeometric Distribution CalculatorExponential Distribution CalculatorWeibull Distribution CalculatorBeta Distribution CalculatorSpearman Rank Correlation CalculatorFisher's Exact Test CalculatorContingency Table CalculatorOdds Ratio CalculatorRelative Risk CalculatorEffect Size CalculatorPermutations with Repetition CalculatorModular Exponentiation CalculatorPrimitive Root CalculatorPerfect Number CheckerAmicable Number CheckerTwin Prime FinderMersenne Prime CheckerGoldbach Conjecture VerifierMöbius Function CalculatorEgyptian Fraction CalculatorFibonacci Number CheckerDigital Root CalculatorPartition Function CalculatorBoolean Algebra SimplifierKarnaugh Map (K-Map) SolverLogic Gate SimulatorGraph Coloring CalculatorTopological Sort CalculatorAdjacency Matrix CalculatorRecurrence Relation SolverInclusion-Exclusion CalculatorLinear Programming SolverTraveling Salesman Solver (TSP)Hamiltonian Path CheckerPlanar Graph CheckerNetwork Flow Calculator (Max Flow)Stable Marriage Problem SolverFirst-Order ODE SolverSecond-Order ODE SolverDirection Field / Slope Field PlotterEuler's Method CalculatorBernoulli ODE SolverSystem of ODEs SolverGroup Theory Order CalculatorRing and Field CalculatorJordan Normal Form CalculatorMatrix Exponential CalculatorTensor Product CalculatorFast Fourier Transform (FFT) CalculatorZ-Transform CalculatorNumerical Integration CalculatorTOML to JSON ConverterJSON to CSV ConverterXML to JSON ConverterSQL to MongoDB Query ConverterCSS Flexbox PlaygroundCSS Grid GeneratorJWT GeneratorBcrypt Hash Generator / CheckerColor Code Converter (All Formats)Git Command Generator.env File GeneratorLorem Picsum / Placeholder Image GeneratorText to Binary/Hex/ASCII ConverterSyllable CounterSentence CounterParagraph CounterSpeaking Time CalculatorReading Time CalculatorWhitespace VisualizerStrikethrough Text GeneratorTorque Converter (Nm, ft-lb, kgf-cm)Data Transfer Rate ConverterFuel Efficiency ConverterAstronomical Unit ConverterRing Size ConverterPaper Size ReferenceGas Mileage CalculatorEV Range CalculatorEV Charging Time Calculator0–60 / Quarter Mile CalculatorCar Lease CalculatorVehicle Towing Capacity CalculatorExposure Triangle CalculatorCrop Factor CalculatorMegapixel to Print Size CalculatorPhoto File Size EstimatorMusic BPM TapperMusic Key TransposerVideo Bitrate CalculatorSeed Germination Rate CalculatorFertilizer Calculator (NPK)Raised Bed Soil CalculatorFrost Date CalculatorLawn Fertilizer CalculatorCompost Calculator (C:N Ratio)Solar Panel CalculatorSolar ROI CalculatorHome Energy Audit CalculatorAppliance Energy Cost CalculatorWater Usage CalculatorElectricity Generation Cost CalculatorHeat Loss CalculatorFlight Distance CalculatorTravel Budget CalculatorJet Lag CalculatorPacking List GeneratorTip Splitter (Advanced)Lease vs Buy CalculatorHourly Rate Calculator (Freelancer)Invoice Late Fee CalculatorESPP CalculatorStock Split CalculatorOptions Probability CalculatorDollar to Gold ConverterBeam Load CalculatorPipe Flow CalculatorBolt Torque CalculatorGravel, Sand & Topsoil CalculatorRandom Sentence GeneratorRandom Paragraph GeneratorRandom Math Problem GeneratorRandom Bible Verse GeneratorRandom Cat/Dog Name GeneratorRandom Debate Topic GeneratorBody Recomposition CalculatorAlcohol Calorie CalculatorMedication Dosage CalculatorPace to Calories CalculatorHydration CalculatorTrain Meeting Problem SolverAge Word Problem SolverMixture Problem SolverWork Rate Problem SolverDistance-Speed-Time Triangle CalculatorCoin Word Problem SolverNumber Bonds GeneratorCarry and Borrow VisualizerTimes Tables QuizMental Math TrainerRoman Numeral Math SolverEgyptian Multiplication CalculatorVedic Math Tricks CalculatorRussian Peasant MultiplicationSoroban Abacus SimulatorAnnuity Payout CalculatorReverse Mortgage CalculatorVariable Annuity CalculatorFixed Indexed Annuity CalculatorBond Convexity CalculatorBond Duration Calculator (Macaulay & Modified)Forward Rate CalculatorMortgage Recast CalculatorTreasury Inflation-Protected Securities (TIPS) CalculatorStock Beta CalculatorTreynor Ratio CalculatorSortino Ratio CalculatorDoppler Effect CalculatorSpring Constant CalculatorPendulum Period CalculatorCentripetal Force CalculatorAngular Velocity CalculatorMoment of Inertia CalculatorSnell's Law CalculatorCoulomb's Law CalculatorElectric Field CalculatorMagnetic Field of Wire CalculatorLens Equation CalculatorA/B Test Significance CalculatorA/B Test Sample Size CalculatorConversion Rate CalculatorCustomer Lifetime Value (CLV) CalculatorCustomer Acquisition Cost (CAC) CalculatorChurn Rate CalculatorRetention Rate Cohort CalculatorNPS (Net Promoter Score) CalculatorPareto Chart GeneratorSix Sigma Process Capability CalculatorTessellation GeneratorSpirograph GeneratorVoronoi Diagram GeneratorDelaunay Triangulation GeneratorL-System Fractal GeneratorMandelbrot Set ExplorerJulia Set GeneratorPolar Equation Plotter3D Surface PlotterSierpinski Triangle GeneratorcURL Command BuilderHTTP Status Code ReferenceUUID Validator/DecoderURL ParserQuery String BuilderSVG to React/JSX ConverterSCSS to CSS CompilerLess to CSS CompilerTypeScript PlaygroundJSON Schema GeneratorImage to ASCII Art ConverterImage to SVG TracerLipogram CheckerPangram CheckerAcronym GeneratorBackronym GeneratorPig Latin TranslatorEXIF Data Viewer/RemoverROT13 Encoder/DecoderAtbash Cipher ToolVigenère Cipher ToolPronunciation IPA ConverterHemingway-Style Readability EditorSentence Length Variance AnalyzerWord Frequency AnalyzerBusiness Days CalculatorAdd Business Days to DateHalfway Date Calculator