Class TextAnalyzer

Constructors

constructor

new TextAnalyzer(input: string): TextAnalyzer
Constructs a new TextAnalyzer instance with the provided input text.
Parameters
- input: string
  The text to analyze
Returns TextAnalyzer
- Defined in utils/TextAnalyzer.ts:53

Properties

`Private`charFrequency

charFrequency: Map<string, number> = ...

Frequency maps for characters and words

`Private`sentences

sentences: string[] = []

`Private`syllableCache

syllableCache: Map<string, number> = ...

`Private` `Optional`syllableStats

syllableStats?: {
    avg: number;
    median: number;
    mono: number;
    perWord: number[];
    total: number;
}

Cached syllable stats

`Private` `Readonly`text

text: string

The original text to analyze

`Private`wordHistogram

wordHistogram: Map<string, number> = ...

`Private`words

words: string[] = []

Tokenized words and sentences

`Private` `Static` `Readonly`REGEX

REGEX: {
    letter: RegExp;
    nonWord: RegExp;
    number: RegExp;
    sentence: RegExp;
    ucLetter: RegExp;
    vowelGroup: RegExp;
    word: RegExp;
} = ...

Regular expressions used in text analysis

Methods

`Private`computeFrequencies

computeFrequencies(): void
Computes character and word frequencies from the tokenized text.

Returns void
- Defined in utils/TextAnalyzer.ts:77

`Private`computeSyllableStats

computeSyllableStats(): {
    avg: number;
    median: number;
    mono: number;
    perWord: number[];
    total: number;
}
Compute internal syllable stats.

Returns { avg: number; median: number; mono: number; perWord: number[]; total: number }
- Computed syllable stats
- Defined in utils/TextAnalyzer.ts:111

`Private`estimateSyllables

estimateSyllables(word: string): number
Estimates the number of syllables in a word using a simple heuristic. Uses caching to avoid redundant calculations for identical words.
Parameters
- word: string
  The word to estimate syllables for
Returns number
- Estimated syllable count
- Defined in utils/TextAnalyzer.ts:91

getAvgSentenceLength

getAvgSentenceLength(): number
Gets the average sentence length in words.

Returns number
- Average length of sentences
- Defined in utils/TextAnalyzer.ts:160

getAvgSyllablesPerWord

getAvgSyllablesPerWord(): number
Gets the average number of syllables per word in the text.

Returns number
- Average syllables per word
- Defined in utils/TextAnalyzer.ts:310

getAvgWordLength

getAvgWordLength(): number
Gets the average word length in the text.

Returns number
- Average length of words
- Defined in utils/TextAnalyzer.ts:151

getCharFrequency

getCharFrequency(): Record<string, number>
Gets the frequency of each character in the text.

Returns Record<string, number>
- A record of character frequencies
- Defined in utils/TextAnalyzer.ts:221

getHapaxLegomena

getHapaxLegomena(): string[]
Gets the least common words (hapax legomena) in the text. Hapax legomena are words that occur only once in the text.

Returns string[]
- Array of hapax legomena
- Defined in utils/TextAnalyzer.ts:191

getHonoresR

getHonoresR(): number
Calculates the Honore's R statistic for the text as a measure of lexical richness.

Returns number
- The Honore's R statistic
- Defined in utils/TextAnalyzer.ts:328

getLength

getLength(): number
Gets the original text length in characters.

Returns number
- Length of the text
- Defined in utils/TextAnalyzer.ts:130

getLIXScore

getLIXScore(): number
Calculates the LIX (Lesbarhetsindex) score for the text. The LIX score is a readability index that combines average word length and sentence length.

Returns number
- The LIX score
- Defined in utils/TextAnalyzer.ts:375

getLongWordRatio

getLongWordRatio(len?: number): number
Gets the ratio of long words (words with length >= len) to total words.
Parameters
- Optionallen: number = 7
  Minimum length for a word to be considered long
Returns number
- Ratio of long words to total words
- Defined in utils/TextAnalyzer.ts:247

getMaxSyllablesWordCount

getMaxSyllablesWordCount(max: number): number
Gets the number of words with at most a specified maximum syllable count.
Parameters
- max: number
  Maximum syllable count for a word to be included
Returns number
- Count of words meeting the syllable criteria
- Defined in utils/TextAnalyzer.ts:301

getMedianSyllablesPerWord

getMedianSyllablesPerWord(): number
Gets the median number of syllables per word in the text.

Returns number
- Median syllables per word
- Defined in utils/TextAnalyzer.ts:319

getMinSyllablesWordCount

getMinSyllablesWordCount(min: number): number
Gets the number of words with at least a specified minimum syllable count.
Parameters
- min: number
  Minimum syllable count for a word to be included
Returns number
- Count of words meeting the syllable criteria
- Defined in utils/TextAnalyzer.ts:291

getMonosyllabicWordCount

getMonosyllabicWordCount(): number
Gets the number of monosyllabic words (words with exactly one syllable).

Returns number
- Count of monosyllabic words
- Defined in utils/TextAnalyzer.ts:281

getMostCommonWords

getMostCommonWords(limit?: number): string[]
Gets the most common words in the text, limited to a specified number.
Parameters
- Optionallimit: number = 5
  Maximum number of common words to return
Returns string[]
- Array of the most common words
- Defined in utils/TextAnalyzer.ts:179

getReadabilityScore

getReadabilityScore(metric?: "flesch" | "fleschde" | "kincaid"): number
Calculates various readability scores based on the text.

This method supports multiple readability metrics:
- Flesch Reading Ease
- Flesch-Kincaid Grade Level
Parameters
- Optionalmetric: "flesch" | "fleschde" | "kincaid" = 'flesch'
  The readability metric to calculate
Returns number
- The calculated readability score
- Defined in utils/TextAnalyzer.ts:354

getReadingTime

getReadingTime(wpm?: number): number
Estimates the reading time for the text based on words per minute (WPM).
Parameters
- Optionalwpm: number = 200
  Words per minute for the calculation
Returns number
- Estimated reading time in minutes
- Defined in utils/TextAnalyzer.ts:340

getSentenceCount

getSentenceCount(): number
Gets the number of sentences in the text.

Returns number
- Count of sentences
- Defined in utils/TextAnalyzer.ts:144

getShortWordRatio

getShortWordRatio(len?: number): number
Gets the ratio of short words (words with length <= len) to total words.
Parameters
- Optionallen: number = 3
  Maximum length for a word to be considered short
Returns number
- Ratio of short words to total words
- Defined in utils/TextAnalyzer.ts:260

getSyllablesCount

getSyllablesCount(): number
Estimates the number of syllables in the text.

Returns number
- Total estimated syllable count
- Defined in utils/TextAnalyzer.ts:272

getUnicodeCodepoints

getUnicodeCodepoints(): Record<string, number>
Gets the frequency of Unicode codepoints in the text.

Returns Record<string, number>
- A record of Unicode codepoint frequencies
- Defined in utils/TextAnalyzer.ts:230

getUpperCaseRatio

getUpperCaseRatio(): number
Calculates the ratio of uppercase letters to total letters in the text.

Returns number
- Ratio of uppercase letters to total letters
- Defined in utils/TextAnalyzer.ts:209

getWordCount

getWordCount(): number
Gets the number of words in the text.

Returns number
- Count of words
- Defined in utils/TextAnalyzer.ts:137

getWordHistogram

getWordHistogram(): Record<string, number>
Gets a histogram of word frequencies in the text.

Returns Record<string, number>
- A histogram of word frequencies
- Defined in utils/TextAnalyzer.ts:169

getWSTFScore

getWSTFScore(): [number, number, number, number]
Calculates the Wiener Sachtextformel (WSTF) scores for the text. The WSTF scores are a set of readability metrics based on word and sentence characteristics.

Returns [number, number, number, number]
- An array of WSTF scores
- Defined in utils/TextAnalyzer.ts:389

hasNumbers

hasNumbers(): boolean
Checks if the text contains any numbers.

Returns boolean
- True if numbers are present, false otherwise
- Defined in utils/TextAnalyzer.ts:202

`Private`tokenize

tokenize(): void
Tokenizes the input text into words and sentences.

Returns void
- Defined in utils/TextAnalyzer.ts:63

Class TextAnalyzer

Index

Constructors

Properties

Methods

Constructors

constructor

Parameters

Returns TextAnalyzer

Properties

PrivatecharFrequency

Privatesentences

PrivatesyllableCache

Private OptionalsyllableStats

Private Readonlytext

PrivatewordHistogram

Privatewords

Private Static ReadonlyREGEX

Methods

PrivatecomputeFrequencies

Returns void

PrivatecomputeSyllableStats

Returns { avg: number; median: number; mono: number; perWord: number[]; total: number }

PrivateestimateSyllables

Parameters

Returns number

getAvgSentenceLength

Returns number

getAvgSyllablesPerWord

Returns number

getAvgWordLength

Returns number

getCharFrequency

Returns Record<string, number>

getHapaxLegomena

Returns string[]

getHonoresR

Returns number

getLength

Returns number

getLIXScore

Returns number

getLongWordRatio

Parameters

Returns number

getMaxSyllablesWordCount

Parameters

Returns number

getMedianSyllablesPerWord

Returns number

getMinSyllablesWordCount

Parameters

Returns number

getMonosyllabicWordCount

Returns number

getMostCommonWords

Parameters

Returns string[]

getReadabilityScore

Parameters

Returns number

getReadingTime

Parameters

Returns number

getSentenceCount

Returns number

getShortWordRatio

Parameters

Returns number

getSyllablesCount

Returns number

getUnicodeCodepoints

Returns Record<string, number>

getUpperCaseRatio

Returns number

getWordCount

Returns number

getWordHistogram

Returns Record<string, number>

getWSTFScore

`Private`charFrequency

`Private`sentences

`Private`syllableCache

`Private` `Optional`syllableStats

`Private` `Readonly`text

`Private`wordHistogram

`Private`words

`Private` `Static` `Readonly`REGEX

`Private`computeFrequencies

`Private`computeSyllableStats

`Private`estimateSyllables

`Private`tokenize