emoji_density() measures how emoji-heavy each text is: the number of
emoji per character and per whitespace-delimited token. Rows with no emoji
get densities of 0; rows whose text is NA or empty get NA.
Value
data, as a tibble, with added columns .emoji_n,
.emoji_per_char (emoji per character of text) and .emoji_per_token
(emoji per whitespace-delimited token).
Examples
df <- data.frame(text = c("hi \U0001f600", "\U0001f600\U0001f600", "plain"))
emoji_density(df, text)
#> # A tibble: 3 × 4
#> text .emoji_n .emoji_per_char .emoji_per_token
#> <chr> <int> <dbl> <dbl>
#> 1 hi 😀 1 0.25 0.5
#> 2 😀😀 2 1 2
#> 3 plain 0 0 0