Skip to contents

If users would like to know how many Emojis and what kinds of Emojis each Tweet has, emoji_extract is a useful function to output a global summary with the row number of each Tweet containing Emoji and the Unicodes associated with each Tweet.

Usage

emoji_extract_unnest(tweet_tbl, tweet_text)

Arguments

tweet_tbl

A dataframe/tibble containing tweets/text.

tweet_text

The tweet/text column.

Value

A summary tibble with the original row number and Emoji count.

Examples

library(dplyr)
data.frame(tweets = c("I love tidyverse \U0001f600\U0001f603\U0001f603",
                      "R is my language! \U0001f601\U0001f606\U0001f605",
                      "This Tweet does not have Emoji!",
                      "Wearing a mask\U0001f637\U0001f637\U0001f637.",
                      "Emoji does not appear in all Tweets",
                      "A flag \U0001f600\U0001f3c1")) %>%
         emoji_extract_unnest(tweets)
#> # A tibble: 8 × 3
#>   row_number .emoji_unicode .emoji_count
#>        <int> <chr>                 <int>
#> 1          1 😀                        1
#> 2          1 😃                        2
#> 3          2 😁                        1
#> 4          2 😅                        1
#> 5          2 😆                        1
#> 6          4 😷                        3
#> 7          6 🏁                        1
#> 8          6 😀                        1