How to process example data in an R-packet in which lines with the UTF-8 index are marked

I would like to include an example dataset (from Twittertweets and metadata) in the package Rthat I am writing.

I downloaded the example data.frame with Twitter APIand saved it as .RData(with the corresponding. RData description file) in my package.

When I run R CMDCheck, I get the following NOTE,

 * checking data for non-ASCII characters ... NOTE
 Note: found 287 marked UTF-8 strings

I tried to save data.framewith help ASCII=TRUE, hoping this fixes the problem. But it persists. Any idea on how I can get R CMDCHECK to run without notes?

(I would also be open to deleting all marked lines UTF-8from the example data if this is a solution). Thank!

example line from data.frame:

First time in SF (@ San Francisco International Airport (SFO) - @flysfo in San Francisco, CA) https://t.co/1245xqxtwesr
  favorited favoriteCount replyToSN             created truncated replyToSID                 id replyToUID
1     FALSE             0      <NA> 2015-03-13 23:30:35     FALSE       <NA> 576525795927179264       <NA>
                                                   statusSource screenName retweetCount isRetweet retweeted
1 <a href="http://foursquare.com" rel="nofollow">Foursquare</a>  my_name93            0     FALSE     FALSE
      longitude    latitude
1 -122.38100052 37.61865062
+5
2

, , :

, UTF-8, , Twitter emoji.

, , , PACKAGE CMD CHECK, UTF-8.

:

nonUTF < - iconv(df$TroubleVector, from="UTF-8", to="ASCII")

, emoji, .. NA, UTF-8. - .

+5

, / , , ascii- (utf-8)

0

All Articles