Twitter

Twitter

Rubiscape provides the functionality to create a dataset using Twitter data. On this data, you can perform Twitter Sentiment Analysis using the text processing algorithms provided in Text Analytics. The sentiment analysis helps to determine the tone (NeutralPositiveNegative) of the tweet and thereby gives beneficial insights about market trends and competitors' policies. This, in turn, helps the decision-makers to create marketing strategies. To perform Twitter Sentiment Analysis, Rubiscape uses Tweepy, a Python library designed to access the Twitter API. A request to the Twitter API returns the Tweet fields (Features), which can then be analysed using Text Analytics.

To create a Twitter dataset, follow the steps given below.

  1. On the home page, click Create icon ()The Product Selection page is displayed.
  2. Hover over the
     
    Data Connect tile, and click Create Dataset
    .



     

    The Dataset Selection page is displayed.
  3. From the Social Media option, select Twitter.


     
    Create Twitter Dataset page is displayed.
  4. Enter the Name of the dataset.
  5. Enter the Description for the dataset.

  6. Enter the Hashtag # for which you want to pull the data. Multiple hashtags can be used, separated by a comma.

  7. Enter the API Key to access the data from twitter platform.

  8. Enter the API Secret of the twitter account.

  9. Enter the Access Token of the twitter account.

  10. Enter the Access Secret of the twitter account.

  11. Click on Verify.

    The table given below explains the Features available for the Twitter dataset.

    Features of the Twitter dataset collected by Rubiscape

    tweet_id

    The unique identifier for the tweet.

    user_id

    The unique identifier for the user who posted the tweet.

    tweet

    The actual text of the tweet.

    username

    The name of the twitter handle that posted the tweet.

    language

    The language of the tweet.

    location

    The location from where the tweet was posted.

    place

    The place the tweet is associated with. This place is not necessarily the place from where the tweet was posted.

    device

    The device used to post the tweet.

    retweet

    This field represents whether the tweet is a retweet or not. Possible values are True and False.

    retweetcount

    In the case of a retweet, this field represents the number of times the tweet has been retweeted.

    reply_to_userid

    If the represented tweet is a reply, this field represents the original Tweet's author ID.

    reply_to_statusid

    If the represented tweet is a reply, this field represents the integer representation of the original Tweet's ID.

    time

    The date and time of the tweet.


  12. Click Create after successful verification.



    The Twitter dataset is created in Rubiscape and is available for use in your workbooks and workflows.

    Notes
    • To obtain the API KeyAPI SecretAccess Token, and Access Secret, you need to create a Twitter Developer Account.
    • Enabling the "Disable Cache" option allows you to create a dataset without generating a dataset cache.
    • When you select to "Disable Cache", the dashboard will not offer the "Enable Direct Query" option. For more information, please refer to the "Enable Direct Query" document

    • Related Articles

    • Adding a Dataset

      A dataset is global and shared across the same workspace. Consider adding a dataset before creating a project. You can add a dataset from the supported data sources. The added datasets can be used in multiple projects. To add a dataset, follow the ...
    • Batch Processing

      Working with Batches Workflow in Data Integrator allows you to divide the dataset into batches and then process it. Batch processing is mainly used to simplify many ETL operations like Missing value Imputation, expression, and validating data. You ...
    • Custom Word Remover

      Custom Words Remover is located under Textual Analysis ( ) in Pre Processing, in the task pane on the left. Use drag-and-drop method to use algorithm in the canvas. Click the algorithm to view and select different properties for analysis. One of the ...
    • Custom Word Remover

      Custom Words Remover is located under Textual Analysis ( ) in Pre Processing, in the task pane on the left. Use drag-and-drop method to use algorithm in the canvas. Click the algorithm to view and select different properties for analysis. One of the ...
    • Datasets

      A dataset is a compilation or collection of data, usually in tabular form. However, non-tabular datasets can also be compiled, as in the case of an XML file, where data appears in the form of marked-up strings of characters. In the case of Datasets, ...