Question

Suppose a tweet is represented as a tuple (tweet id, Boolean, Boolean). The second Boolean element...

Suppose a tweet is represented as a tuple (tweet id, Boolean, Boolean). The second Boolean element will be True if the tweet matches Twitter REST API Query otherwise, it will be False. The third Boolean element will be True if the tweet content is positive (relevant to the topic of interest); Otherwise, it will be False.
The whole twitter space is represented as the set of tweets:
Suppose the crawled tweets using Twitter REST API Query is represented as the set of tweets: M = {(2, True, False), (3, True, True), (6, True, True), (9, True, False), (14, True, False)} Suppose the set of tweets from randomly sampled users is: D0 = {(1, False, False), (4, True, True), (6, True, True), (9, True, False), (10, False, True), (12, True, False), (13, True, False), (14, True, False), (15, True, True)}
Give the above sampled tweets, calculate the approximate values of the three metrics, including API recall, quality recall, and quality precision.

0 0
Add a comment Improve this question Transcribed image text
Answer #1

Answer;

Given that the whole set of Twitter space is represented from the mentioned 2 sets of tweets.

So in total there are the following unique tweet IDs from the 2 sets.

3, 4, 5, 7, 8, 10, 11, 12, 13, 14, 15, 17

Now API recall is the fraction of the tweets that are successfully retrieved by API.

The tweets from the whole set is(only showing the ids): 3, 4, 5, 7, 8, 10, 11, 12, 13, 14, 15, 17

Out of this API got the tweets with ids: 4, 7, 8, 10, 11, 13, 17

Therefore API recall is: 7/12

Quality recall is the fraction of the relevant(quality) tweets of the relevant total tweet set.

The relevant tweet from the whole set is(only showing the ids): 5, 10, 11, 13, 15

Out of this API got the tweets with ids: 10, 11, 13

Therefore API recall is: 3/5

Quality precision is the fraction of the retrieved(quality) tweets that are successfully retrieved by API.

Total tweet retrieved by API(only showing the ids): 4, 7, 8, 10, 11, 13, 17

The relevant tweet got by API: 10, 11, 13

Therefore Quality precision is: 3/7

The difference between precision and recall is that: precision counts the fraction of relevance among the retrieved documents whereas recall counts the fraction of relevance among the whole set of documents.

Add a comment
Know the answer?
Add Answer to:
Suppose a tweet is represented as a tuple (tweet id, Boolean, Boolean). The second Boolean element...
Your Answer:

Post as a guest

Your Name:

What's your source?

Earn Coins

Coins can be redeemed for fabulous gifts.

Not the answer you're looking for? Ask your own homework help question. Our experts will answer your question WITHIN MINUTES for Free.
Similar Homework Help Questions
  • CSC 142 Music Player You will complete this project by implementing one class. Afterwards, your program...

    CSC 142 Music Player You will complete this project by implementing one class. Afterwards, your program will play music from a text file. Objectives Working with lists Background This project addresses playing music. A song consists of notes, each of which has a length (duration) and pitch. The pitch of a note is described with a letter ranging from A to G.   7 notes is not enough to play very interesting music, so there are multiple octaves; after we reach...

  • ***JAVA: Please make "Thing" movies. Objective In this assignment, you are asked to implement a bag...

    ***JAVA: Please make "Thing" movies. Objective In this assignment, you are asked to implement a bag collection using a linked list and, in order to focus on the linked list implementation details, we will implement the collection to store only one type of object of your choice (i.e., not generic). You can use the object you created for program #2 IMPORTANT: You may not use the LinkedList class from the java library. Instead, you must implement your own linked list...

  • Item 1 In the case below, the original source material is given along with a sample...

    Item 1 In the case below, the original source material is given along with a sample of student work. Determine the type of plagiarism by clicking the appropriate radio button. Original Source Material Student Version Suppose you study a group of successful companies and you find that they emphasize customer focus, or quality improvement, or empowerment; how do you know that you haven't merely discovered the management practice equivalent of having buildings? How do you know that you've discovered something...

  • Code is in C# Your instructor would like to thank to Marty Stepp and Hélène Martin...

    Code is in C# Your instructor would like to thank to Marty Stepp and Hélène Martin at the University of Washington, Seattle, who originally wrote this assignment (for their CSE 142, in Java) This program focuses on classes and objects. Turn in two files named Birthday.cs and Date.cs. You will also need the support file Date.dll; it is contained in the starter project for this assignment. The assignment has two parts: a client program that uses Date objects, and a...

ADVERTISEMENT
Free Homework Help App
Download From Google Play
Scan Your Homework
to Get Instant Free Answers
Need Online Homework Help?
Ask a Question
Get Answers For Free
Most questions answered within 3 hours.
ADVERTISEMENT
ADVERTISEMENT