Forbes Celebrity List + Wikipedia Page Edits Data Set

The Data

Forbes Celebrities List from 1999-2017


Also known as the Celebrity 100, the annual list created by Forbes tracks the world's highest-paid entertainers. This is the definitive global index on the top money makers in the business of entertainment.

The dataset contains records from 1999 to 2017 and consists of 192 rows and 7 columns.
The names of the celebrities


The Variables:

Year  - year
Rank - a celebrity's rank in a particular year
Recipient - celebrity name
Country - celebrity's country of origin
Career - the industry the celebrity is in
Tied
Title - celebrity name

Wikipedia Edits for the Celebrity Pages



Wikipedia is a free online encyclopedia that can be created and edited by just about anyone. It is probably the largest collaboratively edited reference online.

The dataset contains edit information regarding the wikipages of the celebrities on the Forbes Celebrity List.

The dataset contains 724,335 rows and 7 columns.
The upper part of the dataset containing variables


The Variables:

title - celebrity name
parentid
revid - the revision id
timestamp - time created
user - the editor
userid - the editor's userid
size - this is what i assume would be the file size of the edit document

Once merged, the dataset balloons to 1,916,973 rows and 13 columns. Here's what the combined documents look like:
The upper part of the merged Forbes and Wiki documents


Top of the List

The current holder of  number one is musician Sean "Puffy" Combs.

Television personality Oprah Winfrey holds the record for the most number of times to reach pole position. She's achieved this 5 times since 1999.

Relationship Between No. of Users and No. of Page Edits

A correlation coefficient of 0.32 (or 32%) indicates the presence of a moderate to weak (depending on how you set correlation boundaries) positive relationship between the two variables.


However, it is possible that the two may share a non-linear relationship:

Pacquiao vs. Mayweather

Arguably the two best boxers in history whose careers seem intertwined because the public couldn't stop comparing the two. The same could be said about edit histories of their wikipedia pages.

Here we see both of them reach a high number of edits in 2008. This was the year when Pacquiao fought Oscar de la Hoya in what many people first thought was suicide for the smaller and (at that time) still up-and-coming Pacquiao. It turned out to be a mismatch the other way around as de la Hoya threw in the towel during the match and made the Filipino a global sensation. This was also the year the Mayweather decided to retire from boxing the first time and appeared in a much-publicized wrestling match.

The Filipino champion would reach greater heights both in terms of his career and the number of wikipedia page edits in 2010. This was the time he defeated his largest opponent to date Antonio Margarito and bagged his 8th title in as many weight classes.

Career Volume

The page edits of celebrities categorized as musician have the highest volume in terms of data.


Career Frequency

Their pages are also the most edited in total:


Avidness of Fans

On average, the pages of golfers  have the highest number of edits per user


Super Fans

Here are the users with the highest numbers of edits:

*editors are allowed to change accounts in wikipedia. This may lead to misattribution of edits.

Bonus


Comments

  1. Having your target audience trust your products or services is something that every brand or business organization wants but very few manage to actually achieve that certain level of trust that is needed for them to increase their sales and be successful. One of the easiest ways to increase your brand’s credibility is if you choose to make a
    wikipedia page for your business. Using a wiki page to increase your brand’s credibility is extremely helpful because wikipedia is a platform that already has made a good reputation for itself when it comes to trust and being present on such a platform increases the credibility of the brand as well. What other ways can be used?

    ReplyDelete
  2. Having your target audience trust your products or services is something that every brand or business organization wants but very few manage to actually achieve that certain level of trust that is needed for them to increase their sales and be successful.

    You write a great article on thansk for writing.please keep sharing.
    i also have a blog about celebrity biography check this:

    How Tall Is Melanie Martinez

    ReplyDelete
  3. You write a great article on Forbes Celebrity Lis thansk for writing.please keep sharing.
    i also have a blog about celebrity biography check this:

    Jim Carrey Net Worth

    ReplyDelete
  4. Post is really supportive to all of us. Eager that these kind of information you post in future also. Otherwise if any One Want Experience Certificate for Fill your Career Gap So Contact Us-9599119376 Or Visit Website.

    Best Consultant for Experience Certificate Providers in Bangalore, India

    ReplyDelete
  5. Excellent and very cool idea and great content of different kinds of the valuable information’s.

    Genuine Fake Experience Certificate Providers in Hyderabad, India

    ReplyDelete
  6. I like your blog it is very knowledable and I got very usefull from your blog. Keep writing this type of blogs . If anyone want to get expercience in Delhi can contact me at - 9599119376 or can visit our website at
    Experience Certificate In Noida
    Experience Certificate In Chennai
    Experience Certificate In Gurugoan

    ReplyDelete
  7. Thank you so much for the post you do. I like your post and all you share with us is up to date and quite informative, i would like to bookmark the page so i can come here again to read you, as you have done a wonderful job. uk news headlines

    ReplyDelete

Post a Comment