League of Legends Champion Select Assistant

This project was done in collaboration with fellow University of Maryland Information Science Student Ryan at the University of Maryland and was made possible through the contributions of the various League of Legends subreddits and Discord channels. Our mission is to understand the thought process behind selecting a champion in League of Legends. We intend to use the statistics we gained here to make informed decisions for new players to help ease the tensions of a new players' starting to play ranked. None of our work is currently monetized and this is purely a passion project, any inquiries may be made to leaguechampassist@gmail.com. Our languages of choice has been Rust and Python, Python providing robust data manipulation tools and Rust offering a fast, thread-safe language which is easy to use. (Though I am slightly biased, given I'm a rustacean.)

From December 21st to December 30th, we posted to the various champion main subreddits, along with various personal and official League of Legends Discord channels. This was an overall success, and we accrued 4805 total responses. Of those, we had to throw away 104 responses due to trolling, duplicate data, or uncleanable data (saying marksman instead of a champion name, etc...), amounting to a 2.17% loss in data. We posted to 148 subreddits, though 3 posts were removed shortly after arriving. Our post to the Caitlyn, Senna, and Shen main subreddits were taken down for lack of direct connection to the champion. The Caitlyn main Subreddit very quickly reinstated the post, with about 3 hours of downtime while the Senna and Shen main Subreddits were never reinstated. The posts from the Senna and Shen subreddits were removed by the same moderator (how's that for impartial judgement), and there were no other useful moderators willing to help so we had to keep that in mind while examining this data. Below you will find our survey results.

Response by region, unsurprisingly most responses came from North America and Europe, totaling to 4103 responses, or 87.3 percent. We did not post on Weibo to gather Chinese data due to language barriers, though we considered it.

Mapping Between Aggression Levels and Rank

Given Rank Aggression Number of Responses
Iron 4 2
Bronze 3.67 52
Silver 3.59 330
Gold 3.77 762
Platinum 3.80 678
Diamond 3.89 396
Master 3.42 36
Grandmaster 4.5 8
Challenger 3.6 11

We collected information on rank if the partcipant stated their preferred game mode was Ranked Draft. Their average aggression level was analyzed in tandem with the amount of responses from the corresponding rank. We can clearly see that Iron, Master, Grandmaster, and Challenger had very low amounts of responses (unsurprisingly given these are the smallest ranked groups by population). What was surprising was the response rate of Platinum players, as they are behind Silver in population, but had a higher response rate, as well as a similar level of aggression. This suggests that the average viewer of champion main subreddits are those dedicated to playing the game so as to be better than average, even if only at one champion.


A mapping of Total Champion Picks to those champions' average aggression levels. Notably, we see that champions with a high-risk playstyle are often played extremely aggressively, but may have a lower pick rate as they fit a niche group. A prime example is Draven, who has the highest average aggression at 4.59, while having a relatively low response rate, with only 32 total responses. Meanwhile a champion like Corki, who has been out of the meta for a long time unsurprisingly had the second lowest amount of picks, while simultaneously having the lowest aggression, suggesting a more reserved and careful playstyle, which players who frequent reddit may not be accustomed to.

FAQ

Why did we ask for your summoner name?

That was mostly for verification, it saves us a lot of trouble if we can identify someone who double submitted data by their summoner name rather than guessing if two players share a very similar champion pool. Additionally, we were aware people could troll with a fake summoner name, but fortunately it seems most trolls didn't take time to make their answers look legitimate so a small script to check if their username exists on the given server made things so much easier to root through. We hold to our decision and think it helped increase our total number of responses.

Why not have people sign in to Google? Why is this so short? Why aren't the questions super detailed?

We understand that some people were expecting a super in-depth survey where they can write an essay about how much they love playing Malphite, but fact of the matter is that the longer a survey is, or the more people have to think about their answers, the less likely they are to finish said survey and submit it. This ties in to the reason it's short and there was no google sign in required, as for every obstacle you put in someone's way, the less likely they are to finish the given task.

Why are you posting to X champion mains subreddits? Won't that skew your data?

So we were initially concerned with not getting enough data and we came to a point where we said "If we post to one champion main subreddit, we need to post to all of them, but if we do post to all of them then we know we're getting enough data." We made the decision to post to every subreddit and achieved exactly what we meant to. Initial scanning of the data seems to show that even one tricks often had fall back champions and only a very small percentage would dodge, int, or troll regardless. Additionally there was concern with the size of each subreddit, and the total number of votes by subscribed members, but I think I have a way to account for this using the size of the subreddit at the time the survey was taken combined with an average 'Online' count from that subreddit.

What did you learn about creating and maintaining a survey?

We learned a lot from making this survey. It was a first-time for both of us and using feedback from before should help our future endeavours. Specifically we learned:
1) Make as many answers as possible preset. Reading over ~4700 individual comments is not my idea of a fun night.
2) Ignore the haters. We got a lot of flak on various subreddits for "spam" and "not doing things right". We take feedback seriously but coming from people who have no idea what we're doing, it was frustrating.
3) Sometimes you can't fix an issue before you send the survey out. It's OK, it happens, put out a notice and fix what you can, no one will mind.

What's next?

Now we unfortunately must go dark, time for my friend and I to work on understanding all this data. We do not have a timeline as we both are University students and our semesters' can be hectic. We're hopeful to have a prototype by Mid-Summer, though of course we can't promise anything. I just wanted to give another big thank you to anyone who contributed to the survey or is interested in the project.

Any business inquiries can be sent to: leaguechampassist@gmail.com