Kyumin Lee
- PhD Candidate
- Infolab (Web and Distributed Information Management Lab)
- Center for the Study of Digital Libraries
- Department of Computer Science and Engineering
- Texas A&M University
- College Station, TX 77843
- kyumin (at) cse.tamu.edu
Bio
Kyumin Lee will join Department of Computer Science at Utah State University as an assistant professor in July, 2013.
He is currently a PhD candidate in the Department of Computer Science and Engineering at Texas A&M University. He is a member of Infolab, and his advisor is Dr. James Caverlee. He received the M.S. degree in computer engineering from Sungkyunkwan University, South Korea, in 2007 (advisor: Dr. Dong-Ryeol Shin) and the B.S. double degree in computer science and electronic engineering from Kyonggi University, South Korea, in 2005.
His primary research interests are in information quality and data analytics over large-scale networked information systems. Application areas are the Web, social media systems, mobile information systems, enterprise and healthcare networks, and other emerging distributed systems.
Recent Publications (Full List :: Google Scholar :: DBLP)
- Kyumin Lee, Prithivi Tamilarasan, James Caverlee. Crowdturfers, Campaigns, and Social Media: Tracking and Revealing Crowdsourced Manipulation of Social Media. 7th International AAAI Conference on Weblogs and Social Media (ICWSM), to appear [20%].
- Kyumin Lee, Krishna Kamath, James Caverlee. Combating Threats to Collective Attention in Social Media: An Evaluation. 7th International AAAI Conference on Weblogs and Social Media (ICWSM), to appear [20%].
- Kyumin Lee, James Caverlee, Zhiyuan Cheng, Daniel Z. Sui. Campaign Extraction from Social Media. ACM Transactions on Intelligent Systems and Technology (ACM TIST), to appear.
- Krishna Kamath, James Caverlee, Kyumin Lee, Zhiyuan Cheng. Spatio-Temporal Dynamics of Online Memes: A Study of Geo-Tagged Tweets. 22nd International World Wide Web Conference (WWW). Rio de Janeiro, May 2013 [15%].
- Zhiyuan Cheng, James Caverlee, Kyumin Lee. A Content-Driven Framework for Geo-locating Microblog Users. ACM Transactions on Intelligent Systems and Technology (ACM TIST), Vol.4, No.1, 2013.
- Kyumin Lee, James Caverlee, Krishna Kamath, Zhiyuan Cheng. Detecting Collective Attention Spam. 2nd Joint WICOW/AIRWeb Workshop on Web Quality (WebQuality) in conjunction with WWW 2012. Lyon, April 2012. [slides]
- Kyumin Lee, James Caverlee, Zhiyuan Cheng, Daniel Z. Sui. Content-Driven Detection of Campaigns in Social Media (short paper). 20th ACM International Conference on Information and Knowledge Management (CIKM). Glasgow, October 2011 [25%].
- Zhiyuan Cheng, James Caverlee, Krishna Kamath, Kyumin Lee. Toward Traffic-Driven Location-Based Web Search. 20th ACM International Conference on Information and Knowledge Management (CIKM). Glasgow, October 2011 [15%].
- Kyumin Lee, Brian David Eoff, James Caverlee. Seven Months with the Devils: A Long-Term Study of Content Polluters on Twitter. 5th International AAAI Conference on Weblogs and Social Media (ICWSM). Barcelona, July, 2011. [23%] (Dataset)
- Zhiyuan Cheng, James Caverlee, Kyumin Lee, Daniel Z. Sui. Exploring Millions of Footprints in Location Sharing Services. 5th International AAAI Conference on Weblogs and Social Media (ICWSM). Barcelona, July, 2011. [11%] (Dataset)
- James Caverlee, Zhiyuan Cheng, Brian Eoff, Chiao-Fang Hsu, Krishna Kamath, Said Kashoob, Jeremy Kelley, Elham Khabiri, Kyumin Lee. SocialTrust++: Building community-based trust in Social Information Systems (invited paper). 6th International Conference on Collaborative Computing (CollaborateCom). Chicago, October 2010.
- Zhiyuan Cheng, James Caverlee, Kyumin Lee. You Are Where You Tweet: A Content-Based Approach to Geolocating Twitter Users. 19th ACM International Conference on Information and Knowledge Management (CIKM). Toronto, October 2010. [13%] (Dataset)
- Kyumin Lee, James Caverlee, Steve Webb. Uncovering Social Spammers: Social Honeypots + Machine Learning. 33rd Annual ACM SIGIR Conference. Geneva, July 2010. [17%]
- Kyumin Lee, Brian David Eoff, James Caverlee. Devils, Angels, and Robots: Tempting Destructive Users in Social Media (short paper). 4th International AAAI Conference on Weblogs and Social Media (ICWSM). Washington, DC, May 2010
- Kyumin Lee, James Caverlee, Steve Webb. The Social Honeypot Project: Protecting Online Communities from Spammers (poster). 19th International World Wide Web Conference (WWW). Raleigh, April 2010.
Professional Activities
- Program committee member: DUBMOD 2013, ECIR 2013, AINA 2013, ECIR 2012
- Reviewer: ICWSM 2013, IEEE/ACM TON, CIKM 2012, International Journal of Cooperative Information Systems, WebQuality 2012, WWW 2012, AAAI-Web 2012, ACM TISSEC, Elsevier Knowledge-Based Systems, WWW 2011, IEEE Internet Computing, CoopIS 2010, ACM TWEB, CollaborateCom 2009 and ICDM 2009
Teaching Experience
- Spring 2013: Information Storage and Retrieval (CSCE670), Guest lecturer; Instructor: James Caverlee
- Fall 2012: Information Storage and Retrieval (CSCE470), Guest lecturer; Instructor: James Caverlee
- Fall 2011: Information Storage and Retrieval (CSCE470), Teaching assistant and guest lecturer; Instructor: James Caverlee
- Fall 2010: Information Storage and Retrieval (CSCE470), Teaching assistant and guest lecturer; Instructor: James Caverlee
- Summer 2010: Programming with C&JAVA (CSCE 601), Teaching assistant; Instructor: Walter Daugherity
- Spring 2010: Information Storage and Retrieval (CSCE 670), Teaching assistant; Instructor: James Caverlee (The Best TA Award in the department)
Work Experience
- Research Intern, IBM Research - Almaden, June ~ Aug 2012 (Mentors: Dr. Jalal Mahmud, Dr. Jilin Chen and Dr. Jeffrey Nichols)
- Research Intern, eBay Research Labs, May ~ Aug 2011 (Mentors: Nish Parikh and Dr. Neel Sundaresan)
-
Assistant Manager and Web Service Developer, *NHN, July 2006 ~ July 2008
*NHN operates the most popular Internet portal and search engine (Naver) and number one online game portal (Hangame) in South Korea.
Datasets
- Social Honeypot Dataset: 40K content polluters (spammers) and legitimate users, and their 5.5 million tweets in Twitter.
- Location Sharing Services Dataset: the location sharing service dataset which includes 22 millinos of checkins
- Twitter Location Dataset: the user-generated content dataset in Twitter
Awards
- 1st Place in Department's Industrial Affiliates Poster Competition ("Combating Threats to Collective Attention in Social Media"), Fall 2012
- 1st Place in Department's Industrial Affiliates Poster Competition ("Content-Driven Detection of Campaigns in Social Media"), Spring 2012.
- 1st Place in Department's Industrial Affiliates Poster Competition ("Seven Months with the Devils: A Long-Term Study of Content Polluters on Twitter"), Fall 2011.
- Finalist, Symantec Research Labs Graduate Fellowship, 2011.
- SIGIR Student Travel Grant, ACM Special Interest Group on Information Retrieval (SIGIR), 2010
- Graduate Teaching Assistant Excellence Award, Department of Computer Science and Engineering, Texas A&M University, 2010
- Industrial Affiliates Program Scholarship, Department of Computer Science and Engineering, Texas A&M University, 2008-2009
- National Graduate S&T Scholarship, Korea Science and Engineering Foundation, January 2007, South Korea
- The Best Presentation Paper Award, the Institute of Electronics Engineers of Korea (IEEK), December 2006, South Korea
- The Best Paper Award (the President Award), Sungkyunkwan University, August 2006, South Korea
- SIMSAN Scholarship, Sungkyunkwan University, May 2006, South Korea
- Dean's list, Kyonggi University, 1998 and 2003, South Korea
Press Coverage
- Phony Twitter Profiles Aim to Outwit Spammers, MIT Technology Review, July 2010