Email: {first}{last_initial} | umich {dot} edu    me | papers | professional | personal
Cong Yu
Research Scientist
Google Research
76 9th Ave, 4th Floor
New York, NY 10011
Recent News:

  + January 2012, Tokyo, Japan: Shonan Meeting
  + November 2011, Durham, NC: talk @ Duke
  + October 2011, Newark, NJ: talk @ NJIT
  + October 2011: DB/IR Day 2011 (co-organizer)
  + Spring 2011: teaching CS6093 @ NYU-Poly

 Brief Introduction:
  I am a Research Scientist at Google Research NYC. My main research interests are social content recommendation and mining and scalable structured data extraction and processing. Previously, I also worked on database usability and data integration. Before Google, I was with Yahoo! Research New York, from July 2007 to September 2010. While there, I was mainly involved in two projects: Purple SOX and Royal Jelly. Before Yahoo!, I graduated from the Database Group at the University of Michigan, Ann Arbor, with a Ph.D. degree in Computer Science and Engineering. My advisor is Prof. H. V. Jagadish.
  I am a devoted Michigan Wolverines fan. I have followed all the games since the Michigan-OSU game in the 2003 season. Things I read: New Yorker, Economist, New York Times, Wall Street Journal.

 Professional:  [back to top]
Program Committee (selected):
    2012: WSDM, EDBT, VLDB (E&A), SIGMOD
    2011: ICDE, APWeb (track chair), SIGMOD (repeatability), DBSocial (Co-Chair), IJCAI, VLDB
    2010: ICDE, WWW, AAAI, VLDB, CIKM (senior PC)
    2009: WWW (Developers Track Co-chair), VLDB, CIKM

Students I mentored:
    Alban Galland (INRIA); Jian Huang (Penn State, now at Google); Arnab Nandi (Michigan, now at Ohio State); Lujun Fang (Michigan); Herald Kllapi (Univ. of Athens)

 Selected Recent Publications:  [back to top]
  For a more complete list of publications, please visit DBLP, thanks to Michael Ley.

 Social Content Mining and Exploration
  mn# Mahashweta Das, Sihem Amer-Yahia, Gautam Das, and Cong Yu. MRI: Meaningful Interpretations of Collaborative Ratings. In VLDB, Seattle, Washington, 2011. [paper]
  rc# Senjuti Basu-Roy, Gautam Das, Sihem Amer-Yahia, and Cong Yu. Interactive Itinerary Planning. In ICDE, Hannover, Germany, 2011. [paper]
  mn# Munmun De Choudhury, Moran Feldman, Sihem Amer-Yahia, Nadav Golbandi, Ronny Lempel, and Cong Yu. Automatic Construction of Travel Itineraries using Social Breadcrumbs. In Hypertext, Toronto, Canada, 2010. [paper] [poster @ WWW 2010] [WSJ]
  rc# Senjuti Basu-Roy, Sihem Amer-Yahia, Ashish Chawla, Gautam Das, and Cong Yu. Constructing and Exploring Composite Items. In SIGMOD, Indianapolis, IN, 2010. [paper]
  dv# Sihem Amer-Yahia, Laks Lakshmanan, Sergei Vassilvitskii, and Cong Yu. Battling Predictability and Overconcentration in Recommender Systems. IEEE Data Engineering Bulletin. December 2009. [paper] [short paper @ RecSys 2009]
  rc# Sihem Amer-Yahia, Senjuti Basu-Roy, Ashish Chawla, Gautam Das, and Cong Yu. Group Recommendation: Semantics and Efficiency. In VLDB, Lyon, France, 2009. [paper] [extended version in The VLDB Journal, 2010]
  # Sihem Amer-Yahia, Jian Huang, and Cong Yu. Building Community-Centric Information Exploration Applications on Social Content Sites. In SIGMOD (industrial), Providence, RI, 2009. [paper]
  dv# Cong Yu, Laks Lakshmanan, and Sihem Amer-Yahia. It Takes Variety to Make a World: Diversification in Recommender Systems. In EDBT, Saint-Petersburg, Russia, 2009. [paper] [short paper @ ICDE 2009]
  # Sihem Amer-Yahia, Laks Lakshmanan and Cong Yu. SocialScope: Enabling Information Discovery on Social Content Sites. In CIDR (perspective), Asilomar, CA, 2009. [paper] (selected for Panel discussion) [L'Atelier]
Note:rc=recommendation | mn=mining | dv=diversification

 Scalable Structured Data Extraction and Processing
  rd# Xiaonan Li, Chengkai Li, and Cong Yu. Entity-Relationship Queries over Wikipedia. In ACM TIST, to appear, 2012. [paper] [workshop paper @ SMUC with CIKM 2010] [demo @ CIKM 2010]
  ex# Maria Christoforaki, Ivie Erunse, and Cong Yu. Searching Social Updates for Topic-centric Entities. In VLDS Workshop affiliated with VLDB, Seattle, WA, 2011. [paper]
  ex# Matthew Solomon, Luis Gravano, and Cong Yu. Quality Impact of Value Matching and Scoring in Topk Entity Attribute Extraction. In DBRank Workshop affiliated with VLDB, Seattle, WA, 2011. [paper]
  # Sarah Cohen, Chengkai Li, Jun Yang, and Cong Yu. Computational Journalism: A Call to Arms to Database Researchers. In CIDR, Asilomar, CA, 2011. [paper]
  sp# Arnab Nandi, Cong Yu, Philip Bohannon, and Raghu Ramakrishnan. Distributed Cube Materialization on Holistic Measures. In ICDE, Hannover, Germany, 2011. [paper]
  rd# Anish Das Sarma, Alpa Jain, and Cong Yu. Dynamic Relationship and Event Discovery. In WSDM, Hong Kong, China, 2011. [paper]
  ex# Jian Huang and Cong Yu. Prioritization of Domain-Specific Web Information Extraction. In AAAI, Atlanta, GA, 2010. [paper]
  ex# Matthew Solomon, Cong Yu, and Luis Gravano. Popularity-Guided Top-k Extraction of Entity Attributes. In WebDB Workshop affiliated with SIGMOD, Indianapolis, IN, 2010. [paper]
Note:ex=extraction | rd=relatedness discovery | sp=scalable processing

Selected earlier works:

 Database Usability
  # Cong Yu. Managing Complex Databases in a Schema Management Framework. Ph.D. Dissertation, University of Michigan, 2007. [ACM SIGMOD Dissertation Award Honorable Mention, 2008] [link]
  # Cong Yu and H. V. Jagadish. Querying Complex Structured Databases. In VLDB, Vienna, Austria, 2007. [paper]
  # H. V. Jagadish, Adriane Chapman, Aaron Elkiss, Magesh Jayapandian, Yunyao Li, Arnab Nandi and Cong Yu. Making Database Systems Usable. In SIGMOD, Beijing, China. 2007. [paper]
  # Cong Yu and H. V. Jagadish. Schema Summarization. In VLDB, Seoul, Korea. 2006. [paper]
  # Cong Yu and H. V. Jagadish. Efficient Discovery of XML Data Redundancies. In VLDB, Seoul, Korea. 2006. [paper] [extended version in The VLDB Journal, 2008]
  # Yunyao Li, Cong Yu and H. V Jagadish. Schema-Free XQuery. In VLDB, Toronto, Canada. 2004. [paper] [extended version in The VLDB Journal, 2006]
  # Shurug Al-Khalifa, Cong Yu and H. V. Jagadish. Querying Structured Text in an XML Database. In SIGMOD, San Diego, CA. 2003. [paper]
  # H. V. Jagadish, Shurug Al-Khalifa, Adriane Chapman, Laks V. S. Lakshmanan, Andrew Nierman, Stelios Paparizos, Jignesh M. Patel, Divesh Srivastava, Nuwee Wiwatwattana, Yuqing Wu, Cong Yu. TIMBER: A native XML database. In The VLDB Journal, 2002 [paper]

 Data Integration
  # Xin Dong, Alon Halevy and Cong Yu. Data Integration with Uncertainty. In VLDB, Vienna, Austria, 2007. [paper] [extended version in The VLDB Journal, 2008]
  # Jayant Madhavan, Shawn Jeffery, Shirley Cohen, Xin Dong, David Ko, Cong Yu and Alon Halevy. Web-scale Data Integration: You Can Only Afford to Pay As You Go. In CIDR, Asilomar, CA. 2007. [paper]
  # Cong Yu and Lucian Popa. Semantic Adaptation of Schema Mappings when Schemas Evolve. In VLDB, Trondheim, Norway. 2005. [paper]
  # Cong Yu and Lucian Popa. Constraint-Based XML Query Rewriting for Data Integration. In SIGMOD, Paris, France. 2004. [paper]

 Personal:  [back to top]
  Places: [ Shanghai, A^2, NYC ]
Travel: [ Kayak; OneTravel; Orbitz; ]
Links: [ FreeReport ]
Since January 1, 2001. Copyright © 2001-2012 Cong Yu Nedstat