Carlos Castillo is a Distinguished Research Professor at Universitat Pompeu Fabra in Barcelona, where he leads the Web Science and Social Computing research group. He is a web miner with a background on information retrieval, and has been influential in the areas of crisis informatics, web content quality and credibility, and adversarial web search. He is a prolific, highly cited researcher who has co-authored over 80 publications in top-tier international conferences and journals, receiving a test-of-time award, four best paper awards, and two best student paper awards. His works include a book on Big Crisis Data, as well as monographs on Information and Influence Propagation, and Adversarial Web Search.
Publications to appear in 2021
Conference papers
• Marzieh Karimi-Haghighi, Carlos Castillo: Efficiency and Fairness in Recurring Data-Driven Risk Assessments of Violent Recidivism. SIGAPP Symposium on Applied Computing (SAC), pp. 994-1002, ACM Press.
• David Solans, Christopher Tauchmann, Aideen Farrell, Karolin Kappler, Hans-Hendrik Huber, Carlos Castillo: Learning to Classify Morals and
Conventions: Artificial Intelligence in Terms of the Economics of Convention. To appear in ICWSM 2021. .
Published in 2020 (8)
Journal papers
• Marius Miron, Songül Tolan, Emilia Gomez, Carlos Castillo: Evaluating causes of algorithmic bias in juvenile criminal recidivism. Journal on Artificial Intelligence and Law, Springer.
• Pakhee Kumar, Ferda Ofli, Muhammad Imran, Carlos Castillo. Detection of Disaster-Affected Cultural Heritage Sites from Social Media Images Using Deep Learning Techniques. ACM Journal on Computing and Cultural Heritage (JOCCH)
• Rahul Pandey, Carlos Castillo, and Hemant Purohit. Ranking and Grouping Social Media Requests for Emergency Services Using Serviceability Model. Social Network Analysis and Mining 10 (22), Springer
Conference papers
• David Solans, Battista Biggio, Carlos Castillo: Poisoning Attacks on Algorithmic Fairness. ECML/PKDD 2020. .
• Francesco Fabbri, Francesco Bonchi, Ludovico Boratto, Carlos Castillo. The Effect of Homophily on Disparate Visibility of Minorities in People Recommender Systems. Accepted for publication at ICWSM, AAAI
• Valerio Lorini, Javier Rando, Diego Saez-Trumper, Carlos Castillo: Uneven Coverage of Natural Disasters in Wikipedia: the Case of Floods. ISCRAM, Virginia, USA. .
• Gemma Galdon Clavell, Mariano Martín Zamorano, Carlos Castillo, Oliver Smith and Aleksandar Matic: Auditing Algorithms: On Lessons Learned and the Risks of Data Minimization. In Proceedings of the AIES 2020 conference. ACM Press.
• Michael Mathioudakis, Carlos Castillo, Giorgio Barnabo, Sergio Celis: Affirmative Action Policies for Top-k Candidates Selection, With an Application to the Design of Policies for University Admissions. To appear in the ACM Symposium on Applied Computing (SAC), Brno, Czech Republic, March 2020.
Workshop, short, and demo papers
Valerio Lorini, Carlos Castillo, Domenico Nappo, Francesco Dottori, and Peter Salamon: Social Media Alerts can Improve, but not Replace Hydrological Models for Forecasting Floods. Web Intelligence 2020 (Short Papers).
Dougal Shakespeare, Lorenzo Porcaro, Emilia Gómez, Carlos Castillo: Exploring Artist Gender Bias in Music Recommendation. 2nd Workshop on the Impact of Recommender Systems., 2020.
Corinna Hertweck, Carlos Castillo, Michael Mathioudakis: Towards Data-Driven Affirmative Action Policies under Uncertainty. In Fairness, Accountability, and Transparency in Educational Data Cyberspace (FATED) Workshop.
Panayiotis Smeros, Carlos Castillo, Karl Aberer: SciLens News Platform: A System for Real-Time Evaluation of News Articles. In PVLDB 2020 Demos.
Meike Zehlike, Carlos Castillo: Reducing Disparate Exposure in Ranking: A Learning To Rank Approach. In WWW Short papers, Taipei, Taiwan.
Meike Zehlike, Tom Sühr, Carlos Castillo, Ivan Kitanovski: FairSearch: A Tool For Fairness in Ranked Search Results. To appear in WWW Demos, Taipei, Taiwan.
Proceedings
Mireille Hildebrandt, Carlos Castillo, Elisa Celis, Salvatore Ruggieri, Linnet Taylor, Gabriela Zanfir-Fortuna: Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency (FAT*2020). ACM Press, 2020.
Published in 2019 (6)
Journal papers
Alexandra Olteanu, Carlos Castillo, Fernando Diaz, Emre Kıcıman: "Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries" Frontiers in Big Data, Volume 2, July 2019 [ssrn/previous version].
Conference papers
Ugur Kursuncu, Manas Gaur, Carlos Castillo, Amanuel Alambo, Krishnaprasad Thirunarayan, Valerie Shalin, Dilshod Achilov, I. Budak Arpinar, Amit Sheth. Modeling Islamist Extremist Communications on Social Media using Contextual Dimensions: Religion, Ideology, and Hate. In Proc. of CSCW 2019, Austin, Texas. [arxiv|doi|slides]
Marius Miron, Songül Tolan, Emilia Gomez, Carlos Castillo: Why Machine Learning May Lead to Unfairness: Evidence from Risk Assessment for Juvenile Justice in Catalonia. In Proc. of International Conference on Artificial Intelligence and Law (ICAIL), Montréal, Canada, pp. 83-92. ACM Press. [slides|acm|doi] Best paper award.
Panayiotis Smeros, Carlos Castillo, Karl Aberer: SciLens: Evaluating the Quality of Scientific News Articles Using Social Media and Scientific Literature Indicators. In Proc. of The Web Conference (WWW), pp. 1747-1758. San Francisco, USA, May 2019. [arxiv|data|doi|acm]
Valerio Lorini, Carlos Castillo, Francesco Dottori, Milan Kalas, Domenico Nappo, Peter Salamon: Integrating Social Media into a Pan-European Flood Awareness System: A Multilingual Approach. In ISCRAM. Valencia, Spain. Best CoRe paper award. [arxiv|slides|venturebeat|engadget]
Yara Rizk, Hadi Samer Jomaa, Mariette Awad, Carlos Castillo: A Computationally Efficient Multi-modal Classification Approach of Disaster-related Twitter Images. In Proc. of the Symposium On Applied Computing, Cyprus. ACM.
Short paper
Rahul Pandey, Carlos Castillo, and Hemant Purohit: Modeling Human Annotation Errors to Design Bias-Aware Systems for Social Stream Processing. In ASONAM (short papers). [doi|arxiv]
Workshop and work-in-progress papers
Fedor Vitiugin, Carlos Castillo: Comparison of Social Media in English and Russian During Emergencies. In ISCRAM. Valencia, Spain (WiPe). [slides]
Meike Zehlike, Carlos Castillo, Ivan Kitanovski. FairSearch: A Programming Library for Fair Search Results. At Data Science for Social Good Workshop. San Francisco, USA (Abstract).
Special issue
Yu-Ru Lin, Carlos Castillo, Jie Yin: Introduction to the Special Issue on AI for Disaster Management and Resilience. IEEE Intelligent Systems 34(3): 3-5 (2019).
Published in 2018 (4)
Conference papers
Aris Anagnostopoulos, Carlos Castillo, Adriano Fazzone, Stefano Leonardi, and Evimaria Terzi: Algorithms for Hiring and Outsourcing in the Online Labor Market. In KDD, London, UK, August 2018. ACM Press. [video teaser|data and code|acm]
Alexandra Olteanu, Carlos Castillo, Jeremy Boy and Kush Varshney: The Effect of Extremist Violence on Hateful Speech Online. In ICWSM, Stanford, CA, June 2018 [aaai|arxiv|slides|data and code|forbes].
Hemant Purohit, Carlos Castillo, Muhammad Imran and Rahul Pandey: Social-EOC: Serviceability Model to Rank Social Media Requests for Emergency Operation Centers. In ASONAM, Barcelona, August 2018. [slides|doi]
Hemant Purohit, Carlos Castillo, Muhammad Imran, and Rahul Pandey: Ranking of Social Media Alerts with Workload Bounds in Emergency Operation Centers. To appear in Web Intelligence, Santiago, Chile, December 2018. ACM/IEEE. [ieee|doi]
Research Report
Carlos Castillo: La oferta y disponibilidad de contenido audiovisual en la era de los datos masivos. Informe comisionado por el Consejo Audiovisual de Cataluña (CAC). Publicado en diciembre 2018. [versión en catalán|versión en castellano]
Workshop Paper, Short Paper, or Poster
Songül Tolan, Carlos Castillo, Marius Miron, Emilia Gómez: Expert assessment vs. machine learning algorithms: juvenile criminal recidivism in Catalonia. Presentation at the Algorithms and Society Workshop, Brussels, Belgium, December 2018. [slides]
Sofiane Abbar, Carlos Castillo and Antonio Sanfilippo: To Post or Not to Post: Using Online Trends to Predict Popularity of Offline Content. Short paper at Hypertext 2018. [acm|doi]
Oana Balalau, Carlos Castillo, Mauro Sozio: EviDense: a Graph-based Method for Finding Unique High-impact Events with Succinct Keyword-based Descriptions. Poster at ICWSM 2018. [data and code|version in arxiv]
Michael Mathioudakis, Carlos Castillo: Using STAN to Explore Fairness in University Admission Policies. Poster at STAN Conference 2018.
Keynote and Tutorial
Carlos Castillo: Fairness and Transparency in Ranking (Keynote at Data and Bias workshop at KDD). In SIGIR Forum, Vol. 52. No. 2, December 2018, pages 64-71. [slides|sigir forum|acm]
Alexandra Olteanu, Emre Kıcıman, Carlos Castillo, Fernando Diaz: A Critical Review of Online Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries. Tutorial at WSDM 2018, WWW 2018, SDM 2018. [doi]
Invited talks
Carlos Castillo: Big Crisis Data / Crisis Informatics. At: Russian Summer School on Information Retrieval (RuSSIR), Kazan, Russia, August 2018.
Carlos Castillo: "A Brief Overview of Sources and Manifestations of Bias When Working with Social Data." Summary of the tutorial coordinated by Alexandra Olteanu, for the Russian Summer School on Information Retrieval (RuSSIR). Kazan, Russia, August 2018.
Carlos Castillo: "Algorithmic Discrimination." Talk at BCN Analytics Data and Ethics event, Barcelona, April 2018. [youtube]
Workshop Proceedings
Yu-Ru Lin, Carlos Castillo, Jie Yin: The 5th International Workshop on Social Web for Disaster Management (SWDM'18): Collective Sensing, Trust, and Resilience in Global Crises. In Proc. WSDM 2018 [acm]
Published in 2017 (3)
Conference papers
Meike Zehlike, Francesco Bonchi, Carlos Castillo, Sara Hajian, Mohamed Megahed and Ricardo Baeza-Yates: FA*IR: A Fair Top-k Ranking Algorithm. In Proc. of CIKM. Singapore, 2017. ACM Press. [acm|arxiv|Python Library|Java Library|ElasticSearch Plug-in]
Julia Proskurnia, Ruslan Mavlyutov, Carlos Castillo, Karl Aberer and Philippe Cudre-Mauroux: Efficient Document Filtering Using Vector Space Topic Expansion and Pattern-Mining: The Case of Event Detection in Microposts. In Proc. of CIKM. Singapore, 2017. ACM Press [acm]
Julia Proskurnia, Przemyslaw A. Grabowicz, Ryota Kobayashi, Carlos Castillo, Philippe Cudré-Mauroux, and Karl Aberer: Predicting the Success of Online Petitions Leveraging Multidimensional Time-Series. In Proc. of WWW, pp. 755-764 . Perth, Australia, 2017 [bib|acm].
Tutorial
Sara Hajian and Carlos Castillo: Discovering and Mitigating Algorithmic Discrimination. Tutorial at International Conference on Computational Social Science (IC2S2). Cologne, Germany. July 2017.
Alexandra Olteanu, Emre Kıcıman, Carlos Castillo, Fernando Diaz: A Critical Review of Online Social Data: Biases, Methodological Pitfalls, and Ethical Boundaries. Tutorial at ICWSM 2016, KDD 2017.
Short paper and abstract
Michele Gentili, Sara Hajian and Carlos Castillo: A case study of anonymization of medical surveys. Short paper in Proceedings of Digital Health, pp. 77-81. London, UK, 2017. ACM [acm].
Dottori, F., Kalas M., Lorini V., Wania A., Pappenberger F., Salamon, P., Ramos M. H., Cloke, H. L., Castillo, C.: Satellites, tweets, forecasts: the future of flood disaster management?. European Geosciences Union General Assembly 2017. [EGU]
Technical report
Carlos Castillo, Francesco Fabbri, and Diego Saez-Trumper: Current Practices of Online Community Managers: A Report from Six Interviews. Technical Report, Eurecat, January 2017. [bibtex]
Invited talks
Carlos Castillo: From Discrimination Discovery to Fairness-Aware Data Mining. Invited talk at 3rd annual workshop of the Center for Semantic Web Research. Santiago, Chile. January 2017.
Carlos Castillo: Detecting Algorithmic Discrimination. Invited talk at EPFL. Lausanne, Switzerland. July 2017.
Published in 2016 (4)
Book
Carlos Castillo: Big Crisis Data: Social Media in Disasters and Time-Critical Situations. Cambridge University Press, July 2016. [Amazon, Cambridge|doi]
Journal articles
Janette Lehmann, Carlos Castillo, Mounia Lalmas, Ricardo Baeza-Yates: Story-focused Reading in Online News and its Potential for User Engagement. Journal of the Association for Information Science and Technology (JASIST), Vol. 68 No. 4. [periodismo.com (news in Spanish)]
Ferda Ofli, Patrick Meier, Muhammad Imran, Carlos Castillo, Devis Tuia, Nicolas Rey, Julien Briant, Pauline Millet, Friedrich Reinhard, Matthew Parkan, Stéphane Joost: Combining Human Computing and Machine Learning to Make Sense of Big (Aerial) Data for Disaster Response. Big Data, March 2016. [liebert]
Conference paper
Muhammad Imran, Prasenjit Mitra, Carlos Castillo: Twitter as a Lifeline: Human-annotated Twitter Corpora for NLP of Crisis-related Messages. In Proc. of LREC 2016, pp. 1638-1643. May 2016, Portorož, Slovenia. [datasets|arxiv|lrec].
Short papers
Muhammad Imran, Sanjay Chawla, Carlos Castillo: A Robust Framework for Classifying Evolving Document Streams in an Expert-Machine-Crowd Setting. Short paper in Proc. of ICDM 2016. Dec 2016, Barcelona, Catalunya-Spain. [arxiv].
Muhammad Imran, Patrick Meier, Carlos Castillo, Andre Lesa and Manuel Garcia Herranz: Enabling Digital Health by Automatic Classification of Short Messages. Short paper in Proc. of ACM Digital Health 2016. [acm|new scientist]
Tutorials
Sara Hajian, Francesco Bonchi, Carlos Castillo: Algorithmic bias: from discrimination discovery to fairness-aware data mining. Tutorial at KDD 2016. [acm]. Slides: Parts I and II: discrimination discovery, Parts III and IV: fairness-aware data mining.
Video: Part I: Introduction and Context, Part II: Discrimination Discovery, Part III: Fairness-Aware Data Mining and Part IV: Challenges and Directions for Future Research.
Workshop and Special Issue
Carlos Castillo, Fernando Diaz, Yu-Ru Lin, and Jie Yin: The Fourth International Workshop on Social Web for Disaster Management (SWDM 2016). Co-located with CIKM in Indianapolis, US. [acm]
Symeon Papadopoulos, Kalina Bontcheva, Eva Jaho, Mihai Lupu, and Carlos Castillo: Overview of the Special Issue on Trust and Veracity of Information in Social Media. ACM Transactions on Information Systems (TOIS) 34 (3), 14. 2016 [doi]
Carlos Castillo: Detecting algorithmic discrimination. Keynote at Dutch-Belgian Information Retrieval Workshop (DIR). November 2016.
Published in 2015 (4)
Journal article
Muhammad Imran, Carlos Castillo, Fernando Diaz, Sarah Vieweg: Processing Social Media Messages in Mass Emergency: A Survey. In ACM Computing Surveys, Volume 47, Issue 4, June 2015. [acm|arxiv pre-print|bib]
Conference articles
Alexandra Olteanu, Carlos Castillo, Nicholas Diakopoulos, Karl Aberer: Comparing Events Coverage in Online News and Social Media: The Case of Climate Change. In Proceedings of ICWSM 2015, pp. 288-297. May 26-29 2015 in Oxford, England. [washington post|dataset|aaai|bib|slides]
Alexandra Olteanu, Sarah Vieweg and Carlos Castillo: What to Expect When the Unexpected Happens: Social Media Communications Across Crises. In CSCW 2015, 14-18 March in Vancouver, Canada. ACM Press. [datasets|acm|slides|TG24|emergency mgmt|la stampa|irevolution]
Anubhav Jain, Julius Adebayo, Eduardo de Leon, Weihua Li, Lalana Kagal, Patrick Meier, Carlos Castillo: Mobile Application Development for Crisis Data. Procedia Engineering, Volume 107, Pages 255-262 (HumTech 2015, 12-14 May in Boston, USA). [springer].
Workshop/Symposium
Irina Temnikova, Carlos Castillo, Sarah Vieweg: The Case for Readability of Crisis Communications in Social Media. SWDM 2015, 18-22 May in Florence, Italy. [acm|dataset]
Muhammad Imran, Carlos Castillo: Towards a Data-driven Approach to Identify Crisis-Related Topics in Social Media Streams. SWDM 2015, 18-22 May in Florence, Italy.
Irina Temnikova, Carlos Castillo, Sarah Vieweg: EMTerms 1.0: A Terminological Resource for Crisis Tweets. ISCRAM 2015, 24-27 May in Kristiansand, Norway. [data]
Talks
Carlos Castillo: "Big Crisis Data, an Open Invitation." Keynote at WebMedia 2015, Manaus, Brazil. [slides]
Carlos Castillo: "Social Media Mining and Retrieval". Tutorial at ESSIR 2015, Thessaloniki, Greece. [slides]
Daniela Iosub, David Laniado, Carlos Castillo, Mayo Fuster Morell and Andreas Kaltenbrunner: "Networked Emotions and Communication Styles in Online Collaboration". Plenary talk at IC2S2, 8-11 June in Helsinki, Finland. [video]
Carlos Castillo, Gianmarco De Francisci Morales, Marcelo Mendoza and Nasir Khan: "Automatic Analysis of Television News: Media, People, Framing and Bias". Parallel session talk accepted at IC2S2, 8-11 June in Helsinki, Finland.
Other
Aris Anagnostopoulos, Ioannis Chatzigiannakis, Carlos Castillo: Algorithmic Methods of Data Mining. Teaching Materials, Sapienza University of Rome, 2015.
Muhammad Imran, Ioanna Lykourentzou, Yannick Naudet and Carlos Castillo: Engineering Crowdsourced Stream Processing Systems. Technical report. [arxiv]
Aditi Gupta, Carlos Castillo, Ponnurangam Kumaraguru: "TweetCredCrisis: Real-time Assessment of Quality of Content Posted on Twitter during Crisis Events". Poster at the CERC-IIITD Security and Privacy Symposium 2015.
Published in 2014 (8)
Journal articles
Daniela Iosub, David Laniado, Carlos Castillo, Mayo Fuster Morell, Andreas Kaltenbrunner: Emotions under Discussion: Gender, Status and Communication in Online Collaboration. In PLOS ONE. [plos]
Hemant Purohit, Amit Sheth, Carlos Castillo, Patrick Meier, Fernando Diaz: Emergency-Relief Coordination on Social Media: Automatically Matching Resource Requests and Offers. First Monday 19 (1), January 2014. [bib]
Sihem Amer-Yahia, Francesco Bonchi, Carlos Castillo, Esteban Feuerstein, Isabel Mendez-Diaz and Paula Zabala: Composite Retrieval of Diverse and Complementary Bundles. IEEE TKDE, Feb. 2014. [tkde final version|oo pre-print]
Conference articles
Sarah Vieweg, Carlos Castillo and Muhammad Imran: Integrating Social Media Communications into the Rapid Assessment of Sudden Onset Disasters. SocInfo 2014. [bib]
Aditi Gupta, Ponnurangam Kumaraguru, Carlos Castillo and Patrick Meier: TweetCred: A Real-time Web-based System for Assessing Credibility of Content on Twitter. In SocInfo 2014. Runner-up for best paper award [arxiv pre-print| daily dot|wp blogs|ktnv|new yorker|wwwhatsnew|techpresident|slides|demo]
Alexandra Olteanu, Carlos Castillo, Fernando Diaz and Sarah Vieweg: CrisisLex: A Lexicon for Collecting and Filtering Microblogged Communications in Crises. In ICWSM. Ann Arbor, MI, USA. June 2014. [bib|aaai||datasets]
Carlos Castillo, Mohammed El-Haddad, Jürgen Pfeffer and Matt Stempeck: Characterizing the Life Cycle of Online News Stories Using Social Media Reactions. In CSCW. Baltimore, USA. February 2014. [slides|arxiv pre-print|review by s.v.|doha news|wan-ifra|acm|demo]
Sofiane Abbar, Habibur Rahman, Saravanan Thirumuruganathan, Carlos Castillo and Gautam Das: Ranking Item Features by Mining Online User-Item Interactions. In ICDE. Chicago, USA. March 2014.
Conference proceedings volume
Ben Carterette, Fernando Diaz, Carlos Castillo, Donald Metzler (Eds.): Proceedings of the Seventh ACM International Conference on Web Search and Data Mining, WSDM 2014. New York, USA. ACM, Feb. 24-28, 2014.
Demos
Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier and Sarah Vieweg: AIDR: Artificial Intelligence for Disaster Response. In WWW 2014 demo [aidr.qcri.org|acm]. See also: talk at CrisisMappers conference (video).
Kiran Garimella and Carlos Castillo: FAST: Forecast and Analytics of Social Media and Traffic. In CSCW 2014 (demos). [fast.qcri.org|acm]
Symposium and workshop articles
Yelena Mejova, Amy X. Zhang. Nicholas Diakopoulos, Carlos Castillo: Controversy and Sentiments in Online News. Poster in Symposium on Computational Journalism. [arxiv]
Muhammad Imran, Carlos Castillo, Ji Lucas, Patrick Meier, Jakob Rogstadius: Coordinating Human and Machine Intelligence to Classify Microblog Communications in Crises. In ISCRAM 2014.
Muhammad Imran and Carlos Castillo: Volunteer-powered Automatic Classification of Social Media Messages for Public Health in AIDR. Public Health in the Digital Age workshop in WWW 2014.
Tutorial and talk
Carlos Castillo, Fernando Diaz, and Hemant Purohit: Leveraging Social Media and Web of Data to Assist Crisis Response Coordination. Tutorial at SDM, Philadelphia, PA, USA. April 2014.
Carlos Castillo: Crisis Computing: Finding Relevant and Credible Information in Social Media During Disasters. Keynote at Big Data Analytics. Delhi, India, December 2014. [slides]
Other
Carlos Castillo: Predicting the Future with Big Data. In Al Jazeera English / Opinion, series on Big Data. 1 March 2014.
Carlos Castillo: How Tweets and Algorithms Can Save Lives. In Al Jazeera English / Opinion, 5 December 2014.
Sarah Vieweg and Carlos Castillo: Combining Human and Machine Intelligence for Processing of Twitter Data During Mass Emergencies. STCSN e-letter vol. 2 no. 1.
Sandra Gonzalez-Bailon, Gianmarco De Francisci Morales, Marcelo Mendoza, Nasir Khan and Carlos Castillo: "Cable News Coverage and Online News Stories: A Large-Scale Comparison of Media Bias". Technical Report, 2014. [ssrn preprint]
Published in 2013 (7)
Monograph
Wei Chen, Laks V.S. Lakshmanan, and Carlos Castillo: Information and Influence Propagation in Social Networks. Synthesis Lectures on Data Management. Morgan and Claypool Publishers, October 2013. [doi|amazon].
Chapter 1: Introduction available for free.
Chapter 6: Data and Software pre-review version available.
Journal article
Carlos Castillo, Marcelo Mendoza, Barbara Poblete: Predicting Information Credibility in Time-Sensitive Social Media (+Supplementary Material). In Internet Research, Vol. 23, Issue 5, Special issue on The Predictive Power of Social Media, pp. 560-588. October 2013. [intr|irevolution|marketplace|slate magazine|huffington post|the verge|gizmodo|slashdot|sciam podcast] [Spanish: 24h (Chile)|ABC & El Pais (Spain)|Vanguardia (Mexico)]
Dino Ienco, Francesco Bonchi, Carlos Castillo: "Meme Ranking to Maximize Posts Virality in Microblogging Platforms". Journal of Intelligent Information Systems. April 2013. [springer]
Ilija Subasic, Carlos Castillo: "Investigating query bursts in a web search engine". Web Intelligence and Agent Systems, Volume 11, pp. 107-124. IOS Press. [ios]
Conference articles
Janette Lehmann, Carlos Castillo, Mounia Lalmas and Ethan Zuckerman: Transient News Crowds in Social Media. In ICWSM 2013. [mirror|bib|blogpost]
Eduardo Ruiz, Vagelis Hristidis, Carlos Castillo and Aristides Gionis: Measuring and Summarizing Movement in Microblog Postings. In ICWSM 2013.
Lilian Weng, Jacob Ratkiewicz, Nicola Perra, Bruno Gonçalves, Carlos Castillo, Francesco Bonchi, Rossano Schifanella, Filippo Menczer, Alessandro Flammini: The Role of Information Diffusion in the Evolution of Social Networks. In KDD 2013. [arxiv pre-print|acm|slides|VIDEO]
Conference article (short paper)
Diego Sáez-Trumper, Carlos Castillo and Mounia Lalmas: Social Media News Communities: Gatekeeping, Coverage, and Statement Bias (+ supplementary material). In CIKM 2013 (short paper) [acm|slides|mirror|bib|DATASET]
Tutorial
Hemant Purohit, Carlos Castillo, Patrick Meier and Amit Sheth: Crisis Mapping, Citizen Sensing and Social Media Analytics. Tutorial at ICWSM, May 2013.
Invited talks
Carlos Castillo: Social Media News Mining and Automatic Content Analysis of News. Invited talk at Tow Center, Columbia University. New York City, USA, 2013. [VIDEO|blogpost|invitation]
Carlos Castillo: News and Social Media. Keynote at the Social News on the Web (SNOW) workshop. Rio de Janeiro, Brazil, 2013. [acm|slides]
Workshop/symposium articles
Carlos Castillo, Gianmarco De Francisci Morales, Marcelo Mendoza, Nasir Khan: Says Who? Automatic Text-based Content Analysis of Television News. Workshop on Mining Unstructured Data Using NLP (UnstructureNLP) , co-located with CIKM. San Francisco, CA, 2013. [arxiv|acm]
Janette Lehmann, Carlos Castillo, Mounia Lalmas and Ethan Zuckerman: Finding News Curators in Twitter. Social News on the Web (SNOW) workshop. Rio de Janeiro, Brazil, 2013. [mirror|blogpost|slides|bib|acm]
Abdulfatai Popoola, Dmytro Krasnoshtan, Attila Toth, Victor Naroditskiy, Carlos Castillo, Patrick Meier and Iyad Rahwan: Information Verification during Natural Disasters. Social Web and Disaster Management (SWDM) workshop. Rio de Janeiro, Brazil, 2013. [slides|acm|veri.ly|new scientist (free reg) (local copy)|mit technology review|heise online|foreign policy|the national (uae)]
Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Practical Extraction of Disaster-Relevant Information from Social Media. Social Web and Disaster Management (SWDM) workshop. Rio de Janeiro, Brazil, 2013. [acm]
Muhammad Imran, Shady Elbassuoni, Carlos Castillo, Fernando Diaz and Patrick Meier: Extracting Information Nuggets from Disaster-Related Messages in Social Media. In ISCRAM. Baden-Baden, Germany, 2013. Best paper award (see "Practical Extraction ..." for a follow-up to this work). [slides].
Soudip Roy Chowdhury, Muhammad Imran, Rizwan Asghar, Sihem Amer-Yahia, and Carlos Castillo: "Tweet4act: Using Incident-Specific Profiles for Classifying Crisis-Related Messages". In ISCRAM. Baden-Baden, Germany, 2013.
Fuming Shih, Oshani Seneviratne, Daniela Miao, Ilaria Liccardi, Lalana Kagal, Evan Patton, Patrick Meier, Carlos Castillo: Democratizing Mobile App Development for Disaster Management. To be presented at the IJCAI Workshop on Semantic Cities. Beijing, China, 2013. [mit news|wired uk|homeland security news wire]
Posters/Demos
Carlos Castillo, Gianmarco De Francisci Morales and Ajay Shekhawat: "Online Matching of Web Content to Closed Captions in IntoNow". SIGIR Demos, 2013. [acm]
Sihem Amer-Yahia, Francesco Bonchi, Carlos Castillo, Esteban Feuerstein, Isabel Mendez-Diaz and Paula Zabala: "Complexity and Algorithms for Composite Retrieval". WWW posters, 2013 [see also extended version|acm].
Published in 2012 (2)
Conference articles
Aris Anagnostopoulos, Carlos Castillo, Aristides Gionis, Luca Becchetti, Stefano Leonardi: Online Team Formation in Social Networks. In Proc. of WWW, pp. 839-848. Lyon, France, 2012. ACM Press. [acm|bib|www]
Eduardo Ruiz, Vagelis Hristidis, Carlos Castillo, Aristides Gionis and Alejandro Jaimes: Correlating Financial Time Series with Micro-Blogging Data. In WSDM, Seattle, Washington. pp. 513-522, ACM Press. 2012. [bib|slides|y!|acm]
Symposium articles
David Laniado, Andreas Kaltenbrunner, Carlos Castillo, Mayo Fuster-Morell: Emotions and dialogue in a peer-production community: the case of Wikipedia. In WikiSym 2012. [slides|acm]
Robert West, Ingmar Weber, Carlos Castillo: Drawing a Data-Driven Portrait of Wikipedia Editors. In WikiSym 2012. [acm|slides]
Tutorials and invited talks
Carlos Castillo, Wei Chen, Laks V. S. Lakshmanan: Information and Influence Spread in Social Networks, KDD 2012 Tutorial. [slides: introduction, data and software, influence maximization, other issues]
Carlos Castillo: Mining Search Behavior and User-Generated Content. In EDBT 2012, Industrial track. [acm]
Poster
Robert West, Ingmar Weber, Carlos Castillo: A Data-Driven Sketch of Wikipedia Editors. WWW Posters, 2012 [photo|acm].
Published in 2011 (6)
Monograph
Carlos Castillo and Brian D. Davison: Adversarial Web Search. In Foundations and Trends in Information Retrieval, Vol. 4, No 5, pp 377-486. Now Publishers. 2011. [now|amazon|amazon uk|bib]
Journal articles
Paolo Boldi, Francesco Bonchi, Carlos Castillo, and Sebastiano Vigna: "Viscous Democracy for Social Networks". In Communications of ACM, No 6, June 2011 [slides|acm|y|bib|cacm].
Paolo Boldi, Francesco Bonchi, Carlos Castillo, Sebastiano Vigna: "Query Reformulation Mining: Models, Patterns and Applications". In Information Retrieval, Springer. 2010. [springer|bib]
Francesco Bonchi, Carlos Castillo, Aristides Gionis and Alejandro Jaimes: "Social Network Analysis and Mining for Business Applications". ACM Transactions on Intelligent Systems and Technology (TIST), Vol. 2 Issue 3, April 2011. [acm|y|bib]
Conference articles
Michael Mathioudakis, Francesco Bonchi, Carlos Castillo, Aristides Gionis, Antti Ukkonen: "Sparsification of Influence Networks". In proceedings of KDD, pp. 529-537. San Diego, CA, USA. 2011. [acm|y!|slides]
Carlos Castillo, Marcelo Mendoza, Barbara Poblete: "Information Credibility on Twitter". In Proceedings of WWW conference, pp. 675-684. Hyderabad, India. 2011. [bib|slides (complete, prezi)|slides (partial, pdf)|acm|ars technica|wsj]. The labels used are available on request: [request by mail]
Workshop report
Carlos Castillo, Zoltán Gyöngyi, Adam Jatowt, Katsumi Tanaka: Joint WICOW/AIRWeb workshop on web quality (WebQuality 2011). WWW (Companion Volume), pp. 313-314, 2011. [acm]
Published in 2010 (8)
Book chapter
Carlos Castillo, Ricardo Baeza-Yates, Berthier Ribeiro-Neto: "Web Crawling". Chapter 12 in Ricardo Baeza-Yates and Berthier Ribeiro-Neto, "Modern Information Retrieval, Second Edition". 2010.
Journal articles
Jacob Abernethy, Olivier Chapelle, Carlos Castillo: "Graph Regularization Methods for Web Spam Detection". In Machine Learning Journal, vol. 81, no. 2, pp. 207-225. Springer. Was: "WITCH: A New Approach to Web Spam Detection", Yahoo! Technical Report 2008-01. [VIDEO|bib|springer]
Luca Becchetti, Paolo Boldi, Carlos Castillo, Aristides Gionis: "Efficient Algorithms for Large-Scale Local Triangle Counting". ACM Transactions on Knowledge Discovery from Data, Volume 4, Issue 3. ACM Press. [acm|bib|software]
Conference articles
Aris Anagnostopoulos, Carlos Castillo, Aristides Gionis, Luca Becchetti, Stefano Leonardi: "Power in Unity: Forming Teams in Large-Scale Community Systems". Proc. of CIKM 2010, pp. 599-608.Toronto, Canada. ACM Press. [bib|acm|slides]
Ilija Subasic, Carlos Castillo: "The Effects of Query Bursts on Web Search Results". In Proc. of ACM/IEEE Web Intelligence 2010, pp. 374-381. Toronto, Canada. Best student paper award [ieee|bib|y!] (For an extended version, see: "Investigating query bursts ..." in 2013)
Ingmar Weber, Carlos Castillo: "The Demographics of Web Search". In SIGIR, pp. 523-530. Geneva, Switzerland, 2010. ACM Press. [errata|slides|bib|y!|acm|new scientist|slashdot|the economist]
Ilaria Bordino, Carlos Castillo, Debora Donato and Aristides Gionis: "Query Similarity by Projections on the Query-Flow Graph". In SIGIR, pp. 515-522. Geneva, Switzerland, 2010. ACM Press. [bib|acm|slides]
Aristides Anagnostopoulos, Luca Becchetti, Carlos Castillo and Aristides Gionis: "An Optimization Framework for Query Recommendation". In proceedings of Web Search and Data Mining (WSDM), pp. 161-170, New York, USA. 2010. [acm|slides|bib|talk blogpost]
Workshop articles and talks
Dino Ienco, Francesco Bonchi, Carlos Castillo: "The Meme Ranking Problem: Maximizing Microblogging Virality". In SIASP workshop. Sydney, Australia. [ieee|bib]
Marcelo Mendoza, Barbara Poblete, Carlos Castillo: "Twitter Under Crisis: Can we trust what we RT?". In SOMA 2010: KDD Workshop on Social Media Analytics, Washington, DC. July 2010. [acm|bib|soma|VIDEO|wall street journal|scientific american]
Ranieri Baraglia, Carlos Castillo, Debora Donato, Franco Maria Nardini, Raffaele Perego and Fabrizio Silvestri: "The Effects of Time on Query Flow Graph-based Models for Query Suggestion". In proceedings of RIAO. Paris, France, 2010. [slides]
Carlos Castillo, Aristides Gionis, Ronny Lempel, Yoelle Maarek: "When no clicks are good news". Industry track, SIGIR 2010. Geneva, Switzerland. [slides|video (teaser)]
Encyclopedia Entry
Carlos Castillo and Ricardo Baeza-Yates: "Web Retrieval and Mining". In Encyclopedia of Library and Information Sciences, Third Edition. Taylor & Francis, pp.5615-5622, 2010. [bib|request by mail]
Course Materials (in Spanish)
Mari Carmen Marcos. Entrevista a Carlos Castillo [on line]. "Hipertext.net", núm. 8, 2010.
Published in 2009 (5)
Journal articles
Francesco Bonchi, Carlos Castillo, Debora Donato and Aristides Gionis: "Taxonomy-driven lumping for sequence mining". Data Mining and Knowledge Discovery Journal, vol. 19, no. 2, pp. 227-244, 2009. Springer. [TAXOMO|VIDEO|bib|springer|slides|abstract]
Conference articles
Paolo Boldi, Francesco Bonchi, Carlos Castillo and Sebastiano Vigna: "From 'dango' to 'japanese cakes': Query Reformulation Models and Patterns".In IEEE/ACM Web Intelligence, 2009. IEEE Cs Press. Best paper award. [slides|bib|ieee|y!]
Ricardo Baeza-Yates, Christian Middleton, Carlos Castillo: "The Geographical Life of Search". IEEE/ACM Web Intelligence, 2009. IEEE Cs Press. [bib|ieee|slides]
Paolo Boldi, Francesco Bonchi, Carlos Castillo, Sebastiano Vigna: "Voting in social networks". In CIKM 2009, pp. 777-786. ACM Press. Was TR RI 327-09 Università degli Studi di Milano. [slides|bib|acm]
Michalis Potamias, Francesco Bonchi, Carlos Castillo, Aristides Gionis: "Fast shortest path distance estimation in large networks". In CIKM 2009, pp. 867-876. ACM Press. Best student paper award [bib|slides|y!|acm]
Conference article (short paper)
Ranieri Baraglia, Carlos Castillo, Debora Donato, Franco Maria Nardini, Raffaele Perego, Fabrizio Silvestri: "Aging effects on Query Flow Graph for Query Suggestion" (short paper). In CIKM 2009, pp. 1947-1950. ACM Press. [bib|poster|acm]
Workshop articles
Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, Sebastiano Vigna: "Query Suggestions Using Query-Flow Graphs". Workshop on Web Search Click Data (WSCD), pp. 56-63, 2009. [acm|slides|bib]
Marcin Sydow, Francesco Bonchi, Carlos Castillo, Debora Donato: "Optimising Topical Query Decomposition". Workshop on Web Search Click Data (WSCD), pp. 43-47, 2009. [acm|slides|bib]
Talks
Video: Minería de logs de consulta (in Spanish). Universidad de Oviedo, 2009-05-27
Video: 'Análisis de enlaces y detección de spam en la Web (in Spanish). Universidad de Oviedo, 2009-05-28. Press Coverage @ La Nueva España
Query-log Mining. Universidade Federal de Minas Gerais, 2009-03-19
Published in 2008 (8)
Journal Articles
Patrizia Andronico, Marina Buzzi, Carlos Castillo and Barbara Leporini: "Evaluating a Modified Google User Interface Via Screen Reader". Journal of Universal Access in the Information Society, Vol. 7, No, 3, pp. 155-177. 2008. Springer. [bib|springer] (Extends "Testing Google interfaces modified for the blind" poster in WWW2006)
Luca Becchetti, Carlos Castillo, Debora Donato, Ricardo Baeza-Yates, Stefano Leonardi: "Link Analysis for Web Spam Detection". ACM Transactions on the Web, Vol. 2, No. 1, Art. 2, 2008. ACM Press. [bib|acm] (extends "Link-based characterization ..." in AIRWeb'06 and "Using rank propagation..." in WebKDD'06).
Josiane Xavier-Parreira, Carlos Castillo, Debora Donato, Sebastian Michel, Gerhard Weikum: "The JXP Method for Robust PageRank Approximation in a Peer-to-Peer Web Search Network". The VLDB Journal, Vol. 17, No. 2, pp. 291-313, 2008. (extends Parreira et al.'s JXP algorithm in VLDB'06 and "Computing trusted authority..." in AIRWeb'06). Springer. [bib|springer]
Conference Articles
Barbara Poblete, Aristides Gionis, Carlos Castillo: "Dr. Searcher and Mr. Browser: a unified hyperlink-click graph". Proceedings of CIKM, pp. 1123-1132.Napa Valley, CA, USA, October 2008. ACM Press. [slides||acm]
Paolo Boldi, Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis, Sebastiano Vigna: "The query-flow graph: model and applications". Proceedings of CIKM, pp. 609-618. Napa Valley, CA, USA, October 2008. ACM Press. [slides|bib|acm]
Francesco Bonchi, Carlos Castillo, Debora Donato, Aristides Gionis: "Topical query decomposition". In Proceedings of ACM KDD, pp. 52-60. Las Vegas, USA, August 2008. [bib|slides|acm]
Luca Becchetti, Paolo Boldi, Carlos Castillo, Aristides Gionis: "Efficient Semi-Streaming Algorithms for Local Triangle Counting in Massive Graphs". In Proceedings of ACM KDD, pp. 16-24. Las Vegas, USA, August 2008. ACM Press. [bib|slides|acm|software] Was tech. report RI 316-07, Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano.
Eugene Agichtein, Carlos Castillo, Debora Donato, Aristides Gionis, Gilad Mishne : "Finding high quality content in social media, with an application to community-based question answering". Proceedings of Web Search and Data Mining (WSDM), pp. 183-194. Stanford, California, USA, 2008. ACM Press. [bib|mirror|y!|slides|slides (local copy)|acm|VIDEO]. Was technical Report YR-2007-005, Yahoo! Research. Winner, Test of Time Award at WSDM 2018
Workshop articles
Carlos Castillo, Claudio Corsi, Debora Donato, Paolo Ferragina, Aristides Gionis: "Query log mining for detecting polysemy and spam". In Proc. of WebKDD, Las Vegas, USA, 2008. Springer. [bib]
Carlos Castillo, Claudio Corsi, Debora Donato, Paolo Ferragina and Aristides Gionis: "Query-log mining for detecting spam". Proceedings of AIRWeb 2008, pp. 17-20. Beijing, China. ACM Press. [bib|acm]
Jacob Abernethy, Olivier Chapelle and Carlos Castillo: "Webspam Identification Through Content and Hyperlinks". Proceedings of AIRWeb 2008, pp. 41-44. Beijing, China. [bib|acm]
Workshop/Project Report
Carlos Castillo, Kumar Chellapilla, Brian Davison, "AIRWeb'07 Workshop report". SIGIR Forum, June 2008, pp. 68-72. [bib|acm|sigirf]
Carlos Castillo, Kumar Chellapilla, Dennis Fetterly, "Fourth international workshop on Adversarial Information Retrieval on the Web (AIRWeb 2008)". In WWW Workshops, April 2008. [bib|acm]
Luca Becchetti, Carlos Castillo, Debora Donato, Stefano Leonardi and Ricardo Baeza-Yates: "Web spam detection: Link-based and content-based techniques". In Friedhelm Meyer (Ed.), The European Integrated Project Dynamically Evolving, Large Scale Information Systems (DELIS): proceedings of the final workshop, pp. 99-113. Heinz-Nixdorf Institut, Universität Paderborn. [bib]
Poster
Antti Ukkonen, Carlos Castillo, Debora Donato, Aristides Gionis: "Searching the Wikipedia with contextual information". Proceedings of CIKM, pp. 1351-1352. Napa Valley, CA, USA, October 2008. ACM Press. [bib|acm]
Book Chapter
Marcin Sydow, Jakub Piskorski, Dawid Weiss, Carlos Castillo: "Fighting Web Spam". In F. Fogelman-Soulié et al. (eds.): Mining Massive Data Sets for Security, Vol. 19 of NATO SPSS Series D., pp. 134-153. IOS Press, 2008. [VIDEO|bib|request by mail]
Invited Column
Carlos Castillo, Yiyu Yao: "EvalWare: Granular Computing for Web Applications". IEEE Signal Processing Magazine, Vol. 25, No. 2, pp. 142-143, March 2008. [ieee|bib]
Published in 2007 (6)
Journal Articles
Ricardo Baeza-Yates, Carlos Castillo and Efthimis N. Efthimiadis:"Characterization of National Web Domains". ACM Transactions on Internet Technology, Vol. 7, No. 2, Art. 9. May 2007. ACM Press. [bib|acm]
Ricardo Baeza-Yates and Carlos Castillo: "Crawling the Infinite Web". Journal of Web Engineering, Vol. 6, No. 1, pp. 49-72. February 2007. Rinton Press (Extends our paper in WAW'04) [bib|rinton]
Gabriel Tolosa, Fernando Bordignon, Ricardo Baeza-Yates, Carlos Castillo: "Characterization of the Argentinian Web". Cybermetrics, Vol. 11, No. 1, P. 7. July 2007. [bib]
Conference Articles
Carlos Castillo, Debora Donato, Aristides Gionis, Vanessa Murdock, Fabrizio Silvestri: "Know your Neighbors: Web Spam Detection using the Web Topology". In Proceedings of SIGIR, pp. 423-430. Amsterdam, Netherlands, 2007. ACM Press. [acm|y!|bib|delis|talk in spanish at ojobuscador] Was DELIS technical report DELIS-TR-0458.
Carlos Castillo, Debora Donato, Aristides Gionis: "Estimating the Number of Citations using Author Reputation". String Processing and Information Retrieval Symposium (SPIRE), pp. 107-117. Santiago, Chile, 2007. Springer. [y!|bib|springer]
Gabriel H. Tolosa, Fernando R. A. Bordignon, Ricardo Baeza-Yates, Carlos Castillo: "Distinctive Features of the Argentinian Web". In Proc. of LA-WEB. Santiago, Chile, 2007. IEEE CS Press.
Workshop Articles
Josiane-Xavier Parreira, Debora Donato, Carlos Castillo, Gerhard Weikum: "Computing Trusted Authority Scores in Peer-to-Peer Networks". Workshop on Adversarial Information Retrieval on the Web (AIRWeb), pp. 73-80. Banff, Canada. 2007. [bib|y!|airweb|acm]
Debora Donato, Mario Paniccia, Maddalena Selis, Carlos Castillo, Giovanni Cortese, Stefano Leonardi: "New Metrics for Reputation Management in P2P Networks". Workshop on Adversarial Information Retrieval on the Web (AIRWeb), pp. 65-72. Banff, Canada. 2007. [bib|y!|airweb|acm]
Invited Paper
Ricardo Baeza-Yates, Carlos Castillo, Flavio Junqueira, Vassilis Plachouras, Fabrizio Silvestri: "Challenges on Distributed Information Retrieval" (Invited Paper). International Conference on Data Engeneering (ICDE). Istanbul, Turkey, April 2007. IEEE CS Press. [bib|talk|y!|ieee]
Workshop Proceedings
Carlos Castillo, Kumar Chellapilla, Brian D. Davison (chairs/editors): "Proceedings of the 3rd international workshop on Adversarial information retrieval on the web". ACM ICPS, Vol. 215. 2007. [bib|acm]
National Journal
Carlos Castillo, Bartlomiej Starosta, Marcin Sydow "Crawl.pl: Measuring Statistical and Structural Properties of the Polish Web", Studia Informatica, 1(8), pp. 43-73, PL ISSN : 1731-2264, Academy of Podlasie Press, 2007. [bib]
Regional Conference
Gabriel H. Tolosa, Fernando R. A. Bordignon, Ricardo Baeza-Yates, Carlos Castillo: "Caracterización del Espacio Web de Argentina" (in spanish). To be presented in CLEI. Costa Rica, 2007.
Published in 2006 (7)
Journal Articles
Ricardo Baeza-Yates, Paolo Boldi, Carlos Castillo: "Generic Damping Functions for Propagating Importance in Link-Based Ranking". Journal of Internet Mathematics, Vol. 3, No. 4, pp. 445-478, 2006. A K Peters. [bib|jim] (extends "Generalizing PageRank ..." in SIGIR'06)
Patrizia Andronico, Marina Buzzi, Carlos Castillo and Barbara Leporini: "Improving Search Engine Interfaces for Blind Users: a Case Study". Journal of Universal Access in the Information Society, special issue on Information Systems Accessibility. Vol. 5, No.1, pp. 23-40, June 2006. Springer. [bib|springer]
Ricardo Baeza-Yates, Carlos Castillo and Vicente López: "Características de la Web de España" (in spanish). El Profesional de la Información, Vol. 15, No. 1. January-February, pp. 6-17 2006. [bib|metapress]
Conference Articles
Luciana Buriol, Carlos Castillo, Debora Donato, Stefano Leonardi and Stefano Millozzi: "Temporal Analysis of the Wikigraph". In Proceedings of the Web Intelligence Conference, pp. 45-51. Hong Kong, December 2006. IEEE CS Press. [bib|acm|slides]
Carlos Castillo, Alberto Nelli and Alessandro Panconesi: "A Memory-Efficient Strategy for Exploring the Web". In Proceedings of the Web Intelligence Conference, pp. 680-686. Hong Kong, December 2006. IEEE CS Press. [bib]
Ricardo Baeza-Yates, Paolo Boldi and Carlos Castillo: "Generalizing PageRank: Damping Functions for Link-Based Ranking Algorithms". In Proceedings of ACM SIGIR, pp. 308-315. Seattle, Washington, USA, August 2006. [acm|bib |talk@pisa] (See also TR N. 305-05 Univ. of Milano, 2005).
Encyclopedic Article
Ricardo Baeza-Yates and Carlos Castillo: "Web Searching". In Keith Brown, (Editor-in-Chief), Encyclopedia of Language and Linguistics, Second Edition, Vol. 13, pp. 527-537. Oxford: Elsevier, 2006.
Workshop Articles
Luca Becchetti, Carlos Castillo, Debora Donato and Adriano Fazzone: "A Comparison of Sampling Techniques for Web Characterization". In Proceedings of the Workshop on Link Analysis (LinkKDD). Philadelphia, USA, August 2006. ACM Press. [bib|linkkdd]
Luca Becchetti, Carlos Castillo, Debora Donato, Stefano Leonardi, Ricardo Baeza-Yates: "Using Rank Propagation and Probabilistic Counting for Link-Based Spam Detection". In Proceedings of the Workshop on Web Mining and Web Usage Analysis (WebKDD). Philadelphia, USA, August 2006. ACM Press. [bib|webkdd|acm|VIDEO] (See also DELIS TR-0341).
Luca Becchetti, Carlos Castillo, Debora Donato, Stefano Leonardi, Ricardo Baeza-Yates: "Link-Based Characterization and Detection of Web Spam". Workshop on Adversarial Information Retrieval on the Web (AIRWeb). Seattle, USA, August 2006. [bib|airweb|talk@bcn]
Gemma Boleda, Stefan Bott, Carlos Castillo, Rodrigo Meza, Toni Badia, Vicente López: "CUCWeb: a Catalan corpus built from the Web". 2nd Workshop on the Web as a Corpus at EACL'06. Trento, Italy, April 2006. [bib|eacl]
Newsletter
Carlos Castillo, Debora Donato, Luca Becchetti, Paolo Boldi, Massimo Santini, Sebastiano Vigna: "A Reference Collection for Web Spam". SIGIR Forum, Vol. 40, No. 2, December 2006. [dataset|www|sigirf|bib|y!|acm]. DELIS technical report DELIS-TR-0405.
Posters
Luca Becchetti and Carlos Castillo: "The Distribution of PageRank Follows a Power-Law only for Particular Values of the Damping Factor". World Wide Web Conference (posters), pp. 941-942. Edinburgh, Scotland, May 2006. [www2006|acm]
Ricardo Baeza-Yates and Carlos Castillo: "Relationship between Links and Trade". World Wide Web Conference (posters), pp. 927-928. Edinburgh, Scotland, May 2006. [delis-tr-0253|www2006|acm]
Patrizia Andronico, Marina Buzzi, Carlos Castillo and Barbara Leporini: "Testing Google Interfaces Modified for the Blind". World Wide Web Conference (posters), pp. 873-874. Edinburgh, Scotland, May 2006. [www2006|acm]
Published in 2005 (2)
Journal Article
Ricardo Baeza-Yates, Carlos Castillo and Vicente López: "Characteristics of the Web of Spain". Cybermetrics, Vol. 9, No. 1, 2005. [cybermetrics| website|bib]
Conference Article
Ricardo Baeza-Yates, Carlos Castillo, Mauricio Marin and Andrea Rodriguez: "Crawling a Country: Better Strategies than Breadth-First for Web Page Ordering". WWW Conference / Industrial Track, ACM, pp. 864-872. Chiba, Japan, 2005. [talk|bib|acm]
Workshop Articles
Ricardo Baeza-Yates, Carlos Castillo and Vicente López: "Pagerank Increase under Different Collusion Topologies". Workshop on Adversarial Information Retrieval on the Web (AIRWeb). Chiba, Japan, 2005. [airweb|talk|bib]
Ricardo Baeza-Yates and Carlos Castillo: "Link Analysis in National Web Domains". Workshop on Open Source Web Information Retrieval (OSWIR), pp. 15-18. Compiegne, France, September 2005. [bibtex|oswir|talk] (extended in "Characterization of National Web Domains" 2006)
Carlos Castillo and Ricardo Baeza-Yates: "WIRE: an Open-Source Web Information Retrieval Environment". Workshop on Open Source Web Information Retrieval (OSWIR), pp. 27-30. Compiegne, France, September 2005 . [bib|oswir|website|talk]
Albert Bifet, Carlos Castillo, Paul-Alexandru Chirita and Ingmar Weber: "An Analysis of Factors Used in a Search Engine's Ranking". Workshop on Adversarial Information Retrieval on the Web (AIRWeb), synopsis. Chiba, Japan, 2005. [bib]. Reprinted in 2007 as a chapter of the book "Internet Search Engines -- An Introduction" edited by Ravi Kumar Jain B.; Chapter 5, pp. 76-95, ICFAI University Press.
National Conference
Marco Modesto, Álvaro R. Pereira Jr., Nivio Ziviani, Carlos Castillo and Ricardo Baeza-Yates: "Un Novo Retrato da Web Brasileira" (in portuguese) , SEMISH Symposium, pp. 2005-2017. São Leopoldo, Brazil. July 2005. [bib]
Abstract
Carlos Castillo: "Effective Web Crawling (Doctoral Abstract)". ACM SIGIR Forum Vol.39 No. 1, pp. 55-56. June 2005. [acm]
Technical Reports
Carlos Castillo and Ricardo Baeza-Yates: "Practical Web Crawling". Technical Report, 2005.
Carlos Castillo and Ricardo Baeza-Yates: "Visualizing the European Trade Graph". Technical re port DELIS-TR-0252, DELIS (Dynamically Evolving Large-scale Information Systems), 2005. [delis]
Ricardo Baeza-Yates, Paolo Boldi and Carlos Castillo: "The Choice of a Damping Factor for Propagating Importance in Link-Based Ranking". Technical report RI-DSI N. 305-05 , Dipartimento di Scienze dell'Informazione, Università degli Studi di Milano, September 2005. [bib|unimi|talk@pisa] (reviewed and published in 2006 in SIGIR)
Ricardo Baeza-Yates and Carlos Castillo: "Caracterización de la Web Chilena" (in spanish). Technical report, Center for Web Research, Universidad de Chile, 2005. [website]
Patrizia Andronico, Marina Buzzi, Carlos Castillo and Barbara Leporini: "Search Engine UIs: remote usability test with blind persons". Technical report TR-15/2005, Istituto di Informatica e Telematica (IIT), Consiglio Nazionale delle Ricerche (CNR). Pisa, Italy, 2005. [request by e-mail]
Published in 2004 (5)
Book Chapter
Ricardo Baeza-Yates, Carlos Castillo and Felipe Saint-Jean: "Web Dynamics, Structure and Page Quality". In M. Levene and A. Poulovassilis (eds.) "Web Dynamics", Springer, pp. 93-109. 2004. [bib|springer]
Journal Article
A. Jaimes, J. Ruiz-del-Solar, R. Verschae, R. Baeza-Yates, C. Castillo, D. Yaksic and E. Davis: "On the Image Content of a Web Segment: Chile as a Case Study". Journal of Web Engineering, Vol. 3 No. 2, pp. 153-168. 2004. [bib|rinton]
Conferences and Workshops with Proceedings
Carlos Castillo, Mauricio Marin, Andrea Rodriguez and Ricardo Baeza-Yates: "Scheduling Algorithms for Web Crawling". WebMedia/LA-WEB 2004, IEEE Cs. Press, pp. 10-17. Ribeirão Preto-SP, Brazil, 2004. [talk|bib|ieee]
R. Baeza-Yates, J. Ruiz-del-Solar, R. Verschae, C. Castillo and C. Hurtado: "Content-based Image Retrieval and Characterization on Specific Web Collections". Conference on Image and Video Retrieval (CIVR), Springer LNCS, pp. 189-198. Dublin, Ireland, 2004. [bib|springer]
Ricardo Baeza-Yates and Carlos Castillo: "Crawling the Infinite Web: Five Levels are Enough". Workshop of Algorithms on Web Graphs (WAW), Springer LNCS, pp. 156-167. Rome, Italy, 2004. [talk|bib|springer] (extended version available, see year 2005)
National Conferences
G. Boleda, S. Bott, B. Poblete, C. Castillo, M.E. Fuenmayor, T. Badia, V. López: "CuCWeb, un corpus del català construït a partir de la web" (in catalan). Congrés Societat del Coneixement. Barcelona, España, 2004. [html]
Poster
Efthimis N. Efthimiadis, Carlos Castillo: "Charting the Greek Web". ASIST Conference (Poster), Providence, Rhode Island, USA, 2004. [bibtex]
Thesis
Carlos Castillo: "Efficient Web Crawling". PhD Thesis. Universidad de Chile, 2004. [bib]
Technical Reports
Ricardo Baeza-Yates, Felipe Lalanne, Carlos Castillo, Georges Dupret: "Comparing the characteristics of the Korean and the Chilean Web". Technical report, ITCC, DCC, University of Chile, 2004.
Ricardo Baeza-Yates, Carlos Castillo and Efthimis Efthimiadis: "Comparing the characteristics of the Chilean and the Greek Web". Technical report, Universidad de Chile, 2004.
Published in 2003 (1)
Conferences
A. Jaimes, J. Ruiz-del-Solar, R. Verschae, D. Yaksic, R. Baeza-Yates, E. Davis and C. Castillo: "On the Image Content of the Chilean Web". Latin American Web Conference (LA-WEB), IEEE Cs. Press, pp.72-83. Santiago, Chile, 2003. [bib|ieee]
Poster
Carlos Castillo: "Cooperation schemes between a Web server and a Web search engine". Latin American Web Conference LA-WEB (Extended Poster), IEEE Cs. Press, pp. 31a-35a. Santiago, Chile, 2003. [bib|ieee]
Technical Reports
Vicente López, Carlos Castillo and Joan Codina: "Information Retrieval in Mail Archives". Technical report, Cátedra Telefónica de Producción Multimedia, Universitat Pompeu Fabra, 2003.
Carlos Castillo: "Estudio de idiomas en las páginas Web españolas (dominio .ES)" (in spanish).Technical report, Cátedra Telefónica de Producción Multimedia, Universitat Pmpeu Fabra, 2003.
Published in 2002 (3)
Journal Article
Ricardo Baeza-Yates and Carlos Castillo: "Balancing Volume, Quality and Freshness in Web Crawling". in A. Abraham, J. Ruiz-del-solar, M. Köppen (Eds.), Soft-Computing Systems: Design, Management and Applications, Frontiers in Artificial Intelligence and Applications 97, IOS Press, pp. 565-572, 2002. [talk|bib|ios]
Conferences
Ricardo Baeza-Yates, Felipe Saint-Jean and Carlos Castillo: "Web Structure, Age, and Page Quality". Proceedings of String Processing and Information Retrieval (SPIRE), Springer LNCS, pp. 117-130, 2002. Lisbon, Portugal. Also presented in 2nd Web Dynamics Workshop, Hawaii, 2002. [bib|springer] (see also year 2004)
Carlos Castillo: "A Model for the Design and Implementation of Web Sites". IADIS International WWW/Internet Conference (ICWI), pp. 452-460. Lisbon, Portugal, 2002. [bib]
Poster
Carlos Castillo and Ricardo Baeza-Yates: "A New Model for Web Crawling". World Wide Web Conference (Poster). Honololulu, USA, 2002. [bib]
Published in 2001 (1)
Conference
Ricardo Baeza-Yates and Carlos Castillo: "Relating Web characteristics with link based Web page ranking". Proceedings of String Processing and Information Retrieval (SPIRE), IEEE Cs. Press, pp 21-32. Laguna San Rafael, Chile, 2001. [talk|bib|ieee] (see also year 2003)
National Conference
Carlos Castillo: "Newtenberg: Un Modelo e Implementación de un sistema de Publicaciones Digitales en la Web" (in spanish). Encuentro Chileno de Ciencias de la Computación. Punta Arenas, Chile. 2001.
Poster
Ricardo Baeza-Yates and Carlos Castillo: "Relating Web Structure and User Behavior". World Wide Web Conference (Poster). Hong Kong, 2001.
Technical Report
Ricardo Baeza-Yates and Carlos Castillo: "Analysis of Link-Based Ranking for the Web". Technical report, University of Chile, 2001.
Published in 2000
National Conference
Ricardo Baeza-Yates and Carlos Castillo: "Caracterizando la Web Chilena". (in spanish) Encuentro Chileno de Ciencias de la Computación, año 2000. [bib]
Thesis
Carlos Castillo: "Características de la Web Chilena y Extensiones a un Buscador Web" (in spanish), Memoria de título, Universidad de Chile, año 2000.