Publications

Article about demo and citizen science study at Virginia Science Festival

See the VT CS page about the 9/26/2015 event.

Theses and Dissertations

Team Term Projects in Virginia Tech Courses Related to IDEAL

CS4624: Multimedia, Hypertext, and Information Access - see final reports online:

CS4984: Computational Linguistics - There were 7 teams, each working with a different type of event, generating a series of ever improving summaries using more and more complex methods, for both a small and a large event of that type. Students used our Hadoop cluster for the big events. See final reports online:

CS5604: Information Retrieval, Fall 2016- see final reports online:

CS5604: Information Retrieval, Spring 2016- see final reports online:

CS5604: Information Retrieval, Spring 2015 - see final reports online:

CS6604: Digital Libraries - see final reports online:

CS5974: Independent Study

  • Pranav Nakate: Big Data Processing of School Shooting Archives, Report, Slides

Team Papers and Posters (see also co-authorship network graph)

  • Sangho Suh, Jaegul Choo, Joonseok Lee, and Chandan Reddy, "Local Topic Discovery via Boosted Ensemble of Nonnegative Matrix Factorization." In Proceedings of the International Joint Conference on Artificial Intelligence (IJCAI), Melbourne, Australia, August 2017. http://dmkd.cs.vt.edu/papers/IJCAI17.pdf
  • Vachik S. Dave, Mohammad Al Hasan, and Chandan K. Reddy, "How Fast Will You Get a Response? Predicting Interval Time for Reciprocal Link Creation." In Proceedings of International AAAI Conference on Web and Social Media (ICWSM), Montréal, Canada, May 2017. http://dmkd.cs.vt.edu/papers/ICWSM17.pdf
  • Vineeth Rakesh, Niranjan Jadhav, Alexander Kotov, and Chandan K. Reddy, "Probabilistic Social Sequential Model for Tour Recommendation." In Proceedings of the ACM International Conference on Web Search and Data Mining (WSDM), Cambridge, UK, February 2017. http://dmkd.cs.vt.edu/papers/WSDM17.pdf
  • Hannah Kim, Jaegul Choo, Changhyun Lee, Hanseung Lee, Chandan K. Reddy, and Haesun Park, "PIVE: Per-Iteration Visualization Environment for Real-time Interactions with Dimension Reduction and Clustering." In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), San Francisco, CA, February 2017. http://dmkd.cs.vt.edu/papers/AAAI17.pdf
  • Bhanukiran Vinzamuri, Yan Li and Chandan K. Reddy, "Pre-Processing Censored Survival Data using Inverse Covariance Matrix based Calibration", IEEE Transactions on Knowledge and Data Engineering (TKDE), 2017. http://dmkd.cs.vt.edu/papers/TKDE17.pdf
  • Mohamed Magdy Gharib Farag, Sunshin Lee, Edward A. Fox. Focused Crawling for Events. International Journal on Digital Libraries, 18:1-17, 2017. DOI: 10.1007/s00799-016-0207-1
  • Edward A. Fox, Martin Klein, and Zhiwu Xie. Guest Editors' Introduction to the Special Issue on Web Archiving. International Journal on Digital Libraries, 18, 2017. DOI: 10.1007/s00799-016-0203-5
  • Edward A. Fox, Zhiwu Xie, Martin J. Klein. Web Archiving and Digital Libraries (WADL) 2016: Highlights and Introduction to this Special Issue. Bulletin of IEEE Technical Committee on Digital Libraries, 13(1), April 2017, 3 pages, http://www.ieee-tcdl.org/Bulletin/v13n1/papers/intro.pdf
  • Mohamed Farag and Edward A. Fox. Which webpage should we crawl first? Social media-based webpage source importance guidance. Bulletin of IEEE Technical Committee on Digital Libraries, 13(1), April 2017, 1 page, http://www.ieee-tcdl.org/Bulletin//v13n1/papers/farag.pdf
  • Sunshin Lee and Edward A. Fox. Archiving and Analyzing Tweets and Webpages with the DLRL Hadoop Cluster. Bulletin of IEEE Technical Committee on Digital Libraries, 13(1), April 2017, 1 page, http://www.ieee-tcdl.org/Bulletin//v13n1/papers/lee.pdf
  • Sunshin Lee and Edward A. Fox. Archiving and Analyzing Tweets and Webpages with the DLRL Hadoop Cluster. Poster presented at WADL 2016: Third International Workshop on Web Archiving and Digital Libraries, June 22-23, 2016. In connection with ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2016, http://www.jcdl2016.org/), Rutgers Univ., Newark, NJ, http://fox.cs.vt.edu/wadl2016.html
  • Mohamed Farag, Pranav Nakate and Edward A. Fox. Big Data Processing of School Shooting Archives. Poster, in Proc. ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2016, http://www.jcdl2016.org/), Rutgers Univ., Newark, NJ, June 19-23, 2016, 271-272, http://dx.doi.org/10.1145/2910896.2925466
  • Sunshin Lee, Mohamed Magdy, Richard Gruss, Tarek Kanan, Xuan Zhang, and Edward A. Fox. Enhanced problem-based learning connecting big data research with classes. Poster at HPC Day 2016, hosted by Virginia Tech's Advanced Research Computing, Blacksburg, VA, 11 April 2016
  • Andrea L. Kavanaugh, Steven D. Sheetz, Rodrigo Sandoval-Almazan, John C. Tedesco, Edward A. Fox. Media use during conflicts: Information seeking and political efficacy during the 2012 Mexican elections. Government Information Quarterly, 33(3): 595-602, Feb. 2016, DOI: 10.1016/j.giq.2016.01.004, http://dx.doi.org/10.1016/j.giq.2016.01.004
  • Edward A. Fox; Andrea Kavanaugh; Donald Shoemaker; Steven Sheetz; Mohamed Magdy Farag; and Sunshin Lee. "Can Collecting, Archiving, Analyzing, and Accessing Webpages and Tweets Enhance Resilience Research and Education?" Presentation for Global Forum on Urban & Regional Resilience Spring 2016 Seminar Series, 2/11/2016, Virginia Tech, Blacksburg, VA
  • Edward A. Fox, Mohamed Farag, Sunshin Lee, Xuan Zhang, Richard Gruss. Conversation: Problem/project-based Learning with Big Data. Proc. 2016 Conference on Higher Education Pedagogy. Feb. 10-12, 2016. Blacksburg, VA, USA.
  • Richard Gruss, Tarek Kanan, Xuan Zhang, Mohamed Farag, Mary C. English, and Edward A. Fox. Teaching Big Data Through Project-based Learning in Computational Linguistics and Information Retrieval. Journal of Computing Sciences in Colleges (ISSN 1937-4771) 31(2): 260-270. From Proc. 29th Annual Consortium for Computing in Small Colleges: Southeastern Conference (CCSC:SE), Roanoke College, Salem, VA, November 6-7, 2015.
  • Edward A. Fox, Zhiwu Xie, Martin J. Klein. Introduction to the Web Archiving and Digital Libraries 2015 Workshop Issue: Web Archiving and Digital Libraries 2015 (WADL 2015) Overview. Bulletin of IEEE Technical Committee on Digital Libraries, 11(2), October 2015, 2 pages, http://www.ieee-tcdl.org/Bulletin/v11n2/papers/intro.pdf
  • Mohamed M. G. Farag, Edward A. Fox. Building and archiving event web collections: A focused crawler approach. Bulletin of IEEE Technical Committee on Digital Libraries, 11(2), October 2015, 2 pages, http://www.ieee-tcdl.org/Bulletin/v11n2/papers/farag.pdf
  • Sunshin Lee, Mohamed Magdy, Edward A. Fox. IDEAL: Integrated Digital Event Archive & Library. Demonstration, Center for Human Computer Interaction (CHCI) 20-Year Celebration Conference, 15-17 October 2015, Blacksburg, VA, USA.
  • Edward A. Fox and the IDEAL team. IDEAL (Integrated Digital Event Archive & Library). Virginia Science Festival, Sept. 26, 2015, Blacksburg, VA, USA. http://www.cpe.vt.edu/sciencefestival/plan.html
  • Edward A. Fox and Zhiwu Xie. Web Archiving and Digital Libraries (WADL). Workshop overview in Proc. of the Joint Conference on Digital Libraries (JCDL 2015). June 21-25, 2015. Knoxville, TN, p. 303. http://dx.doi.org/10.1145/2756406.2756934
  • Sunshin Lee, Mohammed Farag, Tarek Kanan, and Edward A. Fox. Read between the lines: A Machine Learning Approach for Disambiguating the Geo-location of Tweets. Proc. JCDL 2015. June 21-25, 2015. Knoxville, TN, 273-274. http://dx.doi.org/10.1145/2756406.2756971
  • Tarek Kanan, Xuan Zhang, Mohammed Magdy, and Edward Fox. Big Data Text Summarization for Events: a Problem Based Learning Course. Proc. of the Joint Conference on Digital Libraries (JCDL 2015). June 21-25, 2015. Knoxville, TN, pp. 87-90. http://dx.doi.org/10.1145/2756406.2756943
  • Mohamed Magdy Farag and Edward A. Fox. Web Archive Content Analysis: Disaster Events Case Study. International Internet Preservation Consortium (IIPC) 2015 General Assembly (GA2015), Stanford, April 27 - May 1, 2015
  • Kavanaugh, A.L., Sheetz, S.D., Tedesco, J.C., Sandoval-Almazan, R., and Fox, E.A. (2014). Media use during conflicts: Information seeking gratification and efficacy during Mexican Elections 2012. In ACM Proceedings of the 15th International Annual Conference on Digital Government Research (dg.o 2014), Aguascalientes, Mexico, June 18-21, 2014.
  • Sheetz, Steven; Kavanaugh, Andrea; Fox, Edward; Elmongui, Hicham; Hassan, Riham; Yang, Seugwon; Magdy, Mohammed; Shoemaker, Donald. Information Uses and Gratifications in Crisis: Student Perceptions since the Egyptian Uprising. Proceedings of the 11th International ISCRAM Conference, University Park, Pennsylvania, USA, May 18-21, 2014
  • Mohamed M. G. Farag and Edward A. Fox. Intelligent Event Focused Crawling. Proceedings of the 11th International ISCRAM Conference, University Park, Pennsylvania, USA, May 18-21, 2014
  • Yang, S., Chung, H., Lin, X., Lee, S., Chen, L., Wood, A., Kavanaugh, A. L., Sheetz, S. D., Shoemaker, D. J., and Fox, E. A. (2013). PhaseVis: What, When, Where, and Who in Visualizing the Four Phases of Emergency Management through the Lens of Social Media. Proceedings of the 10th International ISCRAM Conference. Baden-Baden, Germany, May 12-15, 2013. (Slides).
  • Kavanaugh, A.L., Sheetz, S.D., Hassan, R., Yang, S., Fox, E.A., Magdy, M., Elmongui, H.G., and Shoemaker, D. J. Between a Rock and a Cell Phone: Communication and Information Use during the Egyptian Uprising. International Journal of Information Systems for Crisis, Response and Management 5(1): 1-21, January - March 2013.
  • Yang, S., Chitturi, K., Wilson, G., Magdy, M., and Fox, E. A. (2012). A Study of Automation from Seed URL Generation to Focused Web Archive Development: The CTRnet Context. ACM/IEEE Joint Conference on Digital Libraries (JCDL'12). June 11-14, 2012. Washington D.C., USA. (Poster).
  • Sunshin Lee, Noha Elsherbiny, Edward A. Fox. A Digital Library for Water Main Break Identification and Visualization. Poster in Proceedings of the 12th ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL 2012), Washington D.C., June 10-14, 2012, 335-336, http://dx.doi.org/10.1145/2232817.2232878
  • Kavanaugh, A., Fox, E. A., Sheetz, S., Yang, S., Li, L.T., Shoemaker, D., Natsev, A., and Xie, L. (2012). Social Media Use by Government: From the Routine to the Critical. Government Information Quarterly, 29(4): 480-491, Oct. 2012, http://dx.doi.org/10.1016/j.giq.2012.06.002
  • Emergency Informatics and Digital Libraries - Workshop at JCDL 2012. Description.
  • Kavanaugh, A.L., Sheetz, S.D., Hassan, R., Yang, S., Fox, E.A., Magdy, M., Elmongui, H.G. Between a Rock and a Cell Phone: Communication and Information Use during the Egyptian Uprising. In Proceedings of the 9th International conference on Information Systems for Crisis, Response and Management (ISCRAM), Vancouver, Canada, April 2012.
  • Yang, Seungwon and Kavanaugh, Andrea (2011) Collecting, Analyzing and Visualizing Tweets using Open Source Tools. Technical Report TR-11-14, Computer Science, Virginia Tech. Paper.
  • Kavanaugh, A., Nastev, A., Fox, E. A., Sheetz, S., Shoemaker, D., Xie, L., Yang, S., Li, L. T., Srinivasan, V. Social Media for Cities, Counties and Communities. Final Grant Report to VT CCSR. March 11, 2011. Paper.
  • Kavanaugh, A., Yang, S., Sheetz, S. D., Li, L. T., Fox, E. A. (2011). Microblogging in Crisis Situations: Mass Protests in Iran, Tunisia, and Egypt. Workshop in conjuction with the ACM Conference on Human Factors in Computing Systems (CHI'11). May 7-12, 2011, Vancouver, Canada. Paper.
  • S. Sheetz, E. Fox, A. Fitzgerald, S. Palmer, D. Shoemaker, A. Kavanaugh. "Why students use social networking sites after crisis situations". 2011. Paper.
  • S. Sheetz. A Database Driven Initial Ontology for Crisis, Tragedy, and Recovery. Draft.
  • S. Sheetz, E. Fox, A. Fitzgerald, S. Palmer, D. Shoemaker, A. Kavanaugh. "Why students use social networking sites after crisis situations", Poster presented at Information Systems for Crisis Response and Management (ISCRAM) conference 2011, Lisbon, Portugal. Poster.
  • Seungwon Yang, Andrea Kavanaugh, Nádia P. Kozievitch, Lin Tzy Li, Venkat Srinivasan, Steven Sheetz, Travis Whalen, Donald Shoemaker, Ricardo da S. Torres and Edward A. Fox. CTRnet DL for Disaster Information Services. ACM/IEEE Joint Conference on Digital Libraries (JCDL'11). June 13-17, 2011. Ottawa, Ontario, Canada.Poster summary.
  • Seungwon Yang and Andrea Kavanaugh. Half-Day Tutorial: Collecting, Analyzing and Visualizing Tweets using Open Source Tools. ACM 2011 Digital Government Research Conference (dg.o 2011), June 12-15, 2011 (College Park, MD). New York: ACM Press. Tutorial summary.
  • Kavanaugh, A., Fox, E., Sheetz, S., Yang, S., Li, L.T., Whalen, T.,  Shoemaker, D., Natsev, A., Xie, L. Social Media Use by Government: From the routine to the critical. In ACM Proceedings of the 12th International Annual Conference on Digital Government Research (dg.o 2011), June 12-15, 2011 (College Park, MD). New York: ACM Press. Paper.
  • Li, L.T., Yang, S., Kavanaugh, A., Fox, E. A., Sheetz, S. D. and Shoemaker, D. Twitter User During an Emergency Event: the Case of UT Austin Shooting. ACM 2011 Digital Government Research Conference (dg.o 2011), June 12-15, 2011 (College Park, MD). New York: ACM Press. Poster summary.

Team Books (4 book series on Digital Libraries)

  1. Edward A. Fox and Jonathan P. Leidig, editors. Digital Library Applications: CBIR, Education, Social Networks, eScience/Simulation, and GIS. Morgan & Claypool Publishers, San Francisco, March 2014, 175 pages, ISBN paperback 9781627050326, ebook 9781627050333, http://dx.doi.org/10.2200/S00565ED1V01Y201401ICR032
  2. Edward A. Fox and Ricardo da Silva Torres, editors. Digital Library Technologies: Complex Objects, Annotation, Ontologies, Classification, Extraction, and Security. Morgan & Claypool Publishers, San Francisco, March 2014, 205 pages, ISBN paperback 9781627050302, ebook 9781627050319,http://dx.doi.org/10.2200/S00566ED1V01Y201401ICR033
  3. Rao Shen, Marcos Andre Goncalves, and Edward A. Fox. Key Issues Regarding Digital Libraries: Evaluation and Integration. Morgan & Claypool Publishers, San Francisco, Feb. 2013, 110 pages, ISBN paperback 9781608459124, ebook 9781608459131, http://dx.doi.org/10.2200/S00474ED1V01Y201301ICR026
  4. Edward A. Fox, Marcos Andre Goncalves, and Rao Shen. Theoretical Foundations for Digital Libraries: The 5S (Societies, Scenarios, Spaces, Structures, Streams) Approach. Morgan & Claypool Publishers, San Francisco, July 2012, 180 pages, ISBN paperback 9781608459100, ebook 9781608459117,http://dx.doi.org/10.2200/S00434ED1V01Y201207ICR022, supplementary websitehttps://sites.google.com/a/morganclaypool.com/dlibrary/

IDEAL Team Annual Project Reports

CTRnet Team Annual Project Reports

Team Workshop, Awards, and Grant Proposals

Team Presentation Slides

  • MESA (Middle East Studies Association) 2016 Presentation: The Use and Impact of Social Media: Tunisian Elections 2014, was held on November 17-20, 2016 in Boston, MA. (Slides).

  • IDEAL personnel ran a tutorial and presented a poster (transactional web archivesdisambiguating geo-location of tweets) and short paper at JCDL 2015. A workshop, WADL 2015, also was run, on Web Archiving and Digital Libraries.

  • The International Internet Preservation Consortium General Assembly (IIPC GA) was held on April 27 - May 1, 2015 in Palo Alto, California

  • The 2014 Internet Archive annual meeting was held on November 18 in Montgomery, Alabama. See a list of interesting talks (including ours) here and the meeting schedule here

  • Internal Advisory Board Meeting (Oct. 16, 2014): download

  • External Advisory Board Meeting (Sept. 23, 2014): download

  • WIRE presentation at Harvard (June 10, 2014): link

  • WADL'13 Workshop (July 25-26, 2013): download

  • ISCRAM'13 Conference (May 12-15, 2013): link

  • Virtual Workshop on Big Data and Emergency Informatics (Jan. 10, 2013): download

  • Archive-it Partner Meeting, Dec. 3, 2012 - Web Archiving: Real-time Archiving of Spontaneous Events (Use-Case: Hurricane Sandy), Analyzing and Visualizing (Tweet) Archves: download

From Other Sources:

Other Presentation Slides, from JCDL'11 Workshop - Web Archiving

  • California Digital Library (CDL)'s Web Archiving System: download.
  • Transactional Web Archives (Los Alamos National Laboratory Research Library): download.
  • Wayback, WAT, and Flying Pigs (Internet Archive): download.
  • Web and Twitter Archiving at the Library of Congress: download.

Other Books

  • The Emergency Preparedness and Disaster Resilience Collection by The National Academies Press (2004). Free books download link.
  • Newman Katherine S. Rampage: The Social Roots of School Shooting, Basic Books (2004).
  • Hasday Judy L. Columbine High School Shooting: student violence. Enslow Publishers (2002)
  • Roland Lazenby. April 16th: Virginia Tech Remembers. Plume Publishers (2007)
  • Beth J. Lueders. Lifting Our Eyes: Finding God's Grace Through the Virginia Tech Tragedy The Lauren McCain Story. Berkley Trade Publishers (2007)
  • Ben Agger, Timothy W. Luke. There is a gunman on campus: tragedy and terror at Virginia Tech. Rowman & Littlefield Publishers, Inc. (April 28, 2008)

Other Scholarly Publications

  • Disaster, Crisis and Trauma Working Party. (March 2008). Database of disaster resources. The British Psychological Society. Download link.
  • Wike, Traci L., Fraser Mark W. (May/June 2009). School shootings: making sense of the senseless. Aggression and Violent Behavior, 14:3, 162-169.
  • Geddes L. (March 2009). School shootings: what kind of environment fosters a killer? The New Scientist, 210:2700.
  • Langman P. (Jan/Feb 2009). Rampage school shooters: A typology. Aggression and Violent Behavior, 14:1, 79-86.
  • E Schwarz, J Kowalski, (November 1991). Malignant memories: PTSD in children and adults after a school shooting, Journal of the American Academy of Child and Adolescent Psychiatry, 30:6, 936-944.
  • Leary M, Kowalski R.M, Smith L, Phillips S. (April 2003). Teasing, rejection, and violence: Case studies of the school shootings, Aggressive Behavior, 29:3, Pages 202 – 214.
  • Verlinden S, Hersen M and Thomas J, (2000). Risk factors in school shootings, Clinical Psychology Review, 20:1, 3-56.
  • Schwarz Eitan D, Kowalski Janice M, McNally Richard J. (October, 1993), Malignant memories: Post-traumatic changes in memory in adults after a school shooting, Journal of Traumatic Stress, 6:4, 545-553.
  • O’Toole, ME (1999). The school shooter: A threat assessment perspective. The Critical Incident Response Group (CIRG), National Center for the Analysis of Violent Crime (NCAVC), FBI Academy.