Tue 5:30-8:00pm, Davis 338A.
Date | Topics | Presenter | Bibliographic information | ||||||||||||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
02/07/2017 | Conditional functional dependencies | Ning Deng | ,
| ||||||||||||||||||||||||||||||||||||||||
02/14/2017 | Record matching and data repairing | Ladan Golshanara |
|
||||||||||||||||||||||||||||||||||||||||
02/21/2017 | Data cleaning | Meghana Ananth Gad, Poonam Kumari |
|
||||||||||||||||||||||||||||||||||||||||
02/28/2017 | Data quality of temporal records and streams. | Deepti Chavan, Sushmita Sinha |
| ||||||||||||||||||||||||||||||||||||||||
03/28/2017 | Crowdsourcing | Pruthvi Mulagala, Vaibhav Sinha |
| ||||||||||||||||||||||||||||||||||||||||
04/04/2017 | Crowdsourcing Algorithms for Entity Resolution. | Jay Narendra Shah |
| ||||||||||||||||||||||||||||||||||||||||
04/04/2017 | Causality in databases | George Gunner |
| ||||||||||||||||||||||||||||||||||||||||
04/18/2017 | Crowdsourcing queries | Prashanth Seralathan, Shreya Ravi Kumar |
| ||||||||||||||||||||||||||||||||||||||||
04/25/2017 | Stream data cleaning | Rajeev Vaswani |
| ||||||||||||||||||||||||||||||||||||||||
04/25/2017 | Approximate entity extraction | Anuradha Ashavatha Rao |
| ||||||||||||||||||||||||||||||||||||||||
05/02/2017 | Traffic monitoring and management | Himal Dwarakanath, Neeharika Nelaturu |
| ||||||||||||||||||||||||||||||||||||||||
05/02/2017 | Data fusion | Arun Sharma |
| ||||||||||||||||||||||||||||||||||||||||
05/02/2017 | Finding related tables | Shad Ullah Khan |
| ||||||||||||||||||||||||||||||||||||||||
05/09/2017 | Cleaning Urban Data | Omkar Guruprasad Neogi |
| ||||||||||||||||||||||||||||||||||||||||
05/09/2017 | Querying Raw Data Files | Mythri Jonnavittula, Barath Eswer Nagasubramaniyan |
| ||||||||||||||||||||||||||||||||||||||||
05/09/2017 | Data errors | Senthil Kumar Laguduva Yadindra Kumar |
|
http://libweb.lib.buffalo.edu/help/help.asp?ID1=442Many papers can be googled on the author pages or retrieved from
dblp.
Ziawasch Abedjan, Xu Chu, Dong Deng, Raul Castro Fernandez, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Michael Stonebraker, Nan Tang: Detecting Data Errors: Where are we and what needs to be done? PVLDB 9(12): 993-1004 (2016).
George Beskales, Mohamed A. Soliman, Ihab F. Ilyas, Shai Ben-David: Modeling and Querying Possible Repairs in Duplicate Detection. PVLDB 2(1): 598-609 (2009)
Shaoxu Song, Aoqian Zhang, Jianmin Wang, Philip S. Yu: SCREEN: Stream Data Cleaning under Speed Constraints. SIGMOD Conference 2015: 827-841
Maria Vanina Martinez, Andrea Pugliese, Gerardo I. Simari, V. S. Subrahmanian, Henri Prade: How Dirty Is Your Relational Database? An Axiomatic Approach. ECSQARU 2007: 103-114.
Wenfei Fan, Shuai Ma, Nan Tang, Wenyuan Yu: Interaction between Record Matching and Data Repairing. J. Data and Information Quality 4(4): 16:1-16:38 (2014)
Furong Li, Mong-Li Lee, Wynne Hsu, Wang-Chiew Tan: Linking Temporal Records for Profiling Entities. SIGMOD Conference 2015: 593-605
Xuan Liu, Xin Luna Dong, Beng Chin Ooi, Divesh Srivastava: Online Data Fusion. PVLDB 4(11): 932-943 (2011)
Andrei Lopatenko, Loreto Bravo: Efficient Approximation Algorithms for Repairing Inconsistent Databases. ICDE 2007: 216-225.
Zeyu Li, Hongzhi Wang, Wei Shao, Jianzhong Li, Hong Gao: Repairing Data through Regular Expressions. PVLDB 9(5): 432-443 (2016)
Jens Ehrlich, Mandy Roick, Lukas Schulze, Jakob Zwiener, Thorsten Papenbrock, Felix Naumann: Holistic Data Profiling: Simultaneous Discovery of Various Metadata. EDBT 2016: 305-316.
Ramanathan V. Guha, Dan Brickley, Steve Macbeth: Schema.org: evolution of structured data on the web. Commun. ACM 59(2): 44-51 (2016)
.
Manish Kumar Anand, Shawn Bowers, Bertram Ludäscher: Techniques for efficiently querying scientific workflow provenance graphs. EDBT 2010: 287-298.
Yael Amsterdamer, Susan B. Davidson, Daniel Deutch, Tova Milo, Julia Stoyanovich, Val Tannen: Putting Lipstick on Pig: Enabling Database-style Workflow Provenance. PVLDB 5(4): 346-357 (2011).
James Cheney, Laura Chiticariu, Wang Chiew Tan: Provenance in Databases: Why, How, and Where. Foundations and Trends in Databases 1(4): 379-474 (2009).
Melanie Herschel: A Hybrid Approach to Answering Why-Not Questions on Relational Query Results. J. Data and Information Quality 5(3): 10:1-10:29 (2015).
Peter Buneman, Adriane Chapman, James Cheney: Provenance management in curated databases. SIGMOD Conference 2006: 539-550,
Alexandra Meliou, Wolfgang Gatterbauer, Katherine F. Moore, Dan Suciu: The Complexity of Causality and Responsibility for Query Answers and non-Answers. PVLDB 4(1): 34-45 (2010).
Alexandra Meliou, Wolfgang Gatterbauer, Joseph Y. Halpern, Christoph Koch, Katherine F. Moore, Dan Suciu: Causality in Databases. IEEE Data Eng. Bull. 33(3): 59-67 (2010).
Michael J. Franklin, Donald Kossmann, Tim Kraska, Sukriti Ramesh, Reynold Xin: CrowdDB: answering queries with crowdsourcing. SIGMOD Conference 2011: 61-72.
Norases Vesdapunt, Kedar Bellare, Nilesh N. Dalvi: Crowdsourcing Algorithms for Entity Resolution. PVLDB 7(12): 1071-1082 (2014).
Norases Vesdapunt, Kedar Bellare, Nilesh N. Dalvi: Errata for "Crowdsourcing Algorithms for Entity Resolution" (PVLDB 7(12): 1071-1082). PVLDB 8(5): 641 (2015).
Xu Chu, John Morcos, Ihab F. Ilyas, Mourad Ouzzani, Paolo Papotti, Nan Tang, Yin Ye: KATARA: A Data Cleaning System Powered by Knowledge Bases and Crowdsourcing. SIGMOD Conference 2015: 1247-1261.
Susan B. Davidson, Sanjeev Khanna, Tova Milo, Sudeepa Roy: Using the crowd for top-k and group-by queries. ICDT 2013: 225-236.
Juliana Freire, Aline Bessa, Fernando Chirigati, Huy T. Vo, Kai Zhao: Exploring What not to Clean in Urban Data: A Study Using New York City Taxi Trips. IEEE Data Eng. Bull. 39(2): 63-77 (2016).
Nikolaos Panagiotou, Nikolas Zygouras, Ioannis Katakis, Dimitrios Gunopulos, Nikos Zacheilas, Ioannis Boutsis, Vana Kalogeraki, Stephen Lynch, Brendan O'Brien: Intelligent Urban Data Monitoring for Smart Cities. ECML/PKDD (3) 2016: 177-192.
Nikolaos Zygouras, Nikos Zacheilas, Vana Kalogeraki, Dermot Kinane, Dimitrios Gunopulos: Insights on a Scalable and Dynamic Traffic Management System. EDBT 2015: 653-664.
Wei Wang, Chuan Xiao, Xuemin Lin, Chengqi Zhang: Efficient approximate entity extraction with edit distance constraints. SIGMOD Conference 2009: 759-770.
Ioannis Alagiannis, Renata Borovica, Miguel Branco, Stratos Idreos, Anastasia Ailamaki: NoDB: efficient query execution on raw data files. SIGMOD Conference 2012: 241-252.