A collection of datasets to facilitate the study of Schema Evolution. Each dataset refers to the history of a database schema as a sequence of releases. Wherever available, contextual information for this history are also published. For more explanations and findings, please refer to http://www.cs.uoi.gr/~pvassil/projects/schemaBiographies/index.html
The current collection of these data has been compiled and processed by Athanasios (Thanos) Pappas, during February 2017.
The first collection and processing of these data was performed by Ioannis Skoulis, in 2013. Unless explicitly stated otherwise (the names of the folders are indicative), the data collected refer to this version. Please refer-to/cite the following paper:
Open-Source Databases: Within, Outside, or Beyond Lehman's Laws of Software Evolution?. Ioannis Skoulis, Panos Vassiliadis, Apostolos Zarras. 26th International Conference on Advanced Information Systems Engineering (CAiSE 2014), 16-20 June 2014, Thessaloniki, Hellas. Source code, datasets, presentations available at http://www.cs.uoi.gr/~pvassil/publications/2014_CAiSE/
Initial work on collecting schema evolution datasets was performed by Carlo A. Curino
The diff's and metrics obtained for the published datasets are obtained by our tool set