Seminario Modelling and Verifying Data Quality

Los invitamos al Seminario Modelling and Verifying Data Quality dictado por el investigador Sagar Sen, del Simula Research Laboratory de Noruega.

Data is ubiquitous and we are now able to collect large quantities of it in complex and distributed software systems such as the cloud. Lot of decision making is also based on analyzing and mining this data. For instance, at the Cancer Registry of Norway a data is collected about patients taking differnet types of screening tests. This data is then exported to reseachers for analysis and they come up with trends in public health and influence policies for new treatments. However, we often fail to ask if the quality of the data is good enough that is complete, correct, valid, and timely to come up with accurate scientific claims that ultimately affect policy. Therefore, there is a need to be able to look at the data quality issues separately. This talk will focus on how we model "high or low" quality data based on the notion of "data interactions" and verify large relational databases for the satisifaction of such models. I will illustrate the functioning of our tool Depict (Discovering Patterns and Interactions in Databases) with data from the Norwegian Customs and Excise department and the Cancer Registry of Norway.

 Fecha:

Miércoles 26 de noviembre

 Hora:

2:00 PM

 Lugar:

Salón W - 501
Universidad de los Andes
Cra. 1 Este No. 19 A - 40

Additional Info

  • Fecha: 2014-11-26
  • Hora: 2:00pm
  • Lugar: W-501 - Universidad de los Andes
Read 3535 times Last modified on Friday, 28 November 2014 11:52