The Environmental Protection Agency (EPA) is at the front line of environmental protection and policing in Ireland. The EPA is an independent public body established under the Environmental Protection Agency Act, 1992. The EPA has a wide range of knowledge, regulatory and advocacy functions including responsibility for environmental licensing and enforcement of industrial, waste and wastewater activities.
Data management technologies and practices have undergone dramatic transformation since 1992. In 1992, our regulatory information and all interactions between the EPA and its licensees were recorded in paper files, which gradually evolved into electronic document storage. From 2005 to 2015, the EPA invested significant ICT time and effort in developing database management systems (DBMS). Important features of the EPA data management architecture are the Environmental Data Exchange Network (EDEN), a Microsoft CRM application to manage all authorisations data and a centralised database of environmental geographic (GIS) data. For more information please see Annex Figure 2.A.1.
In 2017, the EPA adopted an enterprise architecture approach to ICT portfolio management and examined the ICT organisational structure. The EPA had achieved a great deal in establishing robust data management systems: the next opportunity was to unlock the potential of this structured data to gather more knowledge and insights about Ireland’s environment. In 2018, the EPA established a small Analytics team of four staff and two contractors to pilot the use of data science, spatial analysis, earth observation and data visualisation techniques. The purpose of the team is to analyse data to generate knowledge and insights on the environment by working collaboratively with EPA subject matter experts. When the team of experts was formed they embarked on a series of stakeholder meetings across the EPA to identify opportunities for the application of data science and analysis. The idea for this project came from a stakeholder meeting. The project was prioritised because:
the data was available to make it a feasible project to deliver quickly, and,
the importance of UWW treatment to the protection of human health and water quality indicated that it would be a project that would have high environmental impact.