Data Warehousing & BI Summit - Dave Wells - Rick van der Lans

Dag 1 2 july
Dag 2 3 july

Rick van der Lans

09:30 - 09:45 | Room 1

Opening by the chairman

Read less

Rick van der Lans

09:45 - 11:00 | Room 1

Data disruption cannot go without IT disruption (Dutch spoken)

How will IT perform if the complexity dramatically increases? What can we actually learn in the future from all the experiences we have gained? What do we have to change in order to participate in this increasingly data-driven economy in which digital transformation is the magic word for everyone? This keynote addresses this issue, and discusses recommendations on how IT specialists and IT management need to change to be able to address the actual data disruption: the IT disruption.

The data-driven organization, digital transformation, and the data economy are all hot topics in board rooms. They all imply that the role of data changes. Organizations want to use data more widely, more effectively, and more efficiently: a real data disruption. However, data disruption clearly raises the bar for developing IT systems, because it leads to more complex IT systems, involving e.g. AI, sensor technology, and real-time analytics. Luckily, so much high-tech technology is available, we can almost build whatever the business needs. So, the technology is ready, but is IT itself ready? You would think so. IT has more than fifty years of experience in data modeling, data architectures, data strategies, data warehouses and databases. If we look at our track record for developing more traditional IT systems, we have to conclude that some of our projects have not been delivered on time and within budget, and are sometimes completely cancelled. So, how will IT perform if the complexity dramatically increases? What can we actually learn in the future from all the experiences we have gained? What do we have to change in order to participate in this increasingly data-driven economy in which digital transformation is the magic word for everyone? This keynote addresses this issue, and discusses recommendations on how IT specialists and IT management need to change to be able to address the actual data disruption: the IT disruption.

How good is our track record with respect to developing IT systems?
Why don’t we deploy more code generators and self-driving technology?
Has data modeling actually changed in the last 30 years?
How should the IT specialist change to be ready for the data disruption?
The importance of in-depth IT knowledge at the top of the organization.

Read less

Jaap-Willem Verheij van Wijk

11:15 – 11:45 | Room 1

Customer insights from EWALS and AEGON and how they continuously innovate with data

At Qlik, we have made it our mission to help organizations accelerate business value through data. To lead in the digital age, where real-time decisions are critical, you need robust data pipelines to effectively access and analyse the latest and most accurate data. Qlik offers a data integration and analytics platform that continuously ingests all your data changes, automates data warehouses, manages data lakes to provide true data insight across your organization with world-class analytics.

Jaap-Willem Verheij van Wijk will open the session with an introduction to Qlik then share customer insights including EWALS Cargo Care who have continuously innovated with data to stay ahead of the curve. See how they have solved the latest data challenges in logistics and how they automate the data warehouse lifecycle. Jaap will also share how AEGON have solved their manual data stream issues and are now able to enclose multiple data sources with a small team that supports the agility of their business demands.

Read less

Tim Schulteis

11:45 - 13:00 | Room 1

Data Driven: more than just technology (Dutch spoken)

APG is the largest pension provider in the Netherlands and sees data as a crucial asset from current and future business operations. APG also wants to be a leader as an executor and investor, which has led to the earmarking of data as a strategic asset. In this session Tim Schulteis tells an integral story about the journey that has been made in the past period. He will discuss the development of an appropriate architecture, the building of the right knowledge and skills, and the challenges of collaboration across business units.

APG is the largest pension provider in the Netherlands and sees data as a crucial asset from current and future business operations. The government is increasingly withdrawing from a sufficient retirement provision (raising the state retirement age, decreasing the pension accrual), so insight into the personal situation of participants and offering action perspective is crucial. APG also wants to be a leader as an executor and investor.

All this has led to the earmarking of data as a strategic asset. But how do you get from that ambition to execution on the various axes of technology, capability, culture and organization? This presentation tells an integral story about the journey that has been made in the past period: what went well, what didn’t, what did we learn from it, where are we now?
The development of an appropriate architecture, the building of the right knowledge and skills, the combination with modern working methods and the challenges of collaboration across business units will be discussed:

Business drivers
Architecture and knowledge
Culture and agility
Organization and sponsorship
Critical success factors

Read less

Keith McCormick

11:45 - 13:00 | Room 2

The Power of Combining Machine Learning Models - The Risks and Rewards of Random Forests, XGBoost and other Ensembles

Ensembling is one of the hottest techniques in today’s predictive analytics competitions. Every single recent winner of Kaggle.com and KDD competitions used an ensemble technique, including famous algorithms such as XGBoost and Random Forest. This session will provide a detailed overview of ensemble models, their origin, and show why they are so effective.

Ensembling is one of the hottest techniques in today’s predictive analytics competitions. Every single recent winner of Kaggle.com and KDD competitions used an ensemble technique, including famous algorithms such as XGBoost and Random Forest.
Are these competition victories paving the way for widespread organizational implementation of these techniques? This session will provide a detailed overview of ensemble models, their origin, and show why they are so effective. We will explain the building blocks of virtually all ensembles techniques, to include bagging and boosting.

What You Will Learn:

What are ensemble models and what are their advantages?
Why are ensembles in the news?
The two most influential ensembling approaches: bagging and boosting
The core elements of ensembles and their application
The challenge of applying competition strategies to organizational problems.

Read less

12:30 - 13:30 | Plenary

Lunch break

Read less

Petr Beles

13:30 - 13:45 | Room 1

Model Driven Data Vault Automation with Datavault Builder

In this session we will give you a turbo introduction on how to put the data modeler back on the driver's seat. See how your data model is implemented into a working solution in real-time.

Read less

Piethein Strengholt

14:00 - 15:00 | Room 1

Modern Data Management & Data Integration (Dutch spoken)

The digital future: think big, think highly distributed data, think in ecosystems. What Integration Architecture is needed to play an important role in the digital world with eco-system with FinTech company's and other banks? Do Enterprise Data Warehouses still have a role in this landscape?

The digital future: think big, think highly distributed data, think in ecosystems. What Integration Architecture is needed to play an important role in the digital world with eco-system with FinTech company’s and other banks? Do Enterprise Data Warehouses still have a role in this landscape? This is what the presentation is about. Piethein will also cover:

Core Data Integration patterns
Data Ownership and Data Governance
Metadata as the glue
Data Control when data is highly distributed
Data distribution in the Cloud

Read less

Wayne Eckerson

14:00 - 15:00 | Room 2

Making Self-Service Analytics Work: Organizational, Architectural, and Governance Issues

It is notoriously difficult to achieve the promise of self-service analytics. Wayne Eckerson will explain how to empower business users to create their own reports and models without creating data chaos. He will show how to build a self-sustaining analytical culture that balances speed and standards, agility and architecture, and self-service and governance.

Self-service analytics has been the holy grail of data analytics leaders for the past two decades. Although analytical tools have improved significantly, it is notoriously difficult to achieve the promise of self-service analytics. This session will explain how to empower business users to create their own reports and models without creating data chaos. Specifically, it examines seven factors for leading a successful BI program: right roles, right processes, right tools, right organization, right architecture, right governance, and right leadership. Ultimately, it will show how to build a self-sustaining analytical culture that balances speed and standards, agility and architecture, and self-service and governance.

You will learn:

Trends and business dynamics driving analytics adoption
The conundrum of self-service analytics
Success factors for leading a successful BI program
How to survive and thrive in the new world of big data analytics
How to increase user adoption and facilitate self service

Read less

Nigel Turner

15:15 - 16:15 | Room 1

Data Governance and Architecture – Making the connections

Data Governance is one of the hottest topics in data management, focusing both on how Governance driven change can enable companies to gain better leverage from their data and also to help them design and enforce the controls needed to ensure they remain compliant with regulations. Despite this rapidly growing focus, many Data Governance initiatives fail to meet their goals. This session will outline why Data Governance and Architecture should be connected, how to make it happen, and what part Business Intelligence and Data Warehousing will play in defining a robust and sustainable Governance programme.

With data increasingly being seen as a critical corporate asset, more organisations are embracing the concepts and practices of Data Governance. As a result Data Governance is today one of the hottest topics in data management, focusing both on how Governance driven change can enable companies to gain better leverage from their data through enhanced Business Intelligence, Data Analytics and so on, and also to help them design and enforce the controls needed to ensure they remain compliant with increasingly stringent laws and regulations, such as GDPR.

Despite this rapidly growing focus, many Data Governance initiatives fail to meet their goals, with only around one in five fully achieving expectations. Why is the failure rate so high? There are many factors, but one key reason is that implementing Data Governance without aligning it with a defined enterprise and data architecture is fraught with dangers. Linking Architecture with data accountability, a core principle of Data Governance, is essential.
This session will outline why Data Governance and Architecture should be connected, how to make it happen, and what part Business Intelligence and Data Warehousing play in defining a robust and sustainable Governance programme.

This talk will cover:

What is Data Governance and what it is not
Key reasons for Data Governance failure & disappointment
The key components of enterprise architecture – Business, Process and Data
The synergies between architecture and Governance – how do they reinforce each other?
How artefacts from both disciplines can be combined and applied to ensure success
The implications for Business Intelligence and Data Warehousing
Several use cases of successes and lessons learned

Read less

Dave Wells

16:15 – 17:15 | Room 1

Cloud Data Warehousing: Planning for Data Warehouse Migration

Migrating an existing data warehouse to the cloud is a complex process of moving schema, data, and ETL. The complexity increases when architectural modernization, restructuring of database schema or rebuilding of data pipelines is needed. In this session Dave Wells provides an overview of the benefits, techniques, and challenges when migrating an existing data warehouse to the cloud.

Cloud data warehousing helps to meet the challenges of legacy data warehouses that struggle to keep up with growing data volumes, changing service level expectations, and the need to integrate structured warehouse data with unstructured data in a data lake. Cloud data warehousing provides many benefits, but cloud migration isn’t fast and easy. Migrating an existing data warehouse to the cloud is a complex process of moving schema, data, and ETL. The complexity increases when architectural modernization, restructuring of database schema or rebuilding of data pipelines is needed.

This session provides an overview of the benefits, techniques, and challenges when migrating an existing data warehouse to the cloud. We will discuss the pros and cons of cloud migration, explore the dynamics of migration decision making, and look at migration pragmatics within the framework of a step-by-step approach to migrating. The tips and techniques described here will help you to make informed decisions about cloud migration and address the full scope of migration planning.

You Will Learn:

The what and why of cloud data warehousing
The benefits and challenges of cloud data warehousing
Migration analysis and decision making
Technology roles in migration to the cloud
A step-by-step framework for data warehouse migration.

Read less

16:50

Reception

Read less

Rick van der Lans

09:30 - 09:45 | Room 1

Opening by the chairman

Read less

Rick van der Lans

09:45 - 11:00 | Room 1

Data disruption cannot go without IT disruption (Dutch spoken)

How good is our track record with respect to developing IT systems?
Why don’t we deploy more code generators and self-driving technology?
Has data modeling actually changed in the last 30 years?
How should the IT specialist change to be ready for the data disruption?
The importance of in-depth IT knowledge at the top of the organization.

Read less

Jaap-Willem Verheij van Wijk

11:15 – 11:45 | Room 1

Customer insights from EWALS and AEGON and how they continuously innovate with data

Read less

Tim Schulteis

11:45 - 13:00 | Room 1

Data Driven: more than just technology (Dutch spoken)

Business drivers
Architecture and knowledge
Culture and agility
Organization and sponsorship
Critical success factors

Read less

Keith McCormick

11:45 - 13:00 | Room 2

The Power of Combining Machine Learning Models - The Risks and Rewards of Random Forests, XGBoost and other Ensembles

Ensembling is one of the hottest techniques in today’s predictive analytics competitions. Every single recent winner of Kaggle.com and KDD competitions used an ensemble technique, including famous algorithms such as XGBoost and Random Forest. This session will provide a detailed overview of ensemble models, their origin, and show why they are so effective.

Ensembling is one of the hottest techniques in today’s predictive analytics competitions. Every single recent winner of Kaggle.com and KDD competitions used an ensemble technique, including famous algorithms such as XGBoost and Random Forest.
Are these competition victories paving the way for widespread organizational implementation of these techniques? This session will provide a detailed overview of ensemble models, their origin, and show why they are so effective. We will explain the building blocks of virtually all ensembles techniques, to include bagging and boosting.

What You Will Learn:

What are ensemble models and what are their advantages?
Why are ensembles in the news?
The two most influential ensembling approaches: bagging and boosting
The core elements of ensembles and their application
The challenge of applying competition strategies to organizational problems.

Read less

12:30 - 13:30 | Plenary

Lunch break

Read less

Petr Beles

13:30 - 13:45 | Room 1

Model Driven Data Vault Automation with Datavault Builder

In this session we will give you a turbo introduction on how to put the data modeler back on the driver's seat. See how your data model is implemented into a working solution in real-time.

Read less

Piethein Strengholt

14:00 - 15:00 | Room 1

Modern Data Management & Data Integration (Dutch spoken)

The digital future: think big, think highly distributed data, think in ecosystems. What Integration Architecture is needed to play an important role in the digital world with eco-system with FinTech company’s and other banks? Do Enterprise Data Warehouses still have a role in this landscape? This is what the presentation is about. Piethein will also cover:

Core Data Integration patterns
Data Ownership and Data Governance
Metadata as the glue
Data Control when data is highly distributed
Data distribution in the Cloud

Read less

Wayne Eckerson

14:00 - 15:00 | Room 2

Making Self-Service Analytics Work: Organizational, Architectural, and Governance Issues

You will learn:

Trends and business dynamics driving analytics adoption
The conundrum of self-service analytics
Success factors for leading a successful BI program
How to survive and thrive in the new world of big data analytics
How to increase user adoption and facilitate self service

Read less

Nigel Turner

15:15 - 16:15 | Room 1

Data Governance and Architecture – Making the connections

This talk will cover:

What is Data Governance and what it is not
Key reasons for Data Governance failure & disappointment
The key components of enterprise architecture – Business, Process and Data
The synergies between architecture and Governance – how do they reinforce each other?
How artefacts from both disciplines can be combined and applied to ensure success
The implications for Business Intelligence and Data Warehousing
Several use cases of successes and lessons learned

Read less

Dave Wells

16:15 – 17:15 | Room 1

Cloud Data Warehousing: Planning for Data Warehouse Migration

You Will Learn:

The what and why of cloud data warehousing
The benefits and challenges of cloud data warehousing
Migration analysis and decision making
Technology roles in migration to the cloud
A step-by-step framework for data warehouse migration.

Read less

16:50

Reception

Read less

Rick van der Lans

09:30 - 09:45 | Room 1

Opening by the chairman

Read less

Antoine Stelma

09:45 - 11:00 | Room 1

Data routes: combining data vault, ensemble modeling and data virtualisation (Dutch spoken)

This session explains how data vault, ensemble modeling and data virtualization can be used together in an efficient way. Key to this is the new concept of ”data routes". This concept proposes a data-oriented way of processing that is based on the aforementioned issues of data vault, ensemble modeling and data virtualization.

Data vault, ensemble logical modeling, data virtualization and cloud are known to every BI or data warehouse specialist. But the big question is how you can use them together to develop real-life systems and then make optimum use of the power and possibilities of each component. This session explains how all can be used efficiently together. Key to this is the new concept of “data routes”. Within a data and analytics architecture, data routes serve as a fuel for the virtual data presentation layer that is accessed by end users for all their data needs.

The concept proposes a data-oriented way of processing that rests on the aforementioned issues such as data vault, ensemble modeling and data virtualization. A decoupling of data and technology is hereby realized whereby the emphasis is shifted to the characteristics of the data and the requirements set by use cases. The result is offered as a virtual (semantic) data layer to a broad group of data users. With the help of data virtualization, a virtual data collection is built as a virtual data portal for data users.

Does a Cloud Analytics platform offer a complete solution?
How does the Data Routes concept fit in an existing data architecture for Data & Analytics?
Is data modeling not necessary anymore?
How do data routes and data virtualization fit together?
From Ensemble logical modeling to data vault databases

Read less

Lackó Darázsdi

11:15 – 11:45 | Room 1

Turning Data into Innovation - TIBCO Connected Intelligence for Enterprises

Integration and uniformity are cardinal aspects of modern data management. Which aspects are most essential in the information landscape?

To unify the collection of data is necessary in order to ultimately make better decisions. Without uniformity, a decision culture is based on loose sand and a gut feeling. In our contemporary information landscape, we see a greater need to integrate more and more – cloud services and storage, new sources of information, as well as API LED developments. What makes Data Management more than current! Which aspects are essential for today’s information landscape?

Data Virtualization, Data Quality, Reference Data Management, Master Data Management, and Metadata Management as part of Data Management Enable organizations to coordinate the different data silos and improve their decisions.

Our central question is: “How can TIBCO support digital transformation initiatives in this?” Data is the foundation for operational excellence, customer intimacy, and Business Reinvention. The role of TIBCO’s Unify portfolio is the cornerstone of a data-driven initiative for Operations, Data Governance, and Analytics.

What is the impact of data virtualization when using a data warehouse?
Do API LED transformations play a role in information sharing?
How can users gain faster insight into all company data?
What is the need for standardization of reference data?
What is the role of streaming analytics for a Data Science environment?

Read less

Peter Boncz

11:45 - 13:00 | Room 1

Cloud Database Systems in-depth: how do they work and how do they compare? (Dutch spoken)

There are several well know vendors that offer cloud database systems, such as Amazon, Microsoft and Google. In recent year however we have seen some new entries by companies that specialize in cloud services, such as Snowflake and Databricks. Peter Boncz offers an in-depth technical review and comparison of the existing and new generation cloud database systems.

In de afgelopen vijf jaar zijn cloud databasesystemen echt doorgebroken. De cloud maakt het mogelijk om kapitaal investeringen vooraf om te zetten in operationele kosten, zodat men alleen betaalt voor de capaciteit die echt is; en er nooit zorgen hoeven te zijn over capaciteitsproblemen. Daarbovenop “ontzorgen” cloud database systemen in de zin dat het beheer van de database systemen en onderliggende hardware bij de cloud provider ligt. In tijden van personele schaarste is dat een andere belangrijke factor achter het succes van cloud database systemen, die de eventuele nadelen op het gebied van lock-in en zorgen rond privacy en security vaak neutraliseert.

Maar, als eenmaal het besluit is genomen om de database naar de cloud te brengen, welke dan te kiezen? Er zijn op dit moment al een heleboel cloud systemen. Amazon heeft onder andere Aurora, Redshift, Neptune en Athena. Microsoft heeft SQLserver en Cosmos DB. Google heeft onder andere BigQuery. En dan zijn er nieuwe bedrijven bijgekomen, die zich specialiseren in cloud services, zoals Snowflake en Databricks.

Om beter te begrijpen wat de overeenkomsten en verschillen zijn tussen al die nieuwe cloud systemen, zal Peter Boncz ingaan op wat er zich onder de motorkap van deze nieuwe systemen bevindt. De verschillende alternatieven worden technisch ontleed en met elkaar vergeleken.

Enkele van de onderwerpen die aan bod zullen komen:

Een introductie tot cloud data systemen met een actueel marktoverzicht van de belangrijkste kanshebbers
Hoe ziet de architectuur van deze verschillende systemen eruit op het gebied van query-engine, data representatie, elasticiteit en data partitionering
Welke diensten zijn “serverless” en wat is dat precies?
Kunnen cloud database systemen automatisch data optimaliseren?
Wat is het economische model, en verdere implicaties daarvan
Welke ontwikkelingen zullen nog volgen in cloud database systemen in de komende jaren? Een voorbeeld is databases delen in de cloud.

Read less

Keith McCormick

11:45 - 13:00 | Room 2

Data Preparation for Machine Learning: Why Feature Engineering Remains a Human-Driven Activity

In Data preparation it is important how the human modeler creates a dataset that is uniquely suited to the business problem. In this session Keith McCormick will expose analytic practitioners, data scientists, and those looking to get started in predictive analytics to the critical importance of properly preparing data in advance of model building. He will present the critical role of feature engineering and explaining how to do it effectively.

This session will expose analytic practitioners, data scientists, and those looking to get started in predictive analytics to the critical importance of properly preparing data in advance of model building. The instructor will present the critical role of feature engineering, explaining both what it is and how to do it effectively.

Emphasis will be given to those tasks that must be overseen by the modeler – and cannot be performed without the context of a specific modeling project. Data is carefully “crafted” by the modeler to improve the ability of modeling algorithms to find patterns of interest.

Data preparation is often associated with cleaning and formatting the data. While important, these tasks will not be our focus. Rather it is how the human modeler creates a dataset that is uniquely suited to the business problem.

You will learn:

Construction methods for various data transformations
The merits and limitations of automated data preparation technologies
Which data prep tasks are best performed by data scientist, and which by IT
Common types of constructed variables and why they are useful
How to effectively utilize subject matter experts during data preparation

Read less

12:30 - 13:30 | Plenary

Lunch break

Read less

Jan Henderyckx

14:00 - 15:00 | Room 1

Modernizing Data Governance for the Age of Self-Service Analytics

Self-Service analytics is not just about making sure that you provide as many datapoints as possible combined with a flexible tool to the “citizen” data scientist.

With democratization of analytical capabilities and the wider access to data, questions arise on the governance and regulatory- and ethical compliance of the data usage. Locking all data down is not the answer as we would lose too much value.
Using the Data Governance 1.0 top down and waterfall like models are not well suited to deal with the new paradigms. The presentation focusses on the steps you need to take to get sustainable and compliant value through (Self-Service) Analytics out of your big data.

You will learn:

How to adapt your data governance for the new ways of working
What is the distinction between Information and Big Data Governance
What should you put in place for properly governing self-service analytics
Increasing data literacy
Catering for the dynamics of data on-boarding and usage flows
Towards policy based classification and access
Use case governance vs Critical Data Elements

Read less

Wayne Eckerson

14:00 – 15:00 | Room 2

Best Practices in DataOps: Trends, Tips, and Techniques for Creating and Managing Modern Data Pipelines

The state of most data analytics pipelines is deplorable: too little automation; minimal reuse of code and data; and lack of coordination. The result is poor quality data delivered too late to meet business needs. DataOps is an emerging approach for building data pipelines and solutions. In this session Wayne Eckerson will explore trends in DataOps adoption, challenges and best practices.

When it comes to data analytics, you don’t want to know “how the sausage is made.” The state of most data analytics pipelines is deplorable. There are too many steps; too little automation and orchestration; minimal reuse of code and data; and a lack of coordination between stakeholders in business, IT, and operations. The result is poor quality data delivered too late to meet business needs.

DataOps is an emerging approach for building data pipelines and solutions. This session will explore trends in DataOps adoption, challenges that organizations face in implementing DataOps, and best practices in building modern data pipelines. It will examine how leading-edge organizations are using DataOps to increase agility, reduce cycle times, and minimize data defects, giving developers and business users greater confidence in analytic output.

You will learn:

What is DataOps and why you need it
The dimensions of DataOps
The state of DataOps adoption
DataOps best practices and challenges

Read less

Jeroen Vermunt

15:15 - 16:15 | Room 1

Managing and exploring data using a data lake and an analytics lab (Dutch spoken)

ASML, manufacturer of machines for the production of semiconductors, is implementing a central data lake to capture this data and make it accessible for reporting and analytics in a central environment. The data lake environment also includes an analytics lab for detailed exploration of data. In this session Jeroen Vermunt presents real-life examples of how ASML approaches the challenges of managing rapidly changing data.

In the highly complex world of semiconductor manufacturing vast amounts of largely varied data are generated every day. ASML, world-leader manufacturer of machines for the production of semiconductors (chips), is implementing a central data lake to capture this data and make it accessible for reporting and analytics in a central environment. The data lake environment also includes an analytics lab for detailed exploration of data. Managing all this rapidly changing data imposes some very challenging requirements. In this session, real-life examples of how ASML approaches these challenges are presented.

How do business users and data scientists discover information in the data lake without drowning in the amount and complexity?
How can users be enabled to understand the data they want to consume, including were the data comes from (data lineage)?
How do we make sure that the data can be trusted?
How can ASML control and monitor that access to data is secure?
What added value can the analytics lab bring?

Read less

Rick van der Lans

16:15 – 17:15 | Room 1

Ten practical guidelines for modern data architectures (Dutch spoken)

Often, existing data architectures can no longer keep up with the current “speed of business change”. As a result, many organizations have decided that it is time for a new, future-proof data architecture. However, this is easier said than done. In this session, ten essential guidelines for designing modern data architectures are discussed. These guidelines are based on hands-on experiences with designing and implementing many new data architectures.

Many IT systems are more than twenty years old and have undergone numerous changes over time. Unfortunately, they can no longer cope with the ever-increasing growth in data usage in terms of scalability and speed. In addition, they have become inflexible, which means that implementing new reports and performing analyses has become very time-consuming. In short, the data architecture can no longer keep up with the current “speed of business change”. As a result, many organizations have decided that it is time for a new, future-proof data architecture. However, this is easier said than done. After all, you don’t design a new data architecture every day. In this session, ten essential guidelines for designing modern data architectures are discussed. These guidelines are based on hands-on experiences with designing and implementing many new data architectures.

Which new technologies are currently available?
What is the influence on the architecture of e.g. Hadoop, NoSQL, big data, data warehouse automation, and data streaming?
Which new architecture principles should be applied nowadays?
How do we deal with the increasingly paralyzing rules for data storage and analysis?
What is the influence of cloud platforms?

Read less

Rick van der Lans

09:30 - 09:45 | Room 1

Opening by the chairman

Read less

Antoine Stelma

09:45 - 11:00 | Room 1

Data routes: combining data vault, ensemble modeling and data virtualisation (Dutch spoken)

Does a Cloud Analytics platform offer a complete solution?
How does the Data Routes concept fit in an existing data architecture for Data & Analytics?
Is data modeling not necessary anymore?
How do data routes and data virtualization fit together?
From Ensemble logical modeling to data vault databases

Read less

Lackó Darázsdi

11:15 – 11:45 | Room 1

Turning Data into Innovation - TIBCO Connected Intelligence for Enterprises

Integration and uniformity are cardinal aspects of modern data management. Which aspects are most essential in the information landscape?

What is the impact of data virtualization when using a data warehouse?
Do API LED transformations play a role in information sharing?
How can users gain faster insight into all company data?
What is the need for standardization of reference data?
What is the role of streaming analytics for a Data Science environment?

Read less

Peter Boncz

11:45 - 13:00 | Room 1

Cloud Database Systems in-depth: how do they work and how do they compare? (Dutch spoken)

Enkele van de onderwerpen die aan bod zullen komen:

Een introductie tot cloud data systemen met een actueel marktoverzicht van de belangrijkste kanshebbers
Hoe ziet de architectuur van deze verschillende systemen eruit op het gebied van query-engine, data representatie, elasticiteit en data partitionering
Welke diensten zijn “serverless” en wat is dat precies?
Kunnen cloud database systemen automatisch data optimaliseren?
Wat is het economische model, en verdere implicaties daarvan
Welke ontwikkelingen zullen nog volgen in cloud database systemen in de komende jaren? Een voorbeeld is databases delen in de cloud.

Read less

Keith McCormick

11:45 - 13:00 | Room 2

Data Preparation for Machine Learning: Why Feature Engineering Remains a Human-Driven Activity

You will learn:

Construction methods for various data transformations
The merits and limitations of automated data preparation technologies
Which data prep tasks are best performed by data scientist, and which by IT
Common types of constructed variables and why they are useful
How to effectively utilize subject matter experts during data preparation

Read less

12:30 - 13:30 | Plenary

Lunch break

Read less

Jan Henderyckx

14:00 - 15:00 | Room 1

Modernizing Data Governance for the Age of Self-Service Analytics

Self-Service analytics is not just about making sure that you provide as many datapoints as possible combined with a flexible tool to the “citizen” data scientist.

You will learn:

How to adapt your data governance for the new ways of working
What is the distinction between Information and Big Data Governance
What should you put in place for properly governing self-service analytics
Increasing data literacy
Catering for the dynamics of data on-boarding and usage flows
Towards policy based classification and access
Use case governance vs Critical Data Elements

Read less

Wayne Eckerson

14:00 – 15:00 | Room 2

Best Practices in DataOps: Trends, Tips, and Techniques for Creating and Managing Modern Data Pipelines

You will learn:

What is DataOps and why you need it
The dimensions of DataOps
The state of DataOps adoption
DataOps best practices and challenges

Read less

Jeroen Vermunt

15:15 - 16:15 | Room 1

Managing and exploring data using a data lake and an analytics lab (Dutch spoken)

How do business users and data scientists discover information in the data lake without drowning in the amount and complexity?
How can users be enabled to understand the data they want to consume, including were the data comes from (data lineage)?
How do we make sure that the data can be trusted?
How can ASML control and monitor that access to data is secure?
What added value can the analytics lab bring?

Read less

Rick van der Lans

16:15 – 17:15 | Room 1

Ten practical guidelines for modern data architectures (Dutch spoken)

Which new technologies are currently available?
What is the influence on the architecture of e.g. Hadoop, NoSQL, big data, data warehouse automation, and data streaming?
Which new architecture principles should be applied nowadays?
How do we deal with the increasingly paralyzing rules for data storage and analysis?
What is the influence of cloud platforms?

Read less

Limited time?
Can you only attend one day? It is possible to attend only the first or only the second conference day and of course the full conference. The presentations by our speakers have been selected in such a way that they can stand on their own. This enables you to attend the second conference day even if you did not attend the first (or the other way around).

2 july

09:30 - 09:45 | Opening by the chairman
Room 1 Rick van der Lans

09:45 - 11:00 | Data disruption cannot go without IT disruption (Dutch spoken)
Room 1 Rick van der Lans

11:15 – 11:45 | Customer insights from EWALS and AEGON and how they continuously innovate with data
Room 1 Jaap-Willem Verheij van Wijk

11:45 - 13:00 | Data Driven: more than just technology (Dutch spoken)
Room 1 Tim Schulteis

11:45 - 13:00 | The Power of Combining Machine Learning Models – The Risks and Rewards of Random Forests, XGBoost and other Ensembles
Room 2 Keith McCormick

12:30 - 13:30 | Lunch break
Plenary

13:30 - 13:45 | Model Driven Data Vault Automation with Datavault Builder
Room 1 Petr Beles

14:00 - 15:00 | Modern Data Management & Data Integration (Dutch spoken)
Room 1 Piethein Strengholt

14:00 - 15:00 | Making Self-Service Analytics Work: Organizational, Architectural, and Governance Issues
Room 2 Wayne Eckerson

15:15 - 16:15 | Data Governance and Architecture – Making the connections
Room 1 Nigel Turner

16:15 – 17:15 | Cloud Data Warehousing: Planning for Data Warehouse Migration
Room 1 Dave Wells

16:50 | Reception

3 july

09:30 - 09:45 | Opening by the chairman
Room 1 Rick van der Lans

09:45 - 11:00 | Data routes: combining data vault, ensemble modeling and data virtualisation (Dutch spoken)
Room 1 Antoine Stelma

11:15 – 11:45 | Turning Data into Innovation – TIBCO Connected Intelligence for Enterprises
Room 1 Lackó Darázsdi

11:45 - 13:00 | Cloud Database Systems in-depth: how do they work and how do they compare? (Dutch spoken)
Room 1 Peter Boncz

11:45 - 13:00 | Data Preparation for Machine Learning: Why Feature Engineering Remains a Human-Driven Activity
Room 2 Keith McCormick

12:30 - 13:30 | Lunch break
Plenary

14:00 - 15:00 | Modernizing Data Governance for the Age of Self-Service Analytics
Room 1 Jan Henderyckx

14:00 – 15:00 | Best Practices in DataOps: Trends, Tips, and Techniques for Creating and Managing Modern Data Pipelines
Room 2 Wayne Eckerson

15:15 - 16:15 | Managing and exploring data using a data lake and an analytics lab (Dutch spoken)
Room 1 Jeroen Vermunt

16:15 – 17:15 | Ten practical guidelines for modern data architectures (Dutch spoken)
Room 1 Rick van der Lans

Schedule

Opening by the chairman

Data disruption cannot go without IT disruption (Dutch spoken)

Customer insights from EWALS and AEGON and how they continuously innovate with data

Data Driven: more than just technology (Dutch spoken)

The Power of Combining Machine Learning Models - The Risks and Rewards of Random Forests, XGBoost and other Ensembles

Lunch break

Model Driven Data Vault Automation with Datavault Builder

Modern Data Management & Data Integration (Dutch spoken)

Making Self-Service Analytics Work: Organizational, Architectural, and Governance Issues

Data Governance and Architecture – Making the connections

Cloud Data Warehousing: Planning for Data Warehouse Migration

Reception

Opening by the chairman

Data disruption cannot go without IT disruption (Dutch spoken)

Customer insights from EWALS and AEGON and how they continuously innovate with data

Data Driven: more than just technology (Dutch spoken)

The Power of Combining Machine Learning Models - The Risks and Rewards of Random Forests, XGBoost and other Ensembles

Lunch break

Model Driven Data Vault Automation with Datavault Builder

Modern Data Management & Data Integration (Dutch spoken)

Making Self-Service Analytics Work: Organizational, Architectural, and Governance Issues

Data Governance and Architecture – Making the connections

Cloud Data Warehousing: Planning for Data Warehouse Migration

Reception

Opening by the chairman

Data routes: combining data vault, ensemble modeling and data virtualisation (Dutch spoken)

Turning Data into Innovation - TIBCO Connected Intelligence for Enterprises

Cloud Database Systems in-depth: how do they work and how do they compare? (Dutch spoken)

Data Preparation for Machine Learning: Why Feature Engineering Remains a Human-Driven Activity

Lunch break

Modernizing Data Governance for the Age of Self-Service Analytics

Best Practices in DataOps: Trends, Tips, and Techniques for Creating and Managing Modern Data Pipelines

Managing and exploring data using a data lake and an analytics lab (Dutch spoken)

Ten practical guidelines for modern data architectures (Dutch spoken)

Opening by the chairman

Data routes: combining data vault, ensemble modeling and data virtualisation (Dutch spoken)

Turning Data into Innovation - TIBCO Connected Intelligence for Enterprises

Cloud Database Systems in-depth: how do they work and how do they compare? (Dutch spoken)

Data Preparation for Machine Learning: Why Feature Engineering Remains a Human-Driven Activity

Lunch break

Modernizing Data Governance for the Age of Self-Service Analytics

Best Practices in DataOps: Trends, Tips, and Techniques for Creating and Managing Modern Data Pipelines

Managing and exploring data using a data lake and an analytics lab (Dutch spoken)

Ten practical guidelines for modern data architectures (Dutch spoken)

2 july

3 july