Heiner is a Knowledge Engineer with a specialty in semantics. Before joining OSTHUS GmbH, he worked on his PhD at Siemens AG where he developed a semantic model for clinical information that integrates clinical data with medical knowledge and created sematic models for annotations of unstructured medical data. Heiner has successfully demonstrated how semantic technologies can be used to gather information from various sources such as relational databases and unstructured data such as text or images, in the context of patient data.
1. Why did you decide to do a webinar with Business Review Webinars?
In client projects I learned that semantics can help people to better structure and understand their data. For many people however semantics is still only a buzz word. I want to help people understand the basics of semantics and its role within a data-driven industry.
2. How did you get into the industry?
After graduating my studies in mathematics on graph theory, I worked on my Ph.D. thesis at Siemens in publicly funded research project Theseus MEDICO where we explored, how semantic technologies can help to enable knowledge-based access to clinical data such as radiological images or clinical reports.
Since 5 month I work for OSTHUS as a knowledge engineer, mainly in context of the Allotrope Foundation project where semantic technologies are used to create a shared vocabulary and data structures for data on laboratory analytical processes and their result. As a consultant I further help clients to use semantic technologies in various data management projects.
3. What do you most enjoy about your role?
Most of the clients are new to semantic technologies. I like to help them understand why semantics are key enabler for changing the value of their data. Having a shared vocabulary and shared data structures is the first step towards so many opportunities for usage of data. I like to help them make the first step.
4. What has been your best holiday and where would you recommend visiting?
I like to go swimming and eat a lot of sea food in my holidays. Croatia is great for that and I’ve been there the last four holidays.
5. What motivates you?
I want to bring visions to real applications that have impact and solve business questions.
Life science data is a major asset for biopharmaceutical and chemical industries. Their availability via electronic systems is a prerequisite for collaborative work and successful innovation. Currently, most laboratories have to deal with a multitude of data sources originating from different instruments, systems, sites and external resources all with their own data formats. As a consequence, scientists often have to do a lot of manual effort to gain access to the data they need and IT teams are struggling to maintain the large amounts of different IT solutions. Data analytics becomes an inefficient process with a high amount of integration effort.
Typically data integration of life science data is very time consuming. So why is it so time consuming? Data integration projects are complex because you combine data from multiple data sources and these often have different data standards, different data formats, different semantics and different data quality. Data integration is characterized by a high degree of exception handling. Typographical errors caused at data entry, fuzzy definitions of concepts, or inconsistent interpretation of data by different informatics systems are typical root causes. To integrate data efficiently you need to have in depth knowledge of both IT and the science – a combination which is quite rare. So what can you do if you don’t have these people readily available?
One way to make these integration projects go smoother is using reference architectures and data standards. We will show several uses cases to illustrate how reference architectures can support you in data integration, data curation & data migration. One of these reference architectures addresses the migration of biopharmaceutical data using an integration layer on top of a data warehouse as part of a discovery data integration process. This architecture was used in a data integration project combining multiple heterogeneous data sources, all with different data formats and standards.
Central in this architecture is a data curation platform – the operation data store – which provides an easy to use easy-to-use entry to scientific data management and can be used by non-IT experts giving your scientific data experts control of the data integration. During the webinar we will show you additional reference architectures and will highlight how data standards can help you in tackling these complicated projects.
So do you recognize these problems? Is your company dealing with a myriad of data sources or are you losing too much time connecting data? Join us for this webinar in which we will show you how data integration projects can be made easy.
Register here for OSTHUS Webinar.
For more information on our company and services visit www.osthus.com or contact us at [email protected]