Activity
Mon
Wed
Fri
Sun
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
What is this?
Less
More

Memberships

Data Innovators Exchange

Public • 322 • Free

14 contributions to Data Innovators Exchange
Data Innovator Community Meetups
To all Data Innovators, Where do you want to see the next Data innovator events? Due to the huge success of the community meetups in London and Munich we want to keep them coming. In these events we got very good feedback on the focus of the workshops, so we are going to keep the focus on them and stay with the 1 day format with: - Success story - 2 Workshops - Panel discussion The team currently is brainstorming locations and would love to hear your thoughts on this 🙂 So please let us know in the comments where you want a Data Innovators Event to take place and we will give our best to make it happen 🙂 Thank you for your input!
6
9
New comment Oct 10
1 like • Oct 4
@Martin Šafránek If that was in November, this would be a wine and a goose🤷‍♂️
1 like • Oct 7
@Paul Barlow Yes!
Data Modeling in the Modern Data Stack
Data modeling is crucial due to the increased number of data sources, diverse data consumers, and the need for optimization (both speed and cost). Mike Kahan briefly showed different modeling approaches, e.g., Star Schema and Data Vault 2.0. At the end, he delves into hybrid approaches. Have you used hybrid approaches before? If so, which modeling approaches did you mix together and how?
2
1
New comment Sep 20
Data Modeling in the Modern Data Stack
2 likes • Sep 20
Interesting one. Many years ago I was, as a data engineer, on a project where the intention was to create EDW with core in 3NF model and data marts in Star - dimensional model. Basically an Inmon's idea of a data warehouse. We ended up with: 1. having biggest transactional table in core as a Fact and a Dimension, rest in 3NF'ish 2. Data marts with Star'ish model, but some dimension referenced each other directly (no via fact table) This was far from perfect modeling (and not only modeling), but business was extremely happy having EDW delivery value, so in general it was a great success. :)
Recommendations Needed: Best Resources for Learning Data Governance Strategies
Hi everyone! I'm looking for recommendations on where to learn more about Data Governance, especially from a strategic and requirements perspective, rather than just technical-focused content. Any good courses, training programs, or resources that dive deeper into governance frameworks and best practices? I’d appreciate your insights!
3
6
New comment Sep 18
0 likes • Sep 17
For all data management aspects, where data governance is part of, I would think of www.dama.org
0 likes • Sep 18
@Lorenz Kindling Not personally, but I know that they are quite active as an organization. I own the book DAMA-DMBOK. It is really set of principles for data management (like TOGAF for architecture, or ITIL for IT Management) - so quite boring read :) AI Assistant would give you probably something shorter and equally good - to start with. 🤫
Is hashing good enough for anonymising data?
https://www.rnz.co.nz/news/business/527419/inland-revenue-giving-thousands-of-taxpayers-details-to-social-media-platforms-for-ad-campaigns?fbclid=IwY2xjawFLSwJleHRuA2FlbQIxMAABHfiQoZd2lKNuLPKWDo5IrGSrtYtTKNwWBrS0kfJBtccVTTWP9FPKrjY3zg_aem_jp9YxiznNYVeVO5Oo9wqfA
5
7
New comment Sep 18
Is hashing good enough for anonymising data?
1 like • Sep 9
Seems right. What are alternatives?
0 likes • Sep 18
@Shane Gibson this is an operational solution. I what would be the technical better solution, something like “salted hashing”
Tools Data Engineers need to know in 2024
There are so many tools out there, but I found this video on YouTube. It did a good job of breaking down the essentials. Here's a quick list of the tools mentioned: - Basics: SQL, Python, Linux, bash scripting, network understanding - Technical Basics: Git, SFTP, PGP - Databases: PostgreSQL, MySQL, MongoDB - Data Platforms: Snowflake, Databricks, BigQuery, Redshift, Azure Synapse Analytics - Orchestration, ETL & Data Pipelines: Airflow, Dagster, SSAS, Azure Data Factory - Cloud: AWS, Azure, GCP - Others: Docker, Kubernetes, Terraform If you had to give advice on where to start, what would that be? What are your favored tools?
6
5
New comment Sep 17
Tools Data Engineers need to know in 2024
3 likes • Sep 15
I would add to this “data storage formats”: table, json, parquet, delta, iceberg
2 likes • Sep 17
@Lorenz Kindling Correct. I think that for each person that works with data, on each level of data stream, it is important to be at least familiar with basic modeling terminology and concepts.
1-10 of 14
Jaroslaw Syrokosz
3
8points to level up
@jaroslaw-syrokosz-8262
⭐️Analytics & Modeling Engineer | Value-Driven Data Consultant ⭐️Certified in: DataVault 2.0, Snowflake, DBT, Azure, Airflow, Teradata, MicroStrategy

Active 19h ago
Joined Jul 28, 2024
EU
powered by