--- categories: - Data Science date: '2023-12-04 14:40:04' draft: false preview: /social/08825573291b0d6f14c711f2ba28af13e26ba7557348f022676835b15573b427.png tags: - humour title: Data Swamp type: posts url: /2023/12/04/data-swamp/ --- <!-- wp:indieblocks/like {"empty":false} --> <div class="wp-block-indieblocks-like"><div class="u-like-of h-cite"><p><i>Likes <a class="u-url" href="https://snarfed.org/2023-12-03_51578">https://snarfed.org/2023-12-03_51578</a> by <span class="p-author">Ryan Barrett</span>.</i></p></div><div class="e-content"><!-- wp:quote --> <blockquote class="wp-block-quote"><!-- wp:paragraph --> <p>My dad has spent some of his retirement doing hobbyist machine learning projects. He heard the term “data lake” a while back and has taken to calling his datasets a “data swamp.” Feels like a terminology improvement the whole field could get behind.</p> <!-- /wp:paragraph --></blockquote> <!-- /wp:quote --></div></div> <!-- /wp:indieblocks/like --> <!-- wp:paragraph --> <p>This is brilliant, I've not come across this term before but I could definitely get behind using it to describe data that comes out of customer systems like <a href="https://bsky.app/profile/cleverdevil.io/post/3kfom22ivtf2u">Jonathon says he already does</a> at his workplace.</p> <!-- /wp:paragraph -->