Blog Posts & Articles

Sharing insights on data engineering, gen ai, and open source

Infinite Lambda | Dat Nguyen

Data Diff Validation in Blue-Green Deployments

During a blue-green deployment, there are discrepancies between environments that we need to address to ensure data integrity. This calls for an effective data diff solution that complements existing frameworks and improves the data validation process. If we can do that with zero budget, that is even better.

#Snowflake #dbt #BlueGreen #Deployment #DataDiff
🕥 4 min read Read Article
Medium @ datnguye

TSQL — How to convert a number to words

With raw SQL, it is 100% possible to convert a number to words in a paticular language. All we need is to have the algorithm. Let’s pick the English one — which is the most popular language.

#TSQL #Number #ToWord
🕥 2 min read Read Article
Medium @ datnguye

Understanding the Columnstore Indexes in SQL Server

Beginning with the version of 2016 (13.x) SP1, the “new” feature called Columnstore Indexes has been added to SQL Server all editions (before that, this feature was only available in the editions: Enterprise, Developer) to enhance the analytical querying performance and to optimize the way of storing indexes.

#TSQL #Columnstore #Index
🕥 7 min read Read Article
Medium @ datnguye

Generate mock/fake data with TSQL

When you don’t know where to get the data from? When you have to wait for data provided by another team but you couldn’t? When you’re trying to go typing the mock data manually — it would hences take too much time to do?

#TSQL #Mock #Data
🕥 6 min read Read Article
Medium @ datnguye

Ways to download file from FTP using TSQL

There are many many ways to play with FTP download, but when coming to TSQL level, it’s kind of “a tiny wave” in the brain that how to do it natively.

#TSQL #FTP #Download
🕥 6 min read Read Article
Medium @ datnguye

Using python script and pip packages in SQL Server

We all know that R language is great in SQL Server Machine Learning service. Problem comes with “I don’t know R, but Python!” — “No worries, that’s supported as well”, MS said.

#TSQL #Python #packages
🕥 4 min read Read Article
Medium @ datnguye

Using TSQL to send SendGrid email

As the previous article about How to call API by using TSQL?, we hence can use it to call SendGrid API to send email.

#TSQL #Sendgrid #Notification #API
🕥 3 min read Read Article