Loading...
Skip to content
Data stories

Data stories

Stories about my data science journey! Join me :)

    • About me
    • My talks
    • Contact me
    • Find me on github!
  • Recent Posts

    • The Complete 2026 Data Science Roadmap for Mastering AI Agents era
    • Where Does Your Organization Stand on the AI Maturity Scale?
    • Essential Features in AI Products
    • Why and how to reduce hallucinations in LLMs practically?
    • Install Spark & Pyspark on Windows

    Categories

    Archives

    • January 2026
    • November 2024
    • February 2024
    • September 2023
    • October 2021
    • February 2021
    • January 2021
    • November 2020
    • October 2020

Data science

Anything related to data science & machine learning

The Complete 2026 Data Science Roadmap for Mastering AI Agents era

The Complete 2026 Data Science Roadmap for Mastering AI Agents era

Posted on January 22, 2026January 23, 2026Data science, Generative AI

The 2026 Data Science Roadmap is your guide to evolving from a coder into an AI Architect and mastering AI agents. AI agents are now performing tasks that used to require human Data Scientists, making ...

Read more ⟶
Essential Features in AI Products

Essential Features in AI Products

Posted on February 17, 2024November 17, 2024AI product, Data science, Generative AI

In this article, we will dive into AI product characteristics, the most important features, and how they are related to various data product types. Compared to other data products such as analytical, AI products have ...

Read more ⟶
Install Spark & Pyspark on Windows

Install Spark & Pyspark on Windows

Posted on October 26, 2021November 17, 2024Data science, Python, Spark

Spark is a parallel data processing framework, which is used for big data analytics. It is an open-source project and is written in Scala. It supports streaming data analysis, SQL, Machine learning, and Graph processing. ...

Read more ⟶
Google computing options and how to choose them?

Google computing options and how to choose them?

Posted on February 14, 2021January 21, 2026Data science

GCP is offering multiple compute options, selecting the right one which is fitting the need, is essential in terms of performance and cost saving! The available compute options in GCP are: Compute Engine, Google Kubernetes ...

Read more ⟶
How to select the most suitable Google Cloud Storage solution?

How to select the most suitable Google Cloud Storage solution?

Posted on January 16, 2021January 21, 2026Data science

Google is offering many cloud services and finding the best one which is matching with your requirements can be confusing. Let’s just focus on Google Cloud Storage for now. The offered storage solutions by Google ...

Read more ⟶
Object oriented programming (OOP) in Python

Object oriented programming (OOP) in Python

Posted on November 20, 2020November 20, 2020Data science, Python

Object oriented programming (OOP) is a design pattern for creating stable applications and powerful systems. It allows handling rapid changes in complex systems easier. Classes are a blueprint of needed functionalities which is described by ...

Read more ⟶
Some pandas functions that you should start using now!

Some pandas functions that you should start using now!

Posted on November 14, 2020November 14, 2020Data science, Python

Pandas is a popular library for exploratory analysis and data wrangling in python. It has a lot of functions and methods to use. Here I listed some functions that you may not know or don’t ...

Read more ⟶
© 2026 Nonlinear data. All Rights Reserved