Welcome to Data Science Blog!

Manamath Nath - Ramayana Corpus

less than 1 minute read

Manmath Nath - Ramayana Corpus Corpus Introduction Corpus License Bala Kanda Ayodhya Kanda Aranyaka Kanda Kishkinddha Kanda Sunder Kanda ...

KM Ganguli Mahabharat Corpus

less than 1 minute read

KM Ganguli Mahabharat Corpus Adi Parva (The Book of the Beginning) Sabha Parva (The Book of the Assembly Hall) Vana Parva or Aranyaka-Parva (The Bo...

AI Usecases in Government

2 minute read

AI Usecases in Government Preliminary Work before any AI Project with Government Keeping technology evolution, cost of latest technologies, government exp...

AI in School Education

10 minute read

AI in School Education Introduction In the ever-evolving landscape of education, a technological revolution is quietly reshaping the way students learn, t...

Data Science and Basics of Astrology

15 minute read

Basics of Jyotish Introduction In this article I am going to discuss the basics of astrology and the data science aspect of astrology. I am defending here ...

Summary of Life Changing Selfhelp Books

less than 1 minute read

Download Link to this Diary The Inspirational Leader by Gifford Thomas The 5 Elements of Effective Thinking by Edward B. Burger How to Listen by ...

Topic Modeling with BERT

1 minute read

Topic Modeling with BERT Key steps in BERTopic modelling are as following. Use “Sentence Embedding” models to embed the sentences of the article Red...

Graph of Thoughts

2 minute read

Graph of Thoughts This is a valuable resource for learning Graph of Thoughts (GoT) concepts. The YouTube video is from code_your_own_AI. I’m utilizing the...

Basics of Word Embedding

11 minute read

Basics of Word Embedding What is Context, target and window? The “context” word is the surrounding word. The “target” word is the middle word. The ...

Compressing Large Language Model

2 minute read

Compressing Large Language Model Why to compress an LLM? Large Language Models have a large number of parameters, which contributes to their performance bu...

LaTeX Capabilities

5 minute read

In the realm of document typesetting and preparation, LaTeX stands as a timeless giant, revered by professionals, researchers, students, and publishers ali...

What is Pinecone

7 minute read

What is pinecone? Pinecone is a managed vector database that provides vector search (or “similarity search”) for developers with a straightforward API and ...

ML Model Development Framework

1 minute read

ML Model Development Framework & Model Repositories Introduction There are hundreds of machine learning tasks. To do these tasks there are thousands o...

ML Model Respository from Pinto0309

30 minute read

ML Model Repository from Pinto0309 Introduction Using AI we can solve many kinds of tasks for this input can be text, structured data, image, video, audio...

Python APIs for Data

1 minute read

Python APIs for Data Bing Bing is a search engine that brings together the best of search and people in your social networks to help you spend less time s...

Distances in Machine Learning

2 minute read

Distances in Machine Learning Every sample, record, word, sentence, object, image etc in the Machine learning language is called vector. If we want to mea...

Paper with Code Resources

27 minute read

Paper with Code Resources Trending Papers of 2021 ADOP: Approximate Differentiable One-Pixel Point Rendering — Rückert et al — https://paperswithcode....

Important AI Paper List

25 minute read

Introduciton In almost all citations it becomes very difficult to read the title of research papers. Why? Because the contributors’ information is first a...

Machine Learning Metrics

19 minute read

Machine Learning Metrics Introduction In Machine Learning projects whether classical machine learning, deep learning, computer vision, speech processing,...

What is LLM

20 minute read

What is Large Language Model Introduction LLM stands for Large Language Model. It is a type of artificial intelligence (AI) model that is trained on a mas...

How to do Literature Review

5 minute read

How to Conduct Literature Review? Introduction Literature Review (LR) or Literature Survey (LS) is a process that helps you to browse the libraries, liter...

NLP Tasks

45 minute read

NLP Tasks Introduction Processing words of any language and driving some meaning from these is as old as the human language. Recently, AI momentum is taki...

SQL and Relational Algebra

2 minute read

SQL and Relational Algebra Relational algebra (RA) is considered as a procedural query language where the user tells the system to carry out a set of oper...

Types of Questions

6 minute read

Types of Questions Introduction Question-Answering task is one of the tasks in NLP-Task. To create a high-performing AI system that can understand the que...

Google Cloud APIs

12 minute read

Google Cloud APIs Introduction Hundreds of services from Google are available to consumers as API. Every API has a specific purpose. Over a period of time...

Model Tuning with VertexAI

3 minute read

Tuning Large Language Model with VertexAI Why Model Tuning? Tuning is required when you want the model to learn something niche or specific that deviates f...

Introduction to Prompt Engineering

10 minute read

Introduction to Prompt Best Engineering Prompts can contain questions, instructions, contextual information, examples, and partial input for the model to ...

Introduction to ML Model Deployment

11 minute read

Introduction to AI Model deployment Big Players Amazon Amazon has many products and one of their product is AWS Cloud. Under this product th...

AWS SageMaker Jumpstart Models

34 minute read

AWS SageMaker Jumpstart Models As of 17-Jul-23, AWS Sagemaker has 463 models in its Model Zoo. They call these models as Jumstart Models. What are the capa...

Python Decorator Function

4 minute read

Python Decorator Function What is Decorator Function in Python In Python, a decorator is a special type of function that allows you to modify or extend t...

Embedding with FastText

8 minute read

Embedding with FastText What is Embedding? What are Different Types of Embedding What is FastText? FastText is an open-source library for efficient lea...

Python Naming Convention

2 minute read

Python Naming Convention UPPERCASE / UPPER_CASE_WITH_UNDERSCORES => module-level constants lowercase / lower_case_with_underscores => for varia...

Sorting Algorithm A Summary

7 minute read

Sorting Algorithm A Summary Introduction Sorting is a fundamental operation in computer science and plays a vital role in various applications. Whether it’...

What is CAPTCHA?

1 minute read

What is CAPTCHA? CAPTCHA stands for “Completely Automated Public Turing test to tell Computers and Humans Apart.” It is a security mechanism used by websi...

What is GAN Architecture?

4 minute read

What is GAN Architecture? Generative Adversarial Networks (GANs) are a powerful class of neural networks that are used for unsupervised learning. It was de...

Capabilities of AI Transformers

28 minute read

Capabilities of AI Transformers Background Whether GPT, ChatGPT, DALL-E, Whisper, Satablity AI or whatever significant you see in the AI worlds nowdays it...

Model Garden of VertexAI

15 minute read

Model Garden of VertexAI: Unlocking the Power of Google’s VertexAI: Exploring the World of Pre-Built Models for AI Tasks Introduction:

Demystifying DevOps, MLOps, and DataOps

8 minute read

Demystifying DevOps, MLOps, and DataOps: Bridging the Gap between Software Development, Machine Learning, and Data Managemen Introduction What is DevOps...

All Resources to Learn Data Science

3 minute read

All Resources to Learn Data Science Introduction Welcome to the AI ML Resources category page, where you’ll find a wealth of knowledge on various topics ...

A Comprehensive Guide to 210+ AWS Services

13 minute read

A Comprehensive Guide to 210+ AWS Services Exploring 210+ Cloud Services and Their Purposes All these services are availalbe at Link Introduction Amazon ...

God Fathers of AI

6 minute read

God Fathers of AI In other fields of studies or in religion, there is only one god or only one godfather. But in the field of AI, that is not the case. Th...

Business Usecases of GPT

8 minute read

Business-Usecases-of-GPT Introduction You will not lose your job because of AI, but you may lose it because you didn’t learn how to use AI in your job.

The Interconnectedness of Life and Data

12 minute read

The Interconnectedness of Life and Data Introduction While reading below keep your mind open. If you can keep your religious or even scientific informati...

Types of Machine Learning

37 minute read

Types of Machine Learning Introduction Machine learning is a field of artificial intelligence that focuses on developing algorithms that can learn from da...

Linux OS Directories

2 minute read

Linux OS Directories Linux OS Folders and the purpose. Introduction: Linux is a widely used operating system that is known for its flexibility and versa...

Types of Technologies

5 minute read

How Many Types of Technologies? Introduction Often we hear various jargon of different types of technologies and sometimes during the discussion, it becom...

Cognitive Biases

33 minute read

Cognitive Biases Introduction Cognitive biases are the systematic errors that occur when individuals deviate from rational decision-making. These biases a...

Books on Conciousness

51 minute read

Books on Conciousness Introduction If you are from the software industry and especially in a data scientist role on day to day basis. Then there are high...

Responsible AI

4 minute read

Responsible AI Introduction: Artificial Intelligence (AI) is rapidly transforming the way we live, work, and interact with the world around us. As AI sys...

Application of AI in BFSI

5 minute read

Application of AI in Banking, Finance, Security and Insurance (BFSI) Introduction The banking, financial services, and insurance (BFSI) sector has been a...

GPU for Data Science Work

6 minute read

GPU for Data Science Work What is the difference between microprocessor (CPU) and GPU? A microprocessor and a GPU (graphics processing unit) are both type...

Type of Databases

9 minute read

What are the various types of databases? Introduction In the 21st Century, Data is the real oil of machines. There are different kinds of oils and there ...

AI Usecases in Agriculture Industry

2 minute read

AI Usecases in Agriculture Industry Introduction In the today world where energy saving, climate change, cost and process optimization, effectiveness is t...

AI Use Cases in Food Processing

2 minute read

AI Use Cases in Food Processing Introduction The food processing industry is a vital sector in the global economy, responsible for providing safe and nutr...

Will AI Replace Human?

5 minute read

Artificial Intelligence can Replace Human? The Success of ChatGPT has taken the world by storm. Unless you are not living in some cave you already know wh...

Timeseries Interview Questions

24 minute read

Timeseries Interview Questions What are the characterstics of time series data? Time series data is a series of data points collected over time. Some cha...

GPT Usecases

4 minute read

What is GPT? GPT is a transformer. Don’t confuse it with your electricity transformer! In Artificial Intelligence there are different kinds of neural netw...

ChatGPT Usecases

7 minute read

What is ChatGPT? ChatGPT is general purpose - “chat model” from OpenAI. It is a language model, which means if you type some text then it can understand an...

What is Computer Vision

11 minute read

What is Computer vision? Background In the digital world, scientists are working hard to create machines and robots that can interact with humans the way ...

The Science of Reasoning

11 minute read

The Science of Reasoning About Reasoning Reasoning is a unique ability in humans. Whatever civilizational advancement we see, it is because of our abilit...

What is NLP?

12 minute read

What is NLP? Humans interact with their surroundings using different kinds of inputs. Eyes deal with inputs of color, shape, and size. Ear deals with input...

Domain Knowledge in Machine Learning

1 minute read

Domain Knowledge in Machine Learning Let’s say the domain is a restaurant kitchen. A dataset with 3 variables. Two predictors and one predicted. Predictor...

Confusion Matrix Bayesian Theorem

less than 1 minute read

If you are like me then you must have struggled enough to understand the confusion matrix or still struggling to understand the metrics of this confusion m...

Folder Structure for ML Project

12 minute read

Directory Structure for ML Project is critical for any serious data science project. What we learn from a technology college and institution is useful in a...

Generalized AI Model for Prediction

5 minute read

Can we really Develop AI solutions that can predict human behavior? If you are not a technical person then don’t get overwhelmed by the next paragraph, you...

20 Reasons Why AI Project Fails

6 minute read

Contents Why Does AI Project Fails? Why Does AI Project Fails? I have been in IT project management for 2+ decades. Being in the industry for quite a...

What Are Transformers in AI

37 minute read

What Are Transformers in AI Transformer Architecture Background Whether GPT, ChatGPT, DALL-E, Whisper, Satablity AI or whatever significant you see in t...

AI ML Resources from My Diary

41 minute read

AI ML Resources from My Diary This is my personal diary which contains resources, which I know, learned or people have told me to experiment with. I start...

Data Scientists and AI, ML Researchers

1 minute read

Data Scientists and AI, ML Researchers For the initial list, I have taken content from github. I have expanded this and will keep updating this in the fut...

Datasets

18 minute read

150+ Machine Learning Datasets Introduction: Without Data there is no Machine Learning, no AI, no Deep Learning. Because of heavy automation, IOT devices...

My Daily Tools

less than 1 minute read

My Daily Tools My Daily Tools. It is an excel sheet and I keep updating this, as and when I find some new tool.

Best Resources to Learn Python

1 minute read

Best Resources to Learn Python Best Python Resources Learn Python the Hard Way Python Practice Book — Python Practice Book Making Games with Python...

High School Maths for Data Science

9 minute read

High School Maths for Data Science Algebra I How to graph an ordered pair How to find the equation of a circle How to find the equation of a curve ...

Mathematics for Data Scientist

1 minute read

Mathematics for Data Scientist To excel in the field of data science, especially as a data scientist, I would recommend you have good command over the top...

How Naive Bayes Classifier Works

less than 1 minute read

How Naive Bayes Classifier Works? Naive Bayes classifier example In this presentation, I am not going into the depth of the Naive Bayes algorithm. I am a...

Top 10 Technologies of Future

less than 1 minute read

Top 10 Technologies of Future Top 10 Technologies of Future Augmented Reality : Helps surgeons see inside their patients Help workers operating in dan...

EDA & Feature Engineering 101

19 minute read

Contents What is EDA? Importance of EDA EDA Stage 1: Basic Exploration EDA Stage 1.1: Understanding the dataset as a file. EDA Stage 1...

DS, AI, ML Online Course, Tutorial, Videos

5 minute read

DS, AI, ML Online Course, Tutorial, Videos Courses Machine Learning – Stanford by Andrew Ng in Coursera (2010-2014) Machine Learning – Caltech by Yase...

Data Science Interview Question Answers

6 minute read

Data Science Interview Question Answers Thousands of interview questions on various topic related to Machine Learning, Deep Learning, Computer Vision, NLP...

What is XAI?

4 minute read

What is XAI? XAI in Simple Language! The disciple of Data Science and AI has brought many terms in the boardroom for discussion, which looks complicated....

100+ High Level AI Usecases

17 minute read

100+ High Level AI Usecases Nowadays we are listening and reading a lot about AI and its role in shaping our present and future. But many people, sometime...

Dealing with Sensitive Data

2 minute read

Dealing with Sensitive Data Introduction One of the biggest problems for a Data Science project team is to protect the data. We know, data is the basic ra...