Kaggle
Colab

Interactive Notebooks

Explore my collection of machine learning notebooks across Kaggle and Google Colab. From Bengali NLP models to advanced AI benchmarking, dive into hands-on implementations with real-world insights and performance analysis.

9
Total Notebooks
19,974
Total Views
1,266
Total Likes
891
Downloads
4
Featured
4 Kaggle Notebooks
5 Colab Notebooks

What You'll Discover

From cutting-edge Bengali NLP models to comprehensive AI benchmarking, explore real-world implementations with detailed analysis and performance insights

Kaggle

Bengali NLP & LLMs

5

TigerLLM benchmarking, Bengali LLaMA analysis, tokenization strategies, and language model evaluation

Kaggle

Model Evaluation & Benchmarking

1

Comprehensive model testing, performance analysis, reality checks, and capability assessment

Colab

Deep Learning Fundamentals

3

Attention mechanisms, positional encoding, sequence-to-sequence models, and neural architecture

Colab

Practical Machine Learning

2

Customer churn prediction, stock forecasting, time series analysis, and real-world applications

Kaggle

Data Processing & OCR

2

Bengali text extraction, PDF processing, tokenization comparison, and data preprocessing

9
Total
4
Kaggle
5
Colab
19,974
Views
1,266
Likes
891
Downloads
Platform:
Category:
Level:

Showing 9 of 9 notebooks

Featured
Kaggle
Advanced

TigerLLM Testing and Benchmarking

Comprehensive capability assessment and benchmarking of md-nishat-008/TigerLLM-1B-it Bengali Language Model. Detailed evaluation of model performance, limitations, and usage recommendations.

Risad Raihan Malik
1,250
89
234
45 min
Dec 15, 2024
TigerLLM
Bengali
Benchmarking
LLM Evaluation
+1 more
TigerLLM Testing and Benchmarking
Featured
Google Colab
Advanced

Pre-training LLMs with HuggingFace

Complete guide to pre-training large language models using HuggingFace transformers library. Covers data preparation, model architecture, training strategies, and optimization techniques.

Risad Raihan Malik
4,560
289
90 min
Dec 1, 2024
LLM
Pre-training
HuggingFace
Transformers
+1 more
Pre-training LLMs with HuggingFace
Featured
Kaggle
Intermediate

Bengali LLaMA Reality Check: hassanaliemon/bn_r-8b

Complete performance analysis of hassanaliemon/bn_rag_llama3-8b model. Discovering what it's actually good at - excels in creative tasks (4.9/5) but struggles with factual Q&A (0-25% accuracy).

Risad Raihan Malik
2,100
156
445
35 min
Nov 28, 2024
LLaMA
Bengali
RAG
Performance Analysis
+1 more
Bengali LLaMA Reality Check: hassanaliemon/bn_r-8b
Featured
Google Colab
Advanced

Attention Mechanism - Positional Encoding

Deep dive into attention mechanisms and positional encoding using neural networks. Detailed visual understanding including softmax operations and transformer architecture components.

Risad Raihan Malik
3,420
198
60 min
Nov 15, 2024
Attention Mechanism
Positional Encoding
Transformers
Neural Networks
+1 more
Attention Mechanism - Positional Encoding
Kaggle
Advanced

Corpus Bangla Dataset - BPE vs SentencePiece

Comparative analysis of Byte-Pair Encoding (BPE) and SentencePiece tokenizers trained on OSCAR Bengali dataset (4,601 examples). Determines optimal tokenization strategy for Bengali NLP fine-tuning.

Risad Raihan Malik
890
67
123
50 min
Oct 22, 2024
BPE
SentencePiece
Bengali
Tokenization
+1 more
Corpus Bangla Dataset - BPE vs SentencePiece
Google Colab
Intermediate

Developing a Sequence-to-Sequence Model

Comprehensive guide to developing sequence-to-sequence models with BLEU score evaluation metrics. Complete implementation from data preprocessing to model evaluation.

Fatema Akbari+1
2,890
167
75 min
Oct 8, 2024
Seq2Seq
BLEU Score
Neural Machine Translation
Encoder-Decoder
Developing a Sequence-to-Sequence Model
Google Colab
Intermediate

Customer Churn Prediction

Telco customer churn prediction using ensemble methods including Random Forest, Decision Tree, and XGBoost. Complete pipeline from data analysis to model deployment with performance comparison.

Risad Raihan Malik
2,340
145
45 min
Sep 25, 2024
Churn Prediction
Random Forest
XGBoost
Decision Trees
+1 more
Customer Churn Prediction
Kaggle
Intermediate

Flusk OCR Testing for Extracting Bengali Data

Testing Flusk OCR capabilities for extracting Bengali text data from PDF documents. Comprehensive evaluation of OCR accuracy and performance for Bengali language processing.

Risad Raihan Malik
634
43
89
30 min
Sep 18, 2024
OCR
Bengali
Flusk
PDF Processing
+1 more
Flusk OCR Testing for Extracting Bengali Data
Google Colab
Intermediate

Stock Forecasting using LSTM

Amazon stock price forecasting using Long Short-Term Memory (LSTM) neural networks. Time series analysis, data preprocessing, model training, and prediction visualization.

Risad Raihan Malik
1,890
112
55 min
Aug 14, 2024
LSTM
Stock Forecasting
Time Series
Amazon Stock
+1 more
Stock Forecasting using LSTM

Ready to Dive In?

Explore interactive notebooks with real-world data, detailed analysis, and practical insights. All notebooks include comprehensive documentation and reproducible results.

$chat --with pluto