Learning is a lifelong process. But you must know what, where, and how to learn? What skills to develop? What skills will help you boost your career? If not, you are at the right place! Our tutorial section at CoderzColumn is dedicated to providing you with all the practical lessons. It will give you the experience to learn Python for different purposes and code on your own. Our tutorials cover:
For an in-depth understanding of the above concepts, check out the sections below.
A detailed guide on how to use Python library "cufflinks" to create interactive data visualizations/charts. Cufflinks is built on top of Plotly and let us create charts by calling 'iplot()' method on Pandas dataframe. The 'iplot()' method tries to mimic 'plot()' API (matplotlib) of pandas dataframe to generate charts but uses Plotly.
A detailed guide on how to use Python library "memory_profiler" to profile memory usage by Python code/script/program and processes. Tutorial covers various ways of profiling with "memory_profiler" like "@profile decorator", "mprof shell command", "memory_usage() function", etc. It even covers how to use "memory_profiler" in Jupyter notebook using "%mprun" and "%memit" magic commands.
A detailed guide to creating Sankey Diagram (Alluvial Diagram) using Python data visualization libraries Plotly and Holoviews (Bokeh & Matplotlib). The charts are interactive and visualized in Jupyter Notebooks.
Tutorial explains how to use Python module "missingno" to analyze the distribution of missing data (NaNs/NULLs/None Values) in our datasets. It let us create various charts to visualize the spread of missing data from various angles which can help us make better decisions.
A comprehensive guide on how to use Python module "concurrent.futures" for multitasking (Multithreading & Multiprocessing). The "concurrent.futures" module provides a very high-level API that let us create a pool of workers (threads/processes) to which we can submit tasks for completion. It'll take care of handling resources and we don't need to do much low-level coding that we have to do if we use "threading" or "multiprocessing" modules.
A detailed guide on how to use Python library ipywidgets that let us create widgets (dropdown, radio buttons, checkboxes, buttons, etc) in Jupyter notebooks. These widgets let us add interactivity to our notebooks. It can be linked to various data viz libraries to make charts interactive like dashboards and web apps.
A brief guide on how to use pre-trained MXNet networks (GluonCV) for image classification tasks. GluonCV is an image processing toolkit of MXNet and provides many famous pre-trained networks like ResNet, VGG, InceptionNet, MobileNet, DenseNet, AlexNet, etc.
A simple guide on how to use Python library keras to create neural networks. Tutorial is a good starting point for someone who is new to keras. It provides a guide on creating simple neural networks trained on toy datasets that solve classification and regression tasks.
A detailed guide to use Python library SHAP to generate Shapley values (shap values) that can be used to interpret/explain predictions made by our ML models. Tutorial creates various charts using shap values interpreting predictions made by classification and regression models trained on structured data.
A simple guide on how to use pre-trained image classification models available from "torchvision" library of PyTorch. Torchvision is a computer vision toolkit of PyTorch which provides many pre-trained image classification networks like ResNet, VGG, AlexNet, MobileNet, InceptionNet, LeNet, etc.
Parallel Computing is a type of computation where tasks are assigned to individual processes for completion. These processes can be running on a single computer or cluster of computers. Parallel Computing makes multi-tasking super fast.
Python provides different libraries (joblib, dask, ipyparallel, etc) for performing parallel computing.
Concurrent computing is a type of computing where multiple tasks are executed concurrently. Concurrent programming is a type of programming where we divide a big task into small tasks and execute these tasks in parallel. These tasks can be executed in parallel using threads or processes.
Python provides various libraries (threading, multiprocessing, concurrent.futures, asyncio, etc) to create concurrent code.
Once our Machine Learning model is trained, we need some way to evaluate its performance. We need to know whether our model has generalized or not.
For this, various metrics (confusion matrix, ROC AUC curve, precision-recall curve, silhouette Analysis, elbow method, etc) are designed over time. These metrics help us understand the performance of our models trained on various tasks like classification, regression, clustering, etc.
Python has various libraries (scikit-learn, scikit-plot, yellowbrick, interpret-ml, interpret-text, etc) to calculate and visualize these metrics.
After training ML Model, we generally evaluate the performance of model by calculating and visualizing various ML Metrics (confusion matrix, ROC AUC curve, precision-recall curve, silhouette Analysis, elbow method, etc).
These metrics are normally a good starting point. But in many situations, they don’t give a 100% picture of model performance. E.g., A simple cat vs dog image classifier can be using background pixels to classify images instead of actual object (cat or dog) pixels.
In these situations, our ML metrics will give good results. But we should always be a little skeptical of model performance.
We can dive further deep and try to understand how our model is performing on an individual example by interpreting results. Various algorithms have been developed over time to interpret predictions of ML models and many Python libraries (lime, eli5, treeinterpreter, shap, etc) provide their implementation.
Data Visualization is a field of graphical representation of information / data. It is one of the most efficient ways of communicating information with users as humans are quite good at capturing patterns in data.
Python has a bunch of libraries that can help us create data visualizations. Some of these libraries (matplotlib, seaborn, plotnine, etc) generate static charts whereas others (bokeh, plotly, bqplot, altair, holoviews, cufflinks, hvplot, etc) generate interactive charts. Majority of basic visualizations like bar charts, line charts, scatter plots, histograms, box plots, pie charts, etc are supported by all libraries. Many libraries also support advanced visualization, widgets, and dashboards.
Basic Data Visualizations like bar charts, line charts, scatter plots, histograms, box plots, pie charts, etc are quite good at representing information and exploring relationships between data variables.
But sometimes these visualizations are not enough and we need to analyze data from different perspectives. For this purpose, many advanced visualizations are developed over time like Sankey diagrams, candlestick charts, network charts, chord diagrams, sunburst charts, radar charts, parallel coordinates charts, etc. Python has many data visualization libraries that let us create such advanced data visualizations.
Deep learning is a field in Machine Learning that uses deep neural networks to solve tasks. The neural networks with generally more than one hidden layer are referred to as deep neural networks.
Many real-world tasks like object detection, image classification, image segmentation, etc can not be solved with simple machine learning models (decision trees, random forest, logistic regression, etc). Research has shown that neural networks with many layers are quite good at solving these kinds of tasks involving unstructured data (Image, text, audio, video, etc). Deep neural networks nowadays can have different kinds of layers like convolution, recurrent, etc apart from dense layers.
Python has many famous deep learning libraries (PyTorch, Keras, JAX, Flax, MXNet, Tensorflow, Sonnet, Haiku, PyTorch Lightning, Scikeras, Skorch, etc) that let us create deep neural networks to solve complicated tasks.
Image classification is a sub-field under computer vision and image processing that identifies an object present in an image and assigns a label to an image based on it. Image classification generally works on an image with a single object present in it.
Over the years, many deep neural networks (VGG, ResNet, AlexNet, MobileNet, etc) were developed that solved image classification task with quite a high accuracy. Due to the high accuracy of these algorithms, many Python deep learning libraries started providing these neural networks. We can simply load these networks with weights and make predictions using them.
Python libraries PyTorch and MXNet have helper modules named 'torchvision' and 'gluoncv’ respectively that provide an implementation of image classification networks.