tips

Interactive Plots with Bokeh/Holoviews (and Hugo)

Bokeh and Holoviews are an amazing set of libraries to help us visualize data in an intuitive and interactive way. Here you'll learn how to use them to

'wget' Files From Google Drive (+ online tool)

Working with different datasets across machines, in Google Colab, Kaggle Kernels or your own notebooks we sometimes need to transfer large files - from datasets, to other random fragments. A relatively easy and repeatable way to move files across is to use Google Drive. Alternatively use ‘scp’ if the platfrom supports it! So, let’s assume that we have a file on Google Drive that we want to locally access. We can create a wget compatible link and use the cli to fetch it!

How to Remember Your Bash Spells with 'make'

Very often we find ourselves changing parameters of scripts or googling what was the right combination of bash commands. What’s the best way to remember those? Let’s take for instance running a machine learning project in a docker container. We all love reproducibility, and we should all do reproducible science. Because you know, otherwise it´s not science. Docker abstraction To build a consistent runtime environment we can use tools like python virtual environments, anaconda, or Docker.

OCR with OpenCV and Tesseract (Installation Guide)

Introduction Tesseract OCR provides some powerful algorithms for recognizing text in images. The text can be small, big, or skewed and this is managed through some flags in the algorithm. Another good feature is the ability to extract confidence values not only for the whole text, but to individual words and even letters. OpenCV can be viewed as the state-of-the-art library for managing anything related to computer vision. It can be used to read in static images or ones from a camera, transform them to a form suitable for the OCR algorithm and apply it.