Blog

This page contains links to some of my writings on topics that interest me. Usually, they are inspired by problems that I experience in my day-to-day life.

I like pondering over the act or the process of doing something.

Renaming files so they make sense (rename)

shell

python

rename is a bash script I wrote to automatically rename long files (the way I like it).

Nov 28, 2024

Generating tags for git repositories (mytags)

shell

mytags is a wrapper around ctags which respects your gitignore files.

Nov 17, 2024

Extracting the Module and Function Names from Python ASTs

python

How to extract the module and function name from Python Abstract Syntax Trees.

Mar 23, 2024

Visualisation Zoo

python

design

Collection of data visualisations I have created using Python.

Mar 17, 2024

Organising research projects with git

research

productivity

Some standards and conventions I follow when organising research project data using git.

Jan 19, 2024

Toggling background color in kitty and vim (yob)

shell

vim

yob is a tiny shell script which toggles between a light and dark colorscheme in Kitty, my terminal of choice.

Jan 19, 2024

Managing Scientific Bibliography using Emacs Org-mode

emacs

productivity

How I organise, search and retrieve my scientific papers using Emacs org-mode.

Nov 29, 2023

Automatically retrieving Bibtex information from DOI

shell

productivity

doi2bib is a simple Python script I wrote that fetches bibtex information from the Crossref API using the provided DOI. It can also handle pre-prints published on Arxiv.

Nov 26, 2023

Some tools & techniques I use to run a no non-sense blog using static html pages. All powered by a sane file naming convension, plaintext documents writing in markdown and exported to html using pandoc and other unix cli tools.

Feb 3, 2023

Data Validation with TFDV

data

research

SE4AI

In this lecture we will go over the basics of data validation. The first half of this lecture will be a talk on the fundamentals of data validation. We will answer what is data validation?, why should we validate our data? and how we can validate our data?. The second half of the lecture will be a hands-on tutorial on using Tensorflow Data Validation, instructions & code for which can be found on this github repo.

May 16, 2022

Effortless Parallel Execution with xargs & Friends

shell

Recently, I had to run Tensorflow Data Validation on over 500 public datasets from Kaggle to generate a baseline schema file for further analysis. I chose to do this using the xargs unix command.

May 8, 2022

Data Smells in Public Datasets

data

research

SE4AI

technical debt

In this talk I will present our recent paper titled Data Smells in Public Datasets which was published at the 1^st International Conference on AI Engineering (CAIN) 2022. I will first present the problem we are trying to solve along with the contributions that we made. I will present the methodology which was followed along with the results obtained. I will present a select few smells which I personally find interesting & hope will generate some discussion. Finally, we will conclude the talk with some high level takeaways from our study along with the limitations & future directions of work.

May 4, 2022

There and Back Again A Tale of Website Management

shell

vim

web

Managing websites using markdown, shell and vim.

Mar 4, 2022

Timestamps in the Shell (today)

shell

productivity

Creating timestamps in the terminal.

Mar 3, 2022

Aru’s Information Management System (AIMS)

shell

productivity

AIMS or Aru’s Information Management System is a collection of shellscripts to manage information in plaintext. It is inspired by org-mode, and tries to replicate a subset of its functionalities which I frequently use. AIMS is completely tuned towards my workflow as a researcher and how I manage my digital notes.

Feb 28, 2022

Privacy Preserving Deep Learning

machine learning

research

privacy

A talk on Privacy Preserving Deep Learning (PPDL) I gave to my research group. It was largly based on a literature review I did during my Msc.

Sep 7, 2021

Research Workflow in Plaintext

productivity

emacs

research

In this talk I will go over how we can use Emacs and org-mode to craft a research workflow. We will look at how we can leverage the power of Emacs and org-mode to capture, store, search and retrieve research data, all in plain text! The talk will touch upon how org-mode can be used as an environment for literate programming and reproducible research. I do not assume any prior knowledge of emacs or org-mode and I want this to be more of a discussion rather than a talk. Please ask me questions as I go along and share your thoughts, tips and techniques with others!

Jul 12, 2021

Aru’s Org Capture Template (aocp.el)

emacs

productivity

An Emacs package I wrote for managing bibliographic information.

Jun 16, 2021

Categories

Renaming files so they make sense (rename)

Generating tags for git repositories (mytags)

Extracting the Module and Function Names from Python ASTs

Visualisation Zoo

Organising research projects with git

Toggling background color in kitty and vim (yob)

Managing Scientific Bibliography using Emacs Org-mode

Automatically retrieving Bibtex information from DOI

CMS using Pandoc and Friends

Data Validation with TFDV

Effortless Parallel Execution with xargs & Friends

Data Smells in Public Datasets

There and Back Again A Tale of Website Management

Timestamps in the Shell (today)

Aru’s Information Management System (AIMS)

Privacy Preserving Deep Learning

Research Workflow in Plaintext

Aru’s Org Capture Template (aocp.el)