Recent Publications

. CAZypedia: Carbohydrate Binding Module Family 63. CAZypedia, 2018.

Source Document

CV

My CV is available in HTML form or as a PDF.

Recent Posts

More Posts

Introduction Getting data with rgbif Data cleaning Data wrangling Make the animation Another example with Kudzu Introduction Since I discovered GBIF, I’ve been hooked. What is GBIF? From their website: “GBIF—the Global Biodiversity Information Facility—is an international network and research infrastructure funded by the world’s governments and aimed at providing anyone, anywhere, open access to data about all types of life on Earth.” In 2018, GBIF passed the mark of one billion occurence records, which is just incredible.

CONTINUE READING

Introduction Trials and tribulations The solution Introduction Drama, intrigue, arrogance, dashed hopes, rock-bottom, perseverance, and eventual triumph, this post has it all! It starts with me watching Rachael Tatman’s recent live-coding video, and ends with a thrilling race-to-the-bottom between two pathetically slow functions. What lies ahead: many a WTF moment, lots of trial and error, and some useful tidyverse data wrangling tips. Rachael Tatman is a data scientist at Kaggle, and does these awesome live coding sessions every Friday.

CONTINUE READING

Disclamer: I’m a trained microbiologist/biochemist, which means most of my bioinformatics knowledge was self-taught. What you’re about to see may not be pretty; the code might be janky or the workflow inefficient. But I have gone through countless hours of googleing, reading, and trial/error to learn this, and it works pretty well for me, so it might for you too. Let me know if you spot errors or have suggestions for improvement!

CONTINUE READING

The more I use the tidyverse in my R coding, the more I ask myself: does Hadley Wickham hate dogs, or does he just need help with dog-related package names? See, of the packages Hadley has developed for the tidyverse, there are two that have cat-inspired names (forcats and purrr) but zero that pay homage to man’s best friend. It’s not like doggo names are hard to think of for R packages it took me 30 seconds to come up with baRk and woofR**.

CONTINUE READING

I’ve been going through the job application cycle recently, which meant updating my CV. You can write a CV with Microsoft Word, but I find it exceptionally frustrating to do any sort of fancy formatting in Word, and more imporantly, I want my CV to be a page on my website (not just a downloadable file), that has the responsiveness expected of any modern webpage. I found this excellent HTML/CSS template from Thomas Hardy, and decided it was the aesthetic I was going for.

CONTINUE READING

Contact