Recent Publications

. CAZypedia: Carbohydrate Binding Module Family 63. CAZypedia, 2018.

Source Document

CV

My CV is available in HTML form or as a PDF.

Recent Posts

More Posts

Trigger warning Introduction Getting and cleaning data Trends in violence by region Is South America more dangerous for transgender people, or just for all people? Country level analysis Proportion of murder victims that are transgender Number of transgender victims by age Conclusions Trigger warning This is an exploratory data analysis of murders of transgender people. The data contains graphic descriptions of violence against transgender people.

CONTINUE READING

2018 was a crazy year for me. A move, a new job, new career path, and many more ups and downs. And through all of this, was the soundtrack to my life: audiobooks. I listened to over 50 books this year, and the good news is most were excellent! So without further ado, here’s my favorite books that I read (listened to) in 2018. Note that these are not the best books released in 2018, just whichever books I read this year that I loved.

CONTINUE READING

Introduction Getting data with rgbif Data cleaning Data wrangling Make the animation Another example with Kudzu Introduction Since I discovered GBIF, I’ve been hooked. What is GBIF? From their website: “GBIF—the Global Biodiversity Information Facility—is an international network and research infrastructure funded by the world’s governments and aimed at providing anyone, anywhere, open access to data about all types of life on Earth.” In 2018, GBIF passed the mark of one billion occurence records, which is just incredible.

CONTINUE READING

Introduction Trials and tribulations The solution Introduction Drama, intrigue, arrogance, dashed hopes, rock-bottom, perseverance, and eventual triumph, this post has it all! It starts with me watching Rachael Tatman’s recent live-coding video, and ends with a thrilling race-to-the-bottom between two pathetically slow functions. What lies ahead: many a WTF moment, lots of trial and error, and some useful tidyverse data wrangling tips. Rachael Tatman is a data scientist at Kaggle, and does these awesome live coding sessions every Friday.

CONTINUE READING

Disclamer: I’m a trained microbiologist/biochemist, which means most of my bioinformatics knowledge was self-taught. What you’re about to see may not be pretty; the code might be janky or the workflow inefficient. But I have gone through countless hours of googleing, reading, and trial/error to learn this, and it works pretty well for me, so it might for you too. Let me know if you spot errors or have suggestions for improvement!

CONTINUE READING

Contact