Job Shadowing at DataCamp

Talking about my day at DataCamp

Joshua Wallace

5 minute read

The astrophysics academic job application cycle starts soon and my preparations, should I apply for academic jobs, need to start now. Because of this, I’ve been considering one last time whether I want to pursue an academic job before I close that door forever. Just in time for this, I had the opportunity to job shadow at a data science company through Princeton’s new “Take a Ph.D. to Work” Program. I shadowed David Robinson, Chief Data Science at DataCamp. David is a recent Ph.D. graduate from Princeton who actively maintains a data science blog. He got his first job post-Ph.D. as a data scientist at Stack Overflow by being invited to interview after posting the top answer to this SE Cross Validated question. I’m very grateful that he took time out of his work schedule to have me shadow him for a day and answer my questions.

DataCamp

DataCamp is a startup that works with outside instructors to produce hands-on courses about data science then hosts these courses on their platform and shares the revenue with the instructors. This business model isn’t unique: Udemy, Khan Academy, Skillshare, and even YouTube are all places where instructors can create classes and generate revenue. What is unique (I’m told) about DataCamp is how hands-on the courses are. Integrated in the courses are coding evaluations, where a problem is given and students code up a solution, which is then automatically evaluated on the spot. Such real-time hands-on learning increases retention and is DataCamp’s unique offer of value.

David, as Chief Data Scientist, evaluates data generated by the company and its users to improve the product. I imagine this is very similar to comparable positions at other companies. One thing, though, that is different at DataCamp and that I liked is that the company’s data scientists are imbedded in the various departments of the company rather than forming a separate department. For example, the growth/sales team includes a data scientist, the product development team includes a data scientist, etc. This seems like a very efficient and productive way to distribute data science skills.

The Job Shadowing Experience

My shadowing experience had three main components: meeting one-on-one with employees, accompanying David to a work meeting, and doing some actual data science.

The one-on-one meetings were my chance to ask questions of employees and get a better idea of what their jobs are like and how to better qualify myself to fill a similar job. Even though DataCamp is a startup, it’s quite well established and the employees work normal schedules. The employees also are all very excited about the service they offer and so job satisfaction is very high. A product/service I personally am proud to be contributing to is definitely something I hope for in a future job. Of the questions I asked David and the other employees, the biggest one I had was how to clearly communicate my skills to potential employers, since “Astrophysics Ph.D.” doesn’t scream “data science skills” to everyone. I got a lot of good advice, including starting a blog where I present my skills and having a good GitHub account to showcase my work.

As great and useful as the interviews were, what was most useful for me was doing actual data science with David. Given the proprietary nature of the data I don’t want to go into details, but basically I was tasked with finding sudden changes in some time series data. I was given a couple of hours and in that time developed a basic algorithm that was able to successfully find sudden changes in some situations. I presented my work to David, and we discussed the implications of my results for the company and also discussed and discovered why my algorithm didn’t always work. After identifying the algorithm’s shortcomings, we discussed ways to enhance it. It was a lot of fun to be given my favorite kind of data (time series) and work through the math and logic necessary to come up with something that produced some basic but still usable results. This is something I would enjoy for a job.

One thing that stood out to me from the interviews and hands-on data science experience is the differing levels of analysis needed to be successful in academic vs. nonacademic science. In academic science, we are usually working on the edge of what is possible to try to see what no one else has seen before. This often requires the latest and greatest techniques and data sets and usually requires months or years of work. Sometimes you are lucky enough to generate a sufficiently robust data set that there is a lot of low-hanging fruit for analysis, but that is not the typical situation. On the other hand, in work as a data science (at least in a new startups where very basic questions remain to be answered), simple analysis techniques are often sufficient to discover the desired answer. This simplicity and accelerated rate of discovery is very appealing to me.

Conclusion

This job shadowing experience was extremely valuable for me and helped me firmly make the decision to turn towards a nonacademic career. I thank David and the other employees at DataCamp who took time out of their day to talk to me. This experience has given me several tools that will be very useful as I move forward with looking for a job in the next year. I highly recommend anyone considering switching careers to find a job shadowing opportunity.

comments powered by Disqus