brainsteam.co.uk/categories/open-source/index.xml

112 lines
8.4 KiB
XML
Raw Permalink Blame History

This file contains ambiguous Unicode characters

This file contains Unicode characters that might be confused with other characters. If you think that this is intentional, you can safely ignore this warning. Use the Escape button to reveal them.

<?xml version="1.0" encoding="utf-8" standalone="yes"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
<channel>
<title>Open Source on Brainsteam</title>
<link>https://brainsteam.co.uk/categories/open-source/</link>
<description>Recent content in Open Source on Brainsteam</description>
<generator>Hugo -- gohugo.io</generator>
<language>en-us</language>
<copyright>© James Ravenscroft 2020</copyright>
<lastBuildDate>Tue, 21 Dec 2021 01:21:11 +0000</lastBuildDate><atom:link href="https://brainsteam.co.uk/categories/open-source/index.xml" rel="self" type="application/rss+xml" />
<item>
<title>test post</title>
<link>https://brainsteam.co.uk/2021/12/21/test/</link>
<pubDate>Tue, 21 Dec 2021 01:21:11 +0000</pubDate>
<guid>https://brainsteam.co.uk/2021/12/21/test/</guid>
<description>test</description>
</item>
<item>
<title>An opinionated guide to Python environments in 2021</title>
<link>https://brainsteam.co.uk/2021/04/01/opinionated-guide-to-virtualenvs/</link>
<pubDate>Mon, 12 Apr 2021 20:21:11 +0000</pubDate>
<guid>https://brainsteam.co.uk/2021/04/01/opinionated-guide-to-virtualenvs/</guid>
<description>A person overwhelmed by boxes by Cottonbro
Note: If you don&amp;rsquo;t want to read the blah-blah context and history stuff then you can jump to the recommendations The Problem The need for virtual python environments becomes fairly obvious early in most Python developers&#39; careers when they switch between two projects and realise that they have incompatible dependences (e.g. project1 needs scikit-learn-0.21 and project2 needs scikit-learn-0.24). Unlike other mainstream languages like Javascript(Node.</description>
</item>
<item>
<title>Reproducing &#39;ancient&#39; experiments with Pytorch inside docker</title>
<link>https://brainsteam.co.uk/2021/03/01/running-old-pytorch-docker/</link>
<pubDate>Mon, 01 Mar 2021 20:21:11 +0000</pubDate>
<guid>https://brainsteam.co.uk/2021/03/01/running-old-pytorch-docker/</guid>
<description>A beige analog compass by Ylanite Koppens
Introduction Open machine learning research is undergoing something of a reproducibiltiy crisis. In fairness it&amp;rsquo;s not usually the authors&#39; fault - or at least not entirely. We&amp;rsquo;re a fickle industry and the tools and frameworks were &amp;lsquo;in vogue&amp;rsquo; and state of the art a couple of years ago are now obsolete. Furthermore, academics and open source contributors are under no obligation to keep their code up to date.</description>
</item>
<item>
<title>Pickle 5 Madness with MLFlow and Python 3.6/3.7</title>
<link>https://brainsteam.co.uk/2021/01/14/pickle-5-madness-with-mlflow/</link>
<pubDate>Thu, 14 Jan 2021 11:42:28 +0000</pubDate>
<guid>https://brainsteam.co.uk/2021/01/14/pickle-5-madness-with-mlflow/</guid>
<description>A jar of pickles by Ksenia Charnaya
I recently came across an infuriating problem where an MLFlow python model I had trained on one system using Python 3.6 would not load on another system with an identical version of Python.
The exact problem was that when I ran mlflow models serve -m &amp;lt;url/to/model/in/bucket&amp;gt; the service would crash saying that the model could not be unserialized because ValueError: unsupported pickle protocol: 5.</description>
</item>
<item>
<title>Serving NLP Models with MLflow</title>
<link>https://brainsteam.co.uk/2020/12/29/serving-nlp-models-with-mlflow/</link>
<pubDate>Tue, 29 Dec 2020 09:50:28 +0000</pubDate>
<guid>https://brainsteam.co.uk/2020/12/29/serving-nlp-models-with-mlflow/</guid>
<description>MLFlow is a powerful open source MLOps platform with built in framework for serving your trained ML models as REST APIs. The REST framework will load data provided in a JSON or CSV format compatible with pandas and pass this directly into your model. This can be handy when your model is expecting a tabular list of numerical and categorical features. However it is less clear how to serve with models and pipelines that are expecting unstructured text data as their primary input.</description>
</item>
<item>
<title>Why is Tmux crashing on start?</title>
<link>https://brainsteam.co.uk/2018/11/07/why-is-tmux-crashing-on-start/</link>
<pubDate>Wed, 07 Nov 2018 07:40:45 +0000</pubDate>
<guid>https://brainsteam.co.uk/2018/11/07/why-is-tmux-crashing-on-start/</guid>
<description>I spent several hours trying to get to the bottom of why tmux was crashing as soon as I ran it on Fedora. It turns out theres a simple fix. When tmux starts it uses /dev/ptmx to create a new TTY (virtual terminal) that the user can interact with. If your user does not have permission to access this device then tmux will silently die. A good way to verify this is to try running screen too.</description>
</item>
<item>
<title>Programmatically Downloading Open Access Papers</title>
<link>https://brainsteam.co.uk/2018/04/13/programmatically-downloading-open-access-papers/</link>
<pubDate>Fri, 13 Apr 2018 16:04:47 +0000</pubDate>
<guid>https://brainsteam.co.uk/2018/04/13/programmatically-downloading-open-access-papers/</guid>
<description>(Cover image “Unlocked” by Sean Hobson)
If youre an academic or youve got an interest in reading scientific papers, youve probably run into paywalls that demand tens or even hundreds of £ just to read a scientific paper. Its ok if youre affiliated with a university that has access to that journal but it can sometimes be luck of the draw as to whether your institute has access and even if they do, sometimes the SAML login processes dont work and you still cant see the paper.</description>
</item>
<item>
<title>Dialect Sensitive Topic Models</title>
<link>https://brainsteam.co.uk/2017/07/25/dialect-sensitive-topic-models/</link>
<pubDate>Tue, 25 Jul 2017 11:02:42 +0000</pubDate>
<guid>https://brainsteam.co.uk/2017/07/25/dialect-sensitive-topic-models/</guid>
<description>As part of my PhD Im currently interested in topic models that can take into account the dialect of the writing. That is, how can we build a model that can compare topics discussed in different dialectical styles, such as scientific papers versus newspaper articles. If youre new to the concept of topic modelling then this article can give you a quick primer.
Vanilla LDA A diagram of how latent variables in LDA model are connected Vanilla topic models such as Bleis LDA are great but start to fall down when the wording around one particular concept varies too much.</description>
</item>
<item>
<title>timetrack improvements</title>
<link>https://brainsteam.co.uk/2016/12/10/timetrack-improvements/</link>
<pubDate>Sat, 10 Dec 2016 09:33:41 +0000</pubDate>
<guid>https://brainsteam.co.uk/2016/12/10/timetrack-improvements/</guid>
<description>Ive just added a couple of improvements to timetrack that allow you to append to existing time recordings (either with an amount like 15m or using live to time additional minutes spent and append them).
You can also remove entries using timetrack rm instead of remove saving keystrokes is what programming is all about.
You can find the updated code over at github.</description>
</item>
<item>
<title>timetrack a simple time tracking application for developers</title>
<link>https://brainsteam.co.uk/2016/11/23/timetrack-a-simple-time-tracking-application-for-developers/</link>
<pubDate>Wed, 23 Nov 2016 14:43:58 +0000</pubDate>
<guid>https://brainsteam.co.uk/2016/11/23/timetrack-a-simple-time-tracking-application-for-developers/</guid>
<description>Ive written a small command line application for tracking my time on my PhD and other projects. We use Harvest at Filament which is great if youve got a huge team and want the complexity (and of course license charges) of an online cloud solution for time tracking.
If, like me, youre just interested to see how much time you are spending on your different projects and you dont have any requirement for fancy web interfaces or client billing, then timetrack might be for you.</description>
</item>
</channel>
</rss>