<h1>Code Ages Like Milk</h1>
<img src="/images/front_range.webp" alt="The Front Range in the summer is hard to beat">
A bold title, no? But it’s true, and it’s something that I (and most other maintainers) have to deal with on a regular basis. Failing to account for this reality can slow down development and dissuade contributors from sticking around.
As the chief maintainer of Harper, one of my main jobs is to act as quality control. I receive <a href="https://github.com/Automattic/harper/pulse/monthly">a number of PRs each week</a>, both from new contributors and old. Given the diverse pool of contributors, it makes sense that I review a diverse pool of code of varying levels of quality.
When I have my head down on technical and attention-intense projects, I have less time to review code. When I have less time, I need to set priorities. The question becomes: Would I rather let code from high-quality, reliable sources pile up, or code from sources of unknown quality?
I usually end up reviewing the high-quality sources first, since they’ll likely require fewer revisions. Fewer revisions means quicker merge means improving the user experience faster. Whether or not this is a good decision is something I’ve been thinking about all week.
<h2>Code Ages</h2>
As code sits stagnant in a PR, it ages. You might say I’m crazy, that there is no way the code itself changes as it sits still in an unmerged and unmodified PR. I’d say you’re right, except for one teeny-tiny detail: Code only has meaning when it sits within a broader context. If that broader context changes (for example, when other PRs are merged) the meaning of the unmodified code does too.
Let me put it another way. If the patch from a PR remains stagnant, but the code it gets patched onto changes, the actual impact of the patch does too. This can cause all sorts of problems, from merge conflicts to erroneous test failures.
The longer code sits in a PR, the more time it usually ends up taking me to get it merged (when I finally have the time to get to it), just from the merge conflicts alone. It’s the Lindy effect rearing its ugly head once more.
<h2>Features Age Too</h2>
Not only does code age when left untouched and unused, so do the features they represent. If a user requests for a button that marks some text as bold and they don’t receive that feature promptly, they’ll find an alternative solution, possibly from a competitor.
The rest of the app can evolve too. If you’re an individual contributor, it’s in your best interest for your code to make it to master. If that doesn’t happen quickly enough, another contributor (or, if the software is extensible, like Obsidian or WordPress) or plugin author might beat you to the punch.
<h2>What Can I Do About It?</h2>
I believe the way I’ve been handling this until now has been entirely wrong. I should be allocating more of my time to training new contributors and fielding PRs from established ones. Open sources is a team effort.
As for you, dear reader, that’s for you to figure out. The worst way code can age is if it stays in your head. Don’t let your ideas go to waste. <a href="./never_wait.md">Open that PR</a>, or remind your reviewer to take another look if you haven’t already.

Failing to account for this reality can slow down development and dissuade contributors from sticking around.

Failing to account for this reality can slow down development and dissuade contributors from sticking around.

Code Ages like Milk

<h1>Writing an Expression Rule for Harper</h1>
This is part three in a three-part series.
<a href="./writing_a_grammatical_rule_for_harper">Go to part one.</a>
Expression rules (or more commonly, <code>ExprLinter</code>s) are Harper rules that use declarative expressions to find and fix grammatical errors.
They're halfway between a "phrase correction" and manually implementing <code>Linter</code>.
Make sure you properly <a href="https://writewithharper.com/docs/contributors/environment">set up your environment</a>.
Before we get started, let's take a look at the <code>ExprLinter</code> trait.
Here's what it looks like at the time of writing this post.
<pre><code class="hljs language-rust">/// A trait that searches for tokens that fulfil [`Expr`]s in a [`Document`].
///
/// Makes use of [`TokenStringExt::iter_chunks`] to avoid matching across sentence or clause
/// boundaries.
#[blanket(derive(Box))]
pub trait ExprLinter: LSend {
 /// A simple getter for the expression you want Harper to search for.
 fn expr(&#x26;self) -> &#x26;dyn Expr;
 /// If any portions of a [`Document`] match [`Self::expr`], they are passed through [`ExprLinter::match_to_lint`] to be
 /// transformed into a [`Lint`] for editor consumption.
 ///
 /// This function may return `None` to elect _not_ to produce a lint.
 fn match_to_lint(&#x26;self, matched_tokens: &#x26;[Token], source: &#x26;[char]) -> Option&#x3C;Lint>;
 /// A user-facing description of what kinds of grammatical errors this rule looks for.
 /// It is usually shown in settings menus.
 fn description(&#x26;self) -> &#x26;str;
}
</code></pre>
The structure of the trait reveals some of the behind-the-scenes work Harper is doing for you.
There are three phases:
<ol>
<li>You provide Harper an <code>Expr</code>.
It will iterate through the document, looking for token sequences that match your expression.</li>
<li>Any and all matches are passed to <code>match_to_lint</code>.
From there, you can perform optional additional validation to confirm that the tokens really do represent a grammatical error.
If so, return <code>None</code>.
Otherwise, return a <code>Lint</code> with any suggestions that may fix the problem.</li>
<li>Harper will handle everything else. It will show UI, reformat text, and settings menus to the user.
It will also perform aggressive caching on the first two steps, so any modifications to the document have a negligible performance impact.</li>
</ol>
<h2>Let's Get Started</h2>
Now that we've reviewed the essentials, let's implement an <code>ExprLinter</code>.
Before we can write a single line of code, we need a grammatical rule of interest.
I'm going to pay a visit to the Harper <a href="https://github.com/Automattic/harper/issues?q=is%3Aissue%20state%3Aopen%20label%3Aenhancement%20label%3Aharper-core%20label%3Alinting">issue board</a>.
After looking through a few options, I think <a href="https://github.com/Automattic/harper/issues/1513">#1513</a> is a good candidate.
We are looking for missing prepositions between an adjective and a subject.
To get started, we'll create a file under <code>harper-core/src/linting</code> called <code>missing_preposition.rs</code> and add it to the parent Rust module.
I'll paste the template into the file:
<pre><code class="hljs language-rust">pub struct MissingPreposition {
 expr: Box&#x3C;dyn Expr>,
}

impl Default for MissingPreposition {
 fn default() -> Self {
 let expr = todo!();

 Self {
 expr: Box::new(expr),
 }
 }
}


impl ExprLinter for MissingPreposition {
 fn expr(&#x26;self) -> &#x26;dyn Expr {
 self.expr.as_ref()
 }

 fn match_to_lint(&#x26;self, matched_tokens: &#x26;[Token], _source: &#x26;[char]) -> Option&#x3C;Lint> {
 unimplemented!()
 }

 fn description(&#x26;self) -> &#x26;'static str {
 unimplemented!()
 }
}
</code></pre>
I like to start by building out a few test cases before working on the actual code.
We get some for free from the GitHub issue:
<pre><code class="hljs language-rust">#[test]
fn fixes_issue_1513() {
 assert_lint_count(
 "The city is famous its beaches.",
 MissingPreposition::default(),
 1,
 );
 assert_lint_count(
 "The students are interested learning.",
 MissingPreposition::default(),
 1,
 );
}
</code></pre>
Obviously, these tests will fail if we try to run <code>cargo test</code>, but at this point you should do so anyway to make sure your toolchain is working.
<h2>Writing our Expression</h2>
The heart of this grammatical rule is the <code>Expr</code> (pronounced expression).
There are a number of ways to go about making one of these.
The simplest (and most common by far) is to put together a <a href="https://docs.rs/harper-core/latest/harper_core/expr/struct.SequenceExpr.html"><code>SequenceExpr</code></a>.
In our case, we're looking for missing prepositions between an adjective and a noun.
A good expression to start with could look like:
<pre><code class="hljs language-rust">impl Default for MissingPreposition {
 fn default() -> Self {
 let expr = SequenceExpr::default()
 .then(UPOSSet::new(&#x26;[UPOS::ADJ]))
 .t_ws()
 .then(UPOSSet::new(&#x26;[UPOS::NOUN, UPOS::PRON, UPOS::PROPN]));

 Self {
 expr: Box::new(expr),
 }
 }
}
</code></pre>
We're using a <code>UPOSSet</code> here, which is another kind of <code>Expr</code> that looks for specific parts of speech.
The name derives from the <a href="https://universaldependencies.org/u/pos/index.html">Universal Dependencies tag system</a>.
Any tokens tagged with any of the options we've provided to the <code>UPOSSet</code> will match.
However, it's easy to create an example that this expression matches against, but doesn't contain a grammatical error.
We call this a false positive.
Let's write one and add it to our test suite.
<pre><code class="hljs language-rust">#[test]
fn allows_terrible_stuff() {
 assert_no_lints(
 "Either it was terrible stuff or the whiskey distorted things.",
 MissingPreposition::default(),
 );
}
</code></pre>
From here, you should use your brain to continuously refine the expression into something that
maintains a low false-positive rate while remaining useful.
Here's what I settled on:
<pre><code class="hljs language-rust">impl Default for MissingPreposition {
 fn default() -> Self {
 let expr = SequenceExpr::default()
 .then(
 AnchorStart.or(SequenceExpr::default()
 .then(UPOSSet::new(&#x26;[UPOS::DET]))
 .t_ws()),
 )
 .then(UPOSSet::new(&#x26;[UPOS::NOUN, UPOS::PRON, UPOS::PROPN]))
 .t_ws()
 .then(UPOSSet::new(&#x26;[UPOS::AUX]))
 .t_ws()
 .then(UPOSSet::new(&#x26;[UPOS::ADJ]))
 .t_ws()
 .then(UPOSSet::new(&#x26;[UPOS::NOUN, UPOS::PRON, UPOS::PROPN]))
 .then_optional(AnyPattern)
 .then_optional(AnyPattern);

 Self {
 expr: Box::new(expr),
 }
 }
}
</code></pre>
Now that we have an effective expression as a base, let's fill out the remaining fields.
I found checking for an adposition reduced the false-positive rate, and it was easiest to add it to the <code>match_to_lint</code> function.
<pre><code class="hljs language-rust">impl ExprLinter for MissingPreposition {
 fn expr(&#x26;self) -> &#x26;dyn Expr {
 self.expr.as_ref()
 }

 fn match_to_lint(&#x26;self, matched_tokens: &#x26;[Token], _source: &#x26;[char]) -> Option&#x3C;Lint> {
 if matched_tokens.last()?.kind.is_upos(UPOS::ADP) {
 return None;
 }

 Some({
 Lint {
 span: matched_tokens[2..4].span()?,
 lint_kind: LintKind::Miscellaneous,
 suggestions: vec![],
 message: "You may be missing a preposition here.".to_owned(),
 priority: 31,
 }
 })
 }

 fn description(&#x26;self) -> &#x26;'static str {
 "Locates potentially missing prepositions."
 }
}
</code></pre>
That's it!
We've written our rule.
Don't forget to <a href="https://writewithharper.com/docs/contributors/author-a-rule#Register-Your-Rule">register your rule</a> and add some more tests before opening PR.
Make sure you take a look at the <a href="https://github.com/Automattic/harper/pull/1530">pull request</a> to see the finished rule.

This article guides contributors on writing a Harper expression-based rule. It covers defining the rule's expression, writing test cases, and implementing the `match_to_lint` function for user suggestions.

This article guides contributors on writing a Harper expression-based rule. It covers defining the rule's expression, writing test cases, and implementing the <code>match_to_lint</code> function for user suggestions.

Writing an Expression Rule for Harper

It didn't work for me, and if you reading this, it probably won't work for you either.

It didn't work for me, and if you reading this, it probably won't work for you either.

Do Not Type your Notes

the_easiest_way_to_run_llms_locally

<h1>The Easiest Way to Run LLMs Locally</h1>
<img src="/images/llama.webp" alt="A Goofy Lookin&#x27; Llama">
<h2>LLMs</h2>
Unless you've been living under a rock for the past year, you already know what LLMs are.
If you do happen to be one of the lucky few unaware of the current hype around these things, I'll go through it real quick.
A large language model (or LLM) is a statistical model capable of "predicting" a subsequent word or letter, given a body of text.
Essentially, it is a computer program capable of filling in the blank.
If you let it predict the next word, then feed the result back in, you can get some pretty human-looking text.
<h2>Let's Be Clear</h2>
I hold a lot of skepticism on the practical applications of LLMs as a tool.
As a blanket rule, I never use LLMs or any similar technology in my education.
I know some people ask LLMs questions like "explain the fundamental theorem of calculus to me like I'm five."
While they may get good results for questions, I do not want to lean on them as a crutch.
College is not only an opportunity to learn the raw material, but also an opportunity to learn how to learn.
If we know anything about LLMs, it's that its ability to answer complex questions break down as you move to more specialized classes.
Which is all to say: I did not investigate this with the intention of using it as a tool, I just wanted to play around.
<h2>My Circumstance</h2>
I use <a href="https://archlinux.org/">arch, btw.</a>
While I enjoy the level of control it provides, I don't think it's for everybody.
This is partly because some things are quite difficult to set up.
For example, GPU support is limited and finicky, especially if you run an Intel Arc card, like I do.
While it works perfectly for some apps, like <a href="https://www.blender.org/">blender</a>, it doesn't work so well for other things.
My card only has 3 GB of VRAM, so it wouldn't be able to fit most models anyway.
So when I took on the task of running an LLM on my local machine, I started at looking at CPU-only solutions.
Initially, I tried to raw-dog <a href="https://github.com/ggerganov/llama.cpp">llama.cpp</a>.
That worked but only so.
The command-line interface left a lot to be desired, and the process of downloading and loading various models was tedious and confusing.
<h2>Ollama</h2>
That's when I discovered <a href="https://ollama.ai/">Ollama</a>.
Installing it was as easy as running:
<pre><code class="hljs language-bash">sudo pacman -S ollama
</code></pre>
To avoid wasting resources on multiple instances of each model, Ollama uses a server architecture.
You can start the server by running
<pre><code class="hljs language-bash">ollama serve
</code></pre>
Then, you can download an start chatting with a model with:
<pre><code class="hljs language-bash">ollama run llama2
# Or:
ollama run mistral
</code></pre>
<h2>That's It</h2>
That's it!
It really is that simple.
Again, you might have no reason to do any of this.
Especially if you are happy with the privacy nightmare that is OpenAI, Google or Anthropic, or if you already have a system that works for you.

It saved me enough time, I had some to share about it.

It saved me enough time, I had some to share about it.

Elijah Potter

The Easiest Way to Run LLMs Locally

LLMs

Let's Be Clear

My Circumstance

Ollama

That's It

Other Stuff

Code Ages like Milk

Writing an Expression Rule for Harper

Do Not Type your Notes