How I Test the Tools I Review

This page exists because the audit kept asking the same question I’d want answered before trusting any “best AI tools” list: who tested these, for how long, with what money, and what would make them fail your test?

Here’s exactly how every review on OneLessHour gets built.

1. The shortlist

Before I pay for anything, I shortlist 6–12 tools per category from:

The leaders that show up across recent comparisons and roundups
Reddit threads in the relevant subreddit (r/productivity, r/notion, r/freelance) — specifically the “what do you actually use” threads, not the launch announcements
Direct recommendations from readers and people I work with
One or two underdogs that look promising but don’t get a lot of press

I don’t include a tool just because it pays the highest commission. The shortlist is locked before I look at affiliate programmes.

2. The test setup

Every tool gets:

Two weeks minimum of real, daily use. Not a free-trial click-around.
Plugged into real work — my client load, my actual inbox, my actual research projects. No demo data.
Time tracked with Toggl — I record how long the same task takes with vs. without the tool.
The same paid plan a normal user would buy — usually the entry-level paid tier. I don’t review on enterprise plans you can’t afford.

The cost adds up. I spent $165 in trial subscriptions for the AI email assistant roundup alone. That’s by design. If a tool only works on the $50/month plan, you deserve to know that.

3. What disqualifies a tool

A tool gets cut from a “best of” list if:

It saves less than ~30 minutes a week vs. doing the task manually
It silently breaks or hallucinates on real work (not just demos)
The pricing page hides the actual cost behind “contact sales”
It requires more than 30 minutes of setup just to test
The mobile experience is unusable when desktop is fine
It quietly gets worse after the free-trial period (a real pattern with some “AI” tools)

If a tool is cut, I usually still mention it in the article so you know I tested it.

4. The verdict format

Every review entry has the same shape:

What it’s best for — the one job it actually wins at
Pro tip — the non-obvious workflow that made it click for me
Skip it if — a specific situation where you should not buy this tool
Cost / plan tested — the exact tier I paid for

If a section is missing, that’s because I didn’t have a strong opinion — I’d rather leave it blank than fake confidence.

5. “Last tested” badges

Every review carries a “Last tested: [Month YYYY]” badge in the meta row. AI tools change fast — a tool that was the best in your category three months ago can be a punchline today. I re-test every roundup at least every 90 days and update the badge when I do.

If you see a badge older than six months, treat the article as out-of-date and email me. I probably need to re-test.

6. Affiliate links

Some links on this site are affiliate links. If you sign up through one, I earn a small commission at no extra cost to you.

Here’s what affiliate money does not do:

Decide which tools make a “best of” list (the shortlist is locked before I check programmes)
Decide ranking order (I rank by hours saved, period)
Buy a positive review (I’ve trashed tools that pay 40% commissions and praised tools with no programme at all)

Here’s what affiliate money does do:

Pay for the trial subscriptions, the cancelled plans, the wasted-money months
Let me keep doing this without locking the site behind a paywall

If a tool I recommend doesn’t have an affiliate programme, you’ll see a regular link. That happens often.

7. Mistakes I’ve made

I’ll get things wrong. When I do, I update the article with an Updated [date]: [what changed] line at the top. I don’t silently fix mistakes — that’s how trust dies.

If you spot something off, email me at [email protected]. I read every one.

— Alex