How I Test the Tools I Review

This page exists because the audit kept asking the same question I’d want answered before trusting any “best AI tools” list: who tested these, for how long, with what money, and what would make them fail your test?

Here’s exactly how every review on OneLessHour gets built.

1. The shortlist

Before I pay for anything, I shortlist 6–12 tools per category from:

  • The leaders that show up across recent comparisons and roundups
  • Reddit threads in the relevant subreddit (r/productivity, r/notion, r/freelance) — specifically the “what do you actually use” threads, not the launch announcements
  • Direct recommendations from readers and people I work with
  • One or two underdogs that look promising but don’t get a lot of press

I don’t include a tool just because it pays the highest commission. The shortlist is locked before I look at affiliate programmes.

2. The test setup

Every tool gets:

  • Two weeks minimum of real, daily use. Not a free-trial click-around.
  • Plugged into real work — my client load, my actual inbox, my actual research projects. No demo data.
  • Time tracked with Toggl — I record how long the same task takes with vs. without the tool.
  • The same paid plan a normal user would buy — usually the entry-level paid tier. I don’t review on enterprise plans you can’t afford.

The cost adds up. I spent $165 in trial subscriptions for the AI email assistant roundup alone. That’s by design. If a tool only works on the $50/month plan, you deserve to know that.

3. What disqualifies a tool

A tool gets cut from a “best of” list if:

  • It saves less than ~30 minutes a week vs. doing the task manually
  • It silently breaks or hallucinates on real work (not just demos)
  • The pricing page hides the actual cost behind “contact sales”
  • It requires more than 30 minutes of setup just to test
  • The mobile experience is unusable when desktop is fine
  • It quietly gets worse after the free-trial period (a real pattern with some “AI” tools)

If a tool is cut, I usually still mention it in the article so you know I tested it.

4. The verdict format

Every review entry has the same shape:

  • What it’s best for — the one job it actually wins at
  • Pro tip — the non-obvious workflow that made it click for me
  • Skip it if — a specific situation where you should not buy this tool
  • Cost / plan tested — the exact tier I paid for

If a section is missing, that’s because I didn’t have a strong opinion — I’d rather leave it blank than fake confidence.

5. “Last tested” badges

Every review carries a “Last tested: [Month YYYY]” badge in the meta row. AI tools change fast — a tool that was the best in your category three months ago can be a punchline today. I re-test every roundup at least every 90 days and update the badge when I do.

If you see a badge older than six months, treat the article as out-of-date and email me. I probably need to re-test.

6. Affiliate links

Some links on this site are affiliate links. If you sign up through one, I earn a small commission at no extra cost to you.

Here’s what affiliate money does not do:

  • Decide which tools make a “best of” list (the shortlist is locked before I check programmes)
  • Decide ranking order (I rank by hours saved, period)
  • Buy a positive review (I’ve trashed tools that pay 40% commissions and praised tools with no programme at all)

Here’s what affiliate money does do:

  • Pay for the trial subscriptions, the cancelled plans, the wasted-money months
  • Let me keep doing this without locking the site behind a paywall

If a tool I recommend doesn’t have an affiliate programme, you’ll see a regular link. That happens often.

7. Mistakes I’ve made

I’ll get things wrong. When I do, I update the article with an Updated [date]: [what changed] line at the top. I don’t silently fix mistakes — that’s how trust dies.

If you spot something off, email me at [email protected]. I read every one.

— Alex