Mastodon @Mastodon@mastodon.social

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

Testing Without Mocks: A .

So a few days ago I released this massive update to my article, "Testing Without Mocks: A Pattern Language." It's 40 pages long if you print it. (Which you absolutely should. I have a fantastic print stylesheet.) I promised a thread explaining what it's all about.

This is the thread. If you're not interested in TDD or programmer tests, you might want to mute me for a bit.

Here's the article I'm talking about: https://www.jamesshore.com/v2/projects/testing-without-mocks/testing-without-mocks

www.jamesshore.comJames Shore: Testing Without Mocks: A Pattern Language

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

2/ First, why bother? Why write 40 pages about testing, with or without mocks?

Because testing is a big deal. People who don't have automated tests waste a huge amount of time manually checking their code, and they have a ton of bugs, too.

The problem is, people who DO have automated tests ALSO waste a huge amount of time. Most test suites are flaky and SLOOOOW. That's because the easy, obvious way to write tests is to make end-to-end tests that are automated versions of manual tests.

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

3/ Folks in the know use mocks and spies (I'll say "mocks" for short) to write isolated unit tests. Now their tests are fast! And reliable! And that's great!

Except that now their tests have lots of detail about the interactions in the code. Structural refactorings become really hard. Sometimes, you look at a test, and realize: all it's testing... is itself.

Not to mention that the popular way to use mocks is to use a mocking framework and... wow. Have you seen what those tests look like?

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

4/ So we don't want end-to-end tests, we don't want mocks. What do we do?

The people really REALLY in the know say "bad tests are a sign of bad design." They're right! They come up with things like Hexagonal Architecture and (my favorite) Gary Bernhardt's Functional Core, Imperative Shell. It separates logic from infrastructure so logic can be tested cleanly.

Totally fixes the problem.

For logic.

Anything with infrastructure dependencies… well… um… hey look, a squirrel! (runs for hills)

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

5/ Not to mention that (checks notes) approximately none of us are working in codebases with good separation of logic and infrastructure, and (checks notes again) approximately none of us have permission to throw away our code and start over with a completely new architecture.

(And even if we did have permission, throwing away code and starting over is a Famously Poor Business Decision with Far-Reaching Consequences.)

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

6/ So we don't want end-to-end tests, we don't want mocks, we can't start over from scratch... are we screwed? That's it, the end, life sucks?

No.

That's why I wrote 40 pages. Because I've figured out another way. A way that doesn't use end-to-end tests, doesn't use mocks, doesn't ignore infrastructure, doesn't require a rewrite. It's something you can start doing today, and it gives you the speed, reliability, and maintainability of unit tests with the power of end-to-end tests.

James Shore @jamesshore@mastodon.online

7/ I call it (for now, anyway, jury's out, send me your article naming ideas) "Testing With Nullables."

It's a set of patterns for combining narrow, sociable, state-based tests with a novel infrastructure technique called "Nullables."

At first glance, Nullables look like test doubles, but they're actually production code with an "off" switch.

Dec 23, 2022, 12:56 AM··Web

2boosts·6favorites

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

8/ This is as good a point as any to remind everyone that nothing is perfect. End-to-end tests have tradeoffs, mocks have tradeoffs, FCIS has tradeoffs... and Nullables have tradeoffs. All engineering is tradeoffs.

The trick is to find the combination of good + bad that is best for your situation.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

9/ Nullables have a pretty substantial tradeoff. Whether it's a big deal or not is up to you. Having worked with these ideas for many years now, I think the tradeoffs are worth it. But you have to make that decision for yourself.

Here's the tradeoff: Nullables are production code with an off switch.

Production code.

Even though the off switch may not be used in production.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

10/ Okay, enough foreplay. Let's talk about how this thing works. Again, you can see all the details in the article: https://www.jamesshore.com/v2/projects/testing-without-mocks/testing-without-mocks

www.jamesshore.comJames Shore: Testing Without Mocks: A Pattern Language

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

11/ The fundamental idea is that we're going to test everything—everything!—with narrow, sociable, state-based tests.

Narrow tests are like unit tests: they focus on a particular class, method, or concept.

Sociable tests are tests that don't isolate dependencies. The tests run everything in dependencies, although they don't test them.

And state-based tests look at return values and state changes, not interactions.

(There's a ton of code examples in the article, btw, if you want them.)

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

12/ This does raise some questions about how to manage dependencies. Another core idea is "Parameterless Instantiation." Everything can be instantiated with a constructor, or factory method, that takes NO arguments.

Instead, classes do the unthinkable: they instantiate their own dependencies. GASP!

Encapsulation, baby.

(You can still take the dependencies as an optional parameter.)

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

13/ People ask: "but if we don't use dependency injection frameworks..."

I interrupt: "your code is simpler and easier to understand?" I'm kind of a dick.

They continue, glaring: "...doesn't that mean our code is tightly coupled?"

And the answer is no, of course not. Your code was already tightly coupled! An interface with one production implementation is not "decoupled." It's just wordy. Verbose. Excessively file-system'd.

(The other answer is, sure, use your DI framework too. If you must.)

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

14/ Anyway, that's the fundamentals. Narrow, sociable, state-based tests that instantiate their own dependencies.

Next up: A-Frame Architecture! This is optional, but people really like it. It's basically a formalized version of Functional Core, Imperative Shell. I'm gonna skip on ahead, but feel free to check out the article for details. Here's the direct link to the Architecture section: https://www.jamesshore.com/v2/projects/testing-without-mocks/testing-without-mocks#arch-patterns

www.jamesshore.comJames Shore: Testing With Nullables: A Pattern Language

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

15/ Speaking of architecture, the big flaw with FCIS, as far as I've seen, is that it basically ignores infrastructure, and things that depend on infrastructure.

"I test it manually," Gary Bernhardt says in his very much worth watching video: https://www.destroyallsoftware.com/screencasts/catalog/functional-core-imperative-shell

That's a choice. I'm going to show you how to make a different one.

(Not trying to dunk on FCIS here. I like it. A-Frame Architecture has a lot in common with FCIS, but has more to say about infrastructure.)

www.destroyallsoftware.com Functional Core, Imperative Shell

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

16/ So right, Infrastructure!

Code these days has a LOT of infrastructure. And sometimes very little logic. I see a lot of code that is really nothing more than a web page controller than turns around and hands off to a bunch of back-end services, and maybe has a bit of logic to gllue it all together. Very hard to test with the "just separate your logic out" philosophy. And so it often doesn't get tested at all. We can do better.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

17/ There are two basic kinds of infrastructure code:

1) Code that interfaces directly with the outside world. Your HTTP clients, database wrappers, etc. I call this "low-level infrastructure".

2) Code that *depends* on low-level infrastructure. Your Auth0 and Stripe clients, your controllers and application logic. I call this "high-level infrastructure" and "Application/UI code".

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

18/ Low-level infrastructure should be wrapped up in a dedicated class. I call these things "Infrastructure Wrappers," 'cause I'm boring and like obvious names, but they're also called "Gateways" and "Adapters."

Because it talks to the outside world, this code needs to be tested for real, against actual outside world stuff. Otherwise, how do you know it works? For that, you can use Narrow Integration Tests. They're like unit tests, except they talk to a test server. Hopefully a dedicated one.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

19/ High-level infrastructure should also be wrapped up in an Infrastructure Wrapper, but it can just delegate to the low-level code. So it doesn't need to be tested against a real service—you can just check that it sends the correct JSON or whatever, and that it parses the return JSON correctly.

And parses garbage correctly. And error values. And failed connections. And timeouts.

*fratboy impression* Woo! Microservices rock!

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

20/ At this point, people ask,

"But what if the service changes its API? Don't you need to test against a real service to know your code still works?"

To which, I respond: "What, you think the service is going to wait for you to *run your tests* before changing its API?"

(Yeah, still kind of a dick.)

You need to have runtime telemetry and write your code to fail safe (and not just fall over) when it receives unexpected values. I call this "Paranoic Telemetry."

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

21/ Sure, when you first write the high-level wrapper, you'll make sure you understand the API so you can test it properly, maybe do some manual test runs to confirm what the docs say.

But then you gotta have Paranoic Telemetry. They ARE out to get you.

True story: I was at a conference once and somebody—I think it was Recurly, but maybe it was Auth0—changed their API in a way that utterly borked my login process.

My code had telemetry and failsafes, though, and handled it fine. Paranoia FTW.

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

22/ Moving up the call chain: Application code is like high-level infrastructure. It delegates, probably to the high-level infrastructure, which turns around and delegates to low-level infrastructure.

That raises the question: how do you TEST things that eventually delegate to low-level infrastructure and talk to the outside world? Without using mocks, stubs, or spies?

And that's where Nullables come in.

("Finally!" some of you say. "Won't this guy ever shut up?" the rest of you say.)

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

23/ Nullables are production code that can be turned off.

Let's take a simple example. You've got a low-level wrapper for Stdout. If it's Nullable, then you can either say `Stdout.create()`, in which case it works normally, or you can say `Stdout.createNull()`, in which case it works normally in *every respect* except that it doesn't write to stdout.

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

24/ "Working normally" isn't such a big deal for Stdout, because there's no real logic or behavior there, but it is a big deal for your higher-level code that does have logic. For example, a Terminal that uses Stdout and has the ability to draw boxes that are exactly the width of the terminal.

(I dunno. It's hard coming with examples. This is all off the cuff. See the article for actual source code examples with more than 10 seconds of thought in them: https://www.jamesshore.com/v2/projects/testing-without-mocks/testing-without-mocks)

www.jamesshore.comJames Shore: Testing Without Mocks: A Pattern Language

**James Shore** @jamesshore · Dec 23, 2022 *

Dec 23, 2022 *

James Shore @jamesshore

25/ Your low-level infrastructure is Nullable, the high-level infrastructure that uses it is Nullable, and the application logic is Nullable. It's Nullables all the way down. (Except in your logic layer, if you're lucky enough to have one, which is beautiful and pure and mostly nonexistant for us Morlocks.)

And the thing about Nullables is that they run *real code* and *work normally* in *every way* except that they don't actually write to Stdout, or make HTTP calls, or whatever.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

26/ That's kind of a big deal for your tests, because it means that, when somebody changes your Terminal abstraction in a totally cool, awesome, smart way, and THEY BREAK ALL YOUR SHIT, your tests fail.

Let me repeat that: your tests actually fail.

You learn that they broke your shit, and you fix it.

I don't know about you, but that's worth a certain amount of ugly tradeoffs to me.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

27/ So buckle up, because I'm about to reveal the granddaddy of all tradeoffs: the magic that makes this work.

Nullables run real code because, way, way down at the bottom of your dependency chain, in the lowest of low-level infrastructure wrappers, they're implemented with an Embedded Stub.

**Eric Maino** @ericmaino@hachyderm.io · Dec 23, 2022

Dec 23, 2022

Eric Maino @ericmaino@hachyderm.io

@jamesshore love the the thread. I’m not a big fan of the term Nullables. It’s not wrong, I know what it means, however I’m desiring a different term. I don’t know what it is, but will share ideas when I have them.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

@ericmaino Thanks! The term's somewhat well established at this point, but I don't love it either, so I'm willing to change it if you think of something much better.

**Barney Dellar (he/him)** @Bdellar@mastodon.social · Dec 23, 2022

Dec 23, 2022

Barney Dellar (he/him) @Bdellar@mastodon.social

@jamesshore @ericmaino Sorry, I haven’t read the article properly yet. I’ve got a filthy cold… But am I right in thinking that this is basically the “Null Object” pattern? If so, @mfowler has renamed this to the “Special Case” pattern in his latest version of Refactoring.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

@Bdellar @ericmaino @mfowler It was originally inspired by the Null Object pattern, but it ended up completely different. Null Object is more like a test double, in that it's a separate implementation of an interface, whereas Nullables are the same implementation with a tiny encapsulated stub.

**Marcus Rådell** @marcusradell@hachyderm.io · Dec 23, 2022

Dec 23, 2022

Marcus Rådell @marcusradell@hachyderm.io

@jamesshore using a DI framework doesn't reduce coupling in any meaningful way, as you say.

It *hides* coupling and encourages excessive coupling.

**wakingrufus** @wakingrufus@bigshoulders.city · Jan 8, 2023

Jan 8, 2023

wakingrufus @wakingrufus@bigshoulders.city

@jamesshore
I love this concept for FP as well as OOP. I implement a very similar concept when testing purely functional code. I take advantage of ad-hoc polymorphism and default function parameters in #Kotlin https://wakingrufus.neocities.org/adhoc-polymorphism

wakingrufus.neocities.orgUsing Ad-hoc Polymorphism to Test Functional Kotlin

**Marcus Rådell** @marcusradell@hachyderm.io · Dec 23, 2022

Dec 23, 2022

Marcus Rådell @marcusradell@hachyderm.io

@jamesshore

note: for me that isn't into the formal definitions;

I thought interaction-based tests would test return values (interactions make me think of event sourcing and reducers) and that state-based tests would check internal state of mocked dependencies.

I still think you should use the formal language as long as it can present a mental model that anchors the meaning of the definitions.

**James Shore** @jamesshore · Dec 23, 2022

Dec 23, 2022

James Shore @jamesshore

@marcusradell Think of it in terms of what the code does. Interaction-based test check how the code *interacts* with dependencies, and state-based tests check the *state* of the code under test.

**Marcus Rådell** @marcusradell@hachyderm.io · Dec 23, 2022

Dec 23, 2022

Marcus Rådell @marcusradell@hachyderm.io

@jamesshore when seen from the perspective of the unit/code under test, it instinctively clicks for me. (Or the curse of knowledge gives me that sensation, but I think not.)

**ysb33r** @ysb33r@mastodon.social · Dec 23, 2022

Dec 23, 2022

ysb33r @ysb33r@mastodon.social

@jamesshore So here I sit at the airport, sipping fresh (overpriced) orange juice, whilst waiting for some arrivals, and I start to read this.

Thanks for spending the time writing this. It is fn awesome content.

**Rob Fulwell** @rob_fulwell@techhub.social · Dec 29, 2022

Dec 29, 2022

Rob Fulwell @rob_fulwell@techhub.social

@jamesshore I'm definitely going to follow up with your 40 pages but in the meantime, what do you mean by, 'sociable' here?

**James Shore** @jamesshore · Dec 29, 2022

Dec 29, 2022

James Shore @jamesshore

@rob_fulwell Tests that run code in dependencies. There’s a section with more detail in the article.

**Anton Stöckl** @TonyBologni@mastodon.social · Dec 29, 2022

Dec 29, 2022

Anton Stöckl @TonyBologni@mastodon.social

@rob_fulwell @jamesshore

I’m other words than James‘:
Tests that don’t micro test smallest units (class/method/function) in isolation (all collaborators mocked) but e.g. a use whole use case.
The line to integration tests is blurry, but the terms are overloaded anyways.

If it’s only business logic (no persistence, etc.) then many people (like me) still call this unit tests.
But the discussion about that quickly gets religious.

Drag & drop to upload

Recent searches

Search options

Administered by:

Server stats:

Recent searches

Search options

Administered by:

Server stats:

Back