What Are DOK Levels? A Teacher's Guide to Depth of Knowledge
DOK levels, or Depth of Knowledge levels, are a four-part framework developed by Dr. Norman Webb to measure the cognitive complexity required by a task or assessment item. The four levels range from simple recall (Level 1) to extended thinking over time (Level 4). Unlike difficulty, DOK measures how deeply students need to think, not how hard a task is.
This guide breaks down all four DOK levels with cross-subject examples, clears up the most common misconceptions, and explains how to use DOK when designing assessments that match the thinking demands of state tests.
What are DOK levels?
Dr. Norman Webb, a researcher at the Wisconsin Center for Education Research, created the Depth of Knowledge framework in 1997. His original goal was practical: he needed a way to evaluate whether standardized assessment items were asking students to think at the same cognitive level that the corresponding curriculum standards required.
That alignment problem remains equally relevant today. Teachers can cover a standard in instruction and then assess it at a lower cognitive level without realizing it. DOK gives educators a shared language to catch that mismatch.
Here’s the distinction that matters most: DOK measures cognitive complexity, not difficulty. Webb put it plainly in a 2021 Edutopia article he co-authored: “Coming up with the 37th digit of pi is a very difficult task. But it’s not a complex task.” A student can struggle mightily with a calculation and still be operating at DOK Level 1, because the thinking required is procedural rather than deep.
The four DOK levels are:
- •Level 1: Recall and reproduction
- •Level 2: Skills and concepts
- •Level 3: Strategic thinking and reasoning
- •Level 4: Extended thinking
The four DOK levels explained
Quick reference: DOK levels at a glance
| DOK Level | Name | Cognitive Work | Example Task | Common In |
|---|---|---|---|---|
| 1 | Recall | Remember, reproduce | Name the three branches of government | Warmups, vocabulary checks |
| 2 | Skills/Concepts | Apply, infer, compare | Explain how a bill becomes a law | Most daily formatives |
| 3 | Strategic Thinking | Analyze, argue, evaluate | Argue whether the Electoral College should be reformed, with evidence | Common assessments, high-stakes formatives |
| 4 | Extended Thinking | Research, synthesize, create over time | Design a civic action project and present findings | Projects, capstones |
DOK Level 1: Recall and reproduction
At Level 1, students recall or reproduce specific information. The cognitive work is retrieval, not analysis. Students are matching, naming, or executing a memorized procedure without having to make decisions about how to do it.
Key verbs: define, recall, identify, state, label, list, match, name, calculate (routine procedure)
Classroom examples:
- •3rd grade math: Identify which fraction is larger: 1/2 or 1/4
- •5th grade science: Name the stages of the water cycle in order
- •8th grade ELA: Identify the main character and setting of the passage
- •High school social studies: State the date the 13th Amendment was ratified
Level 1 is foundational. Students need factual knowledge before they can reason with it. But an assessment built almost entirely of Level 1 items will not prepare students for the cognitive demands of most state tests.
DOK Level 2: Skills and concepts
Level 2 asks students to apply knowledge in a context that requires mental processing beyond pure recall. Students make inferences, compare ideas, organize information, or explain relationships. This is the most common level in daily classroom formatives, and for good reason: it checks whether students understand a concept, not only whether they can recite it.
Key verbs: describe, explain, summarize, classify, compare, infer, organize, estimate, interpret
Classroom examples:
- •3rd grade math: Explain why you need a common denominator to add fractions with unlike denominators
- •5th grade science: Describe why condensation happens in the water cycle
- •8th grade ELA: Summarize the major events of the story in chronological order
- •High school social studies: Compare the causes and effects of two historical events
Most of what teachers assess on a daily basis lives at Level 2. The challenge is making sure Level 3 thinking also gets a seat in the assessment.
DOK Level 3: Strategic thinking and reasoning
Level 3 requires students to reason, analyze evidence, and support conclusions. There is usually more than one defensible answer, and students need to justify their thinking. This is where academic rigor lives for most grade levels, and it is the primary target of state standardized assessments.
Key verbs: analyze, evaluate, critique, develop an argument, hypothesize, investigate, formulate, draw conclusions
Classroom examples:
- •3rd grade math: Explain a strategy for comparing two fractions with unlike denominators, and show why it works
- •5th grade science: Use local rainfall data to explain why a drought occurred in a specific region
- •8th grade ELA: Analyze how the author’s use of foreshadowing creates tension in the narrative
- •High school social studies: Evaluate a primary source document for bias and explain how that perspective influenced the historical record
DOK 3 items take more time to write well. The student does real intellectual work, not only retrieving or applying. A good DOK 3 question has no single “right” answer that can be looked up.
DOK Level 4: Extended thinking
Level 4 tasks involve sustained, iterative work over time. Students research, synthesize information from multiple sources, reflect on their process, and produce something original. This level is not typically found on formative assessments. It belongs in projects, research papers, and capstone work.
Key verbs: design, synthesize, create, research, produce, collaborate, justify over time
Classroom examples:
- •5th grade science: Design a two-week investigation comparing water evaporation rates in different environments and present findings
- •8th grade ELA: Compare foreshadowing techniques across three texts and produce a literary analysis
- •High school social studies: Synthesize multiple primary sources to argue a historical claim and present findings to an audience
Level 4 is powerful for deeper learning, but it does not fit in a 20-minute formative. Understanding this distinction helps teachers stop expecting DOK 4 from formatives and reserve it for the contexts where it belongs.
DOK vs. Bloom’s taxonomy: what’s the difference?
Teachers frequently conflate DOK and Bloom’s Taxonomy, but they measure different things and work best together.
Bloom’s Taxonomy, developed by Benjamin Bloom in 1956, categorizes the type of thinking a task requires: remember, understand, apply, analyze, evaluate, create. DOK measures the depth of cognitive complexity a task demands.
A useful shortcut: Bloom’s asks what kind of thinking; DOK asks how deeply must a student think.
Why this matters: A task can involve Bloom’s highest level (“Create”) but still be DOK 1. If students are given a template and asked to fill it in, the verb is “create” but the cognitive demand is routine and procedural. Conversely, a question using a basic verb like “explain” can be DOK 3 if it requires students to reason through a complex, non-routine situation.
| Bloom’s Taxonomy | DOK Levels | |
|---|---|---|
| Asks: | What TYPE of thinking? | HOW COMPLEX is the thinking? |
| Levels: | 6 (Remember through Create) | 4 (Recall through Extended Thinking) |
| Best used for: | Writing learning objectives | Evaluating task and assessment complexity |
| Created by: | Benjamin Bloom (1956) | Norman Webb (1997) |
Use both. Write your learning objectives with Bloom’s in mind, then check your assessment items with DOK to ensure the complexity matches what the standard requires.
Three DOK misconceptions that trip up teachers
Misconception 1: The DOK wheel IS the DOK framework.
The DOK wheel is a visual planning aid that lists verbs associated with each level. It is widely used in professional development, but it is not the actual framework. Webb himself has noted that verb placement on the wheel can be inconsistent: the same verb can operate at multiple DOK levels depending on context. The wheel is a useful starting point, not a rule book.
Misconception 2: Higher-level verbs always mean higher DOK.
If a student is asked to “analyze” a two-word sentence with an obvious answer, that question may operate at DOK 2 despite the higher-level verb. DOK is determined by the full context of the task, the complexity of the thinking required, and how independently the student must reason, not by the verb alone.
Misconception 3: Difficult means high DOK.
A hard arithmetic problem with many steps is still DOK 1 if the student is executing a memorized procedure. A straightforward-sounding question about a historical document can be DOK 3 if students must evaluate bias and justify their reasoning. Effort and complexity are different dimensions.
How to balance DOK levels in your assessments
Most state standardized assessments distribute items across DOK levels with roughly 20 percent at Level 1, 50 percent at Level 2, and 30 percent at Level 3, according to the ISBE Webb Align DOK overview (Webb, 2019). Level 4 rarely appears on standardized tests because of the time and collaboration it requires.
If a formative assessment consists mostly of multiple-choice recall questions, it is assessing at DOK 1. Students who score well on that assessment may still struggle on state tests that weight DOK 2 and 3 heavily. The goal is to build formatives that mirror the cognitive distribution of the assessments students will ultimately face.
A practical check: When building a formative, ask whether you have at least a few items that require students to explain, compare, or justify, not only identify or recall. Those Level 2 and Level 3 items are where the diagnostic value lives.
Wayground’s Official Library pre-tags every assessment item by DOK level. Instead of manually categorizing each question, teachers can filter items by level when building an assessment, making it straightforward to achieve the right cognitive distribution from the start.
How to write better questions at every DOK level
The most reliable way to identify and write at the right DOK level is to think about what the student needs to do mentally to answer the question, not what the question stem says.
DOK 1 question stems: “Define...”, “List...”, “Identify...”, “Name...”, “What is...”
DOK 2 question stems: “Explain why...”, “Compare...and...”, “Describe the relationship between...”, “Summarize...”, “How does...affect...”
DOK 3 question stems: “Develop an argument for...”, “Analyze how...”, “Evaluate the effectiveness of...”, “What evidence supports...”, “Justify your reasoning...”
DOK 4 question stems: “Design a...that...”, “Research and synthesize multiple sources to...”, “Produce a... that demonstrates...”
A practical process: Start with the learning objective. Identify the DOK demand the standard requires. Write items at that level. If the standard calls for students to “analyze,” your assessment items should ask for analysis, not recall.
If writing higher-DOK questions from scratch feels time-consuming, Wayground’s DOK Question Generator creates standards-aligned questions at any DOK level in seconds.
Putting DOK to work in your classroom
Understanding DOK levels changes how you look at every assessment you build. The key takeaways:
- •DOK measures cognitive complexity, not difficulty. A hard question can be DOK 1.
- •Level 2 and Level 3 are where most state test demands live. Building formatives that include those levels closes the gap with state test complexity.
- •The DOK wheel is a planning aid, not a rulebook. Context determines DOK, not verb choice alone.
- •A balanced formative does not require a DOK 4 task. It requires a thoughtful mix of Level 1, 2, and 3.
- •You do not have to manually tag every item. A pre-tagged item bank handles that work for you.
Choose one assessment you use regularly and evaluate it through the DOK lens. Count how many items sit at each level. The result will tell you quickly whether your formative is preparing students for the complexity of your state test.
Wayground’s Official Library includes thousands of standards-aligned items pre-tagged by DOK level. Browse items at the right complexity for your unit, or use the DOK Question Generator to build your own DOK-balanced assessments from scratch.
Find your way forward
Frequently Asked Questions
What does DOK stand for in education?
DOK stands for Depth of Knowledge, a framework developed by Dr. Norman Webb in 1997 to measure the cognitive complexity required by educational tasks and assessment items. It was originally designed to evaluate alignment between curriculum standards and standardized tests.
What is the difference between DOK levels and Bloom's taxonomy?
Bloom's Taxonomy categorizes the type of thinking a task requires, from remembering to creating. DOK measures the depth of cognitive complexity required to complete a task. They complement rather than replace each other: use Bloom's when writing learning objectives and DOK when evaluating whether an assessment matches the complexity of a standard.
Can the same question be at multiple DOK levels?
Yes. The same question can represent different DOK levels depending on what students need to do to answer it. "Describe the water cycle" is DOK 1 if students are reciting memorized steps, but DOK 2 or 3 if they must explain the mechanisms or connect them to a local weather event. Context determines DOK, not the question stem alone.
What DOK level should most formative assessments target?
A balanced formative assessment mirrors state test distributions: roughly 20 percent DOK 1, 50 percent DOK 2, and 30 percent DOK 3. Most teacher-built formatives skew too heavily toward DOK 1. Including a few explanation or justification questions raises the cognitive bar significantly.
Is DOK Level 4 required in every assessment?
No. DOK 4 is most appropriate for extended projects, research tasks, or capstone work that unfolds over multiple days or weeks. Most formative and summative assessments appropriately focus on DOK Levels 1 through 3.
How is DOK different from task difficulty?
DOK measures cognitive complexity, not how hard a task is to complete. A difficult multi-step calculation is still DOK 1 if it requires executing a memorized procedure. A simpler question about a historical document can be DOK 3 if the student needs to analyze bias and justify a claim.
What is the DOK wheel?
The DOK wheel is a visual planning tool that lists verbs associated with each DOK level. It is a useful classroom reference but is not Webb's actual framework. Verb placement on the wheel is a guideline, not a fixed rule, and the same verb can appear at multiple levels depending on the task context.
How do I know what DOK level a standard requires?
Read the standard and ask: what does the student actually need to do to demonstrate proficiency? Standards using verbs like "identify" or "recall" typically target DOK 1. Standards using "explain" or "describe relationships" target DOK 2. Standards calling for "analysis," "evaluation," or "justification" target DOK 3.