Today, large language models (LLMs) can generate human-like responses from patterns derived from vast data sets, with little effort on the part of users and usually within minutes (Mohaimenul et al., 2024). They also offer users decision-making support and can even undertake analysis of textual documents themselves (Essien et al., 2024). However, they pose a risk to users of “cognitive off-loading” (Gerlich, 2025) or undertaking what passes for critical thought instead of leaving such thinking t…
Read moreToday, large language models (LLMs) can generate human-like responses from patterns derived from vast data sets, with little effort on the part of users and usually within minutes (Mohaimenul et al., 2024). They also offer users decision-making support and can even undertake analysis of textual documents themselves (Essien et al., 2024). However, they pose a risk to users of “cognitive off-loading” (Gerlich, 2025) or undertaking what passes for critical thought instead of leaving such thinking to the user. In this paper, we describe how we adapted Richard Paul and Linda Elder’s (2006) selection of the marks of critical thinking to analyze a brief literature review and simultaneously evaluate AI-generated analysis and evaluation of the same literature review applying the exact same criteria. While we note some benefits, especially with respect to summarization, we also find deficiencies in the AI analysis and propose possible solutions for strengthening critical thinking and related skills that users need when they seek the assistance of AI in analysis of texts.