Skip to main content

Using an LLM to Find Themes in an Academic Career

thematic analysis and GenaI for analyssing a career steps involved


Brief recap

In an earlier post, “Same prompt, four AIs: why answers aren’t always the same”, I looked at what happened when the same prompt was given to four different LLMs. Unsurprisingly, perhaps, the answers were not identical. That raised an interesting follow-on question: what was the prompt actually trying to do?

The answer is that it was trying to support a form of thematic analysis. In that case, the object of analysis was an academic profile, including my Google Scholar profile. The version I discuss here is a modified prompt, used with Claude.ai, where I uploaded a CV and asked the system to find other appropriate public resources connected with a named person and institution. The aim was not simply to summarise the CV, but to identify visible and less visible themes across a whole career.

Why use thematic analysis?

Thematic analysis is widely used by academics, especially in qualitative research. Braun and Clarke’s well-known paper, Using thematic analysis in psychology (V. Braun and V. Clarke, Qualitative Research in Psychology, 3(2), 2006, pp. 77–101, DOI: 10.1191/1478088706qp063oa), sets out a staged approach involving familiarisation with the data, coding, searching for themes, reviewing themes, defining and naming themes, and producing the final account.

For a personal academic reflection exercise, this is useful because a career is rarely just a list of outputs. A standard CV summary might say what roles someone held, what they published, or which grants and activities they were involved in. A thematic analysis asks a different sort of question: what patterns seem to run through this work?

Some of those patterns may be semantic themes: themes visible on the surface of the material. For example, a CV might show repeated work around computing education, employability, widening participation, or public engagement. Others may be latent themes: deeper or more interpretive patterns that are not stated directly but may be inferred across the material. For example, the profile might suggest a continuing concern with access, community benefit, interdisciplinarity, or building bridges between formal education and informal learning.

What the prompt does

The prompt asks the LLM to analyse an uploaded CV alongside other available and appropriate resources on a named person and their institution or organisation. It sets the scope as the whole career, asks for all available data to be considered, and requests both semantic and latent themes.

One useful feature is that the prompt asks the LLM to ask questions before beginning the analysis. This is not a minor detail. The questions are part of the value, because they force clarification around purpose, scope, sources, and constraints before the tool starts interpreting the material.

In my case, the output was useful for reflection. For someone with an interdisciplinary publication record and non-traditional academic activity, such as setting up Code Clubs, leading STEM-related groups, or undertaking public and community roles, the thematic lens can surface connections that may not be obvious from a chronological CV.

Using the output cautiously

This is not about outsourcing judgement to an LLM. It is better understood as a practical but imperfect tool for structured reflection. The themes it produces should be treated as provisional suggestions, not objective truths.

There are several reasons for caution. First, the analysis is only as good as the sources. If the CV omits important teaching, leadership, collegiality, public engagement, or service work, the LLM may not identify those themes. That absence can still be useful: it may show where the CV needs to capture more of the career story.

Second, if the LLM searches for public sources, it may find irrelevant, incomplete, or incorrect material. Third, latent themes are interpretive. They can be helpful, but they can also be flattering, speculative, or biased. The user needs to check whether each theme is actually supported by the evidence.

There are also privacy and ethical issues. Uploading a CV to an AI tool should be done with care, especially if it includes personal data, unpublished work, confidential projects, student information, or institutional material.

What could be improved next?

A useful future improvement would be a two-stage process. First, ask the LLM to list the sources it has found and ask the user to confirm which should be included. Only after that should it carry out the thematic analysis. This would reduce the risk of spurious sources being used.

Another improvement would be to ask the LLM to provide evidence for each theme, ideally by quoting or citing the relevant CV sections, publication titles, abstracts, institutional pages, or other sources. Future versions of the prompt could also specify extra sources directly, such as an ORCID profile, Google Scholar page, institutional profile, or selected public engagement evidence.

How I would use the output

I would use the generated themes as a reflective aid; comparing them with my own understanding of my career, looking for gaps, challenging anything that feels too neat, and considering whether the CV itself needs rewriting to make important themes more visible.

Different LLMs may produce different results from the same prompt, as the earlier post showed. That is not necessarily a weakness, but it is a reminder that the output is an interpretation, not a verdict.

The prompt

Here is the lightly corrected version of the prompt I used:

From the uploaded file and the other available appropriate resources on <person’s name and institution/organisation>, carry out a thematic analysis. Before doing the analysis, please ask any questions you need, then do the analysis.

Please use the steps in Braun and Clarke (V. Braun, V. Clarke, Using thematic analysis in psychology, Qualitative Research in Psychology, 3(2), 2006, pp. 77–101, DOI: 10.1191/1478088706qp063oa) to do this.

Constraints

Scope: Whole career
Depth of data: All available data, including publication titles, abstracts, and other appropriate sources
Analytical lens: No specific research question
Target themes: Look for semantic themes and then provide a separate analysis of latent themes

Please ask any clarification questions before starting the analysis.




Try adapting it cautiously, and remember: the value is not just in the themes the LLM suggests, but in how those themes help you reflect, revise, and question your own career narrative.




All opinions in this blog are the Author's and should not in any way be seen as reflecting the views of any organisation the Author has any association with. Twitter @scottturneruon

Comments

Popular posts from this blog

GenAI Productivity: Ideas to project proposal 1

One of the ways I use Generative AI with students is to take basic ideas for projects, usually a title, and get these tools to greater ideas and start of a project proposal. This is with all the usual caveats  Check the references (if any); It is going to be basic, so extend it. In this example I am going to use Co-pilot but the ChatGPT, etc can be used, employing a few basic prompt engineering basics: personas (who is the target audience?) and Templates (how do I want it to look?) to start this process. Example:  Project ideas for MSc Data Intelligence students (persona)  on a particular topic. The reply will include subheadings and relevant (hopefully) content for  TITLE, INTRODUCTION, PROBLEM STATEMENT. The prompt: " Taking the topic "Leveraging open-source tools to measure and present academics publications automatically from public domain data.". Give five innovative projects for a Master's level student dissertation in Data Intelligence. Each project example wi...

GenAI Productivity: Ideas to project proposal ideas from Google Scholar

From Google Scholar to Project Ideas or Using AI to Map the Future of Your Research Generated as well by Google Gemini Have you ever looked at a researcher’s Google Scholar profile and felt overwhelmed? Between the long lists of citations and dense technical titles, "connecting the dots" of a decade-long career is a massive cognitive lift. Whether you are a student hunting for a dissertation topic or a professional scouting for a collaborator, understanding the trajectory of research is harder than reading the papers themselves. In my latest experiment, I used Google Gemini to see if it could bridge this gap. I gave it a specific challenge: Analyze my own research profile and design 10 compelling project ideas for a final-year Computer Science student. The Prompt: Turning Data into Direction The secret to a good AI output is giving it a clear "anchor." By providing a URL to a live dataset (my Scholar profile), I bypassed the need to copy-paste thousands of words....

Getting multiple viewpoints with ChatGPT

Well sort of! There are approaches where we can get the generative AI to look at a problem from multiple perspectives (or personas) and bring the ideas generated, ideally informed by the others. to a final plan. One of the main strategy is called Tree of Thoughts (see here for more detail  https://www.forbes.com/sites/lanceeliot/2023/09/08/prompt-engineering-embraces-tree-of-thoughts-as-latest-new-technique-to-solve-generative-ai-toughest-problems/?sh=5ce79bdb2c8b ). The central idea is get a number of expert opinions, allow potential cross-fertilization of ideas, come up with actions or plans. Let see this action.  Scenario: Find out about the UK Government's plans on Disability support and then use Tree of Thoughts to produce some ideas for a company making disability equipment based on their website. Google's Gemini will be used. Stage 1 "UK Governments plans on Disability support ": Prompt:  Read, convert to plain text and consolidate information from the followi...