codebase
computational social science workflows for students and scholars
computational social science workflows for students and scholars
My workflows are guided transformations and sensmaking of data (in R, mostly). Everything I do here you can do as well, for free, on a reasonably modern computer. You can choose to read or code-along.
Recommended guides to complement the content on this page:
These cloud-based workflows transform and explore research database (SCOPUS, Web of Science, etc) exports using network and language processing methods. An embedded Gemini AI assistant can help explain or modify these to your needs.
Identify large communities in a co-author network. Use network methods to obtain a purposive sample of papers.
Conduct a basic content analysis of a text corpus. Find most frequent words, compare across categories and over time.
Make sense of a large corpus of texts using network methods and an LDA topic model: Using algorithms to sample, summarize, and relate.
Create a co-author network from a SCOPUS database export.
Basic data transformation
Import and export data in different formats, some basic useful transformations.
Use Excel as a CAQDAS by converting text from word doc(s) to an excel file with one document/paragraph/sentence per row.
Generate a QR Code based on a URL. Randomly assign order of presentations or anonymize student names. Curve grades based on target mean and median.
These notebooks are static chunks of annotated R code along with their output. These programs require a bit more juice to run, so they're best run on your laptop or desktop. To code along, please install R and RStudio on your machine, in that order.
Requirements: An internet connection and a Twitter developer account with an approved Academic Track project.
Applications: Analyzing and representing changes in texts over time
Applications: Inductively identifying key themes, identifying "fingerprints" of authors or publications in a large longitudinal text data. Purposive sampling of papers based on theme rather than specific keywords. Measuring and representing similarity/dissimilarity/change over time (Good starting point for Lit Reviews).
11/19/25 Added a QR code generator to Classroom Tools notebook
10/01/25 Updated doc2excel to handle odd formatting
06/23/25 Updated TIM research conversation notebook for the 2025 academy meeting
Coming soon...
Tracing evolving meanings of central vocabularies using diachronic embeddings
Creating a co-citation network from data exported from Web of Science/SCOPUS
Codebase is primarily a by-product of one of my research projects, sometimes referred to as JRNLS, where I explore how social processes are entangled with knowledge production at the level of the community.
Codebase was recently used to conduct a machine-assisted literature analysis featured in Faraj, S., & Leonardi, P. M. (2022). Strategic organization in the digital age: Rethinking the concept of technology. Strategic Organization, 20(4), 771-785.
To cite this page for code or research methodology, please use:
Bhardwaj, A. (2025) Codebase: Computational social science tools for students and scholars. https://www.anandb.net/code