Research & Data Analyst | OSINT & Language Technology

I am an interdisciplinary analyst earning degrees in linguistics, international relations (intelligence, security, and diplomacy), and a certificate in symbolic, cognitive, and linguistic systems, seeking opportunities in research analysis, OSINT, and language / data driven roles.I have experience conducting qualitative and quantitative research using Python (Jupyter, Pandas, SudachiPy), SQL, and other programming languages such as Java and C#. My work focuses on analyzing language, information, and open-source data to identify patterns, extract insights, and support evidence-based analysis.I am a native English speaker with working proficiency in Spanish, reading proficiency in French, Italian, and Portuguese. I am actively developing proficiency in Japanese, and have some familiarity with Mandarin Chinese and German. Additionally, I have 3 years of experience in post-secondary ESL education.
My Projects
Come back soon! :)
Tokenization Challenges in Japanese: Morphology, Orthography, and NLP Segmentation - Undergraduate presenter at Arizona State University's Graduate Linguistics, Applied Linguistics, and TESOL Symposium (Feb 2026).
Tools used: Python, SudachiPy, MeCab (UniDic / IPADIC), spaCy, GiNZA, Hugging Face tokenizer, Pandas, Jupyter, & Matplotlib.Political Discourse Keyword Analysis (Jan 2026)
Tools: Python, Regex, spaCy, Jupyter
Exploratory analysis of political speeches and public statements, focusing on keyword frequency, collocations, and framing across speakers or time periods.
Skills
Technical (Learning & Applied): Python - text processing, data analysis, scripting
SQL - querying, joins, basic analytics
Jupyter Notebooks - exploratory analysis and documentation
Pandas - data manipulation and analysis
Regex - text cleaning and pattern extraction
spaCy - tokenization, POS tagging, NER
SudachiPy - Japanese tokenization and morphological analysis
Streamlit - Simple NLP and data exploration apps
Git and Github - version control, project documentationUsed in coursework and independent projects; currently building NLP-focused analysesResearch and Analysis:
Corpus analysis (frequency, concordances)
Linguistic Analysis
Political Discourse Analysis
Text-based Data Analysis
Research Design and Literature Review
Technical and Academic Writing
Let's get in contact: