About Me

Hello! I am Aisha Khatun, a computer science graduate student at the University of Waterloo with interests in AI, ML, and everything Data! As a graduate researcher, I analyze the capabilities and limitations of Large Language Models (LLM) in answering questions about sensitive topics, instruction following, and prompt variations. I am passionate about AI, solving complex problems by applying ML techniques, and extracting valuable information through data analysis. Let's connect and discuss exciting opportunities in the field of AI and computer science!
www.linkedin.com/in/tanny411 | aysha.kamal7@gmail.com | aisha.khatun@uwaterloo.ca

Technical Skills

Languages: (Proficient) Python, SQL, SPARQL, Javascript, C++ (Comfortable) Java, PHP, Scala
Frameworks and tools: NumPy, Pandas, Spark, PySpark, Matplotlib, Plotly, Seaborn, SciPy, Scikit-Learn, Keras, Pytorch, Fast.ai, Tensorflow2, HuggingFace, Hadoop, Airflow, Tableau, Power BI
Database: MongoDB, PostgreSQL
Version Control and Collaboration: Git, GitHub, GitLab, Gerrit, Phabricator
Web Technologies: HTML/CSS, JQuery, NodeJS, React-Redux, Express.js, Flask

Experience

See details, volunteer experience, and awards in the experience and education page.

Graduate Research Student
University of Waterloo

I am working with Professor Daniel G. Brown on the ability of LLMs to respond appropriately and consistently to sensitive topics with prompt variations. We analyzed over 30 models, both open and closed source, and found that most models can barely understand the task at hand. They are sensitive to slight variations in prompt wording and have different responses in different settings.
Paper. Paper.

Research Data Scientist (NLP)
Wikimedia Foundation

I worked with the Research Team as a Research Data Scientist (NLP) to develop Copyediting as a structured task. We used Wiktionary to curate a list of commonly misspelled words and detected misspellings in Wikipedia articles in all languages in an automated fashion.
Meta page, Report, Code.
Currently, I am working on addressing deployment bottlenecks and improving the Wikipedia link recommendation system accuracy by creating a language-agnostic model to replace the 300+ individual language-dependent models.
Meta page, Report Code.

Data Analyst
Wikimedia Foundation

As a Data analyst, I worked with the Search and Analytics team to help scale the Wikidata query service. We performed data analysis on Wikidata dumps and combined it with user's SPARQL query analysis to identify the most frequently queried Wikidata subgraphs. This helped inform decisions to split Wikidata to handle the ever-increasing size of the graph (Ticket).
Analysis work: wikitech/User:AKhatun.

Outreachy Intern
Wikimedia Foundation

Selected as an Intern in Outreachy to work with the Abstract Wikipedia project under Wikimedia Foundation. Performed data analysis and source code similarity analysis using unsupervised machine learning to identify important modules for centralization in Abstract Wikipedia.
See more in Abstract_Wikipedia/Data

Machine Learning Engineer
Therap BD Ltd.

Performed data analysis and applied machine learning algorithms on computer vision and time-series data for pattern recognition and prediction generation. Used state-of-art face detection models, OCRs, and sensor readings for fall detection. Analysed sleep patterns from sleep-mats to identify abnormal activities at night, and analysed server logs to identify ideal downtime for application release.

Research Assistant
SUST NLP Lab

Worked on developing large datasets and implementing transfer learning based deep learning approaches for Authorship Attribution in Bengali Literature, thus far surpassing the existing systems. Work available in GitHub. Datasets available in Mendeley.

Education

See details and online courses I have done in the experience and education page.

Cheriton School of Computer Science, University of Waterloo

Master of Mathematics in Computer Science and Engineering

CGPA: 3.96/4.00

Advisor: Daniel G. Brown

Courses taken:
CS848 F22: The Art and Science of Empirical Computer Science
CS848 F22: Knowledge Graphs
CS889 W23: InfoVis for AI Explainability
CS889 S23: Value-Driven Technology

Shahjalal University of Science and Technology

B.Sc. in Computer Science and Engineering

CGPA: 3.89/4.00 (2nd in Class)

Completed undergraduate thesis on Authorship Attribution in Bangla Literature. Applied deep learning NLP techniques to achieve high performing scalable systems.
Advisor: Md Saiful Islam, Ayesha Tasnim
Thesis Report

Core Courses: Algorithm Design and Analysis, Data Structure, Database System, Object Oriented Programming, Software Engineering and Design Patterns, Technical Writing and Presentation, Artificial Intelligence, Introduction to Data Science, Machine Learning

Publications

Aisha Khatun & Daniel G. Brown. (2024). A Study on Large Language Models' Limitations in Multiple-Choice Question Answering.

Aisha Khatun and Daniel G. Brown. (2023). Reliability Check: An Analysis of GPT-3’s Response to Sensitive Topics and Prompt Wording. In Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023), pages 73–95, Toronto, Canada. Association for Computational Linguistics.

Khatun, A. et al. (2021) Authorship attribution in Bangla literature (AABL) via transfer learning using ulmfit, ACM Transactions on Asian and Low-Resource Language Information Processing.

Khatun, A., Rahman, A., & Islam, M. S. (2019, December). Authorship Attribution in Bangla literature using Character-level CNN. In 2019 22nd International Conference on Computer and Information Technology (ICCIT) (pp. 1-5). IEEE.

Khatun, A., Rahman, A., Chowdhury, H. A., Islam, M. S., & Tasnim, A. (2020). A Subword Level Language Model for Bangla Language. In Proceedings of International Joint Conference on Computational Intelligence (pp. 385-396). Springer, Singapore.

Chowdhury, H. A., Imon, M. A. H., Rahman, A., Khatun, A., & Islam, M. S. (2019, December). A Continuous Space Neural Language Model for Bengali Language. In 2019 22nd International Conference on Computer and Information Technology (ICCIT) (pp. 1-6). IEEE.

Media

University of Waterloo News: LLMs Validate Misinformation

DefenseOne: Generative AI can push misinformation

Projects

GitHub repositories that I've built.

GPT3-Reliability-Check

Jupyter Notebook 3 1

Wikidata-WDQS-Analysis

Analysis on Wikidata and Wikidata Query Service to help figure out ways to scale the service. Repository contains analysis code, written articles on the findings and visualizations.

Jupyter Notebook 1 0

Authorship-Attribution-using-Transfer-Learning

Jupyter Notebook 0 1

Competitive-Programming

Contains my codes for various programming competitions and practices including learning Data Structure and Algorithms.

HTML 0 0

Machine-Learning-Projects

This repository contains some collection of my machine learning, deep learning and AI projects. This includes Kaggle, Courses and Personal projects.

Jupyter Notebook 0 0

GroupProject

A social networking Web App aiming to make group interaction easier and more organized. Features include posting, commenting, files upload and file/folder organizing system along with a group shared whiteboard API for group sharing experiences, all within one or more groups as organized by the members of a group.

PHP 1 0

Blog posts

Articles I've written. See all blog posts here.

First Time Attending An Acl Conference

Jul 17, 2023

Me and my future goals

Feb 03, 2021

Modifying Expectations

Jan 20, 2021

What's Abstract Wikipedia?

Jan 04, 2021

Struggle and Grow

Dec 21, 2020

Internship with Wikimedia

Dec 05, 2020

Aisha Khatun