I've reviewed every PDF editor out there - then I had ChatGPT build me a better one ...
SMILES Pair Encoding (JCIM) first learns a vocabulary of high frequency SMILES substrings from a large chemical dataset (e.g., ChEMBL) and then tokenizes SMILES based on the learned vocabulary for ...
CodeSim is a research toolkit that implements and benchmarks 23 different unsupervised similarity measures for detecting code clones in Java source code. This work addresses the critical challenge of ...
Developed in Python with LangGraph and Streamlit, the system translates user questions into optimized SQL queries, validates them with dry-run checks, enforces guardrails such as partition filters, ...
My paper on explainable AI is cited 1000 times in just three years after publication. Back then, we were deep into programming deep learning algorithms while working at a hospital, where high-stakes ...