βοΈ The PubMed Open-Access (OA) subset shares a metadata for 35 Million articles. Suddenly, the existing article parser represents a Hugging Face dataset that was supported up until 2024. ncbi/pubmed Moreover, the pubmed data represent a compressed XLM which is beneficial for efficiency but limits processing technique application.
π’ To bridge this gap, excited to share pubmed_articles_iter project, which bridges this gap by providing: βοΈ 1. Downloader for the raw files βοΈ 2. No-string iterator over pubmed articles, utilized for converting them into JSON.
So far, BioASQ organizers as CLEF-2025 reveal the complete leaderboard of other submissions (see image below).
Our distil-tuned Qwen2.5-0.5B (BU-Team) has been ofically ranked as the second-best performing system in French! π«π· We also investigate the strongest recall of key aspects among all participants β demonstrating the value of adopted fine-tuning strategy.
π’ For those who interested in adopting streaming with bare minimum dependencies and setting up a GenAI powered demo in Web, this post might be relevant. Streaming support is inevitable for running local or remote models. Delighted to share the first part of the tutorial.
From which you will learn how to: βοΈ Use pure JS for fetching streaming from the specific provider (Replicate) βοΈ Use pure JS with custom proxy streaming provider (FastAPI)
β¨ TLDR: We review POST-based approaches for fetching data readers and adopting data parsers. Using FastAPI as a proxy, we explain how to take control over transferred data.
π’ For those who planning to start a PhD or research in the UK π¬π§ (including AI field in particular) but facing ATAS (Academic Technology Approval Scheme) issues. Excited to share the ultimate guide for dealing with ATAS refusals and how to write effective rebuttal letters.
π From the video you will find: 1. Why appealing an ATAS decision matters even if your visa is approved 2. Which docments to use in understanding the principles behind sponsorship decisions 3. Key tips for proper rebuttal letter structuring