About me


Babbling and Rambling of a Data Baffoon

My story

Hello there, I am János, also known as “Jamesz”, and welcome to my humble little website which aims to catalogue and present all my projects which contribute to my understanding and work with Data. My curiosity and eagerness to learn has lead me to Data Analytics to understand problems.

I have always been fascinated to understand issues and find their characteristics, since I have started my education within Biological Sciences, and onwards. My education and career within science and industry has given me an analytical and critical-thinking mindset to strategically investigate problems and find solutions to them.

I have developed a strong foundation in the life sciences and a passion for using data to uncover meaningful insights. I am excited to bring my technical, analytical and ever-expanding skills to the field of data. I have applied my skills and developed myself to work with complex data, identifying trends, and out-of-the-box thinking to reframe and view problems from a new angle. I have a deep seated interest & experience in data analysis & science as I have held roles focusing on various facets of data from developing bionformatic pipelines to process Next-Generation Sequencing data, all the way to creating Power BI-based tools for better decision making and data insight showcase.

For my CV, click on the link.

For Diplomas & Certifications, see the Certifications section of my website.

For Data Projects, which I have worked on, see the Projects section of my website.

Skills

Technical Skills

Skills Competency
Excel Pivot Tables, Dashboard & Reporting, LookUps, Macros, Visualization, Statistical modelling, Statistical & Data Analysis, Data Quality Checks & Processing
Python Web scraping, Text scraping & parsing, Data Analytics (Pandas, Numpy), Machine Learning (Selenium, PyTorch), Visualization (Matplotlib), Bioinformatics & Genomic data analysis (Anaconda, Mamba, BioPython)
R Data analytics & processing (Tidyverse), Visualization (ggplot2), Genomic data processing (QDNASeq), Data Modelling (Scater), Statistical testing & Modelling
Power BI Power Automate, Power Query, DAX, Visual Dashboard design, Data loading & transformation pipelines
SQL (BigQuery, PostgreSQL) Data retrieval & transformation (Create, Insert, Select, Merge, Joins, Group By, Order By), Data analytics
Git & Github Project Management, Back-up & Version Control
VS Code Coding environment used for general coding needs
Documentation Markdown files, Jupyter Notebooks for reproducible coding
Snowflake Data engineering, Layered data importation & transformation
Google Cloud (BigQuery) Google Cloud access & data retrieval and handling through BigQuery
Shell Virtual Environment, Bioinformatic Pipeline Development, Iterating, Genomic Data cleaning & processing (FastQC, Bowtiw2, Samtools, Picard, Bedtools, Macs2), Executing Python Script
Veeva Vault Cloud SaaS platform, RIM/QMS/DMS management, Data & Report retrieval, Handling of Quality Document & Processes (CAPA, Audits, Supplier information)
Sharepoint Central Document Management
MS Office High-level use and familiarity with Microsoft Office Suite (PowerPoint, Onedrive, Word, Excel)

Soft Skills

  • Project Management
  • Communication & stakeholder management
  • Supplier & Business Analytics
  • Problem-solving
  • Agile methodology
  • Presentation Skills & Story telling
  • Process & Solution development
  • Quality mindset & Critical thinking

Other: Project Management, Veeva Vault, Github, Sharepoint, MS Office

My Aim

I have held a deep-seated drive to apply myself with data in projects that have meaning and to make the world a little bit better and trying to understand problems. One of my drives is to apply this to Clinical & Medical Data analysis as I believe if we better understand how biological processes work, we can make the life of so many people better, and I wish to be part of that.

What I am working on now

I am currently working on the following data projects to further broaden my skills in data analysis, predictive modelling & database management: