My story
Hello there, I am János, also known as “Jamesz”, and welcome to my humble little website which aims to catalogue and present all my projects which contribute to my understanding and work with Data. My curiosity and eagerness to learn has lead me to Data Analytics to understand problems.
I have always been fascinated to understand issues and find their characteristics, since I have started my education within Biological Sciences, and onwards. My education and career within science and industry has given me an analytical and critical-thinking mindset to strategically investigate problems and find solutions to them.
I have developed a strong foundation in the life sciences and a passion for using data to uncover meaningful insights. I am excited to bring my technical, analytical and ever-expanding skills to the field of data. I have applied my skills and developed myself to work with complex data, identifying trends, and out-of-the-box thinking to reframe and view problems from a new angle. I have a deep seated interest & experience in data analysis & science as I have held roles focusing on various facets of data from developing bionformatic pipelines to process Next-Generation Sequencing data, all the way to creating Power BI-based tools for better decision making and data insight showcase.
For my CV, click on the link.
For Diplomas & Certifications, see the Certifications section of my website.
For Data Projects, which I have worked on, see the Projects section of my website.
Skills
Technical Skills
| Skills | Competency |
|---|---|
| Excel | Pivot Tables, Dashboard & Reporting, LookUps, Macros, Visualization, Statistical modelling, Statistical & Data Analysis, Data Quality Checks & Processing |
| Python | Web scraping, Text scraping & parsing, Data Analytics (Pandas, Numpy), Machine Learning (Selenium, PyTorch), Visualization (Matplotlib), Bioinformatics & Genomic data analysis (Anaconda, Mamba, BioPython) |
| R | Data analytics & processing (Tidyverse), Visualization (ggplot2), Genomic data processing (QDNASeq), Data Modelling (Scater), Statistical testing & Modelling |
| Power BI | Power Automate, Power Query, DAX, Visual Dashboard design, Data loading & transformation pipelines |
| SQL (BigQuery, PostgreSQL) | Data retrieval & transformation (Create, Insert, Select, Merge, Joins, Group By, Order By), Data analytics |
| Git & Github | Project Management, Back-up & Version Control |
| VS Code | Coding environment used for general coding needs |
| Documentation | Markdown files, Jupyter Notebooks for reproducible coding |
| Snowflake | Data engineering, Layered data importation & transformation |
| Google Cloud (BigQuery) | Google Cloud access & data retrieval and handling through BigQuery |
| Shell | Virtual Environment, Bioinformatic Pipeline Development, Iterating, Genomic Data cleaning & processing (FastQC, Bowtiw2, Samtools, Picard, Bedtools, Macs2), Executing Python Script |
| Veeva Vault | Cloud SaaS platform, RIM/QMS/DMS management, Data & Report retrieval, Handling of Quality Document & Processes (CAPA, Audits, Supplier information) |
| Sharepoint | Central Document Management |
| MS Office | High-level use and familiarity with Microsoft Office Suite (PowerPoint, Onedrive, Word, Excel) |
Soft Skills
- Project Management
- Communication & stakeholder management
- Supplier & Business Analytics
- Problem-solving
- Agile methodology
- Presentation Skills & Story telling
- Process & Solution development
- Quality mindset & Critical thinking
Other: Project Management, Veeva Vault, Github, Sharepoint, MS Office
My Aim
I have held a deep-seated drive to apply myself with data in projects that have meaning and to make the world a little bit better and trying to understand problems. One of my drives is to apply this to Clinical & Medical Data analysis as I believe if we better understand how biological processes work, we can make the life of so many people better, and I wish to be part of that.
What I am working on now
I am currently working on the following data projects to further broaden my skills in data analysis, predictive modelling & database management: