At T.S.I., I’m continuing my work under Parasecurity Group, spearheading our research on Large Language Models (LLMs) and artificial intelligence. I am developing an efficient Mixture-of-Experts LLM architecture with alternative attention schemes and optimizations for efficiency.
- Developing advanced neural network architectures and deep learning optimization techniques
- Researching AI model compression techniques for real-world deployment
- Working on MSc Thesis: Retentive Networks for malicious code detection
- Contributing to EU Horizon Projects on cybersecurity applications
- Publishing research on Large Language Models and AI for cybersecurity