A Personal Opinion: 2017 Trends in Infocomm Jobs Posting on Careers@Gov

I am curious of the skills in demand for infocomm jobs within the Civil Service. In my own personal capability, I set forth to analyze and summarize my observations of the trends within infocomm job postings on Careers@Gov.


TeBaC-NET Design Considerations

In my previous post on TeBaC-NET, I talked about the reason why I created it. In this post, I talk about why I created it the way it is. Design Considerations #1 Cross Platform One of the most important consideration is that it should be platform agnostic. A simple tool that can run on any […]

Text Based Custom Named Entity Tagger (TeBaC-NET)

I was recently exploring spaCy¬†for some NLP work, and found that the default model was not sufficient for tagging entities in the domain I was exploring. The documentation was very helpful in explaining how I could train the statistical model of the named entity recognizer, but I needed training and evaluation data. While I could […]

Logging in Java Applications (Using java.util.logging Package)

Having a logging component within a library or service (especially in a microservices architecture) is useful to the downstream applications that use it. When properly set up, it allows them to choose their own logging implementation, and level of verbosity, without having to recompile the code. While there are many different logging implementations (e.g. log4j, […]

Late Night Coding – With and Without Git Rebase

Typical code repositories usually have the following branches (with multiple synonyms): master/production – a version of code that your actual customers use beta – a version of code that is used for UAT or just A/B testing alpha/pre-release – a version of code used for end-to-end and/or integration testing development – a version of code […]

Creating a Development CI Pipeline for Appian

A while back I did several posts on Appian, specifically one regarding setting it up on Docker. The objective was two fold – the first was to solve an important development CI pipeline challenge, and second was to evaluate if the software was deploy-able on a PaaS (ans: yes it is, but not via the […]

Apache Hadoop Data Capacity Planning

Planning capacity for a Hadoop cluster is not easy as there are many factors to consider – from the software, hardware, and data aspect. Planning a cluster with too little data capacity and/or processing power may limit the amount of operations/analytics that can be run on it, while planning for every possible scenario may be […]