GBM Data Engineer
City of London (Greater London) IT development
Job description
Role Title: GBM Data Engineer
Business: Operations
New or Existing Role? New
Grade: GCB4
GBM Data Vision
Build a world class “data-driven” organization that rivals our competitors and inspires our employees. Leveraging a revolutionary Data Analytics ecosystem, to generate business insights and provides great customer experience from well-managed and trusted data assets.
Our Mission Statement
The GBM Data Analytics Ecosystem solves complex problems using cutting-edge technologies; helping to rapidly implement insights from data that can help drive more informed decision making. We will deploy smart machines to process complex and large sets of data, impossible in the legacy manual mining methods. Data underpins everything we do; from risk & regulatory management, through monetization, to predicting client behaviour.
The Team
Our Data Science & Engineering teams are partnering with IT to deliver an ecosystem of curated, enriched and protected sets of data – created from global, raw, structured and unstructured sources. Our GBM Big Data Lake is the largest aggregation of data ever within HSBC. We have over 300 sources which equate to more than 20PTB of data, with a use case portfolio of over 110 projects that span all the business lines within GBM. We are utilising the latest machine learning tools and technologies to solve these hypotheses and deliver value and truly unique insights.
The Opportunity
GBM are looking for Data Engineers that will work on the collecting, storing, processing, and analysing of huge sets of data. The primary focus will be on choosing optimal solutions to use for these purposes, then maintaining, implementing, and monitoring them. You will also be responsible for integrating them with the architecture used across the company and to help build out some core services that power our Machine Learning and advanced analytics systems
Desired profile
Qualifications :
Required Skills
·
Ability to process and rationalise structured data, message data and semi/unstructured data and ability to integrate multiple large data sources and databases into one system
·
Proficient understanding of distributed computing principles and of the fundamental design principles behind a scalable application
·
Strong knowledge of the Big Data eco system, experience with Hortonworks/Cloudera platforms
· Practical experience in using HDFS.
·
Practical expertise in developing applications and using querying tools on top of Hive, Spark (PySpark
· Strong Scala skills
·
Experience in Python, particularly the Anaconda environment and Python based ML model deployment
·
Experience of Continuous Integration/Continuous Deployment (Jenkins/Hudson/Ansible)
·
Experience with using GIT/GITLAB as a version control system.
·
Experience in working in Teams using the Agile Methods ( SCRUM ) and Confluence/JIRA
·
Good communication skills (written and spoken), ability to engage with different stakeholders and to synthesize.
Nice to Haves
·
Knowledge of at least one Python web framework (preferably: Flask, Tornado, and/or twisted)
·
Basic understanding of front-end technologies, such as JavaScript, HTML5, and CSS3 would be a plus
·
Good understanding of global markets, markets macrostructure and macro economics
· Knowledge of Elastic Search Stack (ELK)
·
Experience with Google Cloud Platform (Data Proc / Dataflow)
As a business operating in markets all around the world, we believe diversity brings benefits for our customers, our business and our people. This is why HSBC is committed to being an inclusive employer and encourages applications from all suitably qualified applicants irrespective of background, circumstances, age, disability, gender identity, ethnicity, religion or belief and sexual orientation.
We want everyone to be able to fulfil their potential which is why we provide a range of flexible working arrangements and family friendly policies.
https://www.hsbc.co.uk/1/2/popups/uk-privacy-statement#/ overview