Home       |     Overview      |     Candidate Login      |     Post Resume       |     Contact us
 
  
     
     
Search Jobs
     
Keywords,Title,Skills,Company  Location,City,State,Zip  
  Advanced Search
     
Python Pyspark Spark JSON Hadoop SQL Unix Environments. Excellent Communication Skill
 
Requirement id 115348
Job title Developer
Job location in DC Metro Commercial, DC
Skills required Python, Pyspark, Spark, JSON Hadoop SQL Unix Environments. Excel
Open Date 23-Mar-2021
Close Date
Job type Contract
Duration 7 Months
Compensation DOE
Status requirement ---
Job interview type ---
   Email Recruiter: coolsoft
Job Description Developer: Python, Pyspark, Spark, JSON Hadoop SQL Unix Environments. Excel

Note: Location: This position will be required to be onsite in Mclean, VA, following COVID. Interview Information: - 1st Round: Questionnaire (24-48 Hours to Complete) - 2nd Round: 60 Minute Final Interview.

Candidate must be our W2 Employee.

Job Description:

Senior Big Data Developer

Responsibilities include:

• Cleanse, manipulate and analyze large datasets (Structured and Unstructured data – XMLs, JSONs, PDFs) using Hadoop platform.

• Develop Python, PySpark, Spark scripts to filter/cleanse/map/aggregate data.

• Be able to build Dashboards in R/Shiny for end user consumption

• Manage and implement data processes (Data Quality reports)

• Develop data profiling, deduping logic, matching logic for analysis

• Programming Languages experience in Python, PySpark and Spark for data ingestion

• Programming experience in BigData platform using Hadoop platform

• Present ideas and recommendations on Hadoop and other technologies best use to management

Qualifications:

• 5+ years of experience in processing large volumes and variety of data (Structured and unstructured data, writing code for parallel processing, XMLS, JSONs, PDFs)

• 3+ years of programming experience in Hadoop, Spark, Python for data processing and analysis.

• Strong SQL experience is a must

• 3+ years of experience – using Hadoop platform and performing analysis. Familiarity with Hadoop cluster environment and configurations for resource management for analysis work

• Ability to work in a UNIX environment

• Detail oriented. Excellent communication skills (verbal and written)

• Must be able to manage multiple priorities and meet deadlines

• Degree in Computer Science, Statistics, Economics, Business, Mathematics or related field

Project Info / Reason for Opening: Supporting Risk Analysis project - they need to be comfortable with Python PySpark - This is really a development position. They get tons of data from the business, in a bunch of different formats (xml, json) they have to develop programs to parse all of these files into individual data elements and store in database.
 
Call 502-379-4456 Ext 100 for more details. Please provide Requirement id: 115348 while calling.
 
Other jobs in DC: Washington (6),
Python job openings in DC Metro Commercial, DC
Jobs List

Sr. Product Analyst - 73310
Create date: 25-Mar-2021
Note: *Please include answers to these question at the top of candidate resume. Submittals without answers will not be considered. Candidate responses will not be accepted via email. Additions or alterations to answers will not be accepted * 1) What challenge have you encountered in data projects? Interview Information: 2 Rounds 1st Round - 60 Minu.... (This job is for - Java Python Jobs in DC DCMetroCommercial Analyst - (in DC Metro Commercial, DC))

Python Developer - 72815
Create date: 10-Mar-2021
Note: Working on automating a solution for migration of FHFA project on the business resiliency/disaster recovery team.

Candidate must be our W2 Employee.

Job Description: Technology Infrastructure

• Strong knowledge of Unix, Database technologies, reading application logs, Autosys jobs.

• Perform automat.... (This job is for - Python Implementation Jobs in DC DCMetroCommercial Developer - (in DC Metro Commercial, DC))
 
 Python job openings in other states
Jobs List

NCAOC - Power BI Developer/Programmer (731551)
Create date: 20-Mar-2024
Remote

start date :04/01/2024

End date :06/30/2024

submission deadline :03/26/2024

client info :NCAOC

Description :

The North Carolina Administrative Office of the Courts (NCAOC) Fiscal Services Division is seeking a candidate to devel.... (This job is for - Python Jobs in NC Raleigh Developer - (in Raleigh, NC))

Software Engineer-121639
Create date: 01-Dec-2023
Start date :01-02-2024

End date :01-03-2025

Submission deadline : 12/6/2023

Client info : Starbucks Corporation - Seattle

Description :

Independently design and develop programs and tools to support data pipelines including ingestion, curation,.... (This job is for - Python Hadoop Jobs in WA Seattle Engineer - (in Seattle, WA))

System Analyst 6 - 116944 -SP
Create date: 11-May-2023
Start date : 05/29/2023
End Date : 1 Years from projected start date

Submission deadline :5/15 at 10am EST.

Client Info :DHHS

Note:
* Interview Process: Virtual Interview via MS Teams video. Please use laptop and be prepared so share screen if asked. Use of headphones is strongly discouraged. A screensho.... (This job is for - Python JavaScript Java Jobs in MI Lansing Analyst - (in Lansing, MI))

NJDOH Senior Consultant - Cloud Engineer (708389)
Create date: 12-Apr-2023
start date : 05/01/2023

End date : 05/31/2023

submission deadline : 4/17/2023

client info : DOH

Note ;

* Hybrid

Description :

The State of NJ is seeking an Cloud Data Engineer that will assist in maintaining, and monitoring i.... (This job is for - Python Jobs in NJ Trenton Engineer - (in Trenton, NJ))

Business Analyst (IT) III Senior - 20230407090751
Create date: 07-Apr-2023
Start date :4/28/2023
End Date :8+Months from the start date

Submission deadline :04-14-2023

Note:
* Job Site On-Site

Description :

Huntington Bank is seeking a passionate, data-savvy Analyst to join the Enterprise Analytics team to fuel our mission of growth through data-driven insights and opp.... (This job is for - Python Jobs in OH Columbus Analyst - (in Columbus, OH))
 
 
(Developer: Python, Pyspark, Spark, JSON Hadoop SQL Unix Environments. Excel in DC Metro Commercial, DC)
     
Search Jobs
     
Keywords,Title,Skills,Company  Location,City,State,Zip  
  Advanced Search
     

    About Us     Services    Privacy policy    Legal     Contact us