Bioinformatics/Python: a quick help needed with a simple data analysis

$10-30 USD

Completed

Posted

about 1 year ago

$10-30 USD

Paid on delivery

Hi there, For those of you familiar with bioinformatics, I have an input .csv file (attached) that contains a table with Uniprot IDs that correspond to the proteins of my interest. For each organism listed in the column A, I need to calculate the average % of I, V, Y, W, R, E, and L amino acid residues in these proteins. It means that for each organisms we need to merge their protein sequences into one long sequence and then simply divide the total count of I, V, Y, W, R, E, and L letters by the total number of all letters in this sequence. Please ask if something is unclear. BW

Project ID: 36581989

About the project

6 proposals

Remote project

Active 1 yr ago

Looking to make some money?

Email address

Benefits of bidding on Freelancer

Set your budget and timeframe

Get paid for your work

Outline your proposal

It's free to sign up and bid on jobs

Awarded to:

@Fazeennazar

*** Python Expert for your project : CSV data analysis : average values for some columns based on Column A *** I read your project description very carefully. I have a deep understanding and experience in the areas of python that you mentioned. I've previously worked on so many projects for other employers. Here is my profile URL: https://www.freelancer.com/u/Fazeennazar Check out my past reviews and skills. So, I would like to go through more specific discussions with you to provide successful results. Thank you, Mohamed F.

$30 USD in 1 day

5.0

(217 reviews)

7.3

6 freelancers are bidding on average $93 USD for this job

@marufadnan16

Greetings, This is Maruf. I read your project description. My team is working with these stuffs (Python, Data Mining, Data Scraping and Bioinformatics) for almost 5 years and we are here to assist you now. I am checking your attachment, I'll update you shortly... Feel free to contact us to discuss about your project. A fast response is appreciated. Thanks.

$25 USD in 3 days

0.0

(0 reviews)

0.0

@YulianOptimize

Hello, Sergey M. I am the best fit as I've worked on a similar project before. I am very familiar with skills including Data Mining, Bioinformatics, Python and Data Scraping. Please review my profile and portfolios. https://www.freelancer.com/u/YulianOptimize Please feel free to contact me with any questions or to discuss my bid further. I look forward to hearing from you soon. Sincerely, Yulian

$200 USD in 1 day

0.0

(0 reviews)

0.0

@moazizmiami

I understand that you have an input .csv file containing Uniprot IDs and you need to calculate the average percentage of specific amino acid residues in the proteins associated with each organism mentioned in column A. To accomplish this, we can follow these steps: Read the input .csv file: We'll start by parsing the .csv file to extract the necessary information, specifically the Uniprot IDs and the corresponding organisms. Retrieve protein sequences: Using the Uniprot IDs, we'll fetch the protein sequences for each organism from the Uniprot database or any relevant protein database. Merge protein sequences: Once we have the protein sequences for each organism, we'll merge them into a single long sequence for each organism. Calculate the average percentage: Next, we'll count the occurrences of the specified amino acid residues (I, V, Y, W, R, E, and L) in each merged sequence. We'll then divide the total count of these residues by the total number of all letters in the sequence to obtain the average percentage. Generate results: Finally, we'll store the calculated average percentages for each organism, which can be outputted in a suitable format such as a new .csv file or any other desired format. If you have any specific requirements regarding the programming language or any additional considerations, please let me know. I'm here to assist you further and answer any questions you may have. Best regards,

$50 USD in 7 days