Skip to main content
added 90 characters in body
Source Link
JJoao
  • 12.8k
  • 1
  • 26
  • 45

File1 tax_id GeneID Symbol LocusTag Synonyms dbXrefs chromosome map_location description type_of_gene Symbol_from_nomenclature_authority Full_name_from_nomenclature_authority Nomenclature_status Other_designations Modification_date 7 5692769 NEWENTRY - - - - - Record to support submission of GeneRIFs for a gene not in Gene (Azotirhizobium caulinodans. Use when strain, subtype, isolate, etc. is unspecified, or when different from all specified ones in Gene.). other - - - - 20160818 9 1246500 repA1 pLeuDn_01 - - - - putative replication-associated protein protein-coding - - - - 20160813 9 1246501 repA2 pLeuDn_03 - - - - putative replication-associated protein protein-coding - - - - 20160716 9 1246502 leuA pLeuDn_04 - - - - 2-isopropylmalate synthase protein-coding - - - - 20160903 9 1246503 leuB pLeuDn_05 - - - - 3-isopropylmalate dehydrogenase protein-coding - - - - 20150520 9 1246504 leuC pLeuDn_06 - - - - isopropylmalate isomerase large subunit protein-coding - - - - 20160806 9 1246505 leuD pLeuDn_07 - - - - isopropylmalate isomerase small subunit protein-coding - - - - 20160730 9 1246509 ibp pBPS1_01 - - - - Ibp protein protein-coding - - - - 20150801 9 1246510 repA1 pBPS1_02 - - - - repA1 protein protein-coding - - - - 20160813

tax_id GeneID  Symbol  LocusTag        Synonyms        dbXrefs chromosome      map_location    description     type_of_gene    Symbol_from_nomenclature_authority      Full_name_from_nomenclature_authority Nomenclature_status      Other_designations      Modification_date
7       5692769 NEWENTRY        -       -       -       -       -       Record to support submission of GeneRIFs for a gene not in Gene (Azotirhizobium caulinodans.  Use when strain, subtype, isolate, etc. is unspecified, or when different from all specified ones in Gene.).     other   -       -       -       -       20160818
9       1246500 repA1   pLeuDn_01       -       -       -       -       putative replication-associated protein protein-coding  -       -       -       -       20160813
9       1246501 repA2   pLeuDn_03       -       -       -       -       putative replication-associated protein protein-coding  -       -       -       -       20160716
9       1246502 leuA    pLeuDn_04       -       -       -       -       2-isopropylmalate synthase      protein-coding  -       -       -       -       20160903
9       1246503 leuB    pLeuDn_05       -       -       -       -       3-isopropylmalate dehydrogenase protein-coding  -       -       -       -       20150520
9       1246504 leuC    pLeuDn_06       -       -       -       -       isopropylmalate isomerase large subunit protein-coding  -       -       -       -       20160806
9       1246505 leuD    pLeuDn_07       -       -       -       -       isopropylmalate isomerase small subunit protein-coding  -       -       -       -       20160730
9       1246509 ibp     pBPS1_01        -       -       -       -       Ibp protein     protein-coding  -       -       -       -       20150801
9       1246510 repA1   pBPS1_02        -       -       -       -       repA1 protein   protein-coding  -       -       -       -       20160813

File2 sacX arcB metB sprT adrB_2 fadD trpC ansP2 group_1428 plsX repA

sacX
arcB
metB
sprT
adrB_2
fadD
trpC
ansP2
group_1428
plsX
repA

File1 tax_id GeneID Symbol LocusTag Synonyms dbXrefs chromosome map_location description type_of_gene Symbol_from_nomenclature_authority Full_name_from_nomenclature_authority Nomenclature_status Other_designations Modification_date 7 5692769 NEWENTRY - - - - - Record to support submission of GeneRIFs for a gene not in Gene (Azotirhizobium caulinodans. Use when strain, subtype, isolate, etc. is unspecified, or when different from all specified ones in Gene.). other - - - - 20160818 9 1246500 repA1 pLeuDn_01 - - - - putative replication-associated protein protein-coding - - - - 20160813 9 1246501 repA2 pLeuDn_03 - - - - putative replication-associated protein protein-coding - - - - 20160716 9 1246502 leuA pLeuDn_04 - - - - 2-isopropylmalate synthase protein-coding - - - - 20160903 9 1246503 leuB pLeuDn_05 - - - - 3-isopropylmalate dehydrogenase protein-coding - - - - 20150520 9 1246504 leuC pLeuDn_06 - - - - isopropylmalate isomerase large subunit protein-coding - - - - 20160806 9 1246505 leuD pLeuDn_07 - - - - isopropylmalate isomerase small subunit protein-coding - - - - 20160730 9 1246509 ibp pBPS1_01 - - - - Ibp protein protein-coding - - - - 20150801 9 1246510 repA1 pBPS1_02 - - - - repA1 protein protein-coding - - - - 20160813

File2 sacX arcB metB sprT adrB_2 fadD trpC ansP2 group_1428 plsX repA

File1

tax_id GeneID  Symbol  LocusTag        Synonyms        dbXrefs chromosome      map_location    description     type_of_gene    Symbol_from_nomenclature_authority      Full_name_from_nomenclature_authority Nomenclature_status      Other_designations      Modification_date
7       5692769 NEWENTRY        -       -       -       -       -       Record to support submission of GeneRIFs for a gene not in Gene (Azotirhizobium caulinodans.  Use when strain, subtype, isolate, etc. is unspecified, or when different from all specified ones in Gene.).     other   -       -       -       -       20160818
9       1246500 repA1   pLeuDn_01       -       -       -       -       putative replication-associated protein protein-coding  -       -       -       -       20160813
9       1246501 repA2   pLeuDn_03       -       -       -       -       putative replication-associated protein protein-coding  -       -       -       -       20160716
9       1246502 leuA    pLeuDn_04       -       -       -       -       2-isopropylmalate synthase      protein-coding  -       -       -       -       20160903
9       1246503 leuB    pLeuDn_05       -       -       -       -       3-isopropylmalate dehydrogenase protein-coding  -       -       -       -       20150520
9       1246504 leuC    pLeuDn_06       -       -       -       -       isopropylmalate isomerase large subunit protein-coding  -       -       -       -       20160806
9       1246505 leuD    pLeuDn_07       -       -       -       -       isopropylmalate isomerase small subunit protein-coding  -       -       -       -       20160730
9       1246509 ibp     pBPS1_01        -       -       -       -       Ibp protein     protein-coding  -       -       -       -       20150801
9       1246510 repA1   pBPS1_02        -       -       -       -       repA1 protein   protein-coding  -       -       -       -       20160813

File2

sacX
arcB
metB
sprT
adrB_2
fadD
trpC
ansP2
group_1428
plsX
repA
added 1846 characters in body
Source Link
AudileF
  • 185
  • 3
  • 11

I have a large tab file with 15 columns (FILE1) and a list (FILE2) of names which should appear in the table. The problem is the name may appear in columns 4 to 10 in FILE1 and it may not be a case match.

I want a command which searches line for a hit and then print the whole line. Preferably this would not be case sensitive and would not print lines where the names in FILE2 are part of a larger word.

I have tried the following:

grep -Fwf FILE2 FILE1 > out 
xargs -I {} grep "^{}" FILE1 < FILE2 > out 

the first just copies FILE1 into out. The second give a blank out file.

I've also tried a few awk commands which will either give an empty out file or as above copy FILE1. I'm trying to improve my Linux skills at the moment so if possible, if you explain your method I would be very grateful.

File1 FILE1tax_id GeneID Symbol LocusTag Synonyms dbXrefs chromosome map_location description type_of_gene Symbol_from_nomenclature_authority Full_name_from_nomenclature_authority Nomenclature_status Other_designations Modification_date 7 5692769 NEWENTRY - - - - - Record to support submission of GeneRIFs for a gene not in Gene (Azotirhizobium caulinodans. Use when strain, subtype, isolate, etc. is unspecified, or when different from all specified ones in Gene.). other - - - - 20160818 9 1246500 repA1 pLeuDn_01 - - - - putative replication-associated protein protein-coding - - - - 20160813 9 1246501 repA2 pLeuDn_03 - - - - putative replication-associated protein protein-coding - - - - 20160716 9 1246502 leuA pLeuDn_04 - - - - 2-isopropylmalate synthase protein-coding - - - - 20160903 9 1246503 leuB pLeuDn_05 - - - - 3-isopropylmalate dehydrogenase protein-coding - - - - 20150520 9 1246504 leuC pLeuDn_06 - - - - isopropylmalate isomerase large subunit protein-coding - - - - 20160806 9 1246505 leuD pLeuDn_07 - - - - isopropylmalate isomerase small subunit protein-coding - - - - 20160730 9 1246509 ibp pBPS1_01 - - - - Ibp protein protein-coding - - - - 20150801 9 1246510 repA1 pBPS1_02 - - - - repA1 protein protein-coding - - - - 20160813

File2 FILE2sacX arcB metB sprT adrB_2 fadD trpC ansP2 group_1428 plsX repA

I have a large tab file with 15 columns (FILE1) and a list (FILE2) of names which should appear in the table. The problem is the name may appear in columns 4 to 10 in FILE1 and it may not be a case match.

I want a command which searches line for a hit and then print the whole line. Preferably this would not be case sensitive and would not print lines where the names in FILE2 are part of a larger word.

I have tried the following:

grep -Fwf FILE2 FILE1 > out 
xargs -I {} grep "^{}" FILE1 < FILE2 > out 

the first just copies FILE1 into out. The second give a blank out file.

I've also tried a few awk commands which will either give an empty out file or as above copy FILE1. I'm trying to improve my Linux skills at the moment so if possible, if you explain your method I would be very grateful.

File1 FILE1

File2 FILE2

I have a large tab file with 15 columns (FILE1) and a list (FILE2) of names which should appear in the table. The problem is the name may appear in columns 4 to 10 in FILE1 and it may not be a case match.

I want a command which searches line for a hit and then print the whole line. Preferably this would not be case sensitive and would not print lines where the names in FILE2 are part of a larger word.

I have tried the following:

grep -Fwf FILE2 FILE1 > out 
xargs -I {} grep "^{}" FILE1 < FILE2 > out 

the first just copies FILE1 into out. The second give a blank out file.

I've also tried a few awk commands which will either give an empty out file or as above copy FILE1. I'm trying to improve my Linux skills at the moment so if possible, if you explain your method I would be very grateful.

File1 tax_id GeneID Symbol LocusTag Synonyms dbXrefs chromosome map_location description type_of_gene Symbol_from_nomenclature_authority Full_name_from_nomenclature_authority Nomenclature_status Other_designations Modification_date 7 5692769 NEWENTRY - - - - - Record to support submission of GeneRIFs for a gene not in Gene (Azotirhizobium caulinodans. Use when strain, subtype, isolate, etc. is unspecified, or when different from all specified ones in Gene.). other - - - - 20160818 9 1246500 repA1 pLeuDn_01 - - - - putative replication-associated protein protein-coding - - - - 20160813 9 1246501 repA2 pLeuDn_03 - - - - putative replication-associated protein protein-coding - - - - 20160716 9 1246502 leuA pLeuDn_04 - - - - 2-isopropylmalate synthase protein-coding - - - - 20160903 9 1246503 leuB pLeuDn_05 - - - - 3-isopropylmalate dehydrogenase protein-coding - - - - 20150520 9 1246504 leuC pLeuDn_06 - - - - isopropylmalate isomerase large subunit protein-coding - - - - 20160806 9 1246505 leuD pLeuDn_07 - - - - isopropylmalate isomerase small subunit protein-coding - - - - 20160730 9 1246509 ibp pBPS1_01 - - - - Ibp protein protein-coding - - - - 20150801 9 1246510 repA1 pBPS1_02 - - - - repA1 protein protein-coding - - - - 20160813

File2 sacX arcB metB sprT adrB_2 fadD trpC ansP2 group_1428 plsX repA

added 147 characters in body
Source Link
AudileF
  • 185
  • 3
  • 11

I have a large tab file with 15 columns (FILE1) and a list (FILE2) of names which should appear in the table. The problem is the name may appear in columns 4 to 10 in FILE1 and it may not be a case match.

I want a command which searches line for a hit and then print the whole line. Preferably this would not be case sensitive and would not print lines where the names in FILE2 are part of a larger word.

I have tried the following:

grep -Fwf FILE2 FILE1 > out 
xargs -I {} grep "^{}" FILE1 < FILE2 > out 

the first just copies FILE1 into out. The second give a blank out file.

I've also tried a few awk commands which will either give an empty out file or as above copy FILE1. I'm trying to improve my Linux skills at the moment so if possible, if you explain your method I would be very grateful.

File1 FILE1

File2 FILE2

I have a large tab file with 15 columns (FILE1) and a list (FILE2) of names which should appear in the table. The problem is the name may appear in columns 4 to 10 in FILE1 and it may not be a case match.

I want a command which searches line for a hit and then print the whole line. Preferably this would not be case sensitive and would not print lines where the names in FILE2 are part of a larger word.

I have tried the following:

grep -Fwf FILE2 FILE1 > out 
xargs -I {} grep "^{}" FILE1 < FILE2 > out 

the first just copies FILE1 into out. The second give a blank out file.

I've also tried a few awk commands which will either give an empty out file or as above copy FILE1. I'm trying to improve my Linux skills at the moment so if possible, if you explain your method I would be very grateful.

I have a large tab file with 15 columns (FILE1) and a list (FILE2) of names which should appear in the table. The problem is the name may appear in columns 4 to 10 in FILE1 and it may not be a case match.

I want a command which searches line for a hit and then print the whole line. Preferably this would not be case sensitive and would not print lines where the names in FILE2 are part of a larger word.

I have tried the following:

grep -Fwf FILE2 FILE1 > out 
xargs -I {} grep "^{}" FILE1 < FILE2 > out 

the first just copies FILE1 into out. The second give a blank out file.

I've also tried a few awk commands which will either give an empty out file or as above copy FILE1. I'm trying to improve my Linux skills at the moment so if possible, if you explain your method I would be very grateful.

File1 FILE1

File2 FILE2

reformat question, add relevant tag
Source Link
AudileF
  • 185
  • 3
  • 11
Loading
Source Link
AudileF
  • 185
  • 3
  • 11
Loading