I have the following txt file:
Code:
source.txt:
<a href="/gp/product/bold.jsp?tp=&add=B007OZNZG0"><a href="https://www.amazon.com/gp/product/utility/edit-one-click-pref.html?ie=UTF8&query=*entries*%3D0%2C*Version*%3D1&returnPath=%2Fgp%2Fproduct%2FB007OZNZG0" id="oneClickSignInLinkID">Sign in</a> to turn on 1-Click ordering.
<tr><a href="/gp/product/bold.jsp?tp=&add=B007OZAJSDH"><td align="right" style="font-weight:....
...
which is essentially bunch of junks with some links that have "bold.jsp" (bolded above).
What I want to do is to:
1- extract the bold parts (which have bold.jsp) and write them to a new txt file (each in a separate line)
2- add "http:www.amazon.com" to the beginning of each line
So the output would be:
Code:
output.txt:
http:www.amazon.com/gp/product/bold.jsp?tp=&add=B007OZNZG0
http:www.amazon.com/gp/product/bold.jsp?tp=&add=B007OZAJSDH
I am using Cygwin on Windows.
Thank you for your help.
Bookmarks