Where can I learn more on the best type of data to choose for my hypothesis validation?

Any experts here who can guide me on what is the best real time data that can be used for a hypothesis I am trying to validate? I am not a scientist, I am trying to assess realtime methan, CO2 and NO2 in a given  lat and long.